Documentation Index
Fetch the complete documentation index at: https://docs.voxworks.ai/llms.txt
Use this file to discover all available pages before exploring further.
Why Thinking Effort Matters
Language models can struggle with certain types of reasoning, particularly:- Numbers and calculations — Arithmetic, quantities, totals, percentages
- Dates and scheduling — Day of week calculations, time differences, availability checks
- Logic and comparisons — If/then reasoning, comparing options, eligibility checks
- Multi-step reasoning — Tasks requiring several logical steps to reach a conclusion
- Fast — Approximately 200ms faster than normal
- Normal — Baseline latency
- Deep — Approximately 500ms slower than normal
What is Thinking Effort?
When the assistant generates a response, it can use different levels of reasoning:- Fast — Quick, direct responses for simple situations
- Normal — Balanced reasoning for standard interactions
- Deep — Deep reasoning for complex or important moments
Effort Levels
| Level | Response Speed | Reasoning Depth | Best For |
|---|---|---|---|
| fast | Fastest | Surface-level | Simple acknowledgments, quick replies |
| normal | Balanced | Moderate | Standard conversation, most steps |
| deep | Slower | Deep | Complex questions, important decisions |
When to Use Each Level
Fast
Use for steps where you want quicker responses:- Acknowledgments and confirmations
- Simple follow-up questions
- Transitions between topics
- Routine conversation
Normal (Default)
Use for:- Standard questions requiring context
- Responses that need to incorporate multiple factors
- Most conversational turns
Deep
Use for:- Complex questions or objections
- Sensitive topics requiring careful handling
- Important decision points
- When accuracy is critical
Interaction with Other Settings
| Combined With | Effect |
|---|---|
| Patient eagerness | Wait longer + think deeper = very deliberate |
| Keen eagerness | Fast effort is typical; deep effort adds delay |
| Patient silence tolerance | Deep effort makes sense — user is thinking too |
Best Practices
- Default to normal — Start with normal effort and adjust from there
- Elevate strategically — Use deep effort for moments that matter
- Consider step complexity — If a step has more than 3 conditions or requires quantitative/logical reasoning, consider using deep effort
- Test response quality — Verify fast effort responses are still good
Next Steps
- Response Eagerness — Control response timing
- Silence Tolerance — Handle idle users
- Overview — See all conversation dynamics

