yourang.ai Logo
Back to the platform
Documentation
Get started
  • What the platform is
  • First access
  • Platform overview
  • Quick glossary
Dashboard Assistant
  • What the dashboard assistant is
  • How to use it
  • What it can do
  • Safety and privacy
  • Plans and usage limits
Voice agent
  • Basic configuration
  • Voices and language
  • Instructions
  • Prompt tokens
  • Advanced settings
  • Evaluation criteria
  • Testing and playground
  • Updating instructions
  • External Tools
  • Built-in tools
  • MCP servers
Models offered
  • What models are and why they matter
  • Voice models
  • AI models
  • Choosing the right combination
Documents and knowledge base
  • What the knowledge base is
  • Uploading and managing documents
  • How search works during a call
  • Best practices
Calls
  • Call history
  • Transcripts and summaries
  • Audio recordings
  • Filters and search
  • Data export
Call transfers
  • When and why to transfer a call
  • Departments
  • System tools
  • AI after-hours
  • Routing rules
  • Operator app (iOS / Android)
WhatsApp
  • WhatsApp Business in yourang.ai
  • Real-time chat
  • Automations
  • Approved templates
  • WhatsApp contacts and lists
  • WhatsApp AI agents
Actions and campaigns
  • What actions are
  • SMS campaigns
  • Email campaigns
  • Scheduling and batch sends
Reservations
  • Calendar view
  • Availability rules
  • Confirmations and reminders
  • Changes and cancellations
Contacts
  • Customer directory
  • CSV import
  • Lists and segments
  • Custom fields
Shop and catalogue
  • The shop in yourang.ai
  • Product and service catalogue
  • Order management
  • OCR and price-list import
Integrations
  • Connect Apple Calendar
  • Connect HubSpot
  • Integrations overview
  • Calendar
  • WhatsApp
  • SMS and email
  • Business software and PMS
  • Outbound webhooks
Workflows
  • What workflows are
  • Nodes and blocks
  • Triggers and webhooks
  • Practical examples
Call center and dialer
  • What the yourang.ai call center does
  • Outbound campaigns
  • Human operators
  • Contact lists and live sync
  • Operator panel
External APIs and developers
  • yourang.ai for developers
  • API keys and authentication
  • Main endpoints
  • Incoming webhooks
Use cases
  • Hotels and accommodation
  • Restaurant
  • B&Bs and short-term rentals
  • Beauty center and spa
  • Travel agency
Pricing and plans
  • How pricing works
  • Subscription plans
  • Wallet and credits
  • Consumption and invoices
  • Changing, suspending, or cancelling the plan
Management
  • Account and organization
  • Billing and subscription
  • Team and roles
  • Security and privacy
  • Notifications
Business information
  • Business details
  • Location and address
  • Opening hours
  • AI assistant hours
  • Departments and team
Resources
  • Frequently asked questions
  • Complete glossary
  • Support
Documentation›Voice agent›Prompt tokens

Voice agent

Prompt tokens

Understand and manage your assistant's prompt tokens

Every assistant sends the model a fixed prompt on each call: your instructions, the active tools, the business context, and the FAQs. This prompt is measured in tokens and has a limit. Keeping it under control keeps the assistant fast, cost-effective, and reliable.

What tokens are

Tokens are the units the model uses to read text: a common word is worth about one token, while long or rare words take up more. The indicator at the top shows the prompt's static tokens (the ones always present) against the agent's limit.

The limit and the states

The indicator changes color based on how close you are to the limit:

Healthy (green)
You are well within the limit: the assistant has enough room to reason and respond.
Warning (yellow)
You are getting close to the limit. Start trimming the longest parts before adding anything else.
Over the limit (red)
You have exceeded the limit: the assistant cannot be activated until you reduce the prompt.

What consumes tokens

The summary in the indicator breaks tokens down by source, so you know where to act:

Custom instructions
The instructions you write for the assistant. Usually the heaviest item: this is where it's worth cutting first.
Tool schemas
The description of each active tool (built-in, external, or MCP). The more active tools, the more tokens.
Business context
Your business data (name, hours, address) included automatically in the prompt.
FAQs
The questions and answers from your knowledge base that get included in the prompt.

How to reduce tokens

  • Shorten the instructions: short sentences, only the cases that really matter, no repetition.
  • Turn off the tools you don't use: every active tool adds its schema to the prompt.
  • Move detailed information into the FAQs instead of putting it all in the instructions.
  • Review periodically: as you add features, the prompt grows without you noticing.

Over the limit the agent won't activate

If the prompt exceeds the limit, the assistant stays disabled until you bring it back below the threshold. Keep a margin: a prompt close to the limit leaves the model little room to reason during the conversation.

Was this page helpful?

PreviousInstructionsNextAdvanced settings