AI Agent
BYOK Costs
Cost model for AI conversations with bring-your-own-key providers.
Ticket0 tracks estimated AI cost per task in ai_usage and surfaces breakdowns in Settings → AI → Cost Overview.
Typical cost profile per conversation
Most conversations include multiple small calls:
- classification/routing/language tasks (Flash or Haiku-class)
- draft generation (Sonnet-quality)
- optional retrieval and evaluation tasks
In practice, low-complexity conversations are usually a few tenths of a cent. Complex, long-thread conversations cost more because token usage is higher.
Model price assumptions used by the service
Ticket0 estimates cost from token usage with per-1M-token rates in the AI service:
google/gemini-2.5-flash: $0.15 input / $0.60 outputclaude-3-5-haiku-20241022: $0.80 input / $4.00 outputclaude-sonnet-4-6: $3.00 input / $15.00 outputtext-embedding-3-small: $0.02 input
Cost formula
Estimated call cost:
(input_tokens * input_rate + output_tokens * output_rate) / 1_000_000
Conversation cost is the sum of all calls made for that conversation.
Optimization tips
- Keep KB content focused and deduplicated
- Raise auto-send thresholds only where needed
- Use lower-cost models for non-drafting tasks
- Trim very long historical context where possible
- Monitor high-cost task types in Cost Overview regularly
For setup details, see AI & BYOK.