Current models
Augment uses world-class large language models together with our Context Engine. We currently support the following models:- Claude Haiku 4.5 by Anthropic
- Claude Opus 4.5 by Anthropic
- Claude Opus 4.6 by Anthropic
- Claude Opus 4.7 by Anthropic
- Claude Sonnet 4 by Anthropic
- Claude Sonnet 4.5 by Anthropic
- Claude Sonnet 4.6 by Anthropic
- Gemini 3.1 Pro by Google
- GPT-5.1 by OpenAI
- GPT-5.2 by OpenAI
- GPT-5.4 by OpenAI
- GPT-5.5 by OpenAI
- Kimi K2.6 by Moonshot AI (hosted on Fireworks and Baseten)
Choosing a model
You can select the model directly using the Model Picker in the Augment.- In VS Code and JetBrains, open the Augment panel and use the model dropdown near the input box to switch models.
- In Auggie CLI, use the
/modelslash command or pass the--modelflag with the desired model. - Your selection applies only to Agent in that workspace and can be changed at any time.
Model pricing
Augment uses a credit-based pricing system. Different models consume credits at different rates based on their capabilities and costs:- Sonnet 4.6: Baseline credit consumption - latest generation Sonnet, balanced capability for medium or large tasks
- Sonnet 4.5: Baseline credit consumption - previous-generation Sonnet model, same pricing as Sonnet 4.6
- Gemini 3.1 Pro: ~92% of Sonnet’s cost - excels at structural thinking, planning, debugging, investigation, and daily execution. Runs at 2.6x cheaper per message compared to Opus 4.6
- Haiku 4.5: ~30% of Sonnet’s cost - ideal for quick, simple tasks
- GPT-5.1: ~75% of Sonnet’s cost - great for medium-complexity work
- GPT-5.4: ~72% of Sonnet’s cost - advanced GPT generation, excels at computer use and multi-agent orchestration
- Kimi K2.6: ~50% cheaper than Sonnet - best for cheap agentic work; smarter than Haiku and roughly 3-4x cheaper than Opus 4.7. By Moonshot AI, hosted on Fireworks and Baseten.
- Opus 4.5: ~167% of Sonnet’s cost - previous-generation Opus model (legacy)
- Opus 4.6: ~167% of Sonnet’s cost - previous-generation Opus model (legacy)
- Opus 4.7: ~167% of Sonnet’s cost - newest and most capable Opus model, excels at long-running tasks and complex reasoning
- GPT-5.2: ~133% of Sonnet’s cost - enhanced reasoning for complex tasks
- GPT-5.5: ~143% of Sonnet’s cost - most advanced GPT option, designed for highest-complexity coding, computer use, and multi-agent orchestration
Premium models require a credit-based usage plan and may consume credits at a higher rate than other models.
🎉 Launch Promo: Claude Opus 4.7 credits are 50% off until April 30, 2026!
Credit-Based Pricing
Learn more about credit costs, see detailed examples, and discover tips for optimizing your credit usage
Feature compatibility
All listed models support the core Augment Agent features:- Deep code understanding with Augment’s Context Engine
- Tool use (integrations and MCP), file edits, and multi-step planning
Notes and transparency
- We may roll out model updates gradually. If you’re part of a staged rollout, different workspaces or teammates may see updates at different times.
- For troubleshooting, you can share the request ID with our support team; they can confirm which model handled a specific request.