Mistral Small 3
MistralMistral's cost-effective model. Very affordable for general-purpose tasks.
Mistral Small 3 Pricing
| Token Type | Price per Million |
|---|---|
| Input tokens | $0.100 |
| Output tokens | $0.300 |
Estimated Cost by Project Size
Realistic cost estimates for common coding scenarios. Assumes 30% cache hit rate where caching is available.
| Scenario | Token Usage | Estimated Cost |
|---|---|---|
| Small Script (1K lines) | 50K input / 30K output | $0.01 |
| Medium Feature (10K lines) | 500K input / 200K output | $0.10 |
| Large Project (50K lines) | 2,500K input / 1,000K output | $0.47 |
| Code Review (5K lines) | 250K input / 25K output | $0.02 |
Benchmark Performance — Mistral Small 3
Third-party benchmark scores normalized to 0-100 scale. Higher is better. Aggregated scores from published third-party benchmarks. SWE-bench measures real GitHub issue resolution. LiveCodeBench measures competitive programming ability. HumanEval measures basic code generation. BigCodeBench measures practical, multi-step coding tasks. All scores normalized to 0-100 scale.
Sources: SWE-bench Verified, LiveCodeBench, HumanEval, BigCodeBench
Get Access to Mistral Small 3
Ready to start using Mistral Small 3? Get API access directly from Mistral.
How Does Mistral Small 3 Compare?
| Model | Input ($/M) | Medium Feature Cost | |
|---|---|---|---|
| Mistral Small 3 | $0.100 | $0.10 | selected |
| Gemini 2.0 Flash | $0.100 | $0.12 | Compare |
| Microsoft Phi-4 | $0.100 | $0.10 | Compare |
| Gemma 3 27B | $0.100 | $0.12 | Compare |
| Qwen Turbo | $0.080 | $0.08 | Compare |
| Gemini 1.5 Flash | $0.075 | $0.09 | Compare |
Related Models
Claude Sonnet 4
AnthropicAnthropic's balanced model for coding and general tasks. Best price-performance ratio in the Claude family.
Claude Opus 4
AnthropicAnthropic's most powerful model. Best for complex reasoning and challenging coding tasks.
Claude 3.5 Sonnet
AnthropicPrevious generation Sonnet. Still excellent for coding tasks at the same price point.
Claude 3.5 Haiku
AnthropicFast, cost-effective model for high-volume tasks. Great for code review and simple queries.