AI Tech Suite

Mistral AI's New Code Models Beat Google, OpenAI on Performance, Price

Challenging tech titans, Mistral AI's Devstral models offer a new frontier in powerful, cost-effective AI coding assistance.

July 11, 2025

Mistral AI's New Code Models Beat Google, OpenAI on Performance, Price

French AI startup Mistral AI, in collaboration with agentic AI startup All Hands AI, has launched a new set of powerful code generation models that it claims outperform and undercut competitors like Google's Gemini 2.5 Pro on both performance and price. The release features Devstral Medium, a high-performance model available via API, and an upgraded open-source model, Devstral Small 1.1. This strategic move intensifies the competition in the rapidly evolving market for AI-powered software development tools, directly challenging the dominance of major tech players by offering what Mistral describes as a new frontier in cost-effective, powerful coding assistance. The new models are designed not merely for simple code completion but for complex, "agentic" tasks that mimic the workflows of human software engineers, tackling real-world programming challenges.

The core of the announcement centers on two distinct but related models tailored for different use cases. Devstral Medium is positioned as the flagship commercial offering, accessible through Mistral's API and designed for enterprise clients and developers who require top-tier performance.[1] This model can also be deployed directly on private infrastructure, offering enhanced data privacy and control, and can be custom fine-tuned for specific corporate codebases.[1][2] In contrast, Devstral Small 1.1 continues Mistral's commitment to the open-source community. Released under the permissive Apache 2.0 license, it is designed to be lightweight enough to run on a single high-end consumer GPU or a Mac with 32GB of RAM, making advanced AI coding tools accessible for local deployment.[3][2] While the architecture of Devstral Small remains the same at 24 billion parameters, the 1.1 version boasts significant performance improvements and enhanced versatility, supporting a wide range of applications and agentic frameworks.[1][4]

A key pillar of Mistral's claim lies in the demonstrated power of its new models on rigorous, real-world benchmarks. The company highlights performance on SWE-Bench Verified, a benchmark that evaluates an AI model's ability to solve actual issues from GitHub repositories.[3][2] On this challenging test, Devstral Medium achieves a score of 61.6%, which Mistral asserts surpasses the performance of both Gemini 2.5 Pro and GPT 4.1.[1] This benchmark is particularly significant because it moves beyond evaluating simple function generation to testing the model's capacity for higher-level reasoning and multi-step problem-solving within large, complex codebases.[5] The open-source Devstral Small 1.1 also sets a new standard for freely available models, scoring an impressive 53.6% on the same benchmark, a notable leap over its predecessor and other open models.[1][4] The agentic nature of these models, developed in partnership with All Hands AI, allows them to work with developer tools like OpenHands and SWE-Agent to better simulate and execute genuine engineering tasks, from fixing bugs to refactoring code.[5][2]

Beyond raw performance, Mistral's assault on the market is predicated on a highly aggressive pricing strategy. The company explicitly states that its new flagship, Devstral Medium, is offered at a quarter of the price of its main competitors, Gemini 2.5 Pro and GPT 4.1.[1] This positions Devstral Medium not just as a powerful alternative but as a dramatically more cost-effective one, a compelling proposition for both individual developers and large enterprises managing significant operational budgets. This focus on affordability is a consistent theme for Mistral, which has previously priced other models, like Codestral, competitively.[6][7] By creating a new point on the cost-performance curve, Mistral aims to democratize access to state-of-the-art coding assistants, potentially forcing a market-wide re-evaluation of pricing for high-end AI models and accelerating adoption among a broader user base.

The release of the Devstral models and the strategic collaboration with All Hands AI signal a calculated move to capture a significant share of the AI for software development market. All Hands AI specializes in creating AI agents that can use the same tools as human developers, including modifying code, running commands, and browsing the web.[8][9] By building models optimized for these agentic scaffolds, Mistral is focused on delivering practical, real-world utility rather than just impressive but isolated benchmark scores. This emphasis on enterprise-ready solutions, complete with options for on-premise deployment and customization, addresses key corporate concerns around data security and intellectual property that can be a barrier to adopting closed, proprietary systems.[10][11] As a prominent European AI firm, Mistral's success with this dual strategy of open-source community building and high-value enterprise offerings represents a growing challenge to the established US-based leaders in the generative AI space.

In conclusion, Mistral AI's introduction of Devstral Medium and the upgraded Devstral Small 1.1 marks a pivotal moment in the AI coding assistant landscape. By directly claiming superiority in both performance on real-world tasks and a significantly lower price point, Mistral has thrown down the gauntlet to industry giants like Google and OpenAI. The partnership with All Hands AI underscores a focus on practical, agentic capabilities that go beyond simple code suggestion to address complex software engineering workflows. This combination of open-source accessibility, enterprise-grade power, and disruptive pricing is set to fuel intense competition and innovation, ultimately empowering developers with more capable and affordable tools to build the next generation of software.