MiniMax launches M3 open-weight AI model that outperforms OpenAI and Google rivals

The startup’s new open-weight M3 model delivers elite coding and a million-token context to challenge proprietary AI giants.

June 1, 2026

MiniMax launches M3 open-weight AI model that outperforms OpenAI and Google rivals
The global artificial intelligence landscape is witnessing a significant shift as the boundaries between proprietary and open-weight models continue to blur. In a major development, the Shanghai-based artificial intelligence startup MiniMax has officially launched its new flagship foundation model, known as M3. Billed as the first open-weight system to integrate elite coding performance, a massive one-million-token context window, and native multimodality, M3 represents a direct challenge to the closed-source dominance of industry giants like OpenAI and Google[1][2]. By bringing three capabilities that have traditionally been treated as proprietary table stakes into the open-weights ecosystem, MiniMax is redefining the baseline for what developers and enterprises can expect from accessible artificial intelligence systems[3]. This release signals a new era in which high-end agentic workflows and massive data processing can be deployed with unprecedented economic efficiency[4].
At the core of the model's performance leaps is a fundamental architectural innovation known as MiniMax Sparse Attention, or MSA[3][5]. To scale a model's context window to one million tokens, artificial intelligence developers have historically faced the barrier of full-attention mechanisms, which suffer from quadratic growth in computational complexity as the input size expands[3]. MSA overcomes this constraint by utilizing a clean, highly extensible sparse attention framework that replaces standard full attention with key-value block selection[3][6]. This approach dynamically filters and targets specific sequences, allowing the model to focus only on the most relevant information within a massive dataset rather than processing every token pair[3][7]. Consequently, at its maximum context length of one million tokens, the computational requirement per token for M3 is cut to just one-twentieth of the cost seen in the company's previous-generation architectures[6][8]. This structural efficiency translates directly into massive speedups, yielding a 9.7-fold increase in prefilling speeds and a 15.6-fold boost during the decoding phase compared to earlier models[4][9].
These architectural efficiencies have not come at the expense of performance, as MiniMax M3 has demonstrated elite capabilities across a variety of rigorous technical evaluations[3][10]. Most notably, on the highly regarded SWE-bench Pro benchmark, which measures an artificial intelligence's ability to resolve complex, real-world software engineering issues, M3 achieved an impressive score of 59.0 percent[11]. This benchmark performance places the model ahead of major proprietary rivals, including OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro, while positioning it just behind Anthropic's top-tier Opus 4.7[3][11]. In a practical test of its developer utility, the company tasked M3 with optimizing a highly complex FP8 General Matrix Multiply kernel on Nvidia Hopper graphics processing units, a task notoriously difficult to refine[3][10]. Through 147 iterations of autonomous troubleshooting and code generation, M3 successfully delivered a 9.4-fold performance speedup[10]. Beyond specialized software engineering, the model also registered remarkable victories on SVG-Bench for scalable vector graphics generation and OmniDocBench for document processing, consistently outperforming leading closed-source alternatives[3].
The true power of MiniMax M3 lies in its ability to translate these benchmark scores into long-horizon agentic execution, a feat made possible by its native multimodal design and deep alignment training[10][2]. Unlike models that rely on superficial multimodal add-ons, M3's entire training pipeline was restructured from the ground up, utilizing over 100 terabytes of interleaved text, image, and video data[10][2]. This deep integration allows the model to handle tasks requiring visual comprehension, complex tool use, and computer operation with high coherence[3][6]. In a dramatic demonstration of these autonomous agent capabilities, M3 was assigned the task of independently replicating a prominent academic paper detailing the learning dynamics of large language model fine-tuning[10]. Working continuously for nearly 12 hours without human intervention, the artificial intelligence parsed formulas from the document, generated and tested code, managed experiment logs, made 18 code commits, and produced 23 experimental charts, successfully replicating the study's core findings[10][2]. On the BrowseComp benchmark, which measures autonomous web browsing and information retrieval, M3 scored 83.5, once again outclassing Opus 4.7 and showcasing its utility for automated, multi-step workflows[10][2].
Despite these impressive performance metrics, the release of MiniMax M3 highlights an ongoing debate within the technology sector regarding the definition of open-source artificial intelligence[12]. While MiniMax has made M3's weights available to developers, the company has stopped short of releasing the model's underlying training code or custom inference operators[13][12]. This approach mirrors a growing industry trend where laboratories offer open-weight models under permissive commercial licenses while keeping their proprietary infrastructure and training methodologies confidential[12][14]. This strategy allows MiniMax to position itself aggressively against open-weight rivals such as DeepSeek, Moonshot AI, and Nvidia's Nemotron series, while protecting its commercial interests as it expands its business footprint[13][12]. The company's business momentum is rapidly growing, with recent annual revenues reaching 79 million dollars, representing a massive 159 percent year-on-year increase[12]. MiniMax has also forged high-profile alliances, such as integrating with Ant Group's Alipay payment infrastructure, enabling the monetization of its automated agentic services far beyond simple consumer chat applications[13].
The launch of MiniMax M3 marks a pivotal milestone in the democratization of frontier-level artificial intelligence capabilities. By demonstrating that a highly efficient, sparse-attention model can match or exceed the performance of the world's most heavily capitalized proprietary systems, MiniMax has changed the economic calculus for enterprise AI deployments[4]. As million-token context windows and autonomous agentic capabilities transition from luxury features to standard requirements, the global competitive playing field is leveling[5]. For developers and enterprises worldwide, M3 offers a powerful, cost-effective framework that proves open-weight architectures are no longer a step behind closed-source alternatives, but are actively driving the next wave of technological innovation[10][4].

Sources
Share this article