MiniMax‑M1 Debuts with Cost‑Efficient, High-Performance RL Model

MiniMax-AI has released MiniMax‑M1, a large open-weight AI model tuned for long-context reasoning and software engineering tasks. Built using hybrid attention and a novel reinforcement learning algorithm, it was trained in just three weeks for under $535K. Its public release offers developers a new long-context contender at an unusually efficient cost.

June 16, 2025

July 7, 2025

•

Georg S. Kuklick

MiniMax‑M1 comes in two variants with “thinking budgets” of 40K and 80K tokens, optimizing for different task complexities. The model employs Lightning Attention and a hybrid attention mechanism, along with a new RL fine-tuning strategy called CISPO. The result is a model that shows strong comparative performance against top-tier open-weight peers like DeepSeek‑R1 and Qwen3‑235B. Training took place across 512 H800 GPUs, reaching completion in just under three weeks at a compute cost of $534,700. This puts MiniMax‑M1 among the most cost-efficient efforts in the 100B+ parameter class.

The model particularly excels in tasks requiring long-context comprehension and complex reasoning in code, positioning it as a useful tool for AI engineers and researchers building on transformer backbones. Its public release via GitHub marks a deliberate open-access stance, contrasting with more closed models from enterprise labs. For developers needing long-context handling and for teams exploring new RL fine-tuning strategies, MiniMax‑M1 offers a compelling open-source option with competitive performance and efficient scaling.

Pure Neo Signal:

Data Source

Share this post:

We love

and you too

If you like what we do, please share it on your social media and feel free to buy us a coffee.

Vienna - Kleiner Schwarzer $2.90 Berlin - Flat White $4.90 NYC - Pour Over $5.90 San Francisco - Cold Brew $6.90 Buy us Coffee

Latest AI News

DeepSeek

DeepSeek V3

DeepSeek releases V3.1 with 685B parameters and 128k context window

DeepSeek has launched its latest open-source AI model, DeepSeek-V3.1-Base, which comes with 685 billion parameters and a 128 000-token context length. The model posts benchmark results close to leading proprietary systems and is freely available for download, marking a significant move in the open-source AI landscape.

Qwen

Wan 2.2

Alibaba releases Qwen-Image-Edit, an open-source foundation model for image editing

Alibaba’s Qwen team has released Qwen-Image-Edit, a 20-billion-parameter foundation model for text-driven image editing. The model supports both semantic and appearance-level modifications, including precise bilingual text editing, and is licensed under Apache-2.0 for commercial use.

Google

Gemini

Google adds automatic memory and temporary chat controls to Gemini

Google has begun rolling out automatic memory in Gemini AI, allowing the assistant to remember details from past interactions by default. The update also introduces a “Temporary Chat” mode that does not store or use conversations for training and expires after 72 hours. The changes aim to balance personalization with stronger privacy controls.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

AI Lab

For Individuals For Business For Enterprise Pricing

Build with ♥️ in Berlin, New York, and Vienna.