StepFun open-sources Step3, a 321B parameter VLM optimized for Chinese AI chips

StepFun has released Step3, a massive open-source visual language model with 321 billion parameters and leading benchmark scores. The model debuts with novel attention architectures that reduce inference costs and is optimized to run efficiently on domestic Chinese AI hardware.

July 31, 2025

August 1, 2025

•

Georg S. Kuklick

StepFun has launched Step3, a 321 billion parameter visual language model that activates just 38 billion parameters per token, thanks to custom attention mechanisms like Multi-Matrix Factorization Attention (MFA) and Attention-FFN Disaggregation (AFD). The model, open-sourced on July 31, is positioned as a high-performance alternative to proprietary multimodal models. It scores 74.2 on MMMU and 64.8 on MathVision, marking it as one of the strongest open-access reasoning models available.

Unlike most frontier-scale VLMs, Step3 is designed with inference efficiency in mind. MFA and AFD allow it to cut decoding costs by 4–8 times, making it viable for real-world applications without sacrificing output quality. StepFun's release strategy also focuses on hardware-software co-design. Step3 has been tuned for Chinese AI chips from vendors like Huawei Ascend and Cambricon, a strategic move that aligns with broader efforts to decouple from NVIDIA’s GPU stack in China.

For enterprise developers building multimodal agents, Step3 provides an open and high-performance foundation that can run cost-effectively on local infrastructure. The model's release also signals growing maturity in China’s open-source AI stack, with co-optimization across software and silicon. StepFun is distributing Step3 via Hugging Face, GitHub, and ModelScope under a permissive license.

Pure Neo Signal:

Data Source

Share this post:

We love

and you too

If you like what we do, please share it on your social media and feel free to buy us a coffee.

Vienna - Kleiner Schwarzer $2.90 Berlin - Flat White $4.90 NYC - Pour Over $5.90 San Francisco - Cold Brew $6.90 Buy us Coffee

Latest AI News

Thinking Machines

Enterprises Confront LLM Reliability, Determinism, and ROI Failures

OpenAI urges uncertainty-aware evaluation to reduce hallucinations, Thinking Machines outlines reproducibility fixes, and MIT reports 95 percent of enterprise GenAI pilots fail to deliver measurable ROI. The findings highlight a widening gap between model capability and business outcomes.

OpenAI

ChatGPT

OpenAI adds Developer mode to ChatGPT with full MCP client support

OpenAI has introduced a new Developer mode for ChatGPT, giving Pro and Plus users full access to Model Context Protocol (MCP) connectors. The beta feature allows both read and write actions across custom tools, making ChatGPT a central hub for external integrations. While it expands automation options, the mode requires careful handling due to the risk of data loss or misuse from incorrect tool calls.

Anthropic

Anthropic expands Claude usage index with global and US state data

Anthropic has published an update to its Economic Index, tracking how Claude is used across countries and US states. The report shows strong links between income and AI adoption, with automation use now exceeding augmentation overall. Business users on the API differ from consumer users in how they apply the model, underscoring divergent workflows across geographies and sectors.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.