GPT‑OSS: OpenAI Publishes 20B and 120B Open‑Weight Models for Local Deployment

OpenAI has released gpt‑oss‑120b and gpt‑oss‑20b, its first open‑weight models since GPT‑2. The models match or exceed the performance of proprietary counterparts and mark a rare moment of open source leadership from a U.S.-based AI lab. With support for tool use, chain‑of‑thought reasoning, and smooth MacBook deployment, gpt‑oss is designed for full local control.

August 5, 2025

August 6, 2025

•

Georg S. Kuklick

OpenAI has launched two new open-weight models—gpt‑oss‑120b and gpt‑oss‑20b—under an Apache 2.0 license. The move breaks a five-year drought in U.S. open releases at this level of scale and quality. Both models can be fine-tuned and deployed locally or via major platforms including Hugging Face, AWS, Azure, and Databricks. The 120b model contains roughly 117 billion parameters (with 5.1 billion active) and runs on a single 80 GB H100 GPU. The smaller 20b variant fits in 16 GB of memory.

The release comes after weeks dominated by China's open-source leaders, including DeepSeek-VL, Qwen2, and Kimi 2.0. Until now, U.S. labs lagged in making high-quality open models available. With gpt‑oss, OpenAI re-enters the open-source scene by releasing a model that not only competes but in some areas outperforms the best available. It’s a notable shift in momentum in the global race for open AI infrastructure.

Benchmark results shared by OpenAI show that gpt‑oss‑120b outperforms o4‑mini, a proprietary model, on tasks like MMLU, Codeforces, and HealthBench. The 20b model also competes strongly, exceeding o3‑mini across several metrics. The models support agentic use cases, configurable reasoning effort, and chain-of-thought prompting, making them well-suited for developers building autonomous systems or local copilots.

For Apple users, there’s another reason to pay attention. MLX-optimized builds of both models are already available, enabling smooth inference on Apple Silicon Macs. The models run efficiently even on consumer MacBooks, making gpt‑oss a practical foundation for desktop-based LLM applications. This lowers the barrier for indie devs and researchers who want fine-grained control without relying on cloud APIs.

Unlike most closed-source commercial models, gpt‑oss is designed for modification. The full training recipe is not included, but OpenAI has published a detailed model card, parameter counts, architecture notes, and fine-tuning tools. Safety measures include evaluations by a third-party red-teaming vendor and red-teaming API interface, plus alignment via reinforcement learning and supervised fine-tuning.

OpenAI says the release is meant to support safety research, transparency, and broader access. It follows the launch of o4 in June and may reflect internal tension between OpenAI’s closed commercial roadmap and its original open science roots. By releasing performant open-weight models now, OpenAI also sets a benchmark that could pressure others—particularly Meta and Anthropic—to follow suit.

While not full end-to-end reproducibility, gpt‑oss offers what many researchers and startups have asked for: a U.S.-backed, high-performance model that can be studied, deployed, and adapted without license restrictions. It may not mark a total return to openness, but it’s a meaningful step toward rebuilding trust and enabling local AI development at scale.

Pure Neo Signal:

Data Source

Share this post:

We love

and you too

If you like what we do, please share it on your social media and feel free to buy us a coffee.

Vienna - Kleiner Schwarzer $2.90 Berlin - Flat White $4.90 NYC - Pour Over $5.90 San Francisco - Cold Brew $6.90 Buy us Coffee

Latest AI News

OpenAI

Sora

OpenAI launches Sora 2 and introduces social video app

OpenAI has released Sora 2, a new version of its AI video generation model, alongside the debut of the Sora app. The app positions OpenAI as both a model developer and a social platform operator. With higher realism, synchronized audio, and a distinct approach to feeds and responsibility, the launch marks a direct entry into competition with TikTok and Instagram.

OpenAI

ChatGPT

OpenAI debuts ChatGPT Pulse for proactive daily updates

OpenAI has introduced ChatGPT Pulse, a new feature that delivers proactive, personalized updates. Initially available in preview for Pro users on mobile, Pulse shifts ChatGPT from reactive answers to daily insights based on memory, chat history, and optional integrations. The rollout positions ChatGPT as a more active assistant in planning and decision-making.

Notion

Notion adds AI Agent in version 3.0 rollout

Notion has released version 3.0, introducing a built-in AI Agent that executes autonomous tasks across the platform and beyond. The Agent can search connected apps, manage Notion workspaces, and run operations for up to 20 minutes. The update positions Notion as a direct competitor to AI-first workplace tools by moving from note-taking toward task execution.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.