Hugging Face Releases Full SmolLM3 Training Stack and Checkpoints

Hugging Face has open-sourced the full training and evaluation pipeline for SmolLM3, its latest small-scale language model. The release includes more than 100 intermediate checkpoints and supports dual-mode reasoning, multilingual tasks, and long-context inference. This marks a major move toward transparency and reproducibility in compact AI models.

July 28, 2025

August 28, 2025

•

Georg S. Kuklick

Hugging Face has published the complete training stack for SmolLM3, a 3 billion parameter language model designed for resource-efficient inference. The open repository includes scripts for pretraining using Nanotron, post-training alignment with SFT and Adaptive Preference Optimization, and a full suite of evaluation tools. Also included are over 100 training checkpoints, allowing researchers to audit or resume training at any point in the model’s lifecycle.

This release positions SmolLM3 as a viable alternative in the small-to-medium LLM space. Unlike proprietary models, SmolLM3 is fully open and optimized for multilingual (6 languages) and long-context scenarios, with context lengths up to 128,000 tokens. The transparency of its training pipeline makes it particularly valuable for researchers and engineers building or fine-tuning models for on-device or low-latency environments. Hugging Face’s decision to make every layer of SmolLM3 visible—from data to training to alignment—signals a deep commitment to reproducibility in the open-source AI ecosystem.

Pure Neo Signal:

Data Source

Share this post:

We love

and you too

If you like what we do, please share it on your social media and feel free to buy us a coffee.

Vienna - Kleiner Schwarzer $2.90 Berlin - Flat White $4.90 NYC - Pour Over $5.90 San Francisco - Cold Brew $6.90 Buy us Coffee

Latest AI News

OpenAI

Sora

OpenAI launches Sora 2 and introduces social video app

OpenAI has released Sora 2, a new version of its AI video generation model, alongside the debut of the Sora app. The app positions OpenAI as both a model developer and a social platform operator. With higher realism, synchronized audio, and a distinct approach to feeds and responsibility, the launch marks a direct entry into competition with TikTok and Instagram.

OpenAI

ChatGPT

OpenAI debuts ChatGPT Pulse for proactive daily updates

OpenAI has introduced ChatGPT Pulse, a new feature that delivers proactive, personalized updates. Initially available in preview for Pro users on mobile, Pulse shifts ChatGPT from reactive answers to daily insights based on memory, chat history, and optional integrations. The rollout positions ChatGPT as a more active assistant in planning and decision-making.

Notion

Notion adds AI Agent in version 3.0 rollout

Notion has released version 3.0, introducing a built-in AI Agent that executes autonomous tasks across the platform and beyond. The Agent can search connected apps, manage Notion workspaces, and run operations for up to 20 minutes. The update positions Notion as a direct competitor to AI-first workplace tools by moving from note-taking toward task execution.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

AI Lab

For Individuals For Business For Enterprise Pricing

Build with ♥️ in Berlin, New York, and Vienna.