Community Blog & Articles

Community Articles

How I contributed a new model to the Transformers library using Codex

Introducing Cohere-transcribe: state-of-the-art speech recognition

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

KV Caching Explained: Optimizing Transformer Inference Efficiency

Training mRNA Language Models Across 25 Species for $165

Uncensor any LLM with abliteration

Code a simple RAG from scratch

Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

Mastering Tensor Dimensions in Transformers

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

NEO-unify: Building Native Multimodal Unified Models End to End

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

🌈 SKT AI LABS 🌈

Everything You Need to Know about Knowledge Distillation

Take Control of What Your LLM Knows and Does — with the EasyEdit Tool Series

From GRPO to DAPO and GSPO: What, Why, and How

multimodalon-devicegemma4

Welcome Gemma 4: Frontier multimodal intelligence on device

+3

Holo3: Breaking the Computer Use Frontier

Falcon Perception

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

trlreinforcement-learningannouncement

TRL v1.0: Post-Training Library Built to Move with the Field

guideagentsinference-providers

Liberate your OpenClaw

+4

A New Framework for Evaluating Voice Agents (EVA)

Build a Domain-Specific Embedding Model in Under a Day

State of Open Source on Hugging Face: Spring 2026

Holotron-12B - High Throughput Computer Use Agent

hubstorageannouncement

Introducing Storage Buckets on the Hugging Face Hub

+8

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+5

guidedistributed-trainingaccelerate

Ulysses Sequence Parallelism: Training with Million-Token Contexts

lerobotrobotics

LeRobot v0.5.0: Scaling Every Dimension

+6

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

How I contributed a new model to the Transformers library using Codex

Introducing Cohere-transcribe: state-of-the-art speech recognition

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

KV Caching Explained: Optimizing Transformer Inference Efficiency

Training mRNA Language Models Across 25 Species for $165

Uncensor any LLM with abliteration

Code a simple RAG from scratch

Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

Mastering Tensor Dimensions in Transformers

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

NEO-unify: Building Native Multimodal Unified Models End to End

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

🌈 SKT AI LABS 🌈

Everything You Need to Know about Knowledge Distillation

Take Control of What Your LLM Knows and Does — with the EasyEdit Tool Series

From GRPO to DAPO and GSPO: What, Why, and How

View all articles