DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 26 days ago • 356
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 30 days ago • 123
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 20 days ago • 873
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 6 days ago • 153
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 338
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 22 days ago • 49