Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 151
view article Article Welcome Falcon Mamba: The first strong attention-free 7B model +4 Aug 12, 2024 • 113
OLMoE (November 2024) Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated Dec 23, 2025 • 31
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models Paper • 2108.08877 • Published Aug 19, 2021 • 2
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23, 2024 • 77
GritLM Collection Generative Representational Instruction Tuning (GRIT) • 63 items • Updated 11 days ago • 9
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension Paper • 1910.13461 • Published Oct 29, 2019 • 6
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models Paper • 2405.05374 • Published May 8, 2024 • 2
Training data-efficient image transformers & distillation through attention Paper • 2012.12877 • Published Dec 23, 2020 • 2
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published Apr 22, 2024 • 126