Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
embedl 's Collections
FlashHead
EdgeN
Cosmos-Reason2
NVIDIA Jetson Orin Nano
NVIDIA Jetson AGX Orin
NVIDIA Jetson AGX Thor

FlashHead

updated 3 days ago

Efficient Drop-In Replacement for the Classification Head in Language Model Inference. https://github.com/embedl/flash-head

Upvote
2

  • embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead

    Image-Text-to-Text • 2B • Updated 3 days ago • 459 • 7

  • embedl/Qwen3-1.7B-FlashHead-W4A16

    2B • Updated 3 days ago • 95 • 3

  • embedl/gemma-3-270m-it-FlashHead

    0.3B • Updated 3 days ago • 44 • 3

  • embedl/Qwen3-0.6B-FlashHead

    0.6B • Updated 3 days ago • 48 • 4

  • embedl/Qwen3-1.7B-FlashHead

    2B • Updated 3 days ago • 47 • 3

  • embedl/Llama-3.2-1B-Instruct-FlashHead

    1B • Updated 3 days ago • 47 • 4

  • embedl/Llama-3.2-3B-Instruct-FlashHead

    3B • Updated 3 days ago • 64 • 4

  • embedl/Llama-3.2-3B-Instruct-FlashHead-W4A16

    4B • Updated 3 days ago • 79 • 4

  • embedl/Llama-3.2-1B-Instruct-FlashHead-W4A16

    2B • Updated 3 days ago • 69 • 6

  • embedl/gemma-3-1b-it-FlashHead

    1.0B • Updated 3 days ago • 65 • 3

  • embedl/gemma-3-1b-it-FlashHead-W4A16

    1B • Updated 3 days ago • 89 • 3
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs