text-generation-inference

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

External Resources

Adyen wrote a detailed article about the interplay between TGI’s main components: router and server. LLM inference at scale with TGI (Martin Iglesias Goyanes - Adyen, 2024)