Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
6,542
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/t5gemma-2-270m-270m
Image-Text-to-Text
•
0.8B
•
Updated
14 days ago
•
8.96k
•
147
browser-use/bu-30b-a3b-preview
Image-Text-to-Text
•
31B
•
Updated
6 days ago
•
5.4k
•
224
google/t5gemma-2-4b-4b
Image-Text-to-Text
•
9B
•
Updated
12 days ago
•
4.15k
•
128
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
2.82M
•
•
610
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4
•
3.91M
•
3.02k
zai-org/GLM-4.6V-Flash
Image-Text-to-Text
•
10B
•
Updated
21 days ago
•
240k
•
•
520
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text
•
31B
•
Updated
Nov 26
•
1.4M
•
•
477
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
1.56M
•
•
1.78k
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
19 days ago
•
17.1k
•
1.44k
google/t5gemma-2-1b-1b
Image-Text-to-Text
•
2B
•
Updated
12 days ago
•
4.17k
•
59
zai-org/AutoGLM-Phone-9B
Image-Text-to-Text
•
934k
•
Updated
21 days ago
•
86.4k
•
396
tencent/HunyuanOCR
Image-Text-to-Text
•
1.0B
•
Updated
6 days ago
•
880k
•
695
zai-org/GLM-4.6V
Image-Text-to-Text
•
108B
•
Updated
21 days ago
•
162k
•
•
351
janhq/Jan-v2-VL-max-FP8
Image-Text-to-Text
•
31B
•
Updated
8 days ago
•
395
•
25
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21
•
852k
•
1.07k
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
2.59M
•
•
1.41k
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23
•
1.24M
•
248
fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text
•
8B
•
Updated
May 16
•
101k
•
287
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15
•
723k
•
282
unsloth/GLM-4.6V-Flash-GGUF
Image-Text-to-Text
•
9B
•
Updated
3 days ago
•
70k
•
62
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
1.42M
•
•
598
stepfun-ai/GELab-Zero-4B-preview
Image-Text-to-Text
•
4B
•
Updated
11 days ago
•
1.97k
•
139
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10
•
12.1k
•
255
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
Oct 31
•
847k
•
1.17k
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9
•
5.9k
•
•
530
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text
•
236B
•
Updated
Nov 26
•
232k
•
•
346
jinaai/jina-vlm
Image-Text-to-Text
•
2B
•
Updated
25 days ago
•
3.28k
•
90
ServiceNow-AI/Apriel-1.6-15b-Thinker
Image-Text-to-Text
•
15B
•
Updated
8 days ago
•
8.09k
•
•
248
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4
•
821k
•
1.73k
google/medgemma-4b-it
Image-Text-to-Text
•
4B
•
Updated
Oct 28
•
375k
•
806
Previous
1
2
3
...
100
Next