4 6 6

Sayan Deb Sarkar

sayandsarkar

https://sayands.github.io/

AI & ML interests

3D Computer Vision, 3D Scene Understanding

Recent Activity

upvoted a paper about 1 month ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

upvoted a paper about 2 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

upvoted a paper 2 months ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

View all activity

Organizations

upvoted a paper about 1 month ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Paper • 2603.18002 • Published Mar 18 • 13

upvoted a paper about 2 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 14

upvoted a paper 2 months ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published Feb 13 • 31

submitted a paper to Daily Papers 2 months ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published Feb 13 • 31

liked a Space 2 months ago

Vision Arena (Testing VLMs side-by-side)

🖼

562

Explore AI-powered visual tasks in Vision Arena

liked a Space 5 months ago

GuideFlow3D

🤗

A HF Space that demonstrates all use-cases for GuideFlow3D

published a Space 5 months ago

GuideFlow3D

🔥

Robust cross-category 3D appearance transfer

liked a Space 5 months ago

TRELLIS

🏢

590

Scalable and Versatile 3D Generation from images

upvoted a paper 6 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5

authored a paper 6 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5

commented a paper 6 months ago

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer

Paper • 2510.16136 • Published Oct 17, 2025 • 5 •

updated a model 7 months ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

liked a model 8 months ago

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 25.6k • 125

liked a model 11 months ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

published a model 11 months ago

gradient-spaces/CrossOver

Updated Sep 28, 2025 • 6

upvoted 2 papers about 1 year ago

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images

Paper • 2504.08727 • Published Apr 11, 2025 • 12

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 59

published a Space about 1 year ago

Gradient Spaces

🤖

commented a paper about 1 year ago

CrossOver: 3D Scene Cross-Modal Alignment

Paper • 2502.15011 • Published Feb 20, 2025 • 2 •

authored a paper about 1 year ago

CrossOver: 3D Scene Cross-Modal Alignment

Paper • 2502.15011 • Published Feb 20, 2025 • 2

Sayan Deb Sarkar

AI & ML interests

Recent Activity

Organizations

sayandsarkar's activity

Vision Arena (Testing VLMs side-by-side)

GuideFlow3D

GuideFlow3D

TRELLIS

Gradient Spaces