paper maybe useful - a dorni Collection

dorni 's Collections

paper maybe useful

paper maybe useful

updated Sep 25, 2025

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published Feb 12, 2025 • 42
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47
Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18, 2025 • 86
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Paper • 2503.09419 • Published Mar 12, 2025 • 6
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 146
Can Vision-Language Models Answer Face to Face Questions in the Real-World?

Paper • 2503.19356 • Published Mar 25, 2025 • 2
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Paper • 2503.19953 • Published Mar 25, 2025 • 3
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published Mar 7, 2025 • 20
Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79
Segment Any Motion in Videos

Paper • 2503.22268 • Published Mar 28, 2025 • 19
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24, 2025 • 23
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263
Dreamland: Controllable World Creation with Simulator and Generative Models

Paper • 2506.08006 • Published Jun 9, 2025 • 7
Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9, 2025 • 27
PlayerOne: Egocentric World Simulator

Paper • 2506.09995 • Published Jun 11, 2025 • 34
Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 99