Fan Yuan's picture

Fan Yuan

Leoyfan

·

Leofyfan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Pause or Fabricate? Training Language Models for Grounded Reasoning

upvoted a paper 5 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

upvoted a paper 10 days ago

UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

View all activity

Organizations

upvoted a paper about 24 hours ago

Pause or Fabricate? Training Language Models for Grounded Reasoning

Paper • 2604.19656 • Published 5 days ago • 10

upvoted a paper 5 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

Paper • 2604.14258 • Published 11 days ago • 23

upvoted 2 papers 10 days ago

UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

Paper • 2604.14113 • Published 11 days ago • 10

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Paper • 2604.14144 • Published 11 days ago • 62

upvoted a paper 12 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 13 days ago • 141

upvoted a paper 16 days ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 17 days ago • 47

upvoted a paper 2 months ago

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

upvoted a paper 6 months ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9, 2025 • 12

upvoted 3 papers 7 months ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published Sep 29, 2025 • 32

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published Sep 29, 2025 • 31

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 50

upvoted 4 papers 11 months ago

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Paper • 2505.15801 • Published May 21, 2025 • 17

Let LLMs Break Free from Overthinking via Self-Braking Tuning

Paper • 2505.14604 • Published May 20, 2025 • 23

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Paper • 2505.14684 • Published May 20, 2025 • 24

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121