Huo's picture

7 3

Huo

Yupeng123

hyyp1

AI & ML interests

AI NLP

Recent Activity

upvoted a paper about 1 month ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

liked a model 2 months ago

openbmb/MiniCPM-SALA

upvoted a paper 3 months ago

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23

upvoted 3 papers 3 months ago

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

Paper • 2601.21244 • Published Jan 29 • 12

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Paper • 2601.08323 • Published Jan 13 • 1

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published Jan 20 • 16

upvoted a paper 6 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

upvoted 2 papers 10 months ago

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published Jun 22, 2025 • 19

ReDit: Reward Dithering for Improved LLM Policy Optimization

Paper • 2506.18631 • Published Jun 23, 2025 • 7