Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Huo's picture
7 3

Huo

Yupeng123
  • hyyp1

AI & ML interests

AI NLP

Recent Activity

upvoted a paper about 1 month ago
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents
liked a model 2 months ago
openbmb/MiniCPM-SALA
upvoted a paper 3 months ago
Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification
View all activity

Organizations

None yet

upvoted a paper about 1 month ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23
upvoted 3 papers 3 months ago

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

Paper • 2601.21244 • Published Jan 29 • 12

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Paper • 2601.08323 • Published Jan 13 • 1

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published Jan 20 • 16
upvoted a paper 6 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40
upvoted 2 papers 10 months ago

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published Jun 22, 2025 • 19

ReDit: Reward Dithering for Improved LLM Policy Optimization

Paper • 2506.18631 • Published Jun 23, 2025 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs