Ansh Gupta

thisisanshgupta

·

AI & ML interests

Pytorch | NLP

Recent Activity

published a model about 1 month ago

thisisanshgupta/txi-v4-deepqlearning

upvoted a paper about 1 month ago

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

upvoted a paper 3 months ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

View all activity

Organizations

spaces 1

Solo Coder 20B

models 2

thisisanshgupta/txi-v4-deepqlearning

thisisanshgupta/ppo-LunarLander-v2-100000steps

Reinforcement Learning • Updated Jun 6, 2025 • 3 • 1

datasets 3

thisisanshgupta/CodeAlpacaSmall

Viewer • Updated Apr 22, 2023 • 2.02k • 8 • 3

thisisanshgupta/CodeAlpaca

Viewer • Updated Apr 22, 2023 • 20k • 286 • 2

thisisanshgupta/Pycode

Viewer • Updated Dec 10, 2022 • 118k • 7 • 1