arxiv:2507.22412
sijie wang
sijieaaa
AI & ML interests
None yet
Recent Activity
upvoted an article 1 day ago
Ulysses Sequence Parallelism: Training with Million-Token Contexts upvoted an article 6 days ago
Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand upvoted a paper 3 months ago
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics