26 443

sree

srisree

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

liked a model 3 days ago

CohereLabs/tiny-aya-fire

liked a model 3 days ago

zeroentropy/zembed-1

View all activity

Organizations

upvoted a collection 19 days ago

LLaDA2.1

Collection

3 items • Updated about 4 hours ago • 21

upvoted a paper 19 days ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 25 days ago • 68

upvoted a collection 22 days ago

pplx-embed

Collection

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 8 days ago • 85

upvoted an article about 2 months ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

118

upvoted an article 2 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

121

upvoted a paper 3 months ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published Dec 17, 2025 • 66

upvoted an article 3 months ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Dec 17, 2025

•

upvoted an article 4 months ago

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

Jan 28, 2025

•

upvoted 4 papers 5 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 118

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5, 2025 • 26

Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3, 2025 • 25

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 80

upvoted an article 7 months ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17, 2025

•

upvoted 4 articles 8 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

785

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

760

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Jun 19, 2025

•

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26, 2025

•

120

upvoted an article 12 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

791

upvoted a paper 12 months ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10, 2025 • 56

upvoted an article almost 2 years ago

Article

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

Apr 21, 2024

•

sree

AI & ML interests

Recent Activity

Organizations

srisree's activity

The Optimal Architecture for Small Language Models

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Provence: efficient and robust context pruning for retrieval-augmented generation

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Gemma 3n fully available in the open-source ecosystem!

Uncensor any LLM with abliteration

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!