|
Seeking Advice🔥🔥| Strategy for Embedding Multiple Subjective Reviews in One-time Event Domain Recommendations
|
|
1
|
20
|
January 2, 2026
|
|
TurboTensors: Optimizing CPU LLM Performance
|
|
0
|
16
|
December 31, 2025
|
|
Significant generation degradation and repetition loops when enabling KV-cache for Qwen3-VL
|
|
1
|
21
|
December 29, 2025
|
|
Injecting multi modal embeddings into a language model breaks the `generate` function
|
|
1
|
78
|
December 28, 2025
|
|
Transformers v4 or v5 for my new project?
|
|
1
|
22
|
December 27, 2025
|
|
Assistant model is not passed onto the custom_generate method
|
|
3
|
19
|
December 25, 2025
|
|
How can i get TRANSFORMERS_CACHE in transformers v5?
|
|
2
|
26
|
December 19, 2025
|
|
CDM-CTM Fusion: A Rigorous Framework for Depth-Aware Autoregressive Control
|
|
0
|
15
|
December 13, 2025
|
|
Tensor Dimension Mismatch when using TRL GKDTrainer
|
|
3
|
16
|
December 12, 2025
|
|
Transformers.js need for token to char mapping
|
|
3
|
18
|
December 11, 2025
|
|
[Pipelines] Mask Generation Parameters
|
|
2
|
74
|
December 10, 2025
|
|
Having trouble to configure trainer for T5 model evaluation
|
|
1
|
27
|
December 9, 2025
|
|
How do I speedup my callbacks and reduce stall before they start?
|
|
1
|
29
|
December 9, 2025
|
|
Getting 429 Too Many Request
|
|
3
|
49
|
December 8, 2025
|
|
How to add new language to NLLB tokenizer in Huggingface?
|
|
3
|
2030
|
December 6, 2025
|
|
Is it possible to remove all other language from NLLB200 except English and German?
|
|
2
|
757
|
December 6, 2025
|
|
How to use nllb1.3b model to fine-tune the English to German bidirectional translation task?
|
|
2
|
115
|
December 6, 2025
|
|
SAE for Codegemma
|
|
3
|
21
|
December 6, 2025
|
|
Obtain raw logits before decoding scaling is applied
|
|
1
|
32
|
December 5, 2025
|
|
CUDA Out Of Memory when training a DETR Object detection model with compute_metrics
|
|
4
|
171
|
December 3, 2025
|
|
How to understand the special tokens?
|
|
7
|
112
|
December 2, 2025
|
|
GETTING ERROR >> AttributeError: 'InferenceClient' object has no attribute 'post'
|
|
18
|
2054
|
November 30, 2025
|
|
Dora training taking 8x time? Why?
|
|
2
|
112
|
November 27, 2025
|
|
ContractNLI-based NDA Risk Analyzer using RoBERTa + Chunking – Looking for Feedback
|
|
6
|
51
|
November 25, 2025
|
|
Train instance segmentation model with dinov3 backbone
|
|
3
|
194
|
November 24, 2025
|
|
DistilBERT reaches 76% accuracy but still predicts “believable” for impossible/fantasy excuses — why?
|
|
3
|
36
|
November 23, 2025
|
|
Search query autocomplete from the queries I have in my data
|
|
1
|
1694
|
November 21, 2025
|
|
How to sample from the validation set when using Trainer?
|
|
5
|
1986
|
November 21, 2025
|
|
Evaluate subset of data during training
|
|
6
|
5997
|
November 21, 2025
|
|
NeuroTrace – GPT-2 Small Residual Attack & Defence Framework (IOI Task)
|
|
0
|
29
|
November 21, 2025
|