-
Notifications
You must be signed in to change notification settings - Fork 32.2k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Less verbose weight-loading tqdm when stdout is not a TTY (fixes #44303)
#44345
opened Feb 27, 2026 by
manavshrivastavagit
Loading…
Fix tokenizer_class in tokenizer_config.json for Qwen3.5 save_pretrained (fixes #44297)
#44344
opened Feb 27, 2026 by
manavshrivastavagit
Loading…
Fix ANSI codes in loading_report when stdout is not a TTY (fixes #44336)
#44343
opened Feb 27, 2026 by
manavshrivastavagit
Loading…
Fix ANSI color handling in loading report for interactive terminals (#44336)
#44341
opened Feb 27, 2026 by
Kokonico
Loading…
Refactor RoFormer output tracing with @capture_outputs and @can_return_tuple
#44335
opened Feb 27, 2026 by
ManasVardhan
Loading…
4 tasks done
Refactor ALBERT to use named attributes and remove redundant return_dict=True
#44333
opened Feb 27, 2026 by
ManasVardhan
Loading…
Enable Liger Kernel when doing hyperparameter search.
#44329
opened Feb 27, 2026 by
linfeng-du
Loading…
1 of 5 tasks
skip 2 invalid test cases for voxtral_realtime model
#44321
opened Feb 27, 2026 by
kaixuanliu
Loading…
docs: Add NeMo Automodel community integration docs
#44304
opened Feb 26, 2026 by
adil-a
Loading…
3 of 5 tasks
Integrate the Neuron device to TrainingArguments
#44302
opened Feb 26, 2026 by
michaelbenayoun
Loading…
Fix: Qwen3
<think> blocks not written during fine-tuning (TRL)
#44301
opened Feb 26, 2026 by
likejazz
Loading…
Dynamic weight conversion applies to VLMs and is recursive
#44300
opened Feb 26, 2026 by
zucchini-nlp
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.