Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Name change AOPermod -> ModuleFqn
#38456 opened May 28, 2025 by drisspg Loading…
[Qwen2.5-Omni] Fix dtype of cos,sin when used with flash attention
#38453 opened May 28, 2025 by HarryHsing Loading…
5 tasks
Fix TypeError in save_pretrained error handling (fixes #38422)
#38449 opened May 28, 2025 by rahulrshetty45 Loading…
4 tasks done
fix: return next_token properly when streaming=True
#38447 opened May 28, 2025 by McPatate Loading…
Split transformers chat and transformers serve
#38443 opened May 28, 2025 by LysandreJik Loading…
Continuous batchin: offer only the next token
#38437 opened May 28, 2025 by LysandreJik Loading…
[tests] expand flex-attn test for vision models
#38434 opened May 28, 2025 by zucchini-nlp Loading…
More coverage for LossKwargs + cleaning
#38432 opened May 28, 2025 by SunMarc Loading…
GLM-4-0414 Change
#38431 opened May 28, 2025 by zRzRzRzRzRzRzR Loading…
[janus] Fix failing tests on mi3XX
#38426 opened May 28, 2025 by remi-or Loading…
[Qwen2-VL] Fix smart_resize bug
#38423 opened May 28, 2025 by rdonggroq Loading…
1 of 5 tasks
[docs] Format fix
#38414 opened May 27, 2025 by stevhliu Loading…
make it go brrrr
#38409 opened May 27, 2025 by ArthurZucker Loading…
5 tasks
fix: handle no scheduler passed by user
#38407 opened May 27, 2025 by McPatate Loading…
Add Dia model
#38405 opened May 27, 2025 by buttercrab Draft
5 tasks
[WIP] Tokenizer Refactor
#38400 opened May 27, 2025 by itazap Loading…
[generate] move SinkCache to a custom_generate repo
#38399 opened May 27, 2025 by gante Loading…
Add ColQwen2.5 to transformers 🤗
#38391 opened May 26, 2025 by qnguyen3 Loading…
10 tasks done
ProTip! Follow long discussions with comments:>50.