-
Notifications
You must be signed in to change notification settings - Fork 339
Issues: AI-Hypercomputer/maxtext
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Make maxtext_xpk_runner support other trainers
feature request
#1552
opened Apr 9, 2025 by
lukebaumann
Please create direct conversion scripts from huggingface for Gemma3 models
#1528
opened Apr 5, 2025 by
R4ZZ3
moe_lb_loss should be divided by gradient_accumulation_steps for reporting.
#1483
opened Mar 26, 2025 by
bzantium
When using dcn-DP and dcn-FSDP together got error when saving checkpoint.
#1434
opened Mar 20, 2025 by
jiagaoxiang
The default setting of
param_scan_axis=1
hurts performance and memory consumption on GPUs
#1382
opened Mar 12, 2025 by
jaro-sevcik
MFU drops significantly when using megablox with more experts
#1256
opened Feb 9, 2025 by
rodrigo-f-nogueira
llama GPU model with dcn fsdp + ici tp + cudnn flash attention broken
#1093
opened Dec 10, 2024 by
wang2yn84
Support nsys profiler upload in all cases
bug
Something isn't working
good first issue
Good for newcomers
#911
opened Sep 24, 2024 by
gobbleturk
Move maxtext docker images being built to artifact registry
enhancement
New feature or request
#904
opened Sep 20, 2024 by
parambole
Unable to recover after checkpoint saving
bug
Something isn't working
#868
opened Sep 6, 2024 by
peregilk
Previous Next
ProTip!
no:milestone will show everything without a milestone.