forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 66
Pull requests: ROCm/pytorch
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[release/2.5][ROCm][TunableOp] Improve identification of fastest solution (#144942)
#2018
opened Apr 4, 2025 by
naromero77amd
Loading…
[release/2.4][ROCm][TunableOp] Fix TunableOp warmup environment variable. (#147412)
#2017
opened Apr 3, 2025 by
naromero77amd
Loading…
rocthrust, rocprim, rocrand, hipcub, ck and aotriton include paths
#2001
opened Mar 26, 2025 by
renjithravindrankannath
Loading…
Revert "[ROCm] Improvements to non-vectorized elementwise kernels (#1…
#1944
opened Mar 6, 2025 by
BLOrange-AMD
Loading…
Enable input vectorization in ewk for input tensors with heterogeneou…
#1906
opened Feb 15, 2025 by
carlobertolli
Loading…
Enable load-compute-store interleaving for unrolled elementwise kernel.
#1886
opened Feb 6, 2025 by
carlobertolli
•
Draft
[Do NOT MERGE] [release/2.5] Enable tf32 testing on test_nn
#1859
opened Jan 27, 2025 by
jagadish-amd
Loading…
[ROCm] Eliminate the need for divisions in layernorm for default vector size.
#1850
opened Jan 22, 2025 by
doru1004
Loading…
[ROCm][WIP] Improve performance of casted elementwise add operations
#1805
opened Dec 20, 2024 by
doru1004
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.