-
Notifications
You must be signed in to change notification settings - Fork 568
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Bm/genai rocm oss 5
ciflow/rocm
cla signed
module: rocm
#4032
opened Apr 27, 2025 by
q10
Loading…
Optimize kv cache usage for yoco
cla signed
fb-exported
#4030
opened Apr 26, 2025 by
ghjeong12
Loading…
Enable NaN checks on tensor arguments to kernel launches
cla signed
fb-exported
#4029
opened Apr 26, 2025 by
q10
Loading…
update hipify_torch submodule for version 2
cla signed
#4028
opened Apr 26, 2025 by
jeffdaily
Loading…
Add keep_orig_idx_per_feature parameter to block_bucketize_sparse_features kernel
cla signed
fb-exported
#4027
opened Apr 25, 2025 by
emlin
Loading…
Migrate make_pta_acc_format() away from old macros, v2]
cla signed
fb-exported
#4026
opened Apr 25, 2025 by
q10
Loading…
[ROCm OSS Enablement] Update setup.py to account for targets and variants
ciflow/rocm
cla signed
fb-exported
module: rocm
#4023
opened Apr 25, 2025 by
q10
Loading…
Optimize if-statements with if-constexpr
cla signed
fb-exported
#4022
opened Apr 25, 2025 by
q10
Loading…
Clean up
WeightRow
in preparation for optimizer state offloading
cla signed
fb-exported
#4021
opened Apr 24, 2025 by
q10
Loading…
fix build that excludes a bunch of features
cla signed
fb-exported
#4019
opened Apr 24, 2025 by
q10
Loading…
Report TBE data configuration with EEG-based indices estimation
cla signed
fb-exported
#4018
opened Apr 24, 2025 by
gchalump
Loading…
Use cudaMemset/hipMemset to setup IndexShuffling kernel.
cla signed
fb-exported
#4016
opened Apr 24, 2025 by
levendlee
Loading…
Remove unused variable in gqa_attn_splitk_attn_kernel
cla signed
fb-exported
#4014
opened Apr 24, 2025 by
PatriceVignola
Loading…
Gen modes: Remove
-Wno-mismatched-tags
cla signed
fb-exported
#4011
opened Apr 23, 2025 by
q10
Loading…
Enable FP4 CUTLASS GEMM and CUDA quantization kernels
cla signed
fb-exported
#4004
opened Apr 22, 2025 by
jiawenliu64
Loading…
Back out "Add a workaround for stochastic rounding for AMD GPUs"
cla signed
fb-exported
module: rocm
#3977
opened Apr 17, 2025 by
ionuthristodorescu
Loading…
Back out "Cleanups to
StochasticRoundingRNGState
"
cla signed
fb-exported
module: rocm
#3976
opened Apr 17, 2025 by
ionuthristodorescu
Loading…
[fbgemm_gpu] Add Nova workflow for torch 2.6 compatible releases
cla signed
#3958
opened Apr 10, 2025 by
q10
Loading…
[fbgemm_gpu] Add more docs scaffolding for GenAI
cla signed
#3944
opened Apr 8, 2025 by
q10
Loading…
Remove unused-variable in deeplearning/fbgemm/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu +1
cla signed
fb-exported
#3939
opened Apr 8, 2025 by
q10
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-04-24.