Skip to content

Commit 12eea6a

Browse files
q10facebook-github-bot
authored andcommitted
Remove sm_100 and sm_120 (#4024)
Summary: X-link: facebookresearch/FBGEMM#1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10
1 parent 7d7af72 commit 12eea6a

File tree

2 files changed

+2
-3
lines changed

2 files changed

+2
-3
lines changed

.github/scripts/nova_dir.bash

+1-1
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ fi
2323
if [[ "$CONDA_ENV" != "" ]]; then export CONDA_RUN="conda run --no-capture-output -p ${CONDA_ENV}" && echo "$CONDA_RUN"; fi
2424

2525
if [[ "$CU_VERSION" == "cu128" ]]; then
26-
export TORCH_CUDA_ARCH_LIST="7.0;8.0;9.0;9.0a;10.0;10.0a;12.0;12.0a"
26+
export TORCH_CUDA_ARCH_LIST="7.0;8.0;9.0;9.0a;10.0a;12.0a"
2727
echo "Set TORCH_CUDA_ARCH_LIST to: ${TORCH_CUDA_ARCH_LIST}"
2828

2929
elif [[ "$CU_VERSION" == "cu126" ]] ||

fbgemm_gpu/docs/src/general/Releases.rst

+1-2
Original file line numberDiff line numberDiff line change
@@ -15,8 +15,7 @@ older than the one that the FBGEMM release corresponds to.
1515
| | PyTorch Release | Python Versions | CUDA Versions | Architectures | ROCm Versions | ROCm Architectures |
1616
+=================+==================+==================+================+======================+===========================+===========================+
1717
| 1.2.0 | 2.7.x | 3.9, 3.10, 3.11, | 11.8, 12.6, | 7.0, 8.0, 9.0, 9.0a, | 6.1, 6.2.4, 6.3 | gfx908, gfx90a, gfx942 |
18-
| | | 3.12, 3.13 | 12.8 | 10.0, 10.0a, 12.0, | | |
19-
| | | | | 12.0a | | |
18+
| | | 3.12, 3.13 | 12.8 | 10.0a, 12.0a | | |
2019
+-----------------+------------------+------------------+----------------+----------------------+---------------------------+---------------------------+
2120
| 1.1.0 | 2.6.x | 3.9, 3.10, 3.11, | 11.8, 12.4, | 7.0, 8.0, 9.0, | 6.1, 6.2.4, 6.3 | gfx908, gfx90a, gfx942 |
2221
| | | 3.12, 3.13 | 12.6 | 9.0a | | |

0 commit comments

Comments
 (0)