quantize: improve pattern matching for allowed tensors #13033

EAddario · 2025-04-20T09:08:09Z

This PR implements @slaren's regex matching recommendation for allowed tensors. For example: --tensor-type attn=q4_k will now apply to all tensors named *attn*

EAddario · 2025-05-03T06:25:27Z

Apologies for shotgun approach @ggerganov / @slaren / @ngxson, I'm not sure what the proper process to request a review is. This PR addresses #12511 deficiencies. Happy to close or move to draft if it's not suitable for merging

slaren

My opinion is still that it would be better to remove all checks and just accept any regex that the user wants to use.

slaren · 2025-05-04T19:06:32Z

tools/quantize/quantize.cpp

@@ -1,5 +1,6 @@
 #include "common.h"
 #include "llama.h"
+#include "llama-quant.h"


We should avoid including private headers.

I'll update as suggested

EAddario added 4 commits April 17, 2025 09:57

Move struct declaration to header and update allowed tensor list

6c281ad

Improve regex matching whilst still validating allowed tensors

3e031bc

Minor cosmetic change to log output

6c13fcb

Merge branch 'master' into quantize

16923b8

github-actions bot added the examples label Apr 20, 2025

Merge branch 'master' into quantize

d54a423

Add header search path

d69391e

slaren reviewed May 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantize: improve pattern matching for allowed tensors #13033

quantize: improve pattern matching for allowed tensors #13033

EAddario commented Apr 20, 2025

EAddario commented May 3, 2025

slaren left a comment

slaren May 4, 2025

EAddario May 5, 2025

quantize: improve pattern matching for allowed tensors #13033

Are you sure you want to change the base?

quantize: improve pattern matching for allowed tensors #13033

Conversation

EAddario commented Apr 20, 2025

EAddario commented May 3, 2025

slaren left a comment

Choose a reason for hiding this comment

slaren May 4, 2025

Choose a reason for hiding this comment

EAddario May 5, 2025

Choose a reason for hiding this comment