Add Gemma3 #390

vbaddi · 2025-05-06T07:53:07Z

No description provided.

Signed-off-by: vbaddi <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

Signed-off-by: Mohit Soni <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

Signed-off-by: Rishin Raj <[email protected]> Signed-off-by: Mohit Soni <[email protected]> Signed-off-by: Abukhoyer Shaik <[email protected]> Signed-off-by: Asmita Goswami <[email protected]> Signed-off-by: vbaddi <[email protected]> Signed-off-by: Meet Patel <[email protected]> Co-authored-by: Rishin Raj <[email protected]> Co-authored-by: Abukhoyer Shaik <[email protected]> Co-authored-by: asmigosw <[email protected]> Co-authored-by: Vinayak Baddi <[email protected]> Co-authored-by: Meet Patel <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

This reverts commit 70ae12f. Signed-off-by: Mohit Soni <[email protected]>

Signed-off-by: Mohit Soni <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

quic-xiyushi · 2025-05-15T19:15:39Z

QEfficient/transformers/models/modeling_auto.py

            chunk_inputs["input_ids"] = lang_inputs["input_ids"][:, i * prefill_seq_len : (i + 1) * prefill_seq_len]
            chunk_inputs["position_ids"] = lang_inputs["position_ids"][
                :, i * prefill_seq_len : (i + 1) * prefill_seq_len
            ]
            outputs = lang_session.run(chunk_inputs)
+            chunk_inputs["index"] = outputs["index_output"]


Could you explain what are chunk_inputs["index"] and outputs["index_output"]?
Also, with the new approach, is batching supported?

Signed-off-by: vbaddi <[email protected]>

Signed-off-by: Dipankar Sarkar <[email protected]>

Signed-off-by: Ann <[email protected]>

quic-akuruvil · 2025-06-08T11:07:23Z

examples/causal_lm_examples/gemma3_text.py

+    node_precision_info="fp32_nodes_gemma3_4b_text.yaml",
+)
+print(f"qpc path is {qpc_path}")
+exec_info = qeff_model.generate(tokenizer, prompts=Constants.INPUT_STR, device_ids=[0])


@qcdipankar Is match obtained between original torch and AIC? Please report the single layer match.

quic-rishinr · 2025-06-09T10:14:40Z

examples/causal_lm_examples/gemma3_text.py

@@ -0,0 +1,48 @@
+# -----------------------------------------------------------------------------


We should not be having separate script for running Language model. Please remove the script and add example for running text only input, text + image input and text with multi image input using QEFFAutoModelForImageTextToText.

quic-rishinr · 2025-06-09T10:18:36Z

Please add the test, update the model in validated models list

quic-rishinr · 2025-06-09T10:42:03Z

QEfficient/transformers/models/gemma3/__init__.py

@@ -0,0 +1,6 @@
+# -----------------------------------------------------------------------------


please update it to

-----------------------------------------------------------------------------

Copyright (c) Qualcomm Technologies, Inc. and/or its subsidiaries.

SPDX-License-Identifier: BSD-3-Clause

-----------------------------------------------------------------------------

vbaddi requested review from quic-rishinr, ochougul and quic-amitraj as code owners May 6, 2025 07:53

vbaddi marked this pull request as draft May 6, 2025 07:53

vbaddi mentioned this pull request May 14, 2025

"Gemma3" #403

Closed

vbaddi and others added 7 commits May 14, 2025 10:06

Add Gemma3

a6f7ace

Signed-off-by: vbaddi <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

nit: update example script with node precision file

73e31f2

Signed-off-by: vbaddi <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

nit: add multi modal modeling changes

d19ee2b

Signed-off-by: vbaddi <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

Updating Chunking method (#398)

80ef2ca

Signed-off-by: Mohit Soni <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

Revert "Gemma3 Adding Merging and Chunking in DecoderWrapper (#402)"

e532d74

This reverts commit 70ae12f. Signed-off-by: Mohit Soni <[email protected]>

Updating Wrappers for Merging and Chunking in DecoderWrapper (#404)

f434ea3

Signed-off-by: Mohit Soni <[email protected]> Signed-off-by: Mohit Soni <[email protected]>

mohiso22 force-pushed the add_gemma3 branch from 4cea3c8 to f434ea3 Compare May 14, 2025 10:07

quic-xiyushi reviewed May 15, 2025

View reviewed changes

quic-rishinr added the model-enablement label May 21, 2025

vbaddi and others added 5 commits May 23, 2025 06:40

nit: add gemma3 MM example and update NPI for 27b-it

9589e35

Signed-off-by: vbaddi <[email protected]>

Merge branch 'main' into add_gemma3

7272340

SingleQPC Support for gemma3

a26acc0

Signed-off-by: Dipankar Sarkar <[email protected]>

Cleaning done for ruff format and brekpoint removals

c9ba1d2

Signed-off-by: Dipankar Sarkar <[email protected]>

Fix for Gemma3 CB mode

c6e5ee5

Signed-off-by: Ann <[email protected]>

quic-akuruvil self-requested a review June 5, 2025 17:35

quic-akuruvil reviewed Jun 8, 2025

View reviewed changes

quic-rishinr requested changes Jun 9, 2025

View reviewed changes

Merge branch 'main' into add_gemma3

68bb664

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Gemma3 #390

Add Gemma3 #390

vbaddi commented May 6, 2025

Uh oh!

quic-xiyushi May 15, 2025 •

edited

Loading

Uh oh!

quic-akuruvil Jun 8, 2025 •

edited

Loading

Uh oh!

quic-rishinr Jun 9, 2025

Uh oh!

quic-rishinr commented Jun 9, 2025

Uh oh!

quic-rishinr Jun 9, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,48 @@
		# -----------------------------------------------------------------------------

		@@ -0,0 +1,6 @@
		# -----------------------------------------------------------------------------

Add Gemma3 #390

Are you sure you want to change the base?

Add Gemma3 #390

Conversation

vbaddi commented May 6, 2025

Uh oh!

quic-xiyushi May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

quic-akuruvil Jun 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

quic-rishinr Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

quic-rishinr commented Jun 9, 2025

Uh oh!

quic-rishinr Jun 9, 2025

Choose a reason for hiding this comment

-----------------------------------------------------------------------------

Copyright (c) Qualcomm Technologies, Inc. and/or its subsidiaries.

SPDX-License-Identifier: BSD-3-Clause

-----------------------------------------------------------------------------

Uh oh!

Uh oh!

quic-xiyushi May 15, 2025 •

edited

Loading

quic-akuruvil Jun 8, 2025 •

edited

Loading