[Qwen2.5-VL] Fix empty string input crash in processor #38421

Flink-ddd · 2025-05-28T04:11:40Z

This PR fixes #38417.
When passing an empty string to the Qwen2.5 tokenizer with return_tensors="pt", the original output was a float32 tensor.
This patch ensures a consistent torch.long dtype by returning torch.empty((1, 0), dtype=torch.long) for empty input.
A test is included to validate the fix.

This is my first contribution to 🤗 Transformers. Happy to help and open to feedback!

…gface#38417)

…to ensure that `Qwen2Tokenizer` correctly handles empty string inputs...

Flink-ddd · 2025-05-28T09:23:19Z

test(Qwen2Tokenizer): Add regression test for empty string input dtype (#38417)

This commit introduces a regression test within Qwen2_5_VLProcessorTest
to verify the dtype of input_ids returned by Qwen2Tokenizer
when processing an empty string with return_tensors="pt".

The test uses the "Qwen/Qwen2-0.5B" model and asserts that the
output dtype should be torch.long (int64). Currently, this test
is expected to FAIL as it correctly reproduces the behavior
reported in issue #38417, where torch.float32 is returned instead.

This test serves to prevent future regressions and will pass once
the underlying issue in Qwen2Tokenizer is resolved.

Flink-ddd · 2025-05-28T09:53:23Z

Hi maintainers,

I've pushed an update to this PR. Here's the current status of the CI checks:

Both the tests_processors and run_tests jobs are marked as failing.
These failures are both due to the same single reason: the newly added regression test method, test_qwen2_tokenizer_empty_string_regression, is working as expected and failing with AssertionError: torch.float32 != torch.int64. This confirms it successfully reproduces the bug described in issue Tokenizer returns float32 tensor for empty string input instead of long dtype #38417 when using the "Qwen/Qwen2-0.5B" model.
All other unrelated CI checks (e.g., code formatting, quality) should now be passing.

This PR, which aims to add this regression test, is exhibiting the expected behavior for its core test (i.e., identifying the existing bug). It should now be ready for review.

Thanks!

Rocketknight1 · 2025-05-28T14:43:41Z

Thank you for the PR, but unfortunately it's a duplicate of #36555! It's unfortunate - your code looks good and I hope this doesn't discourage you from contributing in future

Flink-ddd · 2025-05-29T02:12:21Z

Hi @Rocketknight1, thank you for the quick review and the kind words!

I understand that this PR is a duplicate of #36555. I appreciate the feedback and the opportunity to go through the contribution process. It was a great learning experience for me.

I'll continue exploring other open issues and look forward to contributing more in the future!

Thanks again

[Qwen2.5-VL] Fix empty string input crash in processor

c79b8da

Flink-ddd mentioned this pull request May 28, 2025

Tokenizer returns float32 tensor for empty string input instead of long dtype #38417

Open

4 tasks

Flink-ddd added 6 commits May 28, 2025 12:31

[Qwen2.5-VL] Add regression test for empty string input issue (huggin…

b9eff13

…gface#38417)

test: add regression test for empty string input

f7cd920

This commit introduces a test case using the "Qwen/Qwen2-0.5B" model …

8191c7b

…to ensure that `Qwen2Tokenizer` correctly handles empty string inputs...

Add robust regression test for empty string handling (huggingface#38417)

4bea476

Add robust regression test for empty string handling (huggingface#38417)

ddf9a52

black format

cf8c7c4

ruff format

ada66e1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Qwen2.5-VL] Fix empty string input crash in processor #38421

[Qwen2.5-VL] Fix empty string input crash in processor #38421

Uh oh!

Flink-ddd commented May 28, 2025

Uh oh!

Flink-ddd commented May 28, 2025

Uh oh!

Flink-ddd commented May 28, 2025

Uh oh!

Rocketknight1 commented May 28, 2025

Uh oh!

Flink-ddd commented May 29, 2025

Uh oh!

Uh oh!

[Qwen2.5-VL] Fix empty string input crash in processor #38421

Are you sure you want to change the base?

[Qwen2.5-VL] Fix empty string input crash in processor #38421

Uh oh!

Conversation

Flink-ddd commented May 28, 2025

Uh oh!

Flink-ddd commented May 28, 2025

Uh oh!

Flink-ddd commented May 28, 2025

Uh oh!

Rocketknight1 commented May 28, 2025

Uh oh!

Flink-ddd commented May 29, 2025

Uh oh!

Uh oh!