[EMB][multimodal]gme-qwen2-vl #3226

Minamiyama · 2025-04-10T15:11:46Z

Feature request / 功能建议

GME: General Multimodal Embedding

The GME models support three types of input: text, image, and image-text pair, all of which can produce universal vector representations and have powerful retrieval performance.

Motivation / 动机

https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-7B-Instruct

https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

Your contribution / 您的贡献

🉑

Minamiyama added feature new model labels Apr 10, 2025

XprobeBot added this to the v1.x milestone Apr 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EMB][multimodal]gme-qwen2-vl #3226

[EMB][multimodal]gme-qwen2-vl #3226

Minamiyama commented Apr 10, 2025

[EMB][multimodal]gme-qwen2-vl #3226

[EMB][multimodal]gme-qwen2-vl #3226

Comments

Minamiyama commented Apr 10, 2025

Feature request / 功能建议

Motivation / 动机

Your contribution / 您的贡献