You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The GME models support three types of input: text, image, and image-text pair, all of which can produce universal vector representations and have powerful retrieval performance.
Feature request / 功能建议
GME: General Multimodal Embedding
The GME models support three types of input: text, image, and image-text pair, all of which can produce universal vector representations and have powerful retrieval performance.
Motivation / 动机
https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-7B-Instruct
https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-2B-Instruct
Your contribution / 您的贡献
🉑
The text was updated successfully, but these errors were encountered: