On August 25,Secret Confessions (2025) Spongkey Episode 44 Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
Why 'Which is your favorite Harry Potter book and why?' is the perfect dating app opener
2025-06-26 16:46
310 views
Read More
Amazon Big Spring Sale 2025: Best air purifier deals from Dyson, Shark, LG, and more
2025-06-26 16:00
1565 views
Read More
'Westworld' review: Bring yourself back online for a phenomenal Season 4
2025-06-26 15:59
506 views
Read More
Apple's iOS 16 to automatically bypass CAPTCHA requests on iPhone
2025-06-26 15:34
2564 views
Read More
'Marcel the Shell with Shoes On' review: Cutesy meme becomes cozy meditation on loss
2025-06-26 14:49
203 views
Read More
UGREEN Nexode 25000mAh 200W power bank drops to $79.99 at Amazon
2025-06-26 14:35
839 views
Read More