Qwen3-VL-8b Image Recognition Issues and Gemma3-27b Mixed Text Image Content

Ivan_Rapekas · 11 december 2025 om 10:55

Hello, I found a topic Managing Images in AI context. I would like to know more about this context.

Could someone clarify current logic of understanding images?

I use Qwen3-VL-8b with LM Studio with OpenAI-compatible API. The hint below says that images are supported by Anthropic, Google and OpenAI models. No chance for Qwen, right?
Qwen3-VL-8b New confusing message when the model cannot recognize a picture/document.

In 3.6.0.beta2:

Both in case vision enabled = true and vision enabled = false AI-bot handles the request of image recognition correctly, without any exception.

In v2025.12.0-latest: new option allowed attachments

Now with vision enabled = true in returns an error in the dialog:

{“error”:“Invalid ‘content’: ‘content’ objects must have a ‘type’ field that is either ‘text’ or ‘image_url’.”}

Gemma3-27b. Some thoughts about recognizing mixed text+image content. The response currently supports text only. When I ask the model to provide a text from OCR-layer of PDF with separated images, it returns

There is nothing at this URL, the model made a fake link.

Thanks!

sam · 11 december 2025 om 11:07

lmstudio does not have PDF support in completion or responses api.

It only supports image/text from what I can tell.

Ivan_Rapekas · 12 december 2025 om 07:33

Thank you for the reply! I will mark it as solved and leave a comment here that it was correct for LM Studio 0.3.x. Studio team is currently working on version 0.4.0 with new REST. Hope they add PDF support in their responses.

system · 11 januari 2026 om 07:33

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Antwoorden	Weergaven
Ai plugin ocr support Feature ai	11	871	2 april 2024
Gemini ai bot to draw picture in chat Support ai	4	150	13 april 2025
Exploring blocking file upload while interacting with AI bot Feature ai , ai-bot	0	51	11 januari 2026
Managing Images in AI context Dev	0	74	28 augustus 2025
Introduce alt-text for images on chat Feature chat	0	351	22 februari 2023

Qwen3-VL-8b Image Recognition Issues and Gemma3-27b Mixed Text Image Content

Gerelateerde topics