Debugging adding new LLM

Isambard · August 13, 2024, 12:01am

I’m trying to add a custom LLM to Discourse AI plugin. When I press the ‘test’ button I get “Internal Server Error”.

Is there a way of debugging this or getting a better error message? When I go into the docker image and curl the /v1/models, I’m able to fetch this correctly.

The model name is “models/Meta-Llama-3-8B-Instruct.Q6_K.gguf” and I’m not sure whether there could be any issue with special characters.

Isambard · August 13, 2024, 12:15am

Trying another one gives: Trying to contact the model returned this error: {"error":{"code":404,"message":"File Not Found","type":"not_found_error"}}

But it doesn’t display what URL/model it is trying to fetch which might help to debug.

The same settings were pasted into Open WebUI which was able to contact both LLM endpoints and inference correctly.

Falco · August 13, 2024, 12:23am

What inference server are you using? vLLM?

When configuring the URL, add the path /v1/chat/completions at the end.

Isambard · August 13, 2024, 7:13am

This was the issue. Note that in LLM software, it is customary to include only upto the /v1 as the endpoint URL. Selection of /chat/completion etc. is then normally added by the software.

Isambard · August 13, 2024, 7:26am

I’m trying to get one running on the localhost to test so put the URL as: “http://172.17.0.1:8081/v1/chat/completions” and get internal server error. I’m able to curl “http://172.17.0.1:8081/v1/models” from the discourse docker container so the connectivity is working.

Are there any other gotchas (e.g. does Discourse allow non-https and arbitrary ports for the LLM endpoint)?

Falco · August 13, 2024, 1:51pm

Both should work.

What is the error you see on /logs ?

Isambard · August 13, 2024, 5:36pm

Ah. I didn’t know about /logs!

NameError (undefined local variable or method 'tokenizer' for an instance of DiscourseAi::Completions::Dialects::ChatGpt) app/controllers/application_controller.rb:424:in 'block in with_resolved_local

Hmm. The one that works is for a model that I quantized myself. I’ll try to quantize the others to see if it is a model format issue.

Isambard · August 23, 2024, 11:43am

Anyone managed to get DeepSeek API working? I’m trying to figure out the right incantation to get it to work with Discourse.

I have it working in Open WebUI and other clients.

Lilly · August 23, 2024, 2:11pm

There’s a topic here about it

Topic		Replies	Views
Internal Server Error 500- Manual configuration [ Discourse AI] Support ai	8	123	September 5, 2025
How do I use hugging face paid inference endpoints as Discourse custom LLMs Support ai-bot , ai	2	129	December 9, 2024
Self-Hosting an OpenSource LLM for DiscourseAI Self-Hosting ai	5	3121	February 21, 2025
Getting discourse ai to work with ollama locally Support ai	15	313	April 6, 2025
Local Ollama is not working with the Plugin Support ai	3	48	October 15, 2025

Debugging adding new LLM

Related topics