How do I use hugging face paid inference endpoints as Discourse custom LLMs

StevePlex · December 9, 2024, 4:00pm

I have several open source LLM models running on the Hugging Face Inference Endpoint service ( essentially AWS ) …

For all the models I have tested (llama, phi, gemma, etc) … I’m able to connect from Discourse LLM settings page, but inference doesn’t work. here’s the error:

“Trying to contact the model returned this error: Failed to deserialize the JSON body into the target type: missing field inputs at line 1 column 163”

What am I doing wrong !? Thanks much…

View from Hugging Face:

View from Discourse:

Falco · December 9, 2024, 4:07pm

It’s been over a year since I last tried their API. Is it OpenAI compatible nowadays? If so you can try setting Provider to OpenAI and pointing to their endpoint.

StevePlex · December 9, 2024, 8:51pm

I have tried most all the providers available on the Discourse LLM setup screen, including OpenAI…

They either give the “Failed to deserialize the JSON body into the target type” error or “Internal Server Error”.

I also tried an actual OpenAI model on the HF endpoint service ( GPT2! but that didn’t work. … same sort of errors.

Topic		Replies	Views
Debugging adding new LLM Support ai	8	176	August 23, 2024
How to use the hugging face llama2 chat bot Dev ai , ai-bot	2	528	March 9, 2024
Self-Hosting an OpenSource LLM for DiscourseAI Self-Hosting ai	5	2894	February 21, 2025
Configure API Keys for OpenAI Integrations how-to , ai	11	3122	November 11, 2024
Discourse AI - Large Language Model (LLM) settings page Site Management how-to , ai	17	1656	July 25, 2025

How do I use hugging face paid inference endpoints as Discourse custom LLMs

Related topics