Exploring Reranking Options for Discourse AI

tpetrov · September 16, 2025, 5:54am

Oh, I see now, thanks for the explanation, that’s what I was missing.

Btw, I know often there’s reranking in more advances RAG. Is there something like reranking in how Discourse handles is? Do you think adding a reranking step would have any positive effect?

sam · September 16, 2025, 6:08am

reranking is a work-in-progress.

@Falco / @Roman implemented a basic reranker in discourse:

github.com/discourse/discourse

plugins/discourse-ai/config/settings.yml

f9424a549


      
          ai_hugging_face_tei_reranker_endpoint:
            default: ""
          ai_hugging_face_tei_reranker_endpoint_srv:
            default: ""
            hidden: true
          ai_hugging_face_tei_reranker_api_key: ""

This is used in semantic search and RAG. However it is quite hidden and not easy to configure.

I think the medium term plan here (which we discussed with @awesomerobot ) was to move from LLM terminology to Models … and maybe do a bit of UI unification so you can define embedding/rerankers and llms in a single interface.

For now we only support a very specific hugging face reranker api.

It certainly improves quality of results.

tpetrov · September 16, 2025, 6:24am

Awesome!
So currently this is off by default, and can’t be configured easily on a hosted (pro) plan?

sam · September 16, 2025, 6:25am

not sure, lets wait for @Falco to answer.

tpetrov · October 2, 2025, 12:19pm

Hi @Falco
is there any way to enable the reranker for testing or maybe a timeline?

Falco · October 15, 2025, 4:16pm

We just deployed a big improvement to the underlying tech that powers semantic search in Discourse in DEV: Re-introduce PG Vector 0.8.0 upgrade by romanrizzi · Pull Request #35233 · discourse/discourse · GitHub.

Can you retry your use case now, it’s already deployed to your site. My expectation is that this will make RAG better without the need for a reranker.

We still want to make the re-ranker widely available, but we are waiting for some upstream changes to land first.

tpetrov · October 16, 2025, 3:59pm

Thanks Falco!

Will this work only for the semantic search over Discourse topics, or also for RAG documents uploaded to a persona? From my own experience, the AI works quite well with forum topics (i.e. on ask.discourse), but not that well with uploaded docs to a persona (or I haven’t found the best formula yet).

Falco · October 16, 2025, 4:16pm

It affects all uses of embeddings in Discourse, including Related Topics, RAG, Search, Composer category and tag suggestions, etc.

Falco · October 30, 2025, 9:50pm

Hey @tpetrov, did the results improve with the new embeddings model?

tpetrov · October 31, 2025, 10:33am

Hey Falco, I’m sorry, I haven’t had time to test extensively, so cannot really say.

I guess there’s no way to sitch between the two to compare?

Falco · November 2, 2025, 5:06pm

Not now, the old and new models were available for a couple of months, but we recently retired the old ones in our hosting.

Topic		Replies	Views
Discourse AI and retrieval augmented generation Feature ai	3	795	April 29, 2024
RAG capacities of discourse-ai Support ai	7	336	September 19, 2024
Why is my AI forum helper struggling to answer questions? Support ai , ai-bot	4	351	October 15, 2025
Advice on a support bot for a technical support forum (Discourse AI vs Discourse Chatbot) General ai-bot , ai	50	3881	September 19, 2024
Improving quality of search filters in Discourse AI Support ai	14	633	June 28, 2024

Exploring Reranking Options for Discourse AI

Related topics