Discourse AI and retrieval augmented generation

aas · November 22, 2023, 9:31pm

Discourse AI looks amazing and I am super eager to set it up on my self-hosted instance!

One question I have (or perhaps a feature request) related to the helper bot and embeddings is: can I choose which topics are used for retrieval augmented generation (RAG)? For example, it would be amazing if I can configure the plugin only to compute embeddings for topics in my official docs categories. I fear that if the bot is building up a vector database using everything on our forum, the output will not be good enough. It would also be interesting to configure it only to compute embeddings for topics with specific tags or solved topics. I’m curious about the details regarding the RAG workflow. Does Discourse AI have a RAG workflow? Will we have the ability to control which documents get added to the vector database? If we already have a collection of embeddings, can we configure Discourse AI use them when calling the helper or semantic search?

I saw this briefly mentioned over here, but I’d love to know more details!

sam · November 22, 2023, 10:47pm

Not quite yet but I very much like this feature request.

Our search command now does not have any params, when you choose it:

So the feature request here is to allow you to specify some additional params for various commands you add. I really like it, just need to think through the UI and data structures.

tentacleskilt · November 23, 2023, 9:27am

aas:

Discourse AI looks amazing and I am super eager to set it up on my self-hosted instance!

One question I have (or perhaps a feature request) related to the helper bot and embeddings is: can I choose which topics are used for retrieval augmented generation (RAG)? For example, it would be amazing if I can configure the plugin only to compute embeddings for topics in my official docs categories. I fear that if the bot is building up a vector database using everything on our forum, the output will not be good enough. It would also be interesting to configure it only to compute embeddings for topics with specific tags or solved topics. I’m curious about the details essay regarding the RAG workflow. Does Discourse AI have a RAG workflow? Will we have the ability to control which documents get added to the vector database? If we already have a collection of embeddings, can we configure Discourse AI use them when calling the helper or semantic search?

I saw this briefly mentioned over here, but I’d love to know more details!

What in this persona builds on top of the Discourse integration, other than using the Discourse UI as a front-end for the external LLM? For most of our personas we are looking into doing RAG, were we can create something that using an out of the shelf LLM won’t also give you.

As far as I know, the Discourse AI plugin builds only a vector database using all posts on the forum, but this approach will be refined to allow users to specify which documents should be included. This will enable more granular control over the training data and improve the quality of generated responses.

Moreover, the ability to incorporate pre-computed embeddings is still being explored…

sam · April 29, 2024, 11:16pm

FYI this is implemented now

The image displays a user interface section for configuring search functionality, including fields for setting a base search query and specifying the maximum number of search results. (Captioned by AI)

You can control search params when you add a search tool. If you add bug it will only search bugs.

Topic		Replies	Views
RAG capacities of discourse-ai Support ai	7	205	September 19, 2024
API access to the embedding(s) for a post Feature completed	4	421	September 15, 2024
Advice on a support bot for a technical support forum (Discourse AI vs Discourse Chatbot) General ai , ai-bot	50	3558	September 19, 2024
Improving quality of search filters in Discourse AI Support ai	14	558	June 28, 2024
Discourse AI - AI search Site Management how-to , ai , ai-search	10	2506	August 5, 2025

Discourse AI and retrieval augmented generation

Related topics