Discourse AI - Embeddings

Discourse · April 24, 2023, 7:40pm

This topic covers the configuration of the Embeddings module of the Discourse AI plugin. It explains what embeddings are, how they’re used, and how to set them up.

Required user level: Administrator

Embeddings are a crucial component of the Discourse AI plugin, enabling features like Related topics and AI search. This guide will walk you through the setup and use of embeddings in your Discourse instance.

What are Embeddings?

Embeddings are numerical representations of text that capture semantic meaning. In Discourse, they’re used to:

Generate related topics at the bottom of topic pages
Enable semantic search functionality

Setting up Embeddings

For hosted customers

If you’re a hosted customer, embeddings are pre-configured. You can simply enable the AI features that depend on them.

For self-hosted instances

If you’re self-hosting, refer to the Discourse AI self-hosted guide for detailed setup instructions.

Configuring embeddings

Navigate to Admin → Settings → Discourse AI, ensure the following settings are enabled.

ai embeddings enabled: Turn the embeddings module on or off
ai embeddings models: Select which models to use for generating embeddings

Optional settings that can be tweaked…

AI embeddings generate for pms: Decide whether to generate embeddings for private messages
AI embeddings semantic related topics enabled: Enable or disable the “Related topics” feature
AI embeddings semantic related topics: The maximum number of related topics to be shown
AI embeddings semantic related include closed topics: Inclusion of closed topics within AI search results
AI embeddings semantic search enabled: Enable full-page AI search
AI embeddings semantic search hyde model: Model used to expand keywords to get better results during a semantic search

Providers

Within the admin settings, navigate to the AI plugin → Embeddings tab to configure any provider-related settings such as API keys.

Discourse AI supports multiple Embedding providers:

Discourse hosted Embeddings (recommended and default)
OpenAI
Google
Open source models via Hugging Face
Custom options

Features

AI Search

Embeddings power the semantic search option on the full-page search interface.

Semantic search leans on HyDE (Hypothetical Document Embedding). We expand the search term using a large language model you supply. Once expanded we convert the expanded search to a vector and look for similar topics. This technique adds some latency to search and improves results.

When selecting a model for hyde via ai embeddings semantic search hyde model be sure to choose a fast model like Gemini Flash, Claude Haiku, GPT4o Mini or the latest available models

Generating embeddings

Embeddings are generated automatically for new posts. To generate embeddings for existing content:

Embeddings are created when a page is viewed if they’re missing
Discourse will automatically backfill embeddings for older topics.

FAQs

Q: How are related topics determined?
A: Related topics are based solely on embeddings, which include the title, category, tags, and posts content

Q: Can I exclude certain topics from related topics?
A: Yes, there’s a site setting to remove closed topics from the results

Q: Do embeddings work for historical posts?
A: Yes, the system will automatically backfill embeddings for all your content

Additional resources

Last edited by @Falco 2025-04-14T21:00:20Z

Last checked by @hugh 2024-08-06T04:16:01Z

Check document
Perform check on document:

kuaza · July 23, 2023, 10:36am

Great work, thanks first of all, but I can’t see similar topics under the topics, somehow, my settings are like this, I added an openai key. Semantic search works, but how can I show similar articles under topics?

Falco · July 27, 2023, 2:03pm

If you want to use OpenAI for embeddings you must set ai embeddings model to text-embedding-ada-002.

bigfudge · August 17, 2023, 9:01am

How are the jobs to generate embeddings scheduled? From the code it seems like embeddings are only generated when the page is viewed and embeddings are missing. Is there a way to generate embeddings for the whole site when turning the feature on?

Falco · August 17, 2023, 8:56pm

You can also run rake ai:embeddings:backfill to generate embeddings for all topics eagerly.

EricGT · August 18, 2023, 6:56pm

Suggestion

Sometimes reading a topic one knows most of the noted background but there are also some mentions that are not known. While there is summarization for summarizing an entire topic up to that point what would also be of help would be an AI option that inserts a glossary for the topic as a post near the top and updates it if a user selects a word or phrase that it wants the AI to include in the glossary.

Today in reading this topic there was one reference I did not recognize so looked it up and added a reply with a reference for it. While I know the remaining references I am sure there are others, especially those new to LLMs and such, that would have no idea of many of the noted references and if the AI could help them they would visit the site much more often.

While I know what RAG means in this starting post, how many really know that?

What is RAG (Click triangle to expand)

How do domain-specific chatbots work? An Overview of Retrieval Augmented Generation (RAG)

Note: Did not know with which topic to post this but since it needed embeddings to work posted it here. Please move this if it makes more sense elsewhere or as the Discourse AI plugin changes.

swong · October 27, 2023, 10:16pm

Are embeddings the only variable when determining “Related Topics”? Or are there any other factors that are considered (e.g. author, topic score, topic age, category, etc)?

Falco · October 27, 2023, 10:47pm

Only the embeddings, but those contain the title, category, tags and posts. There is a site setting to remove closed topics from the results too.

JammyDodger · December 14, 2023, 11:47am

7 posts were split to a new topic: Is full page semantic search only in English?

Falco · February 7, 2024, 5:20pm

2 posts were split to a new topic: Differences in search latency between AI semantic and keyword search

Isambard · March 23, 2024, 10:41pm

I wish I found this a few months ago. I already created embeddings using bge-small-en-v1.5 and hosted them in an external database.

I will see if it can be shoehorned into this ‘standard’ set-up!

fokx · April 29, 2024, 3:28am

I find a little bug in the recent version leading to rake ai:embeddings:backfill failed:

root@nbg-webxj:/var/www/discourse# rake ai:embeddings:backfill
rake aborted!
NameError: uninitialized constant Parallel (NameError)

  Parallel.each(topics.all, in_processes: args[:concurrency].to_i, progress: "Topics") do |t|
  ^^^^^^^^
/var/www/discourse/plugins/discourse-ai/lib/tasks/modules/embeddings/database.rake:27:in `block in <main>'
/usr/local/bin/bundle:25:in `load'
/usr/local/bin/bundle:25:in `<main>'
Tasks: TOP => ai:embeddings:backfill
(See full trace by running task with --trace)

I suspect the culprit is that the parallel gem is neither installed in this plugin, nor in Discourse core(only find one in the if ENV["IMPORT"] == "1" block: gem "parallel", require: false).

I find the ruby-progressbar gem also required to perform rake ai:embeddings:backfill.

I make a simple PR on Github:

Hifihedgehog · May 16, 2024, 3:52pm

Note to others that this rake method seems to have been demoted/semi-deprecated since per Falco on GitHub:

Thanks for the PR @fokx, but I’ve left those out unintentionally as the rake task fell out out favor and should only be used in rare occasions by experienced operators who can easily install those out of band.

Hifihedgehog · May 16, 2024, 4:05pm

Is the semantic search option no longer shown in that dropdown and instead comprehended or enabled through the AI toggle?

PeakProsperity · July 18, 2024, 1:58pm

Can you confirm for me if the embeddings will only work on posts after installing or will it also allow us to semantic-search all historical posts? I’m hoping the latter! Thanks.

Falco · July 22, 2024, 2:18pm

It’s the later, as it will automatically backfill embeddings for all your content.

packman · August 14, 2024, 10:41am

I’m trying to set up AI Embeddings using Gemini Flash but I can’t get it to work. I can’t find good descriptions/examples of all the settings fields though, so I might have missed one or two that are important. I don’t know if the ‘ai_embeddings_model’ setting is required, but if I set it to ‘gemini’ I get the following error…

I’ve not been able to find the ai_gemini_api_key setting. I do have Gemini Flash set up as an LLM with an API key and that’s working elsewhere, e.g. summarization, but I’m assuming this is wanting the API key entered somewhere else?

Overgrow · September 10, 2024, 2:35pm

I suppose this would work with OpenAI too, wouldn’t it?

It would be great if it could support their Batch API (50% discount)

Falco · September 10, 2024, 3:51pm

Yes, but nowadays we backfill automatically in the background, so this isn’t mandatory.

For price conscious peeps, we support great open weights model that you can run on your own hardware.

Overgrow · September 10, 2024, 6:45pm

Thanks. Do I understand it correctly that backfill is when the vectorization happens? When switching between models, do the vectors need to be recalculated (Are they “proprietary”)? I assume yes.

It’d be useful to know how the costs of using the OpenAI API stack up against investing in a GPU-powered server with opensource solution. Is there a formula or any way to estimate the number of tokens used? We’re only using the API to vectorize posts, not for calculating vector distances, right? So, the number of tokens used depends on how much content we have, correct?

I assume that for both related topics and AI-powered search, all posts need to be vectorized only once, so I can calculate the total number of words in posts table and derive the number of tokens needed. The same process would apply to the daily addition of posts. I’m neglecting the search phrases for now.

Topic		Replies	Views
Problem with the new Discourse AI "related / similar topics"-function Support ai , related-topics	5	880	August 21, 2023
Discourse AI - Related topics Site Management how-to , ai , related-topics	11	2502	September 11, 2024
Discourse AI - AI search Site Management how-to , ai , ai-search	10	2543	August 5, 2025
API access to the embedding(s) for a post Feature completed	4	424	September 15, 2024
How to enable Related topics? Support ai , related-topics	3	716	August 25, 2023