Discourse AI - Self-Hosted Guide

DonH · June 29, 2023, 6:07pm

I may - will - want the service but it’s early days for the forum I have in mind so not enough data to chew on yet.

Since you are playing with this technology can you tell us what role tags play in training the AI? I put a ton of effort into clustering the corpus of one of my forums in order to generate labels that could then be used to categorize and tag topics. While categorization went very well, implementing tags is problematic because of the sheer numbers of terms involved. There’s no practical way to present them all.

I would think that the AI could use those terms to improve its own results.

Falco · June 30, 2023, 2:33pm

There is no training of models in Discourse today. All models currently used by any of the modules are already trained.

Tags may come in hand to add context in prompts for features like tag suggestions and related topics, but neither are used at the moment.

Falco · July 13, 2023, 6:38pm

Hey @nodomain,

Discourse AI will now store the embeddings in the same DB instance we use for everything else. That makes it much easier to install and maintain, and we will automatically import the embeddings from the old database when you update. After that you can now decomission the old database.

nodomain · July 18, 2023, 4:50pm

Ah, this explains the issues I now get with my setup:

I, [2023-07-18T09:29:11.218667 #1]  INFO -- : > cd /var/www/discourse && su discourse -c 'bundle exec rake db:migrate'
------------------------------DISCOURSE AI ERROR----------------------------------
    Discourse AI requires the pgvector extension on the PostgreSQL database.
         Run a `./launcher rebuild app` to fix it on a standard install.
            Alternatively, you can remove Discourse AI to rebuild.
------------------------------DISCOURSE AI ERROR----------------------------------

My database is a RDS Aurora serverless v2 and hence cannot use the pgvector extension. Any chance of configuring the old behaviour?

nodomain · July 18, 2023, 7:50pm

Answering myself: a feasible way could be to transition to the “non-serverless” Aurora service: Amazon Aurora PostgreSQL now supports pgvector for vector storage and similarity search

Nevertheless I still am interested in the answer to my question.

Falco · July 18, 2023, 8:54pm

You are using serverless for the main Discourse DB or only for the embeddings one? Discourse AI now stores the embeddings in the main DB and requires the pgvector extension enabled there. It’s available on the RDS PostgreSQL 13.11 and greater. We don’t use Aurora in production, only RDS PostgreSQL, so that’s the only thing I can recommend you.

nodomain · July 18, 2023, 8:55pm

Fine for me, thx.
And yes, the main DB is serverless as of now.

DonH · July 18, 2023, 10:48pm

So is Amazon RDS PostgreSQL the version that’s bottled in the Docker distribution?

Falco · July 18, 2023, 10:52pm

RDS is a SaaS from AWS, it can’t be packaged in a Docker image.

Discourse AI works with either the PostgreSQL version we package in our Docker image, with Amazon RDS or with any PostgreSQL instance with the extension installed.

ValGun · July 19, 2023, 11:55am

Hi
Can I use “Llama 2” open-source from Meta for recommendation of posts for my users?
Anyone had experience with such instrument?
Thanks

Falco · July 19, 2023, 1:31pm

Do you mean recommending “Related Topics” ? In that case no, not yet. There are no embeddings models based on Llama 2 yet.

Worth mentioning that the ones we ship (one open-source and one from OpenAI API) are really good and more than enough to power the Related Topics feature.

ValGun · July 19, 2023, 1:41pm

Thank you for an explanation!

Firefishy · August 2, 2023, 3:09pm

Are there git repos for the docker images?

Falco · August 2, 2023, 4:57pm

Not at the moment, as that would require me to keep two separate repos, one with the app code and another one with the internal tooling to build images and push to our internal repositories and I really couldn’t find time to properly set this up.

The API code is all visible inside the container image tho, even if that is not the best way to peruse it, at least it’s all there.

anilguven · August 3, 2023, 1:35pm

Could anyone shares the exact minimum and recommended server requirements for a forum with standard visitors? Honestly, I want to give it a try, but I don’t know where to start since there is no clear server requirement.

pfaffman · August 3, 2023, 1:50pm

You just need to try with whatever you think is reasonable. Something with 16gb of ram sounds like the minimum.

For your standard users, you’d use a standard server, I guess.

anilguven · August 3, 2023, 2:14pm

In my forum, 200-250 online users and an average of 300 posts are created daily. So it can’t be called too much, so I said standard. I understand what you mean, but I plan to rent a new server because the Cloud server I am using now does not allow many upgrades. Thanks for your answer

Falco · August 3, 2023, 3:04pm

That’s incredibly hard to answer.

For example, if you just want to play with embeddings, a $6 droplet doing it on CPU will be enough and that will give you access to the Similar Topics feature.

Now if you want AIHelper and AIBot, you can:

pay per call on OpenAI, and the cost will depend on your usage.
run an open source LLM in a server you own for privacy. A model like Llama2-70B-Chat will need a server that costs 10k ~ 25k a month tho.
run an open source LLM in a pay per hour service, You can run a quantized version of Llama2 in HuggingFace endpoints for $6.50 an hour and it will automatically sleep after 15 minutes without requests.

The ML/Ops area is moving fast, GPUs are super scarce and new models launch every day. Hard to predict, we are all experimenting.

anilguven · August 3, 2023, 5:32pm

Thank you for your detailed explanation. Then I will try with a single plugin at each step. I think I will learn in detail according to the situation

Stephen · August 4, 2023, 5:40pm

A post was split to a new topic: Implement Discourse AI on DigitalOcean

Topic		Replies	Views
關於Discourse AI Support ai	6	746	October 1, 2024
Discourse AI Plugin official , included-in-core , ai	83	35647	August 13, 2025
Discourse AI plugin with self hosted discourse site Support ai	2	178	July 9, 2024
Introducing Discourse AI Blog	26	3573	May 4, 2023
I want to install Discourse AI on Discourse Installation ai	13	434	June 18, 2024

Discourse AI - Self-Hosted Guide

Related topics