Introducing Discourse AI

Falco · April 24, 2023, 7:39pm

We are happy to announce Discourse AI, a new plugin and our one-stop solution for integrating Artificial Intelligence and Discourse, enabling both new features and enhancing existing ones. With this first release, we are shipping 7 different Discourse AI modules to help community managers, members, and moderators with various tasks from sentiment analysis to automated proofreading and suggested edits. Read along and find out more details about each of these features as well as what is coming up next on our roadmap!

This is a companion discussion topic for the original entry at https://blog.discourse.org/2023/04/introducing-discourse-ai/

justin · April 24, 2023, 7:49pm

This is an impressive body of work, @Falco and team. Really excited to see how this all works in practice and its impact on community management overall.

ozkn · April 24, 2023, 9:32pm

Thanks for the plugin. I’m looking forward to the feature to extract text from images.

jordan-violet · April 24, 2023, 10:28pm

These are the kind of updates that feel like opening a new Christmas present.

We (at the present time of this writing) do not have a dedicated manager for our community, and tools like this enable us to continue to scale without a dedicated role.

Not to mention the features like composer helper that just elevate the user experience.

Helga_Razinkova · April 25, 2023, 7:42am

Wow, it sounds awesome! Looking forward to having the AI feature on Business plan

rosiesherry · April 25, 2023, 9:03am

I’m excited for this and adding it in as soon as I can.

Are there any plans to use AI to detect AI post/comment spam? It’s a huge problem (in general) in communities.

Jagster · April 25, 2023, 11:16am

Nice! But… why are nice things everytime so god damn expensive

Tris20 · April 25, 2023, 12:21pm

I like this one

Canapin · April 25, 2023, 1:10pm

Well, if you want to generate text or have answers, there’s the cheaper old-school version

sam · April 26, 2023, 6:28am

Hi Rosie,

Yes we are planning on exploring this area. The tricky thing is that we only have a small number of examples to feed into GPT-4 given the prompt limits, meeting token limits is really hard. There are quite a few other approaches we can take though and we will explore and report back.

Even with very little fine tuning GPT-4 does not do a terrible job assessing stuff:

On another topic

What are the odds that the following post is a spam post on meta.discourse.org:

Title: What is software testing in detail please elaborate it?
Body: * [Aug '18 - since discussing is first about the read time, and also read time is something connected to the Discourse brand, I guess it would make sense to have read time in the meaning of top . image it will ...](https://meta.discourse.org/t/showing-read-time-in-the-top-categories-pane-in-summary/93682)
* [Soundcloud e](https://meta.discourse.org/t/soundcloud-embed-doesnt-work-well-when-the-link-is-shortened/194562)

From 1 to 10.

On another topic

What are the odds this is spam?

Title: Multisite installation with seperated smtp emails
Body: I followed this [post](https://meta.discourse.org/t/multisite-configuration-with-docker/14084) to setup doctor multisites.

but how to configure different smtp email settings for different sites?

mattdm · April 26, 2023, 1:00pm

Could you try it with a post which contains a long block of code or syslog output? Those are getting tagged as spam by akismet all the time on our site.

sam · April 26, 2023, 8:24pm

Sure, link me to a few

Fabio_Machado_de_Oli · April 30, 2023, 3:31am

If the cost of GPT-4 is acceptable for it, won’t fine tuning GPT-3 help with this?

sam · April 30, 2023, 11:04am

Probably, but it would get super expensive to fine tune a model. Some people get really good results by simply using embeddings, that is probably the next thing to try.

Fabio_Machado_de_Oli · April 30, 2023, 3:06pm

When I checked it fine tuning is way cheaper than what I expected. it depends a lot of how many training data you plan to use, but if the comparison is with the size you can fit in a single gpt 4 it’s cents

I didn’t get to the point of using it, so chances are I missed something, so please correct me if I’m wrong

jordan-violet · April 30, 2023, 3:22pm

Training can be very, very expensive. In my case, for our training calculations, just for OpenAI’s recommended minimum training, we’d be looking at almost $200,000 for training on a single use case.

Fabio_Machado_de_Oli · April 30, 2023, 5:47pm

Is new users getting confused with TL1 limits still a thing?

If so, I think AI could be a good solution for that, let new users do more, but with the AI paying close attention to them, and put it in the moderator queue if it’s not confident it’s ok

mattdm · May 2, 2023, 3:00pm

As requested: non-spam Ask Fedora posts which akismet flagged:

mattdm · May 2, 2023, 7:46pm

I’m also pretty curious as to how the sentiment analysis classifies these!

sam · May 3, 2023, 12:52am

No probs at all @Falco was doing a spike on this today and it looks very promising, even a trivial prompt does surprisingly well. Spam is just sooooo spammy.

Will leave it to Falco to share specifics.

Another interesting approach which we can possibly combine is leaning on the vector database. If you post something and the vector is close to 20 other spams… well it is probably spam. This approach allows fine tuning.

To be honest I kind of see Akismets future as not that bright. Matt must be stressing out about the long term here for it.

Topic		Replies	Views
Discourse AI Plugin official , included-in-core , ai	77	35114	August 8, 2025
ChatGPT Assistant Integration Support	6	1180	April 16, 2024
Discourse AI plugin with self hosted discourse site Support ai	2	174	July 9, 2024
How do you use Discourse AI? Tell us and make it even better! Feature feedback , ai	22	2204	March 2, 2025
Experiments with AI based moderation on Discourse Meta Community moderation , ai	11	545	May 26, 2025

Introducing Discourse AI

Related topics