LLM provider pricing for Discourse AI

Discourse · November 16, 2023, 6:06pm

Discourse AI requires linking to one LLM provider - this is a reference linking to the pricing for different options.

Required user level: Administrator

In order to use certain Discourse AI features, users are required to use a 3rd party Large Language Model (LLM) provider. Please see each AI feature to determine which LLMs are compatible.

The following guide links to the pricing of different LLM providers.

Note that the costs might vary based on multiple factors such as the number of requests, the length of the text, the computational resources used, the models chosen, and so on. For the most up-to-date and accurate pricing, regularly check with each provider.

OpenAI GPT pricing
Anthropic Claude pricing
Google Gemini
Azure OpenAI
AWS Bedrock with Anthropic access
HuggingFace Endpoints with Llama2-like model
Run your own OSS Llama2-like model with TGI: The cost of running your own OSS Llama2-like model with TGI would depend on various factors such as the infrastructure costs, the costs associated with fine-tuning the model, and the costs of managing and maintaining the model.

Last edited by @Saif 2024-10-31T22:44:08Z

Last checked by @hugh 2024-07-30T10:02:34Z

Check document
Perform check on document:

Jagster · November 17, 2023, 5:51am

This is defenetly not statistically acquire comparison, but based on my short testing using OpenAI GPT-4 is three times more expensive than GPT-3.5 Turbo when counted API calls and how many tokens was used — and because moneywise tokens used by GPT-4 are more expensive that difference is much bigger.

And I got no benefits with GPT-4 compared to 3.5 Turbo.

And as a disclaimer: I used finnish, so english can be different thing. Plus any AI is totally useless in chat use when used finnish, but that is totally different ball game — but means, from my point of view, all chatbots are just pure waste of money when used small languages.

Saif · November 20, 2023, 3:03am

The costs here are estimated and agreed that the costs can vary quite dramatically based on usage!

It’s important to note that for many basic tasks, the difference between GPT-4 and GPT-3.5 models may not be significant. However, GPT-4 does have some substantiated differences in terms of its capabilities, creative understanding, and raw input.

I also agree that for languages that are not popular, there is much to be desired in the model’s abilities.

Jagster · November 20, 2023, 10:43am

I think we are talking about same thing, but to be on safe side : that is an issue of AI companies and you, I or any dev can’t change that fact.

But I’m after something like we all should follow a bit how much we are spending money (if we aren’t using money from othet budget than from ours pocket ) and trying to find balance of very sujective usefullness and money.

And no, I don’t know what I’m talking about. Mainly because responses of all chat bots are basically just based on english buzz of millions fly (quantity over quality). Situation can be change - better or worse, it depends - if we have better tools to educate AI what sources it can use. Sure, we have, but it will cost huge much more that price of tokens.

And yes, that is headache of small players.

I’m wondering… is there a chance that we can get a better cost/accuracy balance with more freely prompt editing?

Tris20 · March 22, 2024, 12:32pm

Would you be comfortable disclosing roughly what the cost is for Meta at the moment? Even as a ballpark or range would be helpful.

I asked the bot to give an estimate and it provided the following:

On another topic

Assumptions for Calculation:

Average Post Length: An average post is assumed to be around 50 tokens (considering the mix of shorter and longer posts).

AI-Enabled Actions Per Post: If AI assists in composing, summarizing, or answering queries, let’s assume it’s engaged twice per post (once for drafting a reply and perhaps once for additional tasks like summarization).

Daily Active Users and Posts: Meta Discourse has a high level of engagement. For an approximation, let’s assume there are about 100 active users per day, each generating an average of 4 posts/comments (totaling 400 daily interactions).

Monthly Activity: This translates to 12,000 interactions monthly (400 interactions * 30 days).

Total Token Usage: Assuming each AI action involves processing 100 tokens (50 tokens for reading/input + 50 for generating output), and AI is used twice per post, that’s 200 tokens per post. Therefore, monthly token usage would be 2.4 million tokens (12,000 interactions * 200 tokens).

Cost Estimation:

Taking the GPT-3.5 model as a reference, which cost around $0.02 per 1,000 tokens near the end of my training data:

Monthly Cost: The cost for 2.4 million tokens would be approximately $48 (2,400 * $0.02).

I feel like that number is too low, but discounting experimental work and usage from the Team etc, perhaps this isn’t far away from what most instances of a similar size to Meta could expect?

Jagster · April 1, 2024, 7:09pm

Another stupid question but is the math itself valid? Just asking because LLM just can’t count.

My forum is using way more less AI things (via OpenAI) and my fees are over that.

bryce · April 2, 2024, 4:51am

The token price that the bot mentioned isn’t accurate. The current pricing for gpt-3.5-turbo-0125 is $0.50 per 1 million input tokens and $1.50 per 1 million output tokens. Going with the assumption of half input and half output, 2.4 million tokens should only cost $2.40. gpt-4 is $30/m input and $60/m output, which would work out to $108 for 2.4m tokens.

sam · April 2, 2024, 5:00am

Claude Haiku gets very close to GPT-4 performance and half the price of GPT-3.5.

I think you need a super compelling reason to use 3.5 over Claude 3 Haiku.

@Saif can you update the OP with latest pricing from Claude. OP is way out of date.

I am not sure it is worth carrying actual prices cause they change so often.

Saif · April 2, 2024, 7:21am

Updated the OP to just have the links, I agree the prices are ever changing and its better to get the most up-to-date info

Saif · November 4, 2024, 9:01pm

With the ever growing set of providers and LLMs, its better for users to check with the provider directly. Thus we are removing this topic.

Topic		Replies	Views
Estimating costs of using LLMs for Discourse AI Site Management how-to , price-sensitive , ai	2	665	November 14, 2024
Discourse AI - AI usage Site Management how-to , ai	0	226	January 23, 2025
What Discourse AI features are FREE to use? Support ai	14	283	September 29, 2024
What LLM to use for Discourse AI? Site Management how-to , ai	0	375	January 23, 2025
How much do you spend on OpenAI integration? General	8	930	January 15, 2024

LLM provider pricing for Discourse AI

Related topics