LLM-gebruikslimieten configureren in Discourse AI

sam · 21 januari 2025 om 06:04

This guide explains how to configure and manage usage quotas for Large Language Models (LLMs) in Discourse AI.

Required user level: Administrator

Summary

LLM Usage Quotas allow administrators to control and monitor AI resource consumption by setting limits on token usage and interactions for different user groups. This helps maintain cost efficiency while ensuring fair access to AI features across your community.

Configuration

Accessing quota settings

Navigate to your site’s admin panel
Go to Admin > Plugins > Discourse AI > LLMs
Select the LLM model you want to configure

Setting up quotas

For each user group, you can configure:

Maximum token usage
And/Or Maximum number of AI interactions
And/Or Maximum cost
Reset period duration

At least one of max tokens or max usages must be set for each quota.

Note: The “everyone” group cannot be assigned a quota. You must use specific groups (e.g., trust level groups or custom groups).

Duration options

Choose from preset reset periods:

1 hour
6 hours
24 hours
7 days
Custom duration (specified in hours)

Usage monitoring

Viewing statistics

Administrators can monitor token consumption and usage consumption at: https://SITENAME/admin/plugins/discourse-ai/ai-usage

Navigate to Admin > Plugins > Discourse AI
Select “Usage” tab
Filter by date range, user group, or specific metrics

User experience

Quota notifications

Users receive clear feedback when approaching or reaching quota limits:

Current usage status
Time until next quota reset

Error messages

When a quota is exceeded, users see:

A clear notification that the quota limit has been reached
The time remaining until their next quota reset

Best practices

Start conservative: Begin with lower quotas and adjust based on actual usage patterns
Group-based allocation: Assign different quotas based on user group needs and roles
Regular monitoring: Review usage patterns to optimize quota settings
Clear communication: Inform users about quota limits and reset periods

Common issues and solutions

Issue: Users frequently hitting limits

Solution: Consider:

Increasing quota limits for specific groups
Reducing the reset period
Creating specialized groups for high-usage users

Issue: Unused quotas

Solution:

Adjust limits downward to optimize resource allocation
Review group assignments to ensure quotas match user needs

FAQs

Q: Do unused quotas roll over?
A: No, quotas reset completely at the end of each period.

Q: Can different LLM models have different quotas?
A: Yes, quotas can be configured independently for each LLM.

Q: What happens if multiple quotas are set for a single LLM?
A: Quotas are group based and applied per user. For a user to exceed quota the user must exceed quota in all groups. This means that if you give admins a very relaxed quota and trust level 1 a more restrictive one, the admin quota will apply to admins.

Q: What if no quota is applied to an LLM?
A: Nothing special will happen all LLM usage will be unmetered

Q: What if I want different quotas for different features
A: Discourse AI allows you to define multiple LLMs which all contact the same endpoint and even can reuse keys, if you wish to give one quota to AI helper and a different to AI Agent, define 2 LLMs.

Q: How do I remove a quota?
A: Delete the quota from the LLM model’s configuration page. There is no way to temporarily “pause” or disable a quota — it must be deleted and recreated.

Additional resources

lava · 21 januari 2025 om 06:37

It seems we can’t completely prohibit a group from using a specific model by setting the quota to 0.

Could you add support for this setting?

sam · 21 januari 2025 om 06:40

Sorry can you expand here. Each feature also is group gated, so you can enable helper only for a subset of users anyway.

lava · 21 januari 2025 om 06:50

I want some premium models to be restricted to specific groups only. It would be great if we could set a model’s quota to 0 to disable access for certain groups.

sam · 21 januari 2025 om 06:53

Yeah, it’s an interesting problem. I’ll have a think about it.

You may want the helper to use GPT4o for “special group 1” and GPT4o mini for the rest of the people.

At the moment, we only allow you to select one model for the AI helper, so we would need a reasonably big change to support this.

@Falco / @Saif / @awesomerobot, something to think about.

Topic		Antwoorden	Weergaven
LLM Quotas for Discourse AI Announcements ai	0	178	21 januari 2025
Limit the number of AI tokens a user can use in a day? Feature completed , ai	11	1021	3 april 2025
Making the case for a hard cap feature on user group AI usage for AI bots and AI Helper Feature chat , completed , ai , ai-bot	11	457	21 januari 2025
Balancing Costs and Functionality in AI-Powered Forums Feature ai , ai-bot	4	861	21 januari 2025
Discourse AI - Large Language Model (LLM) settings page Site Management how-to , ai	20	3958	9 mei 2026