Эксперименты с модерацией на основе ИИ на Discourse Meta

sam · 03.Апрель.2025 03:56:10

Я не публиковал сообщения уже давно, хотя каждый день захожу в своё маленькое окно чата и оно помогает мне хотя бы раз-два в день… постоянно.

Причина моей задержки заключалась в том, что мне пришлось разобраться с этим довольно крупным изменением.

github.com/discourse/discourse-ai

FEATURE: flexible image handling within messages (#1214)

main ← better_upload_support

merged 03:39PM - 31 Mar 25 UTC

SamSaffron

+1380 -722

**1. What Led to the Change? (Problems with Previous Approach)** * **Incons…istent Context Handling:** The previous system often passed context information (like `post_id`, `user`, `private_message`, `topic_id`, `custom_instructions`) around using plain Ruby hashes (`context: {}`). This approach lacked structure, was potentially error-prone (typos in keys), and made it harder to track what context was available or required in different parts of the AI Bot system (Tools, Personas, Bot logic). Accessing context often involved `context[:key]`. * **Inflexible Image/Upload Handling:** Images associated with a user message were previously passed using a separate `upload_ids: [...]` array within the message hash. This made it difficult or impossible to represent prompts where text and images are interleaved naturally (e.g., "Describe this image {image1}, then compare it to this one {image2} and tell me the difference"). The LLM received the text and a list of associated image IDs, but not their precise relationship *within* the user's text flow. * **Complex/Decentralized Context Building:** Logic for assembling conversation history (e.g., pulling previous posts/messages, handling custom prompts, associating uploads) was somewhat spread out, notably seen in the significant changes and removals within `lib/ai_bot/playground.rb` (specifically the `conversation_context` and `chat_context` logic being refactored). **2. What New Support Does It Add? (Key Changes & Benefits)** * **Introduction of `DiscourseAi::AiBot::BotContext`:** * **What:** A dedicated class (`BotContext`) is introduced to encapsulate all contextual information for an AI Bot interaction. This includes messages, post/topic details, user information, site details (URL, title, description), time, participants, and control flags (like `skip_tool_details`). * **Why:** Provides a structured, standardized, and object-oriented way to manage and pass context. This improves code readability, maintainability, and reduces the chance of errors compared to using unstructured hashes. Access changes from `context[:key]` to `context.key`. * **Impact:** This class is now used consistently when initializing Tools (`Tool#initialize`), crafting prompts (`Persona#craft_prompt`), invoking the bot (`Bot#reply`), and within various helper methods, ensuring a uniform context object is available throughout the system. * **Enhanced Multimodal Input (Inline Images/Uploads):** * **What:** The format for representing user messages with uploads has fundamentally changed. Instead of a separate `upload_ids` array, uploads are now embedded directly *within* the `content` field, which becomes an array if uploads are present. Example: `content: ["Here is an image:", { upload_id: 123 }, "What do you see?"]`. * **Why:** This allows for precise interleaving of text and visual elements within a single user turn. It's a much more natural way to represent multimodal prompts for vision-capable LLMs, enabling more complex instructions involving multiple images referenced at specific points in the text. * **Impact:** Required changes across multiple components: * **`Prompt` Class:** Logic for handling uploads (`encoded_uploads`, `encode_upload`, `content_with_encoded_uploads`, `text_only`) was refactored to support this new inline structure. Validation was updated. * **LLM Dialects:** All relevant dialects (`ChatGpt`, `Claude`, `Gemini`, `Mistral`, `Nova`, `Ollama`, `OpenAiCompatible`) were updated to correctly parse the new `content` array format and translate it into the specific structure required by each respective LLM API (e.g., OpenAI's array of text/image_url objects, Gemini's parts array). A helper `to_encoded_content_array` was added to the base `Dialect` class. * **Modules Using Vision:** Code that passes uploads to LLMs (e.g., `LlmTriage`, `Assistant`, `SpamScanner`, `Playground`) was updated to use the new `content` format. * **Refactored Context Building:** * **What:** Logic for building conversation history from posts or chat messages seems to be increasingly centralized in `DiscourseAi::Completions::PromptMessagesBuilder`. New methods like `messages_from_post` and `messages_from_chat` appear to encapsulate this logic. * **Why:** Simplifies components like the `Playground` by abstracting away the details of fetching and formatting conversation history, including handling the new inline upload format. * **Impact:** Significant simplification in `lib/ai_bot/playground.rb`, removing large chunks of previous context-building code.

Оно обеспечивает тонкое, но критически важное улучшение для Discourse AI.

Я регулярно замечал, что бот модерации говорит о совершенно нерелевантных изображениях из-за способа, которым мы формировали контекст. Это изменение позволяет нам представлять смешанный контент (содержащий изображения и текст в правильном порядке).

Это означает, что LLM больше не путается.

Что дальше?

У нас нет возможности в автоматизации вызвать правило после того, как редактирование поста «устаканится». Вызовы LLM могут быть дорогими, и нам не нужно сканировать одно и то же снова и снова только из-за того, что кто-то исправил опечатку. Я не уверен, что это необходимо здесь, но я хотел бы предусмотреть возможность запуска автоматизации после того, как пост примет новую форму.
Инженерия промптов — текущий промпт приемлем, но для моего вкуса он слишком громкий, он меня немного раздражает, возможно, я его немного смягчу.
Улучшенный контекст — меня действительно беспокоит то, что автоматизация теперь не учитывает уровень доверия пользователя. Некоторые пользователи пользуются большим доверием в сообществе, чем другие (например, модераторы). Я хотел бы посмотреть, сможем ли мы улучшить эту ситуацию.
Возможность запускать автоматизацию на пакетах постов для быстрой итерации.
Я уверен, что появится ещё много чего.

Тема		Ответов	Просм.
Introducing Discourse AI Blog	26	4126	04.05.2023
AI integration for moderation Support	2	168	25.01.2026
AI Forum Moderation: Seeking Insights and Experiences Development ai	8	1994	27.09.2025
Have AI check for inappropriate post or at least words and flag the post Support ai , ai-toxicity	2	482	07.07.2023
Setting up NSFW detection in your community Site Management moderation , automation , how-to , ai	0	1378	10.10.2024

Эксперименты с модерацией на основе ИИ на Discourse Meta

Что дальше?

Связанные темы