ロケール検出器の戻り値の問題

RGJ · 2025 年 9 月 7 日午前 8:49

2つの異なるフォーラムで、コンテンツのローカライゼーションが投稿を元の言語に翻訳し始めるという2つの異なる問題が発生しました。\n\nこれを調査したところ、ロケール検出器がLLMから純粋な言語コードを受け取っていないことが判明しました。\n\n代わりに、それはマークダウン でラップされていました（読みやすくするためにログから関連部分をコピーしました。\n\n\n"delta":{"content":"\"}\n\"delta\":{\"content\":\"en"},\n\n\nまたは、おそらくプロンプトが `Output: \"en\"` と言っていることに混乱したため、周囲に引用符が付いていました。\n\n\n"delta":{"content":"\""}\n"delta":{"content":"en\""}\n```\n\nプロンプトの最後の行を Your response must be a language code, and nothing else. Do not wrap your response in markdown. に変更したところ役立ちましたが、おそらく LanguageDetector.detect は使用する前に回答を少しクリーンアップすべきだと思います（AZaz と - のみ許可するなど）。

sam · 2025 年 9 月 15 日午前 1:24

報告ありがとうございます。@nat が確認します。

nat · 2025 年 11 月 4 日午前 3:21

@RGJ この件についてはPR（プルリクエスト）をオープンしていますが、使用しているLLM（大規模言語モデル）を教えていただけますか？

RGJ · 2025 年 11 月 4 日午前 6:14

そのインスタンスは廃止しましたが、私の記憶が正しければ、それはミニストラル 3B でした。

nat · 2025 年 11 月 5 日午前 10:17

ここに、プロンプトの更新と、例をシステムプロンプトから適切なインタラクションへの移動を含んだ修正をマージしました。

github.com/discourse/discourse

FIX: Improve prompt and check returned value conforms to standard

main ← sanitise-locale-detection

opened 11:37AM - 03 Nov 25 UTC

nattsw

+92 -34

This commit improves the prompt, and also matches the return value against this:… - https://datatracker.ietf.org/doc/html/rfc5646#section-2.2.1 - **Primary Language Subtag**: ... Two-character primary language subtags were defined in the IANA registry according to the assignments found in the standard "ISO 639-1:2002 ... - **Extended Language Subtags**: ... Extended language subtags consist solely of three-letter subtags. Meta: https://meta.discourse.org/t/locale-detector-return-value-issues/381852

私たちのチームは現在、さまざまなLLMにわたる信頼性の向上を目指して評価に取り組んでいます。

nat · 2025 年 11 月 8 日午前 12:00

このトピックは2日後に自動的に閉じられました。返信はもう受け付けられません。

トピック		返信	表示
AI Translation skips Portuguese (pt) locale - post translated to all languages except Portuguese Bug ai , dynaloc	25	423	2026 年 4 月 22 日
Default LLM model is required prior to enabling "Chat"? Bug ai , content-localization	2	153	2025 年 9 月 15 日
Norwegian is identified as `no` by locale detector agent, content localization supported locales is `nb_NO` Bug ai , fixed	5	171	2026 年 5 月 18 日
AI Commentary on German Translations Bug ai , fixed , content-localization	2	101	2026 年 4 月 3 日
Localized content shows raw HTML or json Bug content-localization	2	121	2025 年 9 月 15 日

ロケール検出器の戻り値の問題

関連トピック