How to implement Mistral with Embeddings

Falco · April 10, 2025, 7:00pm

Your post above token length according to some tokenizers:

OpenAI: 45
Mixtral: 52
Gemini: 47
E5: 50
bge-large-en: 49
bge-m3: 50
mpnet: 49

Looks like Mistral-embed doesn’t differ much from the others. And since it supports a very large context window of 8k, you should be safe picking any and leaving some room to spare by limiting the context window in Discourse to 7 or 7.5k.

Topic		Replies	Views
Support for Mistral API Feature ai	1	512	December 26, 2023
Self-Hosting Embeddings for DiscourseAI Self-Hosting ai-search , ai , related-topics	21	2179	April 14, 2025
Can´t set ai embedding model Support ai	4	84	July 16, 2025
Embedding of post is not being properly truncated in discourse-ai plugin Bug ai	5	401	August 20, 2023
Adding Semantic Search feature for our self-hosted discourse site Support ai , ai-search	9	169	March 19, 2025

How to implement Mistral with Embeddings

Related topics