How to implement Mistral with Embeddings

RGJ · April 9, 2025, 3:36pm

I’ve been struggling to set up Embeddings with Mistral AI, I suspect because Mistral requires a model to be passed. Do you know whether this is possible (and if so, how), or what should be done to make it possible?

Falco · April 9, 2025, 3:42pm

Try setting mistral-embed in the “Model name” field, that appears after you select “Provider” as OpenAI.

RGJ · April 10, 2025, 6:32pm

Thanks, that works

I’m struggling to find out what would be the best tokenizer to use for this use case though. The Mixtral tokenizer is not selectable here. Do you have any suggestions?

Falco · April 10, 2025, 7:00pm

Your post above token length according to some tokenizers:

OpenAI: 45
Mixtral: 52
Gemini: 47
E5: 50
bge-large-en: 49
bge-m3: 50
mpnet: 49

Looks like Mistral-embed doesn’t differ much from the others. And since it supports a very large context window of 8k, you should be safe picking any and leaving some room to spare by limiting the context window in Discourse to 7 or 7.5k.

Falco · April 11, 2025, 2:04pm

Looks like mistral-embed uses the same tokenizer as the first Mixtral model, and we already ship that anyway, so what do you think about enabling that tokenizer in the embeddings config page @Roman_Rizzi ?

Roman_Rizzi · April 11, 2025, 2:44pm

Sure. I don’t see why not if it’s already there. This change will add it to the available options:

RGJ · May 11, 2025, 2:44pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Support for Mistral API Feature ai	1	502	December 26, 2023
Self-Hosting Embeddings for DiscourseAI Self-Hosting ai , ai-search , related-topics	21	2033	April 14, 2025
Can´t set ai embedding model Support ai	3	38	July 16, 2025
Embedding of post is not being properly truncated in discourse-ai plugin Bug ai	5	371	August 20, 2023
Adding Semantic Search feature for our self-hosted discourse site Support ai , ai-search	9	154	March 19, 2025

How to implement Mistral with Embeddings

Related topics