Semantic Search API

MarcP · December 17, 2024, 5:42am

How can we access semantic search through the API?

search.json does not seem to have semantic (even though, I remember they did at some point?)

discourse-ai/embeddings/semantic-search?q= is called after search.json, good results are correct when I open the URL including query I just did.

But if I discourse-ai/embeddings/semantic-search?q=differentQuery directly the results do not make sense at all.

What do I miss here?

sam · December 17, 2024, 6:31am

You can do a pure embeddings search using:

https://DOMAIN/discourse-ai/embeddings/semantic-search.json?hyde=false&q=....

That disables the hyde portion so it is rate limited a lot less aggressively.

Additionally ideally use an api key for the call which relaxes a lot of the limits.

MarcP · December 17, 2024, 6:41am

Thanks, this works!

I whitelisted my IP’s from ratelimiting in app.yml, I think I read somewhere that this was also a solution to bypass ratelimits if I’m correct.

sam · December 17, 2024, 6:43am

Not really, search limits are generally implemented in the app, skipping hyde is critical here:

github.com

discourse/discourse-ai/blob/ca800f7aa3e6efcd8c7c9d8aef87ea8b2c548184/app/controllers/discourse_ai/embeddings/embeddings_controller.rb#L10-L11


      
          MAX_HYDE_SEARCHES_PER_MINUTE = 4
          MAX_SEARCHES_PER_MINUTE = 100

You only get 4 hyde queries a minute (where we expand the search term for you), you get up to 100 non hyde ones (provided other rate limits are relaxed)

MarcP · December 17, 2024, 6:46am

I will pass this param for sure.

My question was actually: passing API key is effectively the same as excluding an IP from ratelimits? Or did you mean hyde=false only works IF an API key is passed?

sam · December 17, 2024, 6:49am

both are unrelated. api has different knobs for rate limits, you can relax it more than other parts of the app in global settings

MarcP · December 17, 2024, 6:59am

Got it, the app.yml flag I was talking about seems to lift nginx ratelimits (DISCOURSE_MAX_REQS_PER_IP_EXCEPTIONS)

The topic below made it a bit more clear to me:

system · January 16, 2025, 6:59am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
API access to the embedding(s) for a post Feature completed	4	426	September 15, 2024
Discourse AI - AI search Site Management how-to , ai , ai-search	10	2547	August 5, 2025
Is it possible to make the default search Semantic search on the site? How much do these calls cost? Feature ai , ai-search	7	1283	October 8, 2023
Getting a lot of no results for semantic search Support ai , ai-search	21	228	April 12, 2025
Support for Vanilla Embedding Search Feature ai , ai-search	2	74	June 28, 2025

Semantic Search API

Related topics