You can do a pure embeddings search using:
https://DOMAIN/discourse-ai/embeddings/semantic-search.json?hyde=false&q=....
That disables the hyde portion so it is rate limited a lot less aggressively.
Additionally ideally use an api key for the call which relaxes a lot of the limits.