It’s not resource intensive on your self-hosted instance because you can just use hosted models, e.g., openai. So you just pay for API calls for embeddings and chat.
It’s not resource intensive on your self-hosted instance because you can just use hosted models, e.g., openai. So you just pay for API calls for embeddings and chat.