Hey @sam yes indeed we are happy Discourse customers, and one of the most common pairings with GPT-4 for the exact use case you mentioned — see the logos + quotes on our homepage. Can we help you with a POC?
It seems like you’ve done your research on the costs of training, but I wanted to share my understanding based on the OpenAI fine-tuning guide. If I understand OpenAI API correctly, they recommend using Ada for classification tasks and providing 100 examples of each class. In that case, we would have a total of 200 examples (spam and not spam). Assuming an average example consists of 500 tokens, the total would be 500 * 200 = 100,000 tokens on Ada, which would cost US$ 0.04 to train. If you were to use Davinci instead, the cost would be US$ 3.00.
I guess that the pricing might be for a single step or a single epoch of training, but I couldn’t find any more detailed information on their website. Please let me know if you have any insights or if I’ve misunderstood something.