Ah now I understand the confusion.
Yes there is an API token (or key, which you need to get from OpenAI) and there are text tokens that represent parts of words. The latter are processed by the LLM. The former is so you can be authenticated by the API.
Here’s quite a good guide to the terminology from The Verge: