I wonder if the gpt-4-turbo bot is ‘gpt-4-turbo-preview’ or ‘gpt-4-vision-preview’
I.e. does it have vision?
If it’s ‘gpt-4-turbo-preview’ will vision be also added? I think it would be useful since since e.g. a screenshot can help the bot to understand a user’s query.
I don’t know the answer, and yet Vision is same as 4, it has a lot of issues and can be problematic. But AFAIK there should be own model for Vision and Discourse doesn’t use its own naming style.
At the moment gpt-4-turbo == gpt-4-0125-preview aka no vision.
@keegan is exploring some vision integration here:
Particularly adding captions to images via vision apis (both open and closed models)
I do really like the “demo” power of … starting a PM with bot, uploading an image and asking about it, but need to think through flow cause we would need to switch in and out of models.
Maybe a concept of “auto tool” can work here, so if we see an upload we automatically route to tool and then back to the main model… will have a think.
Technically wiring this up is not too complex within our architecture.