Fix OpenAI model definitions by restricting to `gpt-4o` models for multimodal support by mslinnea · Pull Request #18 · felixarntz/ai-services

mslinnea · 2024-11-17T01:52:11Z

I encountered an issue with the current model selection for the OpenAI service. It defaults to gpt-4-turbo-preview as the preferred multimodal model, but this model fails to generate image alt text, returning the error:

“Invalid content type. image_url is only supported by certain models.”

This error persists even though a valid base64-encoded image is sent in the request.

To resolve this, I’ve updated the model selection to use only gpt-4o models, which are compatible and successfully handle the image input for generating alt text.

I also excluded realtime and audio models. Realtime models are incompatiable with the v1/chat/completions endpoint (they require v1/completions endpoint). Audio models don't accept image inputs.

Let me know if this fits or if you need any more tweaks!

felixarntz

Thank you for the PR @mslinnea, this is a great catch!

Update Open AI models that support multimodal cap

188401b

mslinnea mentioned this pull request Nov 17, 2024

Add alt text generation mslinnea/ai-seo-tools#6

Merged

felixarntz added the bug Something isn't working label Nov 18, 2024

felixarntz approved these changes Nov 18, 2024

View reviewed changes

Comment thread includes/OpenAI/OpenAI_AI_Service.php Outdated

Adjust code formatting.

87a56ca

felixarntz changed the title ~~Fix OpenAI Model Support: Restrict to Compatible gpt-4o Variants for Alt Text~~ Fix OpenAI model definitions by restricting to gpt-4o models for multimodal support Nov 18, 2024

felixarntz reviewed Nov 18, 2024

View reviewed changes

Comment thread includes/OpenAI/OpenAI_AI_Service.php Outdated

Fix WPCS.

dc8a267

felixarntz merged commit 75f5d24 into felixarntz:main Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix OpenAI model definitions by restricting to `gpt-4o` models for multimodal support#18

Fix OpenAI model definitions by restricting to `gpt-4o` models for multimodal support#18
felixarntz merged 3 commits intofelixarntz:mainfrom
mslinnea:openai-models-image-input

mslinnea commented Nov 17, 2024

Uh oh!

felixarntz left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mslinnea commented Nov 17, 2024

Uh oh!

felixarntz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants