Tuesday, 14 May 2024

GPT-4o released with improved text, audio and vision capabilities

GPT-4o (“o” for “omni”) is OpenAI’s latest multimodal large language model (LLM) and it brings major advancements in text, voice, and image content generation to offer more natural interaction between users and AI.

OpenAI claims its new AI model can respond to audio inputs in as little as 232 milliseconds and it is significantly faster in text response in non-English prompts with support for over 50 languages. You can also interrupt the model with new questions or clarifications while it is talking.

GPT-4o also features a more capable, human-sounding voice assistant that responds...


http://dlvr.it/T6s6mX

No comments:

Post a Comment

Deals: Google Pixel 9 and Samsung Galaxy S24 offers

Pre-orders for the new iPhone 16 series started Friday last week and this Friday the new Apple phones went on sale. The big question now is ...