Multimodal AI Models: Their Impact and What They Will Change

Aleksandar Basara
Fragment Studio
Published in
3 min readDec 27, 2023

--

Photo by Milad Fakurian on Unsplash

The emergence of multimodal AI models is a landmark in the evolution of artificial intelligence. These models, which process and integrate diverse data types like text, images, audio, and video, are redefining the boundaries of human-computer interaction. Pioneered by developments such as OpenAI’s GPT-4 and Google’s Gemini, these models, are set to revolutionize various sectors with…

--

--

Aleksandar Basara
Fragment Studio

Reimagining the interaction between digital experiences and physical objects.