Meta AI labs unveil ImageBind, a huge advance for AI. It allows artificial intelligence to simultaneously understand multiple sources: audio, image, text and even heat.
Generative artificial intelligence can quickly create text, with ChatGPT, or images, with Midjourney. It is sometimes possible, especially since GPT-4, to provide it with an image to understand, but the process of the AI will pass by a creation of text from this image to be able to make treatment. So we always come back to text as a method to communicate with the AI.
With ImageBind, Meta unveils a new method that could revolutionize artificial intelligence in its current form. The company wants to go much further and foresees a method that allows the AI to interpret up to 5 completely different sources simultaneously.
Approaching the human
Meta was inspired by this idea to develop ImageBind, a new artificial intelligence model that the firm wants to make open source. It is the first model capable of combining information from 6 different types of sources: text, image, audio, depth (3D), thermal (via infrared) and velocity.
0 Comments