AI Garage Jargons
Multimodal AI
AI systems that understand or generate across multiple data types like text, image, audio, and video.
Use Cases
- Image captioning
- Voice assistants
- Document + screenshot understanding
ELI5
AI that can work with more than just words, like pictures and sound too.
Why it matters
Multimodal workflows unlock richer product experiences and automation.
Back to glossary/multimodal-ai