Building Multimodal AI Agents From Scratch — Apoorva Joshi, MongoDB

AI Engineer

In this hands-on workshop, you will build a multimodal AI agent capable of processing mixed-media content—from analyzing charts and diagrams to extracting insights from documents with embedded visuals. Using MongoDB as a vector database and memory store, and Google's Gemini for multimodal reasoning,