Sagar Verma
Deltia AI | Micropilot. Cambridge, MA.
I am a Staff Computer Vision Engineer at Deltia AI, where I work on Vision-Language Models (VLMs) for manufacturing plant analysis. My work focuses on building multimodal AI systems that understand and reason about industrial environments from visual data.
Previously, I was the Chief Technology Officer and Co-Founder at Granular AI, where I led the development of GeoEngine, the world’s first Geospatial MLOps platform. Our go-to-market product, Inspect.Properties, served 900+ construction, insurance, and adjuster companies, providing automated roofing, property inspections, and claims documentation using satellite, aerial, and drone imagery.
I also maintain Micropilot, a personal open-source initiative focused on learning-based control systems for prosthetics, exoskeletons, and anthropomorphic dexterous robots.
Background
I did my Ph.D. in 2022 from CentraleSupélec, Université Paris-Saclay, under Dr. Jean-Christophe Pesquet, focusing on optimization methods for pruning neural networks for edge deployment. As part of my Ph.D., I worked on modeling the dynamics of heavy electrical motors using neural networks, guided by Dr. Marc Castella, Dr. Nicolas Henwood, and Dr. Al-Kassem Jebai. I was funded by ANRT CIFRE and Schneider Electric.
Before that, I was a Research Fellow at IIIT Delhi (2017-2018), where I worked with Dr. Chetan Arora on egocentric video understanding. I was also part of Watson AI team at IBM Research (2017), developing NLQ systems for banking applications.
Current Research Interests
- Vision-Language Models
- Computer Vision for Manufacturing
- Reinforcement Learning
- Optimization for Learning-Based Control