I'm a research engineer working on AI-native video infrastructure.
I graduated summa cum laude from the
University of Maryland, College Park, where I earned B.S. degrees
in Computer Science and Mathematics. I am fortunate to be a part
of
Abhinav Shrivastava's Perception and Intelligence Lab.
I'm currently applying to computer vision PhD programs (Fall
2026). I would love to collaborate on image/video generation and
understanding projects, please
email me!
Previously, I interned at Amazon AWS EC2 Nitro,
building tools for detecting server issues across millions of
machines, and at Anello Photonics, working on
data compression pipelines and automated photonic gyroscopes
testing. I've also explored real-time ML applications in surgical
wearables and helped build autonomous drones.
My interests include computer vision, robotics, and deep learning.
Outside of research, I love reading sci-fi and fantasy novels
(favorites include Dune and The Way of Kings) or
training in Brazilian Jiu-Jitsu, Muay Thai, and MMA.
Publications
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal, Abhinav Shrivastava, Matthew Gwilliam
arXiv, 2025
We introduce ECAD, an evolutionary algorithm to automatically discover efficient caching schedules for
accelerating diffusion-based image generation models. ECAD achieves faster than state-of-the-art speed
and higher quality among training-free methods and generalizes across models and resolutions.
Projects
Coursework, side projects, and unpublished (failed?) research work.
We build a real-time face-blurring system that redacts faces in live and recorded video while keeping chosen identities visible. The modular pipeline combines YuNet/SCRFD detection, SORT tracking, SFace recognition, and configurable blur options for flexible speed–accuracy tradeoffs. On crowded IRL Twitch footage, it runs faster than real time on CPU and favors privacy by accepting a few extra blur boxes rather than missing a face.
Learning to Settle: Reinforcement Learning in Catan
Anirud Aggarwal, Jinhai Yan, Serena Huang, Rohit Kommuru, Monish Napa, Han Lin
unpublished , 2024
We build a custom PettingZoo environment and training stack for learning to play the board game Catan with reinforcement learning. Starting from a refactored Settlers-RL codebase, we explore both multi-agent methods (via MARLlib) and a single-agent PPO baseline with dense reward shaping. Our experiments show agents that learn to play shorter, higher-scoring games, while highlighting the remaining gap to robust multi-agent performance in non-stationary, multi-player settings.