I completed my integrated postgraduate degree (B.Tech + M.Tech) in Information Technology from IIIT Gwalior. Recently, I submitted my Ph.D. thesis at IIT Jodhpur in the Department of Computer Science. My research lies at the intersection of Computer Vision, Multimodal AI, and Explainable Deep Learning, with applications in remote sensing image analysis.
- Vision + Language Models (e.g., LLaVA, BLIP, Kosmos, GPT, DeepSeek)
- Mathematical innovation behind LLMs
- Reinforcement Learning with LLMs
- Open-set Object Detection in Remote Sensing
- Generative AI for impactful real-world tasks
- Agentic AI & Retrieval-Augmented Generation (RAG) across domains
- Codebases for research papers (WACV, IGARSS, ECIR... and more coming soon)
- Experiments with Transformers, LLMs, Agentic AI, and Multimodal Learning
- Occasional fun side projects when inspiration strikes!
- βοΈ saini.9@iitj.ac.in | nandinisaini021@gmail.com
- π LinkedIn | Google Scholar
I love diving deep into AI models by day, and exploring new cultures and cuisines by night π±π