I’m Avineet Kumar, a Staff ML Engineer (GenAI/LLM Systems) with 9+ years of experience building production-grade AI systems at scale. My expertise spans LLM orchestration, multi-agent architectures, RAG pipelines, NL2SQL engines, and cloud-native ML platforms on GCP, AWS, and Azure.
At Nagarro, I improved the Gemini 2.0 chatbot accuracy by 28% and reduced hallucinations by 25% through advanced prompt engineering and context-aware agents. I’ve also fine-tuned LLaMA models using LoRA, developed RBAC multi-agent systems, and optimized Cloud Run & Vertex AI deployments for 400+ users.
I’m passionate about bridging applied AI research with scalable engineering, designing measurable evaluation frameworks, and mentoring teams to deliver high-impact GenAI solutions. I’m now seeking a Staff or Principal-level AI/ML role focused on GenAI infrastructure, LLM optimization, or intelligent agent systems, where I can drive architectural innovation and real-world AI adoption.
No employment history.
No education history.