GenAI
Projects
Deep Learning
Project
Machine Learning
Projects
I am an AI Engineer specializing in Generative AI systems, multi-agent architectures, and scalable backend services. I design and deploy production-ready RAG pipelines and agentic workflows using LangChain, LangGraph, and Agno (formerly Phidata).
My experience covers the full AI development lifecycle, including model fine-tuning, backend API development with FastAPI, containerization with Docker, and cloud deployment on AWS. I have hands-on experience working with EC2, ECR, and Amazon Bedrock AgentCore to build scalable AI infrastructure.
I focus on building reliable, high-performance AI systems that integrate machine learning with modern backend engineering, enabling intelligent automation and real-world AI applications.
Programming Languages: Python, C/C++, SQL, PowerShell
Frameworks and Platforms: Agno, LangGraph, LangChain, FastAPI, Streamlit, Anaconda
AI/ML Expertise: Agentic AI Systems, Multi-Agent Architectures, LLM Fine-tuning, RAG Pipelines (ChromaDB), Knowledge Graphs, NLP
Data and ML/DL Tools: Pandas, NumPy, TensorFlow, PyTorch, Neural Networks
Databases and Cloud: MySQL, SQL Server, SQLite, Vector Databases (ChromaDB), Docker, AWS EC2
Gurgaon | Dec 2025 – Present
• Designed and deployed an end-to-end Generative AI chatbot system using LLMs and Retrieval-Augmented Generation (RAG) to deliver accurate, con responses across diverse user queries.
• Architected a production-ready conversational AI pipeline using Llama 3.3 via Groq API, implementing intelligent query classification that achieved 90%+ intent detection accuracy.
• Built a RAG pipeline with ChromaDB vector database enabling semantic search over FAQ documentation and product knowledge to generate grounded, hallucination-resistant responses.
• Developed a natural language to SQL engine that converts user questions into executable SQL queries, enabling real-time product search across 1,000+ inventory items stored in SQLite.
• Implemented a semantic query routing system using Sentence Transformers (all-MiniLM-L6-v2) to automatically direct queries to appropriate pipelines (FAQ retrieval, SQL generation, or conversational responses).
• Built an interactive LLM-powered web application using Streamlit, supporting streaming responses, session management, and responsive UI for real-time user interaction.
• Implemented secure database access controls restricting operations to read-only SELECT queries, preventing SQL injection and unauthorized data manipulation.
• Developed a data ingestion and processing pipeline by scraping e-commerce product data from Flipkart, performing cleaning, deduplication, and migration into SQLite for efficient querying.
• Optimized vector embeddings and semantic retrieval algorithms, achieving sub-second response latency for knowledge retrieval operations.
Impact: Delivered a scalable AI-powered customer query automation system improving response accuracy, reliability, and system security.
Remote | Sep 2025 – Nov 2025
• Built an AI-powered HRMS platform using Claude AI and FastMCP, automating 15 HR workflows including employee onboarding, leave management, meeting scheduling, and internal ticketing.
• Developed multi-step backend orchestration workflows integrating HRMS systems, SMTP notifications, equipment provisioning, and calendar scheduling with conflict detection.
• Implemented natural language search capabilities using fuzzy matching and hierarchical manager relationships for intelligent employee data retrieval.
• Designed a secure credential distribution system using Gmail SMTP with TLS encryption, ensuring safe onboarding communication and account setup.
Impact: Reduced employee onboarding processing time by ~95% through AI-powered workflow automation.
Feel free to get in touch with me. I am always open to discussing new projects, creative ideas or opportunities to be part of your visions.