Ronak
Kataria
Transforming Ideas into Intelligent Reality
AI Architect crafting production-grade AI solutions with 8+ years of expertise in Agentic AI, LLMOps, Voice AI, and MLOps. Architecting scalable systems across AWS, Azure, GCP, Databricks & Snowflake— delivering transformative outcomes through innovation and technical excellence.
About Me
Engineering intelligent systems at the intersection of AI, Voice AI, and cloud architecture

My Journey
A Senior AI Engineering leader with 8+ years of experience building enterprise-grade AI/ML, Generative AI, and LLMOps platforms across AWS and Azure. Currently engineering cutting-edge AI/ML solutions at HARMAN International, with deep expertise in architecting real-time conversational AI and voice agent systems (STT/LLM/TTS pipelines), scalable RAG applications, and multi-agent agentic workflows.
Hands-on with LLM fine-tuning (LoRA, PEFT), fraud detection and anomaly detection models, ML-powered trading and forecasting systems, and low-latency streaming architectures. I've led 30+ AI transformation initiatives across automotive and banking verticals, delivered solutions for clients like HDFC Bank, Yes Bank, and Capricorn Group, and driven 40% productivity improvements and 37% revenue growth.
Passionate about pushing the boundaries of what's possible with Generative AI, Voice AI, LLMs, and autonomous agent systems — turning ideas into intelligent reality.
The Complete Stack
An interconnected ecosystem of AI, Cloud, Data, and DevOps technologies working in harmony
AI/ML Core
- Agentic AI & Multi-Agent Systems
- LLM Fine-tuning & RAG
- Computer Vision & NLP
- Neural Networks & Deep Learning
Multi-Cloud
- Multi-Cloud Architecture Design
- Serverless & Microservices
- Cloud Cost Optimization
- Security & Compliance
Data Engineering
- Data Lakehouse Architecture
- Real-time Data Pipelines
- ETL/ELT Workflows
- Data Governance & Quality
DevOps & MLOps
- CI/CD Pipeline Automation
- Container Orchestration
- Infrastructure as Code
- Model Deployment & Monitoring
AI/ML Core
- Agentic AI & Multi-Agent Systems
- LLM Fine-tuning & RAG
- Computer Vision & NLP
- Neural Networks & Deep Learning
Multi-Cloud
- Multi-Cloud Architecture Design
- Serverless & Microservices
- Cloud Cost Optimization
- Security & Compliance
Data Engineering
- Data Lakehouse Architecture
- Real-time Data Pipelines
- ETL/ELT Workflows
- Data Governance & Quality
DevOps & MLOps
- CI/CD Pipeline Automation
- Container Orchestration
- Infrastructure as Code
- Model Deployment & Monitoring
Voice & Conv. AI
- OpenAI Whisper (STT) & TTS-1
- WebRTC Real-Time Streaming
- Twilio & SIP/VoIP Integration
- Context Retention & Interruption Handling
Multi-Cloud Mastery
AWS
Azure
GCP
Experience & Achievements
8+ years of driving innovation across AI, ML, and cloud engineering
Senior AI Architect
- Leading AI architecture and strategy for next-generation intelligent systems
- Designing scalable Agentic AI and LLM-powered solutions for enterprise products
- Driving AI innovation across HARMAN's connected technologies portfolio
Senior Technical Lead – AI Engineering
- Led Automotive AI/GenAI initiatives — delivering 30+ Agentic AI interventions across automotive verticals including ADAS, in-cabin intelligence, and connected vehicle platforms
- Led, architected and developed an Agentic AI Enterprise Platform with multi-agent framework for the automotive Software Defined Vehicle (SDV) lifecycle — orchestrating autonomous AI workflows across development, testing, validation, and deployment stages
- Pioneered Edge AI for in-car LLM use cases — fine-tuning, quantizing (GPTQ, AWQ, GGUF), and deploying LLMs on Qualcomm SoCs for on-device automotive AI with low-latency inference
- Architected enterprise-scale RAG-based AI productivity tools using LangChain and LLMs (GPT-4, LLaMA, Qwen, Claude), improving employee productivity by 40%
- Designed LLMOps accelerator platform for orchestration, fine-tuning (LoRA, PEFT), deployment, and lifecycle management of enterprise LLM applications
- Architected real-time conversational AI prototypes with STT/LLM/TTS pipelines for voice-enabled in-car assistants — context retention, interruption handling, and fallback logic
- Led and mentored 12-member engineering team; worked with multi-agent architectures, embedding models (Nomic, OpenAI, HF), and vector databases (Pinecone, Milvus, FAISS)
Data Scientist - MLOps Solutions Architect
- Built multi-cloud MLOps accelerator supporting AWS, Azure, GCP, and Databricks as part of the MLOps CoE
- Architected scalable LLMOps platform for RAG pipelines and LangChain workflows across GPT, LLaMA 2, T5, and Stable Diffusion
- HDFC Bank: Designed customer churn prediction model using XGBoost, Random Forest with time series behavioral analysis
- Yes Bank: Developed credit card fraud detection system using Isolation Forest, XGBoost with real-time transaction pattern recognition
- Capricorn Group (South Africa): Built income classification ML model on Azure for automated credit decisioning
- Built NLP Document AI solution leveraging sentiment analysis, topic modeling, keyword extraction, and clustering
- Earned company-wide recognition for most viewed blog in Q4 2023 on LLM Operationalization
Sr. Software Engineer (ML & Conversational AI)
- Architected highly secure, serverless CRM application for UK Fortune 500 clients, integrating 20+ AWS services
- Led design and delivery of ML-powered Conversational AI — NLP voice/text agents, intelligent task dispatcher, real-time sentiment analysis
- Built self-learning task dispatcher that autonomously routes, prioritizes, and escalates tasks — driving 37% product revenue growth in Q4 2021
- Developed audio language conversion and real-time closed-captioning tool for live-stream healthcare video
- Ensured scalability, reliability, and data privacy compliance across AI features serving regulated industries (healthcare, finance)
Cloud & Machine Learning Engineer
- Developed end-to-end CI/CD pipelines and ML products for real-time object detection, tracking, and NLP-based keyword spotting on AWS
- Built OTA Portal and Data Science Dashboard with full-stack development (React, Python, Dash-Plotly) and DevOps integration on AWS
- Emerging Star Award (Jan'20) and two Pat On The Back Awards (Dec'19, Sept'20) for technical innovation
Machine Learning Engineer
- Designed and optimized custom neural networks using TensorFlow, Keras, and TFLite for edge deployment on low-powered devices
- Developed Alexa skill sets and voice-enabled AI applications
- Managed CI/CD pipelines and AWS infrastructure using CloudFormation and Terraform
Awards & Achievements
Recognized for innovation, excellence, and impactful contributions across organizations
Client Champion Award
Recognized for driving client success and cross-functional AI delivery excellence across enterprise engagements.
Leadership Award
Awarded for exceptional leadership in mentoring a 12-member engineering team and delivering 30+ Agentic AI interventions.
Most Viewed Blog - Q4 2023
Company-wide recognition for the most viewed technical blog on LLM Operationalization using Snowflake and AWS SageMaker.
Pat On The Back Award
Recognized for innovative thinking in LLMOps and Generative AI solutions.
Emerging Star Award
Recognized for exceptional performance and innovative contributions to ML and cloud engineering projects.
Pat On The Back Award
Second recognition for consistent high performance and team collaboration.
Pat On The Back Award
First recognition for outstanding dedication and technical excellence in delivering critical projects.
Certified Expertise
Industry-recognized certifications across cloud platforms, AI/ML, and deep learning
Cloud & AI Certifications
AWS Certified Solutions Architect
Associate
Amazon Web Services
AWS Certified Machine Learning
Specialty
Amazon Web Services
AWS Certified Cloud Practitioner
Foundational
Amazon Web Services
Microsoft Certified Azure AI
Fundamentals (AI-900)
Microsoft
Microsoft Certified Azure
Fundamentals (AZ-900)
Microsoft
NanoDegrees & Specializations
Machine Learning NanoDegree
Udacity (Kaggle)
Kaggle PartnershipDeep Learning NanoDegree
Udacity (Facebook AI Labs & AWS)
Facebook AI Labs + AWSGenerative Adversarial Networks (GANs)
deeplearning.ai
deeplearning.aiAcademic Foundation
Strong engineering roots complemented by continuous learning and certifications
Bachelor of Technology
Electronics & Communications Engineering
Languages
Multilingual Professional
Featured Projects
Agentic AI, LLMs & Multi-Cloud solutions pushing technological boundaries
Voice-Enabled Real-Time Interactive RAG Application
Voice-enabled real-time conversational AI using RAG with LangChain, OpenAI Whisper (STT), TTS-1 and GPT-4. Low-latency streaming pipeline using WebRTC, AWS Lambda for parallel processing, and ElastiCache for caching — achieving sub-second response latency for voice-to-voice conversations.
Agentic AI Enterprise Platform for Automotive SDV
Led, architected and developed an enterprise Agentic AI platform with multi-agent framework for the automotive Software Defined Vehicle (SDV) lifecycle. Orchestrates autonomous AI workflows across development, testing, validation, and deployment stages — enabling OEMs to accelerate SDV delivery with intelligent automation.
Enterprise LLMOps Accelerator Platform
Enterprise-grade LLMOps platform for automotive AI — orchestration, fine-tuning (LoRA, PEFT), quantization (GPTQ, AWQ, GGUF), and deployment of LLMs. Supports GPT, LLaMA, Qwen, Claude with multi-agent architectures and Edge AI deployment on Qualcomm SoCs for in-car LLM use cases.
Automotive Edge AI — In-Car LLM Deployment
Fine-tuned and quantized LLMs (GPTQ, AWQ, GGUF) for on-device deployment on Qualcomm SoCs powering in-car AI use cases — ADAS copilots, in-cabin voice assistants, and real-time contextual recommendations with ultra-low-latency inference on resource-constrained automotive hardware.
Credit Card Fraud Detection (Yes Bank)
Real-time credit card fraud detection pipeline using Isolation Forest and XGBoost for anomaly detection on transaction streams. Feature engineering on transaction velocity, geolocation, and behavioral patterns. MLOps on Kubernetes + Kubeflow with A/B testing and drift monitoring.
Customer Churn Prediction (HDFC Bank)
Customer churn prediction model using ensemble ML techniques (XGBoost, Random Forest, Logistic Regression) and time series behavioral analysis across retail banking segments. MLOps pipelines on AWS with Snowflake and SageMaker for automated retraining.
Income Classification (Capricorn Group, South Africa)
Income classification ML model on Azure for automated customer segmentation and credit risk assessment. End-to-end MLOps using Azure ML Studio and MLflow for experiment tracking, model versioning, automated retraining, and production deployment.
Multi-Cloud MLOps Accelerator
Multi-cloud MLOps accelerator supporting AWS, Azure, GCP, and Databricks. Standardized model training, deployment, and monitoring pipelines. Featured in company-wide most viewed blog on LLM Operationalization using Snowflake and AWS SageMaker.
Scalable CRM with Conversational AI
Highly secure serverless CRM for UK Fortune 500 clients. ML-powered Conversational AI with NLP voice/text agents, self-learning task dispatcher, real-time sentiment analysis. Audio language conversion and closed-captioning for healthcare video streams.
IoT Device Management & Data Science Dashboard
Full-stack data science dashboard for IoT device management and big-data analytics on AWS (S3, ECS, Glue, Athena) with Apache Airflow, PySpark, Dash-Plotly, and ReactJS. Real-time device monitoring for HVAC and Water Heater IoT products.
Home-Office Automation & Surveillance
Deep learning CV models (TensorFlow, TFLite) for edge deployment on NVIDIA Jetson Nano. Azure-based REST APIs with CI/CD pipelines and IoT cloud architecture using IoT Hub, CosmosDB, and ML Studio for intelligent home automation.
Deep Learning GAN for Face Generation
Generator-Discriminator based Deep Convolutional GAN (DCGAN) architecture for generating realistic human faces. Trained on CelebA dataset with 200k+ celebrity images using PyTorch.
Case Studies
Deep dives into transformative AI/ML projects with measurable business impact
Automotive AI/GenAI, Edge AI & Agentic SDV Platform
Challenge
KPIT's automotive OEM clients needed AI-driven solutions across ADAS, in-cabin intelligence, connected vehicle platforms, and the full Software Defined Vehicle (SDV) lifecycle — including on-device LLM inference on resource-constrained automotive SoCs and autonomous multi-agent AI workflows.
Solution
- Led, architected and developed an Agentic AI Enterprise Platform with multi-agent framework for the automotive SDV lifecycle — orchestrating autonomous AI workflows across development, testing, validation, and deployment stages
- Led 30+ Agentic AI interventions across automotive verticals — ADAS, in-cabin intelligence, connected vehicle platforms, and autonomous workflow automation
- Pioneered Edge AI for in-car LLM use cases — fine-tuning, quantizing (GPTQ, AWQ, GGUF), and deploying LLMs on Qualcomm SoCs for low-latency on-device inference
- Architected enterprise-scale RAG-based AI productivity tools using LangChain and LLMs (GPT-4, LLaMA, Qwen, Claude)
- Designed LLMOps accelerator platform for orchestration, fine-tuning (LoRA, PEFT), deployment, and lifecycle management
- Architected real-time conversational AI for voice-enabled in-car assistants with STT/LLM/TTS pipelines, context retention, and interruption handling
- Worked with multi-agent architectures, embedding models (Nomic, OpenAI, HF), and vector databases (Pinecone, Milvus, FAISS)
Results & Impact
Technologies Used
Credit Card Fraud Detection System
Challenge
Yes Bank required a real-time fraud detection system capable of identifying anomalous credit card transactions with high accuracy while minimizing false positives.
Solution
- Designed real-time fraud detection pipeline using Isolation Forest and XGBoost for anomaly detection on transaction streams
- Engineered features on transaction velocity, geolocation, and behavioral patterns for high-accuracy detection
- Built end-to-end MLOps infrastructure on bank's on-premise data center using Kubernetes and Kubeflow
- Implemented automated model training, serving, A/B testing, and drift monitoring pipelines
Results & Impact
Technologies Used
Customer Churn Prediction Platform
Challenge
HDFC Bank needed proactive retention strategies by predicting customer churn across retail banking segments using behavioral and transactional data.
Solution
- Designed customer churn prediction model using ensemble ML techniques (XGBoost, Random Forest, Logistic Regression)
- Applied time series behavioral analysis across retail banking segments for proactive retention strategies
- Built end-to-end MLOps pipelines on AWS with Snowflake data warehousing
- Implemented automated retraining triggers based on model performance degradation
Results & Impact
Technologies Used
Multi-Cloud MLOps Accelerator
Challenge
Enterprise clients required a unified MLOps platform that could seamlessly operate across AWS, Azure, GCP, and Databricks without vendor lock-in.
Solution
- Architected multi-cloud MLOps accelerator with abstraction layer for cloud services as part of MLOps CoE
- Built scalable LLMOps platform for RAG and LangChain workflows across GPT, LLaMA 2, T5, and Stable Diffusion
- Built NLP Document AI solution leveraging sentiment analysis, topic modeling, keyword extraction, and clustering
- Built serverless inference endpoints on AWS SageMaker with auto-scaling, A/B testing, and model versioning
- Published most-viewed technical blog on LLM operationalization with Snowflake and AWS SageMaker
Results & Impact
Technologies Used
AI-Powered CRM with Conversational Intelligence
Challenge
UK Fortune 500 clients needed a highly secure, scalable CRM with AI-driven automation for regulated industries including healthcare and finance.
Solution
- Architected fully serverless CRM integrating 20+ AWS services with automated deployment via CloudFormation and CodePipeline
- Led design of ML-powered Conversational AI — NLP voice/text agents, intelligent task dispatcher, real-time sentiment analysis
- Built self-learning task dispatcher that autonomously routes, prioritizes, and escalates tasks
- Developed audio language conversion and real-time closed-captioning for healthcare live-stream video
- Ensured data privacy compliance across AI features serving enterprise clients in regulated industries
Results & Impact
Technologies Used
Want to discuss your project and explore how we can achieve similar results?
Get In TouchPublications & Blogs
Sharing knowledge and insights on AI/ML, LLMOps, and cloud architecture
Embracing the Future with Generative AI: Operationalizing Large Language Models using Snowflake and AWS SageMaker
A comprehensive guide on operationalizing LLMs at scale using Snowflake's data platform and AWS SageMaker. Covers architecture patterns, best practices, and real-world implementation strategies for enterprise GenAI solutions.
More publications coming soon on Agentic AI, LLMOps, and Multi-Cloud Architecture
Need an AI Architect for
Agentic AI, LLMOps, or Voice Systems?
Book a 1:1 session to get expert guidance on enterprise AI architecture, RAG pipelines, multi-agent systems, or multi-cloud strategy.
1:1 Consultation
Personalized guidance on your AI/ML challenges
Flexible Scheduling
Book a time that works for you
Expert Insights
8+ years of AI/ML and cloud experience
Topics I can help with:
Work With Me
Whether you need an AI architecture review, a production RAG system, or ongoing strategic advisory — I bring 8+ years of hands-on experience to deliver results that matter.
AI Strategy & Architecture
End-to-end AI architecture design — from problem framing and model selection to production deployment and scaling across enterprise environments.
Agentic AI & Multi-Agent Systems
Design and build autonomous AI agent workflows — multi-agent orchestration, tool-use patterns, and enterprise automation platforms.
Voice AI & Conversational Systems
Real-time STT/LLM/TTS pipelines, WebRTC streaming, context retention, interruption handling, and voice-enabled enterprise applications.
LLMOps & MLOps Modernization
Production-grade ML lifecycle — fine-tuning (LoRA, PEFT), quantization, deployment, monitoring, and CI/CD for LLM applications.
RAG & Enterprise Search
Scalable RAG pipelines with vector databases (Pinecone, Milvus, FAISS), embedding strategies, and retrieval-augmented generation for enterprise knowledge.
AI Workshops & Team Enablement
Hands-on workshops for engineering teams on Agentic AI, LLMOps, RAG, and production AI — from fundamentals to advanced architecture patterns.
How to Work With Me
Flexible engagement models tailored to your needs — from a single consultation to long-term strategic partnerships.
Paid Consultation
One-on-one expert sessions for AI architecture reviews, technology assessments, or strategic advisory.
Book a SessionProject-Based Engagement
End-to-end delivery of AI projects — from architecture design to production deployment and handover.
Discuss a ProjectFractional AI Architect
Ongoing strategic partnership with monthly advisory, architecture oversight, and continuous optimization.
Explore PartnershipSave My Contact
Download my vCard, share my profile, or scan the QR code

Ronak Kataria
Senior AI Architect | GenAI, LLMOps & Voice AI
HARMAN International
Add to your contacts • Share with colleagues • Scan QR code
Let's Collaborate
Ready to bring your AI vision to life? Let's discuss how we can work together.
Phone
+91-8758682083Location
Pune, India
Available for Remote Work Globally
Open for Opportunities
AWS & Azure Certified Solutions Architect with 8+ years of experience in AI/ML, LLMOps, and multi-cloud solutions. Available for consulting, architecture design, Agentic AI development, and LLM fine-tuning projects.
Download CV