Ronak
Kataria
Transforming Ideas into Intelligent Reality
AI Architect crafting production-grade AI solutions with 8+ years of expertise in Agentic AI, LLMOps, Voice AI, and MLOps. Architecting scalable systems across AWS, Azure, GCP, Databricks & Snowflake— delivering transformative outcomes through innovation and technical excellence.
About Me
Engineering intelligent systems at the intersection of AI, Voice AI, and cloud architecture

My Journey
A Senior AI Engineering leader with 8+ years of experience building enterprise-grade AI/ML, Generative AI, and LLMOps platforms across AWS and Azure. Currently engineering cutting-edge AI/ML solutions at HARMAN International, with deep expertise in architecting real-time conversational AI and voice agent systems (STT/LLM/TTS pipelines), scalable RAG applications, and multi-agent agentic workflows.
Hands-on with LLM fine-tuning (LoRA, PEFT), fraud detection and anomaly detection models, ML-powered trading and forecasting systems, and low-latency streaming architectures. I've led 30+ AI transformation initiatives across automotive and banking verticals, delivered solutions for clients like HDFC Bank, Yes Bank, and Capricorn Group, and driven 40% productivity improvements and 37% revenue growth.
Passionate about pushing the boundaries of what's possible with Generative AI, Voice AI, LLMs, and autonomous agent systems — turning ideas into intelligent reality.
What I Bring to the Table
AI & ML Engineering
Designing production-grade AI systems — from LLM fine-tuning (LoRA, PEFT) to multi-agent orchestration and fraud detection
Multi-Cloud Architecture
Building scalable solutions across AWS, Azure, GCP, Databricks, and Snowflake with deep platform expertise
Voice & Conversational AI
Real-time STT/LLM/TTS pipelines, WebRTC streaming, Whisper, context retention, and interruption handling
Agentic AI Systems
Creating autonomous AI agents and RAG pipelines with vector databases (Pinecone, Milvus, FAISS) for enterprise workflows
MLOps & LLMOps
End-to-end ML lifecycle — from experimentation to production with CI/CD, Kubeflow, MLflow, and monitoring
By The Numbers
Measurable impact driving innovation and business growth
Years of Experience
Building AI/ML solutions
AI Interventions Delivered
Across automotive verticals
ML Models Deployed
Production-grade models
Productivity Boost
With RAG-based tools
Cloud Platforms
Multi-cloud expertise
Users Impacted
Enterprise-scale reach
The Complete Stack
An interconnected ecosystem of AI, Cloud, Data, and DevOps technologies working in harmony
AI/ML Core
- Agentic AI & Multi-Agent Systems
- LLM Fine-tuning & RAG
- Computer Vision & NLP
- Neural Networks & Deep Learning
Multi-Cloud
- Multi-Cloud Architecture Design
- Serverless & Microservices
- Cloud Cost Optimization
- Security & Compliance
Data Engineering
- Data Lakehouse Architecture
- Real-time Data Pipelines
- ETL/ELT Workflows
- Data Governance & Quality
DevOps & MLOps
- CI/CD Pipeline Automation
- Container Orchestration
- Infrastructure as Code
- Model Deployment & Monitoring
AI/ML Core
- Agentic AI & Multi-Agent Systems
- LLM Fine-tuning & RAG
- Computer Vision & NLP
- Neural Networks & Deep Learning
Multi-Cloud
- Multi-Cloud Architecture Design
- Serverless & Microservices
- Cloud Cost Optimization
- Security & Compliance
Data Engineering
- Data Lakehouse Architecture
- Real-time Data Pipelines
- ETL/ELT Workflows
- Data Governance & Quality
DevOps & MLOps
- CI/CD Pipeline Automation
- Container Orchestration
- Infrastructure as Code
- Model Deployment & Monitoring
Voice & Conv. AI
- OpenAI Whisper (STT) & TTS-1
- WebRTC Real-Time Streaming
- Twilio & SIP/VoIP Integration
- Context Retention & Interruption Handling
Multi-Cloud Mastery
AWS
Azure
GCP
Experience & Achievements
8+ years of driving innovation across AI, ML, and cloud engineering
Senior AI Architect
- Leading AI architecture and strategy for next-generation intelligent systems
- Designing scalable Agentic AI and LLM-powered solutions for enterprise products
- Driving AI innovation across HARMAN's connected technologies portfolio
Senior Technical Lead – AI Engineering
- Led Automotive AI/GenAI initiatives — delivering 30+ Agentic AI interventions across automotive verticals including ADAS, in-cabin intelligence, and connected vehicle platforms
- Led, architected and developed an Agentic AI Enterprise Platform with multi-agent framework for the automotive Software Defined Vehicle (SDV) lifecycle — orchestrating autonomous AI workflows across development, testing, validation, and deployment stages
- Pioneered Edge AI for in-car LLM use cases — fine-tuning, quantizing (GPTQ, AWQ, GGUF), and deploying LLMs on Qualcomm SoCs for on-device automotive AI with low-latency inference
- Architected enterprise-scale RAG-based AI productivity tools using LangChain and LLMs (GPT-4, LLaMA, Qwen, Claude), improving employee productivity by 40%
- Designed LLMOps accelerator platform for orchestration, fine-tuning (LoRA, PEFT), deployment, and lifecycle management of enterprise LLM applications
- Architected real-time conversational AI prototypes with STT/LLM/TTS pipelines for voice-enabled in-car assistants — context retention, interruption handling, and fallback logic
- Led and mentored 12-member engineering team; worked with multi-agent architectures, embedding models (Nomic, OpenAI, HF), and vector databases (Pinecone, Milvus, FAISS)
Data Scientist - MLOps Solutions Architect
- Built multi-cloud MLOps accelerator supporting AWS, Azure, GCP, and Databricks as part of the MLOps CoE
- Architected scalable LLMOps platform for RAG pipelines and LangChain workflows across GPT, LLaMA 2, T5, and Stable Diffusion
- HDFC Bank: Designed customer churn prediction model using XGBoost, Random Forest with time series behavioral analysis
- Yes Bank: Developed credit card fraud detection system using Isolation Forest, XGBoost with real-time transaction pattern recognition
- Capricorn Group (South Africa): Built income classification ML model on Azure for automated credit decisioning
- Built NLP Document AI solution leveraging sentiment analysis, topic modeling, keyword extraction, and clustering
- Earned company-wide recognition for most viewed blog in Q4 2023 on LLM Operationalization
Sr. Software Engineer (ML & Conversational AI)
- Architected highly secure, serverless CRM application for UK Fortune 500 clients, integrating 20+ AWS services
- Led design and delivery of ML-powered Conversational AI — NLP voice/text agents, intelligent task dispatcher, real-time sentiment analysis
- Built self-learning task dispatcher that autonomously routes, prioritizes, and escalates tasks — driving 37% product revenue growth in Q4 2021
- Developed audio language conversion and real-time closed-captioning tool for live-stream healthcare video
- Ensured scalability, reliability, and data privacy compliance across AI features serving regulated industries (healthcare, finance)
Cloud & Machine Learning Engineer
- Developed end-to-end CI/CD pipelines and ML products for real-time object detection, tracking, and NLP-based keyword spotting on AWS
- Built OTA Portal and Data Science Dashboard with full-stack development (React, Python, Dash-Plotly) and DevOps integration on AWS
- Emerging Star Award (Jan'20) and two Pat On The Back Awards (Dec'19, Sept'20) for technical innovation
Machine Learning Engineer
- Designed and optimized custom neural networks using TensorFlow, Keras, and TFLite for edge deployment on low-powered devices
- Developed Alexa skill sets and voice-enabled AI applications
- Managed CI/CD pipelines and AWS infrastructure using CloudFormation and Terraform
Awards & Achievements
Recognized for innovation, excellence, and impactful contributions across organizations
Client Champion Award
Recognized for driving client success and cross-functional AI delivery excellence across enterprise engagements.
Leadership Award
Awarded for exceptional leadership in mentoring a 12-member engineering team and delivering 30+ Agentic AI interventions.
Most Viewed Blog - Q4 2023
Company-wide recognition for the most viewed technical blog on LLM Operationalization using Snowflake and AWS SageMaker.
Pat On The Back Award
Recognized for innovative thinking in LLMOps and Generative AI solutions.
Emerging Star Award
Recognized for exceptional performance and innovative contributions to ML and cloud engineering projects.
Pat On The Back Award
Second recognition for consistent high performance and team collaboration.
Pat On The Back Award
First recognition for outstanding dedication and technical excellence in delivering critical projects.
Certified Expertise
Industry-recognized certifications across cloud platforms, AI/ML, and deep learning
Cloud & AI Certifications
AWS Certified Solutions Architect
Associate
Amazon Web Services
AWS Certified Machine Learning
Specialty
Amazon Web Services
AWS Certified Cloud Practitioner
Foundational
Amazon Web Services
Microsoft Certified Azure AI
Fundamentals (AI-900)
Microsoft
Microsoft Certified Azure
Fundamentals (AZ-900)
Microsoft
NanoDegrees & Specializations
Machine Learning NanoDegree
Udacity (Kaggle)
Kaggle PartnershipDeep Learning NanoDegree
Udacity (Facebook AI Labs & AWS)
Facebook AI Labs + AWSGenerative Adversarial Networks (GANs)
deeplearning.ai
deeplearning.aiAcademic Foundation
Strong engineering roots complemented by continuous learning and certifications
Bachelor of Technology
Electronics & Communications Engineering
Languages
Multilingual Professional
Featured Projects
Agentic AI, LLMs & Multi-Cloud solutions pushing technological boundaries
Voice-Enabled Real-Time Interactive RAG Application
Voice-enabled real-time conversational AI using RAG with LangChain, OpenAI Whisper (STT), TTS-1 and GPT-4. Low-latency streaming pipeline using WebRTC, AWS Lambda for parallel processing, and ElastiCache for caching — achieving sub-second response latency for voice-to-voice conversations.
Agentic AI Enterprise Platform for Automotive SDV
Led, architected and developed an enterprise Agentic AI platform with multi-agent framework for the automotive Software Defined Vehicle (SDV) lifecycle. Orchestrates autonomous AI workflows across development, testing, validation, and deployment stages — enabling OEMs to accelerate SDV delivery with intelligent automation.
Enterprise LLMOps Accelerator Platform
Enterprise-grade LLMOps platform for automotive AI — orchestration, fine-tuning (LoRA, PEFT), quantization (GPTQ, AWQ, GGUF), and deployment of LLMs. Supports GPT, LLaMA, Qwen, Claude with multi-agent architectures and Edge AI deployment on Qualcomm SoCs for in-car LLM use cases.
Automotive Edge AI — In-Car LLM Deployment
Fine-tuned and quantized LLMs (GPTQ, AWQ, GGUF) for on-device deployment on Qualcomm SoCs powering in-car AI use cases — ADAS copilots, in-cabin voice assistants, and real-time contextual recommendations with ultra-low-latency inference on resource-constrained automotive hardware.
Credit Card Fraud Detection (Yes Bank)
Real-time credit card fraud detection pipeline using Isolation Forest and XGBoost for anomaly detection on transaction streams. Feature engineering on transaction velocity, geolocation, and behavioral patterns. MLOps on Kubernetes + Kubeflow with A/B testing and drift monitoring.
Customer Churn Prediction (HDFC Bank)
Customer churn prediction model using ensemble ML techniques (XGBoost, Random Forest, Logistic Regression) and time series behavioral analysis across retail banking segments. MLOps pipelines on AWS with Snowflake and SageMaker for automated retraining.
Income Classification (Capricorn Group, South Africa)
Income classification ML model on Azure for automated customer segmentation and credit risk assessment. End-to-end MLOps using Azure ML Studio and MLflow for experiment tracking, model versioning, automated retraining, and production deployment.
Multi-Cloud MLOps Accelerator
Multi-cloud MLOps accelerator supporting AWS, Azure, GCP, and Databricks. Standardized model training, deployment, and monitoring pipelines. Featured in company-wide most viewed blog on LLM Operationalization using Snowflake and AWS SageMaker.
Scalable CRM with Conversational AI
Highly secure serverless CRM for UK Fortune 500 clients. ML-powered Conversational AI with NLP voice/text agents, self-learning task dispatcher, real-time sentiment analysis. Audio language conversion and closed-captioning for healthcare video streams.
IoT Device Management & Data Science Dashboard
Full-stack data science dashboard for IoT device management and big-data analytics on AWS (S3, ECS, Glue, Athena) with Apache Airflow, PySpark, Dash-Plotly, and ReactJS. Real-time device monitoring for HVAC and Water Heater IoT products.
Home-Office Automation & Surveillance
Deep learning CV models (TensorFlow, TFLite) for edge deployment on NVIDIA Jetson Nano. Azure-based REST APIs with CI/CD pipelines and IoT cloud architecture using IoT Hub, CosmosDB, and ML Studio for intelligent home automation.
Case Studies
Deep dives into transformative AI/ML projects with measurable business impact
Automotive AI/GenAI, Edge AI & Agentic SDV Platform
Challenge
KPIT's automotive OEM clients needed AI-driven solutions across ADAS, in-cabin intelligence, connected vehicle platforms, and the full Software Defined Vehicle (SDV) lifecycle — including on-device LLM inference on resource-constrained automotive SoCs and autonomous multi-agent AI workflows.
Solution
- Led, architected and developed an Agentic AI Enterprise Platform with multi-agent framework for the automotive SDV lifecycle — orchestrating autonomous AI workflows across development, testing, validation, and deployment stages
- Led 30+ Agentic AI interventions across automotive verticals — ADAS, in-cabin intelligence, connected vehicle platforms, and autonomous workflow automation
- Pioneered Edge AI for in-car LLM use cases — fine-tuning, quantizing (GPTQ, AWQ, GGUF), and deploying LLMs on Qualcomm SoCs for low-latency on-device inference
- Architected enterprise-scale RAG-based AI productivity tools using LangChain and LLMs (GPT-4, LLaMA, Qwen, Claude)
- Designed LLMOps accelerator platform for orchestration, fine-tuning (LoRA, PEFT), deployment, and lifecycle management
- Architected real-time conversational AI for voice-enabled in-car assistants with STT/LLM/TTS pipelines, context retention, and interruption handling
- Worked with multi-agent architectures, embedding models (Nomic, OpenAI, HF), and vector databases (Pinecone, Milvus, FAISS)
Results & Impact
Technologies Used
Credit Card Fraud Detection System
Challenge
Yes Bank required a real-time fraud detection system capable of identifying anomalous credit card transactions with high accuracy while minimizing false positives.
Solution
- Designed real-time fraud detection pipeline using Isolation Forest and XGBoost for anomaly detection on transaction streams
- Engineered features on transaction velocity, geolocation, and behavioral patterns for high-accuracy detection
- Built end-to-end MLOps infrastructure on bank's on-premise data center using Kubernetes and Kubeflow
- Implemented automated model training, serving, A/B testing, and drift monitoring pipelines
Results & Impact
Technologies Used
Customer Churn Prediction Platform
Challenge
HDFC Bank needed proactive retention strategies by predicting customer churn across retail banking segments using behavioral and transactional data.
Solution
- Designed customer churn prediction model using ensemble ML techniques (XGBoost, Random Forest, Logistic Regression)
- Applied time series behavioral analysis across retail banking segments for proactive retention strategies
- Built end-to-end MLOps pipelines on AWS with Snowflake data warehousing
- Implemented automated retraining triggers based on model performance degradation
Results & Impact
Technologies Used
Multi-Cloud MLOps Accelerator
Challenge
Enterprise clients required a unified MLOps platform that could seamlessly operate across AWS, Azure, GCP, and Databricks without vendor lock-in.
Solution
- Architected multi-cloud MLOps accelerator with abstraction layer for cloud services as part of MLOps CoE
- Built scalable LLMOps platform for RAG and LangChain workflows across GPT, LLaMA 2, T5, and Stable Diffusion
- Built NLP Document AI solution leveraging sentiment analysis, topic modeling, keyword extraction, and clustering
- Built serverless inference endpoints on AWS SageMaker with auto-scaling, A/B testing, and model versioning
- Published most-viewed technical blog on LLM operationalization with Snowflake and AWS SageMaker
Results & Impact
Technologies Used
AI-Powered CRM with Conversational Intelligence
Challenge
UK Fortune 500 clients needed a highly secure, scalable CRM with AI-driven automation for regulated industries including healthcare and finance.
Solution
- Architected fully serverless CRM integrating 20+ AWS services with automated deployment via CloudFormation and CodePipeline
- Led design of ML-powered Conversational AI — NLP voice/text agents, intelligent task dispatcher, real-time sentiment analysis
- Built self-learning task dispatcher that autonomously routes, prioritizes, and escalates tasks
- Developed audio language conversion and real-time closed-captioning for healthcare live-stream video
- Ensured data privacy compliance across AI features serving enterprise clients in regulated industries
Results & Impact
Technologies Used
Want to discuss your project and explore how we can achieve similar results?
Get In TouchPublications & Blogs
Sharing knowledge and insights on AI/ML, LLMOps, and cloud architecture
Embracing the Future with Generative AI: Operationalizing Large Language Models using Snowflake and AWS SageMaker
A comprehensive guide on operationalizing LLMs at scale using Snowflake's data platform and AWS SageMaker. Covers architecture patterns, best practices, and real-world implementation strategies for enterprise GenAI solutions.
More publications coming soon on Agentic AI, LLMOps, and Multi-Cloud Architecture
Ready to Transform Your
AI Vision into Reality?
Book a 1:1 session to discuss your AI/ML projects, LLMOps strategy, or multi-cloud architecture. Let's build something extraordinary together.
1:1 Consultation
Personalized guidance on your AI/ML challenges
Flexible Scheduling
Book a time that works for you
Expert Insights
8+ years of AI/ML and cloud experience
Topics I can help with:
Save My Contact
Download my vCard, share my profile, or scan the QR code

Ronak Kataria
Senior AI Architect | GenAI, LLMOps & Voice AI
KPIT Technologies (CTO AI Office)
Add to your contacts • Share with colleagues • Scan QR code
Let's Collaborate
Ready to bring your AI vision to life? Let's discuss how we can work together.
Phone
+91-8758682083Location
Pune, India
Available for Remote Work Globally
Open for Opportunities
AWS & Azure Certified Solutions Architect with 8+ years of experience in AI/ML, LLMOps, and multi-cloud solutions. Available for consulting, architecture design, Agentic AI development, and LLM fine-tuning projects.
Download CV