Jersey City, NJ

Mukul Desai

I'm aData Engineer

Data Engineer with 2+ years of experience in ETL/ELT pipeline design, cloud data architecture, and AI-powered system development across healthcare and financial services, delivering reliable data infrastructure that processes millions of records and drives faster operational decisions.

ML for Business ImpactAnalytics EngineeringReal-time Insights
Mukul Desai
Available for work
NJ
Jersey City, NJ

Work Experience

A concise, role-focused summary of experience across enterprise data engineering, healthcare analytics, and AI systems.

Enterprise Data Engineering

Johnson & Johnson

1 role

Data Engineer

Completed
Nov 2025 – PresentRaritan, NJ
  • Developed event-driven monitoring and validation pipelines across 6+ enterprise applications, processing 200+ files and 100K+ records daily with automated schema and anomaly checks.
  • Standardized release workflows by engineering Jenkins CI/CD pipelines with shared libraries across multiple data and integration repositories, reducing average deployment time by 65%.
  • Engineered cross-system identity data reconciliation logic across ServiceNow, IAM, and Salesforce, automating access provisioning workflows for hundreds of users and cutting cycle time by 60% across 3 teams.
PythonJenkinsServiceNowSalesforceIAMData Validation
Healthcare AI & Analytics

TripForCure Inc.

2 roles

AI Analytics Engineer

Completed
Jun 2025 – Oct 2025Plainsboro, NJ
  • Designed and orchestrated 8+ Airflow DAGs running Python and FastAPI data workflows for hospital recommendation systems, cutting manual data preparation time by 70%.
  • Built ingestion and embedding pipelines indexing 2K+ healthcare document chunks into a ChromaDB vector store, powering a LangChain and GPT-4 RAG system serving 500+ clinical stakeholders.
  • Migrated data infrastructure to AWS using a zero-downtime cutover while improving reliability and HIPAA-aligned compliance posture.
LangChainOpenAI GPT-4ChromaDBAWSAirflowFastAPI

Analytics Intern

Completed
Sep 2024 – May 2025Plainsboro, NJ
  • Productionized Python and SQL ETL pipelines consolidating healthcare records from 31 hospital locations into Snowflake, reducing manual consolidation effort by 80%.
  • Implemented dbt transformations and quality tests across staging and mart layers, maintaining 99.9% data quality across 7 production models.
  • Created Power BI dashboards tracking readmission rates, length of stay, and patient throughput metrics for 500+ stakeholders.
SnowflakedbtPower BISQLPythonETL
Business Intelligence

Larsen & Toubro Technology Services

1 role

Data Analyst Intern

Completed
Aug 2021 – Sep 2021Mumbai, India
  • Delivered Tableau dashboards and SQL workflows tracking project performance and financial metrics across 5+ engineering units, reducing manual reporting effort by 40%.
TableauSQLReportingAnalytics

Technical Skills

Expertise spanning data engineering, AI/ML, cloud platforms, and analytics

Data Engineering

Apache Airflow90%
dbt88%
Apache Kafka85%
Apache Flink80%
ETL/ELT Pipelines92%
Snowflake88%
Databricks82%
Apache Spark85%

AI & Machine Learning

LangChain85%
OpenAI GPT88%
ChromaDB80%
RAG Pipelines82%
Scikit-learn90%
FastAPI85%

Cloud & DevOps

AWS85%
Docker80%
Jenkins78%

Analytics & BI

Power BI90%
Tableau88%
Plotly82%
D3.js75%

Languages

Python95%
SQL92%
R80%
JavaScript78%

Databases

PostgreSQL88%
MongoDB80%

Featured Projects

Selected work across analytics engineering, AI systems, finance, healthcare, and business intelligence.

Healthcare Data Reliability Platform
Healthcare
Analytics EngineeringCompleted

Healthcare Data Reliability Platform

Apr 2026

End-to-end healthcare data platform with trusted analytics models, automated quality checks, pipeline monitoring, and an interactive dashboard for operational insights.

dbtAirflowSnowflakeStreamlitSynthea
ZeroDay – Agentic AI Developer Productivity Assistant
AI Platform
Multi-Agent AICompletedLive Demo

ZeroDay – Agentic AI Developer Productivity Assistant

Jun – Jul 2025

Agentic AI developer assistant with four collaborating agents for code search, task recommendations, learning guidance, and real-time support.

LangChainOpenAIChromaDBFastAPINext.js
QuantFlow – AI-Augmented DCF Valuation Platform
Finance
Financial AnalyticsCompletedVideo Demo

QuantFlow – AI-Augmented DCF Valuation Platform

May – Jun 2025

AI-augmented DCF valuation workflow with financial data ingestion, scenario modeling, dashboards, and investor-ready insights.

PythonFastAPIReactPostgreSQLPlotlyOpenAI
InterviewGPT – Generative Preparation Trainer
AI Platform
Applied AICompletedLive Demo

InterviewGPT – Generative Preparation Trainer

Apr 2025

Interview preparation platform with GPT-4o and Gemini-powered simulations, resume insights, job exploration, and progress tracking.

Next.jsFirebaseGPT-4oGemini 2.0 FlashTypeScript
Real-Time Financial Risk & Fraud Detection
Finance
Streaming MLCompletedVideo Demo

Real-Time Financial Risk & Fraud Detection

Feb – Mar 2025

Real-time risk and fraud analytics platform using event streaming, anomaly detection, and Power BI dashboards for continuous monitoring.

KafkaFlinkPythonScikit-learnPyODPower BI
Automated Financial Data Pipeline & Risk Analytics
Finance
Analytics EngineeringCompletedLive Demo

Automated Financial Data Pipeline & Risk Analytics

Jan – Feb 2025

Automated ETL, anomaly detection, and portfolio risk analytics pipeline with Airflow, dbt, and dashboard-driven investment insights.

PythonAirflowdbtPostgreSQLPlotly
Marketing & Social Media Content Tool
Marketing
Applied AICompletedVideo Demo

Marketing & Social Media Content Tool

Nov – Dec 2024

AI-assisted content generation tool for marketing workflows, combining prompt engineering, content support, and analytics-oriented outputs.

DjangoOpenAI APIChromaDBFine-TuningPython
IPL 2023 vs 2024 Tableau Analysis
Sports
Sports AnalyticsCompletedVideo Demo

IPL 2023 vs 2024 Tableau Analysis

Aug – Sep 2024

Tableau-based cricket analytics study comparing IPL 2023 and 2024 with trend analysis, run-rate views, and predictive insights.

PythonPandasTableauSQL
UAE Vehicle Market Analysis
Market Analysis
Market AnalyticsCompleted

UAE Vehicle Market Analysis

Apr 2024

Market segmentation and pricing analysis of 150,000+ UAE vehicle listings, combining clustering with interactive Power BI dashboards to surface pricing dynamics and regional patterns.

Power BIClusteringData PreprocessingMarket Segmentation

Education & Certifications

My academic foundation and professional certifications in data science and engineering

Education

Northeastern University

Master's degree, Information Systems

Aug 2023 - May 2025GPA:3.38
Key Coursework:
Data Management and Database Design
Data Science and Engineering Methods
Prompt Engineering and AI
Big Data Architecture & Governance
Application Engineering Development
Business Analysis and Information Engineering
Advances in Data Sciences and Architecture
Technical Skills:
Python
SQL
Machine Learning (Scikit-learn, TensorFlow)
LangChain & OpenAI GPT
Data Engineering (Airflow, Kafka, dbt)
Cloud (AWS, Snowflake)
PostgreSQL & MongoDB
Power BI & Tableau

Vivekanand Education Society's Institute Of Technology

Bachelor of Engineering - BE, Electronic and Telecommunication

Aug 2019 - May 2023GPA:3.29
Key Coursework:
Image Processing and Machine Vision
Artificial Neural Networks and Fuzzy Logics
Cloud Computing
Augmented and Virtual Reality
Computer Communications Network
Internet Communication Engineering
Technical Skills:
Python
R
Excel (Advanced)
SQL Basics
Data Visualization (Tableau)
Machine Learning Foundations
Linux
Activities:
Institute of Electrical and Electronics Engineering

Professional Certifications

IBM Data Engineering Specialization logo

IBM Data Engineering Specialization

IBM

Dec 2024

Google Cloud Data Analytics Specialization logo

Google Cloud Data Analytics Specialization

Google Cloud Skills Boost

Nov 2024

IBM Data Science Specialization logo

IBM Data Science Specialization

IBM

Oct 2024

Career Essentials in Data Analysis by Microsoft and LinkedIn logo

Career Essentials in Data Analysis by Microsoft and LinkedIn

Microsoft

Aug 2024

Get In Touch

Ready to discuss your next data project? Let's connect and explore how we can work together.

Contact Information

Location

Jersey City, NJ

Connect With Me

Send a Message

Ready to Collaborate?

I'm always excited to work on innovative data projects and help organizations unlock the power of their data. Let's build something amazing together!