Author Image

Hi, I am Anu

Anudeep Vanjavakam

Principal/Lead Data Analyst, Data Scientist at Moulton Niguel Water District

I’m a Principal Data Analyst with 10+ years of experience designing and scaling full-stack analytics solutions that drive business impact. My journey has taken me from crunching numbers to building modern data platforms, leading projects that integrate data, ML, and automation to solve high-stakes business problems. Over the years, I’ve

  • Built end-to-end ETL pipelines and automated reporting tools using SQL, Python, R, Power BI, dbt, ML, AI models, and cloud platforms (AWS, GCP). - Led analytics modernization for enterprise systems—saving thousands of hours, reducing vendor dependency, and enabling data-driven decision-making. - Translated messy, siloed datasets into high-value insights used by executives, finance teams, and operations leaders. - Been a key driver of award-winning budgeting, billing, water demand & energy forecasting systems at a major California utility.

Certified Data Scientist Nanodegree
AI Programming with Python Nanodegree
AWS Machine Learning Foundations
Leadership
Team Work
Data Mining and Data Wrangling

Skills

Experiences

1

Laguna Hills, CA, USA

MNWD delivers high-quality drinking water, recycled water and wastewater services to more than 170,000 customers in Orange County, CA and has been named a top workplace in Orange County for four years in a row!

Principal Data Analyst

Jun 2025 - Present

Responsibilities:
  • Led 217M+ O&M and CIP budgeting analytics lifecycle, building executive-grade financial models, automating QA/QC tools that cut 3+ hours per iteration, and earning Board recognition in achieving GFOA Triple Crown Award.
  • Built end-to-end ETL pipelines ingesting incremental energy usage & billing data (Utility API → AWS Postgres), powering $3M+ energy forecasting, peak/off-peak optimization, and org-wide BI tools.
  • Engineered automated flat-file ETL framework for customer portal (180K+ users) upgrade (v8→v11), delivering daily data syncs from ERP SQL Server and saving $65K in integration consulting.
  • Co-built and launched a Power BI Budget Tracker, integrating real-time ERP data to eliminate manual PDF monitoring and empower 45+ departments with self-service financial insights visibility — enhancing accountability, saving hours of effort, and improving budget management efficiency.
  • Designed and deployed an interactive dashboard (R, SQL, Posit Connect) that automated month-end GL allocation unbilled receivables by pulling live data, calculating fund splits using dynamic parameters, and eliminating 40+ hours of manual accounting effort per close cycle.
Senior Data Analyst

Aug 2020 - May 2025

Responsibilities:
  • Led short and long-range financial planning with senior management reducing 40% of manual processes resulting in AAA rating (Fitch, S&P) and GFOA Award for the company
  • Built and deployed R shiny apps leading to a reduction in customer bill and rebate inquiries by 22%
  • Strengthened financial reporting tools to monitor 100MM+ investments and cash flow trends saving almost $2.5M annually
  • Implemented a python-based anomaly detection model to detect (up to 3 months early) meters at risk of failure resulting in savings of $44K per month for customers
  • Deployed an R-Shiny tool to analyze user behavior and manage a customer acquisition and retention budget of $350K
  • Coordinated with IT to add database indices on SQL Server that resulted in a 36% reduction in app load times and query processing times creating a better user experience for staff (internal reporting) and customers (website apps)
  • Influenced billing policies for 32 agencies in California by developing an R-Shiny app that generates correlation insights between rate change, surcharge, revenue, and customer bills
Data Analyst

Mar 2017 - Aug 2020

Responsibilities:
  • Improved the efficiency of annual financial reporting by 85% using an Excel model, achieving ‘Certificate of Achievement for Excellence in Financial Reporting’ GFOA award
  • Automated rate-bill verification process resulting in 86% reduction in manual effort while maintaining a 100% billing accuracy
  • Built a monitoring system that alerts discrepancy between the enterprise billing system and vendor portal saving $68K
Data Science Intern

Aug 2016 - Mar 2017

Responsibilities:
  • Created ‘Water Use and Efficiency Charts’ to track critical metrics, customer efficiency and engagement trends, and funds spent on programs to aid data-driven policy decisions

Tempe, AZ, US

Contributed to the performance testing of machine learning algorithms across platforms such as Microsoft Azure ML, IBM SPSS, R to determine the trade offs of each platform for multi class problems

Research Associate - Data Analytics

Jun 2015 - Jan 2016

2

3
Wipro Ltd.

Dec 2013 - May 2015

Hyderabad, India

Wipro is a global IT, consulting, and business services leader, leveraging cognitive computing, automation, cloud, analytics, and emerging tech for client success

Project Engineer

Dec 2013 - May 2015

Responsibilities:
  • Supported energy and utilities project for the largest water and gas distribution company in UK
  • Mastered Data Migration techniques to handle millions of business-related data records, enabling smooth transition of data across multiple platforms – using SAP ABAP, LSMW, EMIGALL, MS-Excel

Education

MS-BA (Master's in Business Analytics)
CGPA: 3.97 out of 4
Taken Courses:
  • Intro to Enterprise Analytics
  • Data Mining I
  • Intro to Applied Analytics
  • Data Driven Quality Management
  • Analytic Decision Making I
  • Data Mining II
  • Business Analytics Strategy
  • Applied Project (SCM 593)
  • Marketing Analytics
  • Analytic Decision Making II
  • Applied Project (CIS 593)
B.Tech. in Computer Science & Engineering
CGPA: 8.02 out of 10
Taken Courses:
  • Data Structures and Algorithms
  • Artificial Intelligence and Expert Systems
  • Database Management Systems
  • Parallel and Distributed Computing
  • Decision Support System
  • Information Storage and Management
  • Object Oriented Analysis and Design
  • Probability and Queuing Theory
  • Operating Systems
  • Design and Analysis of Alogirthms
  • Software Engineering

Projects

Find best products and sentiment analysis on Reddit
Find best products and sentiment analysis on Reddit
Owner

Use this app to find out 1) the best products or services 2) whether a product is worth buying based on Reddit.

Rate Comparison Tool
Rate Comparison Tool
Co-developer

An R-Shiny tool to easily understand and compare the revenue, equity, and demand implications of different water rate structures. This is an open project of the California Data Collaborative, developed in partnership with the Moulton Niguel Water District and ARGO Labs.

Disaster Response Pipeline
Disaster Response Pipeline
Owner

This web app (embedded with machine learning pipeline) lets an emergency worker input a new message and displays categorized events so that you can send the messages to an appropriate disaster relief agency

Recommendor System for IBM
Recommendor System for IBM
Owner

Interactions that users have with articles on the IBM Watson Studio platform are analyzed and new article recommendations are made to them

Video Games Ratings Analysis
Video Games Ratings Analysis
Owner

Generated insights on what makes a video game successful based on 30K+ ratings extracted from IGDB API

Publications

With more and better data, an individualized approach that ties rates to effects on the water system is possible, meaning utilities can capture each customer’s water‐use patterns and tailor rates accordingly

Accomplishments

Data Scientist Nanodegree
Udacity Feb 2023 - Jun 2023

Built effective machine learning models, ran ETL and NLP pipelines, designed experiments and analyzed A/B test results, built recommendation systems, and deployed solutions to the cloud with industry-aligned projects

AI Programming with Python Nanodegree
Udacity Sep 2020 - Dec 2020

I used Python to solve complex problems quickly, learnt all the key tools for working with data in Python, learnt foundational math needed for AI success — vectors, calculus essentials, linear transformations, and matrices—as well as the linear algebra behind neural networks, how to use PyTorch for deep learning, and finally created an image classifier and converted it into a python application.

AWS Machine Learning Foundations
Udacity Aug 2021 - Oct 2021

Learnt how to prepare, build, train, and deploy high-quality machine learning (ML) models quickly with Amazon SageMaker and learny object-oriented programming best practices.

Machine Learning - Stanford University
Coursera (Course by Andrew Ng) Sep 2016 - Dec 2016

This course provides a broad introduction to machine learning, datamining, and statistical pattern recognition. Topics include: (i) Supervised learning (parametric/non-parametric algorithms, support vector machines, kernels, neural networks). (ii) Unsupervised learning (clustering, dimensionality reduction, recommender systems, deep learning). (iii) Best practices in machine learning (bias/variance theory; innovation process in machine learning and AI).

Learnt data visualization through Tableau 2022 and used it’s features to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks and present data quickly and beautifully.

SQL (Advanced) Certificate
HackerRank Aug 2022

Cleared advanced SQL assessment which covers topics like query optimization, data modeling, Indexing, window functions, and pivots in SQL.