Author Image

Hi, I am Anu

Anudeep Vanjavakam

Data Scientist at Moulton Niguel Water District

I am a passionate data scientist with over 7 years of experience in developing data products that enhance systems and processes. I provide impactful insights through advanced analytics and thrive on helping people make informed decisions.

Certified Data Scientist Nanodegree
AI Programming with Python Nanodegree
AWS Machine Learning Foundations
Leadership
Team Work
Data Mining and Data Wrangling

Skills

Experiences

1
Moulton Niguel Water District

Aug 2016 - Present, Laguna Hills, CA, USA

MNWD delivers high-quality drinking water, recycled water and wastewater services to more than 170,000 customers in Orange County, CA and has been named a top workplace in Orange County for four years in a row!

Senior Data Analyst

Aug 2020 - Present

  • Led short and long-range financial planning with senior management reducing 40% of manual processes resulting in AAA rating (Fitch, S&P) and GFOA Award for the company
  • Built and deployed R shiny apps leading to a reduction in customer bill and rebate inquiries by 22%
  • Strengthened financial reporting tools to monitor 100MM+ investments and cash flow trends saving almost $2.5M annually
  • Implemented a python-based anomaly detection model to detect (up to 3 months early) meters at risk of failure resulting in savings of $44K per month for customers
  • Deployed an R-Shiny tool to analyze user behavior and manage a customer acquisition and retention budget of $350K
  • Coordinated with IT to add database indices on SQL Server that resulted in a 36% reduction in app load times and query processing times creating a better user experience for staff (internal reporting) and customers (website apps)
  • Influenced billing policies for 32 agencies in California by developing an R-Shiny app that generates correlation insights between rate change, surcharge, revenue, and customer bills
Data Analyst

Mar 2017 - Aug 2020

  • Improved the efficiency of annual financial reporting by 85% using an Excel model, achieving ‘Certificate of Achievement for Excellence in Financial Reporting’ GFOA award
  • Automated rate-bill verification process resulting in 86% reduction in manual effort while maintaining a 100% billing accuracy
  • Built a monitoring system that alerts discrepancy between the enterprise billing system and vendor portal saving $68K
Data Science Intern

Aug 2016 - Mar 2017

  • Created ‘Water Use and Efficiency Charts’ to track critical metrics, customer efficiency and engagement trends, and funds spent on programs to aid data-driven policy decisions

Research Associate - Data Analytics
Arizona State University - W.P. Carey Dept. of Information Systems

Jun 2015 - Jan 2016, Tempe, AZ, US

Contributed to the performance testing of machine learning algorithms across platforms such as Microsoft Azure ML, IBM SPSS, R to determine the trade offs of each platform for multi class problems

2

3
Project Engineer
Wipro Ltd.

Dec 2013 - May 2015, Hyderabad, India

Wipro is a global IT, consulting, and business services leader, leveraging cognitive computing, automation, cloud, analytics, and emerging tech for client success

Responsibilities:
  • Supported energy and utilities project for the largest water and gas distribution company in UK
  • Mastered Data Migration techniques to handle millions of business-related data records, enabling smooth transition of data across multiple platforms – using SAP ABAP, LSMW, EMIGALL, MS-Excel

Education

MS-BA (Master's in Business Analytics)
CGPA: 3.97 out of 4
Taken Courses
  • Intro to Enterprise Analytics
  • Data Mining I
  • Intro to Applied Analytics
  • Data Driven Quality Management
  • Analytic Decision Making I
  • Data Mining II
  • Business Analytics Strategy
  • Applied Project (SCM 593)
  • Marketing Analytics
  • Analytic Decision Making II
  • Applied Project (CIS 593)
B.Tech. in Computer Science & Engineering
CGPA: 8.02 out of 10
Taken Courses
  • Data Structures and Algorithms
  • Artificial Intelligence and Expert Systems
  • Database Management Systems
  • Parallel and Distributed Computing
  • Decision Support System
  • Information Storage and Management
  • Object Oriented Analysis and Design
  • Probability and Queuing Theory
  • Operating Systems
  • Design and Analysis of Alogirthms
  • Software Engineering

Projects

Find best products and sentiment analysis on Reddit
Find best products and sentiment analysis on Reddit
Owner

Use this app to find out 1) the best products or services 2) whether a product is worth buying based on Reddit.

Rate Comparison Tool
Rate Comparison Tool
Co-developer

An R-Shiny tool to easily understand and compare the revenue, equity, and demand implications of different water rate structures. This is an open project of the California Data Collaborative, developed in partnership with the Moulton Niguel Water District and ARGO Labs.

Disaster Response Pipeline
Disaster Response Pipeline
Owner

This web app (embedded with machine learning pipeline) lets an emergency worker input a new message and displays categorized events so that you can send the messages to an appropriate disaster relief agency

Recommendor System for IBM
Recommendor System for IBM
Owner

Interactions that users have with articles on the IBM Watson Studio platform are analyzed and new article recommendations are made to them

Video Games Ratings Analysis
Video Games Ratings Analysis
Owner

Generated insights on what makes a video game successful based on 30K+ ratings extracted from IGDB API

Publications

With more and better data, an individualized approach that ties rates to effects on the water system is possible, meaning utilities can capture each customer’s water‐use patterns and tailor rates accordingly

Accomplishments

Data Scientist Nanodegree
Udacity Feb 2023 - Jun 2023

Built effective machine learning models, ran ETL and NLP pipelines, designed experiments and analyzed A/B test results, built recommendation systems, and deployed solutions to the cloud with industry-aligned projects

AI Programming with Python Nanodegree
Udacity Sep 2020 - Dec 2020

I used Python to solve complex problems quickly, learnt all the key tools for working with data in Python, learnt foundational math needed for AI success — vectors, calculus essentials, linear transformations, and matrices—as well as the linear algebra behind neural networks, how to use PyTorch for deep learning, and finally created an image classifier and converted it into a python application.

AWS Machine Learning Foundations
Udacity Aug 2021 - Oct 2021

Learnt how to prepare, build, train, and deploy high-quality machine learning (ML) models quickly with Amazon SageMaker and learny object-oriented programming best practices.

Machine Learning - Stanford University
Coursera (Course by Andrew Ng) Sep 2016 - Dec 2016

This course provides a broad introduction to machine learning, datamining, and statistical pattern recognition. Topics include: (i) Supervised learning (parametric/non-parametric algorithms, support vector machines, kernels, neural networks). (ii) Unsupervised learning (clustering, dimensionality reduction, recommender systems, deep learning). (iii) Best practices in machine learning (bias/variance theory; innovation process in machine learning and AI).

Learnt data visualization through Tableau 2022 and used it’s features to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks and present data quickly and beautifully.

SQL (Advanced) Certificate
HackerRank Aug 2022

Cleared advanced SQL assessment which covers topics like query optimization, data modeling, Indexing, window functions, and pivots in SQL.