Summary
Overview
Work History
Education
Skills
Timeline
Generic

Akhila Muppavarapu

Bentonville

Summary

Proficient Data Engineer with a focus on Apache Airflow, SQL, and Power BI. Designed and optimized scalable data pipelines, achieving a 30% reduction in processing time.

Experienced with designing and optimizing data pipelines to ensure seamless data flow. Utilizes advanced SQL and Python skills to create and maintain robust data architectures. Track record of implementing scalable solutions that enhance data integrity and support informed decision-making.

Overview

8
8
years of professional experience

Work History

Data Engineer III

Walmart
Bentonville, AR
02.2024 - Current
  • Designed, developed, and maintained end-to-end data pipelines, incorporating large-scale batch processing using Apache Spark, Hive, and Scala, enabling seamless ingestion, processing, and transformation of complex datasets.
  • Developed and optimized data workflows using orchestration tools, such as Apache Airflow and Automic, ensuring efficient processing of millions of daily data records across distributed systems.
  • Collaborated cross-functionally with business teams, product managers, and engineers to build scalable consumption-layer data products, yielding actionable insights for key stakeholders.
  • Automated data workflows using SQL, Python, Airflow DAGs, and Presto, reducing data processing time by 30% and improving operational efficiency.
  • Implemented Kafka streaming solutions for real-time, event-driven pipelines to deliver low-latency, time-sensitive data.
  • Designed and governed comprehensive data schemas and metadata models to ensure data quality and compliance with organizational standards.
  • Deployed scalable data solutions on Google Cloud Platform (GCP) and Azure, optimizing performance and ensuring cost-effectiveness and system reliability.
  • Proficient in data storage and querying technologies such as Hive, BigQuery, Postgres, Cassandra, and CosmosDB, as well as distributed SQL engines like Presto and Trino.
  • Enhanced CI/CD pipelines with GitHub Actions and Sonar, enabling version control, automated testing, and streamlined deployments.
  • Leveraged AI tools such as GitHub Copilot to increase productivity, streamline code reviews, and accelerate feature delivery.

Data Engineer

USAA Bank
Plano, TX
08.2022 - 02.2024
  • Migrated on-premise ETLs (e.g., Datastage) to modern platforms, such as DBT, improving scalability and performance.
  • Developed and implemented Snowflake models for data cleansing, Slowly Changing Dimensions, surrogate key assignments, and change data capture (CDC).
  • Built robust pipelines using the Data Build Tool (DBT) to transfer data efficiently between sources and destinations.
  • Created and orchestrated cloud-based ELT pipelines to automate copy activities and data transformation tasks.
  • Enhanced operational efficiency by managing cloud-based data platforms (e.g., Snowflake) and improving data integrity using modern processes and best practices.
  • Designed and managed large-scale ETL pipelines using PySpark and SQL, processing terabytes of structured and unstructured data daily.
  • .Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.

Senior Data Analyst

Capgemini
Philadelphia, PA
11.2018 - 06.2022
  • Designed and maintained ETL processes using Pentaho to extract, transform, and load complex datasets, ensuring data quality and efficiency.
  • Automated key business processes by implementing tailor-made Pentaho Data Integration workflows, reducing manual effort and improving operational efficiency.
  • Spearheaded cross-functional collaboration efforts, leading to a 70% improvement in team efficiency and optimized data pipeline performance.
  • Played a critical role in migrating payroll processes to Workday by designing and optimizing ETL workflows using MySQL and Pentaho.
  • Conducted discovery and requirement-gathering sessions, streamlining project deliverables and ensuring timely completion.
  • Delivered training sessions on Pentaho Data Integration to build team proficiency and ensure consistent knowledge sharing.
  • Developed automated reports and dashboards with Power BI, providing actionable insights to stakeholders.
  • Leveraged Agile methodologies throughout the software development lifecycle (SDLC) to optimize workflows and streamline project execution.
  • Played a key role in the migration of Informatica applications from version 10.1 to 10.2, achieving seamless transitions within project deadlines.
  • Utilized the DVO tool to validate and test data integrity during system migration, ensuring accurate data processing.
  • Gained expertise in ETL architecture by designing and developing complex workflows and mappings for diverse datasets.

Education

Bachelor of Science - Computer Science

Hindustan University
Chennai
06-2018

Skills

  • ETL Tools: Airflow, Automic, Pentaho Data Integration, Informatica, GCP
  • Databases: MySQL, SQL, BQ
  • Data Visualization: Power BI, Tableau
  • Workflow Methodologies: Agile SDLC

Timeline

Data Engineer III

Walmart
02.2024 - Current

Data Engineer

USAA Bank
08.2022 - 02.2024

Senior Data Analyst

Capgemini
11.2018 - 06.2022

Bachelor of Science - Computer Science

Hindustan University
Akhila Muppavarapu