Summary
Overview
Work History
Education
Skills
Timeline
Generic

Kosal Ram

Data Engineer
Bentonville

Summary

Experienced Data Engineer with 8+ years of expertise in designing and implementing scalable data pipelines, ETL workflows, and data integration solutions. Skilled in big data processing for analytics and business intelligence. Proficient in Python, SQL, and Apache Spark for building reliable data systems that support real-time and batch processing.

Overview

15
15
years of professional experience

Work History

Data Engineer

WALMART (Contract)
03.2025 - 10.2025
  • Designed and maintained scalable, distributed data pipelines using Apache Spark, Hadoop, and PySpark to process large-scale retail and operational datasets.
  • Integrated Kafka, Airflow, and GCP services BigQuery, DataProc, Cloud Storage for real-time data ingestion, transformation, and storage.
  • Deployed containerized applications using Docker and Kubernetes, improving scalability and deployment efficiency.
  • Designed and optimized databases using SQL and Cassandra for structured and semi-structured data storage.
  • Collaborated with ML teams to prepare and serve data using BigQuery ML and PySpark, enabling predictive analytics.
  • Developed RESTful APIs and backend services using Java, Spring Boot, and JPA for seamless integration across systems.
  • Delivered insights via Tableau and Power BI, supporting data-driven decision-making across retail operations.
  • Implemented CI/CD pipelines using GitHub, GitHub Actions, and Agile development methodologies to automate testing, deployment, and iterative delivery.
  • Technologies Used: Python, Java, Spring Boot, Hadoop, Pyspark, Kafka, BigQuery, Airflow, SQL, Cassandra, Docker, Kubernetes, GCP

AI Engineer

FLORIDA ATLANTIC UNIVERSITY
08.2024 - 12.2024
  • Designed and implemented end-to-end LLM pipelines for RAG applications, including data ingestion, preprocessing, retrieval, and orchestration.
  • Fine-tuned open-source LLMs using proprietary datasets to improve domain-specific text generation and accuracy.
  • Developed scalable embedding generation pipelines using Spark and distributed compute frameworks to process billions of documents efficiently.
  • Engineered and optimized vector databases and indexing strategies to enable high-performance semantic search for GenAI applications.

Data Engineer

GREEN METHOD TECHNOLOGIES
12.2020 - 06.2023
  • Designed, implemented, and maintained scalable end-to-end data pipelines supporting multiple projects and business initiatives using AWS-native services and Snowflake.
  • Architected and optimized cloud-based data pipeline architectures leveraging AWS S3, Lambda, Glue, Redshift, RDS, and Snowflake for high availability and scalability.
  • Built and optimized data models across Snowflake, Databricks, Redshift, and S3, enabling efficient analytics and downstream reporting.
  • Implemented real-time and batch data ingestion pipelines using Snowpipe for continuous data loading and Snowflake Streams and Tasks for incremental processing and automated transformations.
  • Developed robust ETL workflows to read, transform, stage, and load large and complex datasets using PySpark, SQL, AWS Glue, and Snowflake SQL.
  • Integrated diverse data sources and assembled large-scale datasets to meet complex business and analytical requirements.
  • Customized and managed data integration tools, databases, data warehouses, and analytical platforms, ensuring seamless interoperability across systems.
  • Monitored and optimized data pipeline performance, uptime, cost, and scalability, implementing proactive alerting and tuning strategies.
  • Developed data quality validation, monitoring, and auditing frameworks to ensure data accuracy, consistency, and reliability.

Data Analyst

AIKAH ESTABLISHMENT
12.2014 - 09.2020
  • Applied software tools to historical data, enhancing the reliability of low voltage systems and reducing downtime.
  • Designed dashboards using Tableau, providing data driven insights for project stakeholders.
  • Enhanced data reporting capabilities by utilizing SQL for data extraction and transformation and leveraging visualization tools such as Tableau and Power BI to generate actionable insights for decision making.
  • Preprocess and analyze operational data, streamlining retrofitting processes. Collaborated with design and operations teams to ensure compliance with international electrical standards.

Energy Analyst

ENERCON INDIA LTD (WIND WORLD INDIA LTD)
04.2011 - 04.2014
  • Monitored wind turbine generator performance by integrating operational data into SQL based systems, ensuring real time tracking and analysis.
  • Created and maintained reports using MS Excel, leveraging advanced formulas and pivot tables to summarize power generation trends.
  • Conducted data cleaning and preprocessing for fault analysis, utilizing software tools to ensure high data quality for decision making.
  • Supported preventive maintenance schedules by analyzing historical data stored in access databases, reducing system downtime.
  • Worked with teams to implement data logging systems, enabling better tracking of operational parameters and fault histories.
  • Worked with design and maintenance team to provide solutions for better turbine performance.

Education

Master of Science - Data Science And Analytics

Florida Atlantic University
Florida
12-2024

Skills

  • Python, Java, Spring Boot, SQL
  • Machine Learning, LLM, GenAI, Agents
  • Cassandra, Tableau, Power BI
  • PySpark, Databricks, Snowflake
  • kafka, Airflow, Kubernetes
  • GCP, AWS

Timeline

Data Engineer

WALMART (Contract)
03.2025 - 10.2025

AI Engineer

FLORIDA ATLANTIC UNIVERSITY
08.2024 - 12.2024

Data Engineer

GREEN METHOD TECHNOLOGIES
12.2020 - 06.2023

Data Analyst

AIKAH ESTABLISHMENT
12.2014 - 09.2020

Energy Analyst

ENERCON INDIA LTD (WIND WORLD INDIA LTD)
04.2011 - 04.2014

Master of Science - Data Science And Analytics

Florida Atlantic University
Kosal RamData Engineer