Rica Mae | Data Engineer

ALittle More About Me

I'm Rica Mae 💙

Data Engineer with 5+ years of experience designing and optimizing scalable data platforms for analytics and AI/ML initiatives within the AWS ecosystem. Proficient in delivering high-performance solutions using Python, PySpark, Databricks, and AWS services (S3, Redshift, Glue, Lambda). Proven expertise in automating infrastructure with Terraform and CI/CD to enhance data quality and support business intelligence. Familiar with multi-cloud environments, including foundational knowledge of Microsoft Azure data services.

Download CV

Years Experience

20+

Projects Completed

AWS Certifications

15+

Technologies

Specialty

Data Engineering

Designing and optimizing scalable data platforms for analytics and AI/ML initiatives.

Cloud Computing

Delivering high-performance solutions using AWS services and automating infrastructure with Terraform.

AI/ML Initiatives

Engineering and scaling AI-ready data pipelines to power real-time BI dashboards and support machine learning model training.

My Skills

My Skills & Work Experiences.

Data Engineer (April 2024 - Present) - Prosource BPO

Orchestrated AWS infrastructure deployment using Terraform, designed and implemented data processing pipelines with Databricks and PySpark, and developed BuildKite CI/CD pipelines. Engineered and scaled AI-ready data pipelines to power real-time BI dashboards.

Data Engineer (June 2023 - April 2024) - InfoAlchemy

Architected and automated serverless ETL pipelines using AWS Glue and Lambda. Developed data ingestion pipelines for an AWS-based data lake. Led the strategic planning and management of data infrastructure, ensuring high levels of data quality and availability.

Data Engineer (September 2022 - May 2023) - DTN

Designed, developed, and tested features using Python, serverless framework, and AWS services. Successfully refined infrastructure deployment using AWS CloudFormation and Bamboo CI/CD and migrated an existing system from on-premise to AWS services.

Software Engineer (December 2020 - September 2022) - Accenture

Developed ETL mappings using Informatica PowerCenter, developed stored procedures, database triggers, and SQL queries. Implemented best practices and tuned SQL code for optimization. Participated in Code review and UAT tests.

Quality Assurance Engineer (August 2019 - November 2020) - Kodec

Established and maintained Quality Management Systems in Production. Prepared regular Quality Reports and developed and monitored performance metrics for all processes in the Production.

Python/Pyspark 95%

AWS 90%

SQL 95%

Databricks 85%

Terraform 80%

Data Engineering:

ETL, CI/CD, Automation, Scripting, Data Warehousing, Data Cleaning, AI/ML

Most Used Tools:

Databricks, Terraform, BuildKite, JIRA, Informatica PowerCenter, VsCode, SQL Developer

Relevant Certifications:

AWS Certified Data Engineer - Associate - Validate
AWS Certified Cloud Practitioner - Validate
AWS Certified Developer Associate - Validate
AWS Certified Solutions Architect - Validate

My Projects

AWS Serverless ETL Pipeline

Architected and automated serverless ETL pipelines using AWS Glue and Lambda for real-time data processing.

Python AWS Glue Lambda S3

AI-Ready Data Pipeline

Engineered and scaled data pipelines to power real-time BI dashboards and support ML model training.

PySpark Databricks Redshift

Infrastructure as Code

Orchestrated AWS infrastructure deployment using Terraform with automated CI/CD pipelines.

Terraform AWS BuildKite

AWS Data Lake Architecture

Developed data ingestion pipelines for an AWS-based data lake with high data quality and availability.

AWS S3 Glue Athena

Cloud Migration Project

Successfully migrated an existing system from on-premise to AWS services using CloudFormation.

CloudFormation EC2 RDS

Informatica ETL Solution

Developed ETL mappings and optimized SQL code for enterprise data warehousing solutions.

Informatica SQL Oracle

My Blog

15 Jan

Data Engineering 5 min read

Building Scalable ETL Pipelines: Best Practices

Learn essential best practices for designing and implementing scalable ETL pipelines using AWS services and modern data engineering patterns.

08 Jan

Cloud 8 min read

Infrastructure as Code with Terraform on AWS

A comprehensive guide to automating AWS infrastructure deployment using Terraform, including best practices and real-world examples.

22 Dec

Data Engineering 10 min read

Designing a Modern Data Lake Architecture

Explore architectural patterns and considerations for building a robust, scalable data lake on AWS using S3, Glue, and Athena.

10 Dec

Career 6 min read

From QA to Data Engineer: My Journey

Sharing my career transition story, lessons learned, and advice for aspiring data engineers looking to break into the field.

28 Nov

Cloud 7 min read

Serverless Data Processing with AWS Lambda

Discover how to build cost-effective, serverless data processing pipelines using AWS Lambda, S3, and EventBridge.

15 Nov

Tutorial 12 min read

Getting Started with PySpark and Databricks

A beginner-friendly tutorial on using PySpark in Databricks for large-scale data processing and transformation tasks.

Contact Me

Get In Touch

Want to work together or whether you have a question or just want to say hi, I’ll try my best to get back to you!

Name

Rica Mae

Address

Mexico, Pampanga

lacsonrica@gmail.com

Follow Me

Message Me

Hi! I'm Rica