About

I have been working in various engineering fields since 2018. Started in mechanical, transitioned to software and have settled into data engineering in recent years.

Python SQL
- Python/SQL Expert
Apache Spark Apache Airflow
- Big Data and Orchestrator
Docker Kubernetes
- Data Architect
AWS AWS S3
- AWS Pro.
dbt Airbyte
- Open Source Enthusiast
>> Check my Medium for Data Projects.
  • Age: 27 years
  • City: Vitória-ES / Brazil

Interests

Web Development

Data Engineering

Cloud Engineering

DevOps Engineering

Databases

SQL: PostgresSQL, MySQL, Microsoft, Oracle...
NoSQL: MongoDB, Redis, Apache Cassandra...

ML, DL and AI

Data Analysis

Power BI

Mechanical Engineering

About

Data Engineering Articles and Projects (read on Medium & Github)

1
Deploying Apache Spark Clusters: A Comparison of EC2, EMR, Databricks & More
Apache Spark Databricks AWS EMR
2
Building Scalable Data Pipelines with Apache Spark and Airflow
Python Apache Spark Apache Airflow AWS S3
3
Deploying a Spark Cluster on AWS EC2 Instances
Apache Spark AWS EMR
4
End-to-End Data Engineering project using Spark, Airflow, Jupyer, dbt, EC2 and more
Python SQL Apache Spark Apache Airflow Docker AWS S3

WEB DEV. PROJECTS

Fast PID (Engineering App)

Flask and MongoDB web app

and HTML/CSS/JS, Docker, SocketIO, Bootstrap...
https://professionalpid.onrender.com (free hosting - wait 30s)
Flask (Python)
Docker
NoSQL
SocketIO
Bootstrap
API
Orçamentei (Budgets App)

Flask and MongoDB web app

and HTML/CSS/JS, Stripe, Maps API, Vite, oAuth, Docker, Bootstrap...
https://www.orcamentei.com/ (free hosting - wait 30s)
Flask (Python)
Docker
NoSQL
Bootstrap
API

Skills

Languages

vectorlogo.zone vectorlogo.zone vectorlogo.zone upload.wikimedia.org vectorlogo.zone

Data Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Others: Delta Lake, Dagster, Mage.io, Astro Python SDK, ElasticSearch, Superset, ClickHouse, Druid, Amundsen...

Databases & Warehouses

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Cloud

vectorlogo.zone vectorlogo.zone vectorlogo.zone

Frameworks

Flask vectorlogo.zone Django vectorlogo.zone Node.js vectorlogo.zone Bootstrap vectorlogo.zone Tensorflowvectorlogo.zone upload.wikimedia.org

Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Experience

Snc Lavalin (now Atkins Réalis)
[TSE: ATRL]

2019 - Present

Senior Utility Systems Analyst

  • Responsible for preparing technical documents such as Engineering Flow Diagrams and Calculation Reports for utility systems.
  • Data analysis for engineering projects.
  • Led industrial projects focused on designing complex systems like water networks, compressed air systems, and dust extraction.
  • Applied detailed simulations and calculations to optimize system performance for large-scale projects.

Data Engineer and Web Developer Freelancer>

2023 - Present

Data Engineer and Web Developer Freelancer

  • Developed data pipelines using Apache Spark and Delta Lake to efficiently process large datasets.
  • Built ETL process orchestration using Airflow, including data ingestion and storage integration with S3.
  • Implemented medallion architecture to ensure clean and structured data, from ingestion to final analysis.
  • Automated data pipelines using Docker and Docker Compose for local development and production deployment.
  • Utilized dbt for data transformation, creating models that ensure data governance and quality.
  • Developed APIs and dashboards using Flask to present insights from processed data.
  • Collaborated on web development projects with backend in Flask/Django and frontend using HTML/CSS/JS, ensuring performance and responsive design.

Two SaaS Founder

2023 - 2024

Founder

  • Orçamentei e FastPID

Online Certification

Machine Learning

Algorithms-Design and Analysis

Algorithmic Toolbox

Deep Learning with Tensorflow

Machine Learning with Python

Neural Networks and Deep Learning

Experience

Arizona State University

January 2021 - Present

Software Engineer

  • Managed large‑scale deployment of JupyterHub with Nbgrader and webwork software, facilitating approx 5500 students.
  • Configured, troubleshot, and administered server‑side web applications for the statistics department.
  • Handled Linux server administration and Apache configuration; automated tasks like user account creation, managing student database, and system maintenance using Shell and automation scripts, reducing manual work by 200%.

Augmenify Infotech Pvt. Ltd.

August 2020 - November 2020

Backend Developer

  • Documented and coded server‑less web application for the hotel industry and designed REST API using Flask‑based JWT authentication.
  • Redeveloped an existing system to support customer account management, scheduling, and time tracking; enabled dynamic API calls with the help of Amazon API Gateway, AWS Lambda, and DynamoDB.

Epitome Corporation Pvt. Ltd.

July 2019 - Dec 2019

Software Developer

  • Tested, designed, and developed backend APIs of WebRTC enabled multi‑party video conferencing web application and delivered the project 15 days ahead of schedule by efficiently designing the flow of the project.

Meditab Software Pvt. Ltd.

May 2018 - June 2018

Programmer Analyst

  • Optimized image processing algorithm of pill detection by creating customized MASKRCNN algorithm, increasing accuracy by 15%; trained classification algorithm with the help of triplet loss to learn the image embedding of pill, reducing the hassle of collecting data of new pills.
  • Devised a pipeline of the project to incorporate it into the product of the company. Implemented Restful APIs in Django that enabled our quality Analyst team to increase reporting speed by 46%.
  • Built a web app to onboard data from the company’s product using Flask, Postgres, and AWS S3, enabling interactive charts in real-time.
  • Mentored 2 interns to optimize the pill detection algorithm and to include the multiprocessing pipeline, increasing overall speed by 75%.

Space Application Centre, ISRO

Jan 2018 - May 2018

Research Intern

  • Implemented noise reduction algorithm on the satellite image and prepared architecture for detecting objects in high‑resolution satellite images, achieving 80% accuracy.
  • Increased accessibility of satellite image data by redesigning database and application for showcasing graphical data.

Education

Mechanical Engineering - Bachelor of Science

Janurary 2015 - 2020
Relevant Coursework
  • Mechanical Engineering theory
  • Linear Algebra
  • Programming
  • Data analysis
  • Foundation Of Algorithms

Contact

My Address

Vitória/ES - Brasil

Social Profiles

Email

prettibernardo@gmail.com

or contact me on LinkedIn!