Skip to content
View Shegzimus's full-sized avatar

Block or report Shegzimus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shegzimus/README.md

Deutsch

🌟 About Me

  • ⚙️ Data Engineer skilled in building modular cloud-based ETL pipelines.
  • 🔢 Academic background in Mathematics (BSc) & Data Science (MSc)
  • 💼 Brief career in Management & Public Health Consulting
  • 💻 Passionate about data architecture development, and containerized workflows.

🛠️ Tech Stack:

My Skills

  • Specialty: Writing readable ETL modules and configuring Virtual machines with Terraform

💡 My Design Philosophy (what you can expect)

  • Scalable and Delegable Pipelines: I believe in designing easy pipelines to operate and maintain so that teams can focus on creating new solutions and solving other challenges. This approach contrasts with the belief that only the author can maintain their code. Good pipelines require little to no author intervention after deployment.

  • Functionality before refinement: I work to get things off the ground and start with a pipeline that works—delivering value quickly—before refining it into a more polished, future-proof version. This saves time and lets me adapt designs based on real-world feedback.

  • Security and Modularity as Cornerstones: Secure and modular designs are fundamental to my work. I focus (too much sometimes) on implementing best practices like secret management, non-hardcoded paths, and modular structures to ensure pipelines are robust, compliant, and easy to maintain.

🔭 What I’m Learning

  • RegEx
  • JVM languages (Java & Scala)
  • Go (for writing Kafka producers and multithreading)

📫 Connect With Me

Pinned Loading

  1. DE_Fashion_Product_Images DE_Fashion_Product_Images Public

    Apache Airflow powered ETL Pipeline for moving about 133k images from Kaggle to GCS and BigQuery

    Python

  2. DE_NASA_NeoW_Pipeline DE_NASA_NeoW_Pipeline Public

    Airflow powered ETL pipeline for moving Near-Earth-Object data from NASA to Google Cloud

    Python

  3. ML-Video-Game-Sales-Prediction ML-Video-Game-Sales-Prediction Public

    Jupyter Notebook

  4. Masters-Thesis Masters-Thesis Public

    Python