Skip to content
View aymane-maghouti's full-sized avatar

Block or report aymane-maghouti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aymane-maghouti/README.md

Aymane Maghouti

Data Engineer · Big Data · AI Solutions

Designing scalable data platforms and AI-driven pipelines for enterprise environments

LinkedIn Portfolio YouTube Email

Profile Views


About me

I'm a data engineer specializing in the design and modernization of data platforms, real-time analytics frameworks, and AI-enabled information systems. I've contributed to large-scale transformation initiatives — from migrating critical enterprise environments to building RAG systems and semantic pipelines for complex technical content.

  • 🛫 Currently @ Airbus — Data Analytics & Engineering on the Final Assembly Line (FAL), working within the Skywise (Palantir) Big Data ecosystem
  • 🧠 Building AI-powered knowledge platforms with RAG, Vertex AI, and Gemini on GCP
  • ☁️ Working across AWS · Azure · GCP cloud environments
  • 🏀 Outside of data: basketball and football keep me going

Experience highlights

Company Role Period
✈️ Airbus Data Analyst / Data Engineer — FAL Production Analytics (Skywise) + AI Knowledge Platform (GCP/RAG) Feb 2026 — Present
📡 INWI Data Platform Migration & Real-Time Analytics — Oracle → SQL Server, Kafka, Power BI Feb 2025 — Sep 2025
🧱 Shiftbricks Data Ingestion Pipeline for AI Application — Medallion Architecture, Airflow Jun 2024 — Sep 2024
🚂 ONCF Data Analytics Intern — BigQuery, DBT, Star Schema, 10M+ records/day Jul 2024 — Sep 2024

Tech stack

Data Engineering & Streaming

Apache Kafka Apache Spark Apache Airflow Hadoop dbt PySpark

Cloud & MLOps

GCP Azure AWS Docker MLflow Vertex AI

Data Warehousing & BI

SQL Server BigQuery Snowflake Power BI SSIS

Programming & Frameworks

Python Java SQL FastAPI Spring Boot React

Databases

PostgreSQL MongoDB Oracle MySQL ClickHouse


GitHub stats

GitHub Streak


Featured projects

Project Description Stack
🏠 LeBonPrix End-to-end rental price prediction platform — scraping, Medallion architecture, XGBoost, mobile app Python · XGBoost · FastAPI · Spring Boot · React Native · Power BI
Event-Driven CDC Pipeline Real-time change data capture with Kafka, Spark Streaming, Debezium Kafka · Spark · Debezium · Spring Boot
☁️ ML on Azure Full ML solution migrated to Azure with MLOps pipelines and cloud-native deployment Azure · MLflow · FastAPI · React Native
🚀 AWS Smartphone Pipeline Big data pipeline migrated to AWS ecosystem Glue · Athena · S3 · QuickSight
📊 HR Azure Pipeline HR analytics pipeline with ADF, Databricks, and Power BI ADF · Databricks · Blob Storage · Power BI
🔁 Sales Data Pipeline ETL from SQL Server to BigQuery with Airflow orchestration and Looker Studio SQL Server · Airflow · BigQuery · Looker

"Good data infrastructure is invisible — you only notice it when it's missing."

Pinned Loading

  1. Big-Data-Project Big-Data-Project Public

    This project aims to predict smartphone prices using a combination of batch and stream processing techniques in a Big Data environment. The architecture follows the Lambda Architecture pattern, pro…

    Python 26 3

  2. Real-Time-Data-Pipeline-Using-Kafka Real-Time-Data-Pipeline-Using-Kafka Public

    This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the loc…

    Python 15 2

  3. Students-Management-System-Spring Students-Management-System-Spring Public

    This is a simple application for managing student records built with Spring Boot. It provides basic CRUD (Create, Read, Update, Delete) functionality for student entities.

    Java 4 2

  4. Human-Resources-data-pipeline Human-Resources-data-pipeline Public

    This ETL (Extract, Transform, Load) project aims to extract human resources data, clean it using PL/SQL and SQL, integrate it into a Snowflake data warehouse on Azure Cloud using Informatica, and v…

    5 2

  5. Dice-Game Dice-Game Public

    This web application is a simple game developed using Java Enterprise Edition (JEE) components such as filters, servlets, and JavaServer Pages (JSP). The backend is implemented using JEE, while Boo…

    Java 1

  6. Sentiment-Analysis-for-Jumia-Reviews-and-Smartphone-Price-Prediction-System Sentiment-Analysis-for-Jumia-Reviews-and-Smartphone-Price-Prediction-System Public

    The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making al…

    Jupyter Notebook 2