๐ก About me:
๐ MSc in Data Science (Tilburg University)
๐ Certifications: Data Engineer, Data Analyst, Big Data with PySpark, AWS Practitioner (DataCamp)
๐ Based in the Netherlands
๐ ๏ธ Data Engineering: Architecting pipelines using Databricks, SQL, and Azure to move data at scale.
๐ค Applied AI/ML: Developing predictive models (XGBoost, CatBoost) and NLP solutions with a focus on explainability and bias detection.
๐ Business Intelligence: Delivering Power BI dashboards that don't just show data, but drive decisions.
๐ก Mindset: Ownership-driven, Agile-native, and obsessed with turning complex environments into clarity.
Languages: Python, SQL, R
Data Science: pandas, NumPy, scikit-learn, Optuna, TensorFlow, Keras, spaCy, IsANLP, Machine Learning Models, statistical analysis.
Data Engineering: ETL & ELT pipelines, dbt, airflow, HTML parsing, Docker, Azure DevOps, AWS (Practitioner)
Analytics & BI: Power BI, Tableau
Other: Git, CI/CD basics, Feature Engineering, ML Pipelines
Research project analyzing linguistic bias in automated essay scoring & prediction.
Skills: NLP, feature extraction, ML pipeline, academic analysis
๐ Repo link
An end-to-end analytics engineering pipeline that leverages local OLAP analysis.
Skills: Pipeline building, data orchestration, dbt
๐ Repo link
Multiple experiments comparing ML models for customer churn in a mobile game company.
Skills: EDA, ML, data visualization
๐ Repo link
Business-focused dashboard analyzing churn drivers and customer segments in a Telecom.
Skills: DAX, data modeling, storytelling
๐ Repo link
HR analysis dashboard analyzing employees information.
Skills: DAX, data modeling, storytelling
๐ Repo link
Exploring IMDb data to understand ROI drivers in the film industry.
Skills: Web scraping, ETL pipeline, EDA, visualization, business insights
๐ Repo link
Multiple experiments comparing ML models for loan default prediction.
Skills: modeling, evaluation, hyperparameter tuning
๐ Repo link
Regression pipeline predicting rental revenue using engineered features.
Skills: regression, pipelines, metrics
๐ Repo link
- Improving ML pipeline structure (modular code, reproducibility)
- Building more Power BI dashboards for business storytelling
- Improving my skills in dbt and Kubernetes