Skip to content
View Triet00's full-sized avatar

Block or report Triet00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Triet00/README.md

๐Ÿ‘‹ Hi, welcome to my portfolio. I'm Triแบฟt. I build systems that make data valuable!

๐Ÿ’ก About me:

๐ŸŽ“ MSc in Data Science (Tilburg University)

๐ŸŽ“ Certifications: Data Engineer, Data Analyst, Big Data with PySpark, AWS Practitioner (DataCamp)

๐Ÿ“ Based in the Netherlands

๐Ÿ› ๏ธ Data Engineering: Architecting pipelines using Databricks, SQL, and Azure to move data at scale.

๐Ÿค– Applied AI/ML: Developing predictive models (XGBoost, CatBoost) and NLP solutions with a focus on explainability and bias detection.

๐Ÿ“Š Business Intelligence: Delivering Power BI dashboards that don't just show data, but drive decisions.

๐Ÿ’ก Mindset: Ownership-driven, Agile-native, and obsessed with turning complex environments into clarity.


๐Ÿ”ง Tech Stack

Languages: Python, SQL, R
Data Science: pandas, NumPy, scikit-learn, Optuna, TensorFlow, Keras, spaCy, IsANLP, Machine Learning Models, statistical analysis.
Data Engineering: ETL & ELT pipelines, dbt, airflow, HTML parsing, Docker, Azure DevOps, AWS (Practitioner)
Analytics & BI: Power BI, Tableau
Other: Git, CI/CD basics, Feature Engineering, ML Pipelines


๐Ÿ“Œ Featured Projects

๐Ÿ”น Systematic Bias Detection in AES (NLP, ML, SA)

Research project analyzing linguistic bias in automated essay scoring & prediction.
Skills: NLP, feature extraction, ML pipeline, academic analysis

๐Ÿ”— Repo link

๐Ÿ”น Customer Insights Data Pipeline (dbt, airflow, SQL)

An end-to-end analytics engineering pipeline that leverages local OLAP analysis.

Skills: Pipeline building, data orchestration, dbt

๐Ÿ”— Repo link

๐Ÿ”น Customer Churn Prediction (ML, SA)

Multiple experiments comparing ML models for customer churn in a mobile game company.

Skills: EDA, ML, data visualization
๐Ÿ”— Repo link

๐Ÿ”น Customer Churn Analysis Telecom (EDA, Power BI)

Business-focused dashboard analyzing churn drivers and customer segments in a Telecom.
Skills: DAX, data modeling, storytelling
๐Ÿ”— Repo link

๐Ÿ”น HR Analytics (EDA, Power BI)

HR analysis dashboard analyzing employees information.
Skills: DAX, data modeling, storytelling
๐Ÿ”— Repo link

๐Ÿ”น Movies Analytics (HTML, EDA, ML)

Exploring IMDb data to understand ROI drivers in the film industry.
Skills: Web scraping, ETL pipeline, EDA, visualization, business insights
๐Ÿ”— Repo link

๐Ÿ”น Loan Payback Prediction (ML)

Multiple experiments comparing ML models for loan default prediction.
Skills: modeling, evaluation, hyperparameter tuning
๐Ÿ”— Repo link

๐Ÿ”น Rental Price Predictor (ML)

Regression pipeline predicting rental revenue using engineered features.
Skills: regression, pipelines, metrics
๐Ÿ”— Repo link


๐Ÿ“ˆ What I'm Working On

  • Improving ML pipeline structure (modular code, reproducibility)
  • Building more Power BI dashboards for business storytelling
  • Improving my skills in dbt and Kubernetes

Pinned Loading

  1. Systematic-Bias-Detection-in-AES Systematic-Bias-Detection-in-AES Public

    This project examines the systematic bias in LLMs automated essay scoring in the IELTS writing by analyzing linguistic features.

    Jupyter Notebook 2

  2. Movies-analytics Movies-analytics Public

    This projects collect movies data from IMDb and explore the relationships between the predictors and each movie's ROI (Return on Investment).

    Jupyter Notebook 2

  3. Loan-Payback-Prediction Loan-Payback-Prediction Public

    This project conducts multiple experiments in predicting loan payback from the Kaggle's Playground Series.

    Jupyter Notebook 1

  4. Rental-price-predictor Rental-price-predictor Public

    This project uses machine learning model to predict rental revenue.

    Jupyter Notebook 2

  5. Customer-Churn-Prediction- Customer-Churn-Prediction- Public

    This project focuses on predicting player churn using gameplay data collected from January 2015 to May 2016, consisting of 163,929 rows and three columns: device, score, and time. The time variableโ€ฆ

    Jupyter Notebook

  6. Analysis-Visualization-Churn-Rate-Telecom Analysis-Visualization-Churn-Rate-Telecom Public

    This project explores customer churn patterns within a telecommunications company using Power BI. The goal is to understand why customers leave, which segments are most at risk, and what business iโ€ฆ