Skip to content

Amey-Thakur/KAGGLE-COURSES

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

133 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kaggle Courses

License: MIT Status Technology Curated by Amey Thakur and Mega Satish

Kaggle Courses: A comprehensive archive of completed Data Science, Machine Learning, and Data Manipulation courses. This repository documents mastery of the Python ecosystem through structured Kaggle exercises and certifications.

Source Code  ·  Amey's Kaggle  ·  Mega's Kaggle  ·  Certifications


Authors  ·  Overview  ·  Courses  ·  Structure  ·  Certifications  ·  Quick Start  ·  Usage Guidelines  ·  License  ·  About  ·  Acknowledgments


Important

🤝🏻 Special Acknowledgement

Special thanks to Mega Satish for her meaningful contributions, guidance, and support that helped shape this work.


Overview

This repository was created as part of the Internship at Technocolabs Softwares. Curated by Amey Thakur and Mega Satish, this project documents our journey of learning and acquiring essential Data Science and Machine Learning skills through 7 official Kaggle courses. We successfully completed all course material, from data cleaning fundamentals to machine learning explainability, earning recognized certifications as a record of our achievement.

The project demonstrates a disciplined approach to upskilling in Data Science, leveraging the Python Ecosystem (Pandas, Numpy, Scikit-Learn, XGBoost) to solve real-world analytical problems.

Learning Objectives

The curriculum is governed by strict computational data science principles:

  • Data Wrangling: Mastering pandas for efficient data manipulation, cleaning, and transformation.
  • Visual Analytics: Utilizing matplotlib and seaborn for exploratory data analysis and publication-quality visualizations.
  • Machine Learning: Implementing robust models from decision trees to gradient boosting and understanding model explainability.

Tip

Kaggle Course Completion

This repository represents the successful completion of 7 Kaggle Courses. Courses successfully completed with Mega Satish. Each directory corresponds to specific course milestones, ensuring a linear and verifiable progression of skills.


Courses

Data Manipulation (Pandas)    Certificate

  • Exercise 1 - Creating, Reading and Writing    Kaggle    Colab
  • Exercise 2 - Indexing, Selecting & Assigning    Kaggle    Colab
  • Exercise 3 - Summary Functions and Maps    Kaggle    Colab
  • Exercise 4 - Grouping and Sorting    Kaggle    Colab
  • Exercise 5 - Data Types and Missing Values    Kaggle    Colab

Data Cleaning    Certificate

  • Exercise 1 - Handling Missing Values    Kaggle    Colab
  • Exercise 2 - Scaling and Normalization    Kaggle    Colab
  • Exercise 3 - Parsing Dates    Kaggle    Colab
  • Exercise 4 - Character Encodings    Kaggle    Colab
  • Exercise 5 - Inconsistent Data Entry    Kaggle    Colab

Data Visualization    Certificate

  • Exercise 1 - Hello, Seaborn    Kaggle    Colab
  • Exercise 2 - Line Charts    Kaggle    Colab
  • Exercise 3 - Bar Charts and Heatmaps    Kaggle    Colab
  • Exercise 4 - Scatter Plots    Kaggle    Colab
  • Exercise 5 - Distributions    Kaggle    Colab
  • Exercise 6 - Choosing Plot Types and Custom Styles    Kaggle    Colab
  • Exercise 7 - Final Project    Kaggle    Colab

Intro to Machine Learning    Certificate

  • Exercise 1 - Explore Your Data    Kaggle    Colab
  • Exercise 2 - Your First Machine Learning Model    Kaggle    Colab
  • Exercise 3 - Model Validation    Kaggle    Colab
  • Exercise 4 - Underfitting and Overfitting    Kaggle    Colab
  • Exercise 5 - Random Forests    Kaggle    Colab
  • Exercise 6 - Machine Learning Competitions    Kaggle    Colab

Feature Engineering    Certificate

  • Exercise 1 - Mutual Information    Kaggle    Colab
  • Exercise 2 - Creating Features    Kaggle    Colab
  • Exercise 3 - Clustering With K-Means    Kaggle    Colab
  • Exercise 4 - Principal Component Analysis    Kaggle    Colab
  • Exercise 5 - Target Encoding    Kaggle    Colab

Intermediate Machine Learning    Certificate

  • Exercise 1 - Introduction    Kaggle    Colab
  • Exercise 2 - Missing Values    Kaggle    Colab
  • Exercise 3 - Categorical Variables    Kaggle    Colab
  • Exercise 4 - Pipelines    Kaggle    Colab
  • Exercise 5 - Cross-Validation    Kaggle    Colab
  • Exercise 6 - XGBoost    Kaggle    Colab
  • Exercise 7 - Data Leakage    Kaggle    Colab

Machine Learning Explainability    Certificate

  • Exercise 1 - Permutation Importance    Kaggle    Colab
  • Exercise 2 - Partial Plots    Kaggle    Colab
  • Exercise 3 - SHAP Values    Kaggle    Colab
  • Exercise 4 - Advanced Uses of SHAP Values    Kaggle    Colab

Note

Detailed Course Logs

Detailed code and notebooks for every single course are available in the repository structure. Refer to the directory tree below to navigate to specific topics.


Project Structure

KAGGLE/
│
├── docs/                            # Documentation Layer
│   └── SPECIFICATION.md             # Technical Architecture
│
├── Mega/                            # Attribution Assets
│   ├── Filly.jpg                    # Companion (Filly)
│   └── Mega.png                     # Profile Image (Mega Satish)
│
├── Source Code/                     # Core Learning Modules
│   ├── Data Cleaning/               # Missing Values, Dates, Encodings
│   ├── Data Manipulation/           # Pandas Mastery
│   ├── Data Visualization/          # Seaborn, Matplotlib
│   ├── Feature Engineering/         # PCA, Clustering, Encoding
│   ├── Machine Learning.../         # Intro & Intermediate ML
│   └── Machine Learning Explainability/  # SHAP, Permutation
│
├── Certificates/                    # Course Completion Credentials
│   ├── Kaggle Pandas.png
│   ├── Kaggle Data Cleaning.png
│   ├── Kaggle Data Visualization.png
│   ├── Kaggle Intro to Machine Learning.png
│   ├── Kaggle Feature Engineering.png
│   ├── Kaggle Intermediate Machine Learning.png
│   └── Kaggle Machine Learning Explainability.png
│
├── .gitattributes                   # Git LFS/Attribute Configuration
├── .gitignore                       # Project Ignore Patterns
├── CITATION.cff                     # Project Citation Manifest
├── codemeta.json                    # Metadata Standard
├── LICENSE                          # MIT License
├── README.md                        # Project Entrance
└── SECURITY.md                      # Security Protocols

Certifications

Kaggle Data Science & Machine Learning Courses
Certified completion of comprehensive Data Science curriculum through Kaggle Learn.

Pandas Certificate

Data Cleaning Certificate

Data Visualization Certificate

Intro to ML Certificate

Feature Engineering Certificate

Intermediate ML Certificate

ML Explainability Certificate


Quick Start

1. Prerequisites

Warning

Runtime Environment Guard

Python notebooks rely on relative file paths for datasets. Ensure you set your working directory to the repository root or the specific notebook folder before execution to prevent FileNotFoundError during data loading phases.

2. Installation & Setup

Step 1: Clone the Repository

Open your terminal and clone the repository:

git clone https://github.com/Amey-Thakur/KAGGLE-COURSES.git
cd KAGGLE-COURSES

Step 2: Library Synchronization

Ensure all required Python libraries are installed. Open your terminal and run:

pip install pandas numpy matplotlib seaborn scikit-learn xgboost shap

3. Execution

Navigate to the Source Code directory and launch Jupyter:

jupyter notebook

Tip

Kaggle Mastery | Internship Portfolio Hub

Experience the complete Kaggle Learning Ecosystem directly through this centralized Internship Archive. This hub serves as a scholarly gateway that orchestrates the mastery of seven core data science disciplines, providing a visual demonstration of skill evolution, credential validation, and featured project integration across the modern analytical landscape.

Explore Technocolabs Internship Specifications


Usage Guidelines

This repository is openly shared to support learning and knowledge exchange across the data science community.

For Students
Utilize this repository as a definitive roadmap for mastering the Kaggle learning platform. The structured course progression offers a rigorous, measurable pathway to transition from data cleaning fundamentals to advanced machine learning explainability.

For Educators
Adopt this curriculum architecture as a modular template for designing intensive data science workshops or accelerated machine learning bootcamps, providing a proven pedagogical framework for technical capability building.

For Researchers
Reference these artifacts as a verifiable case study in self-paced technical education, demonstrating the efficacy of structured Kaggle courses in rapid skill acquisition and applied machine learning.


License

This repository and all its creative and technical assets are made available under the MIT License. See the LICENSE file for complete terms.

Note

Summary: You are free to share and adapt this content for any purpose, even commercially, as long as you provide appropriate attribution to the original authors.

Copyright © 2021 Amey Thakur & Mega Satish


About This Repository

Created & Maintained by: Amey Thakur & Mega Satish

This repository was created as part of the Internship at Technocolabs Softwares. Together with Mega Satish, we embarked on a journey to learn and acquire essential Data Science and Machine Learning skills through structured Kaggle courses. This repository serves as a record of our learning experience and the certifications we earned along the way. It documents our progression from data cleaning and manipulation fundamentals to advanced feature engineering and model explainability.

Connect: GitHub  ·  LinkedIn  ·  ORCID

Acknowledgments

Grateful acknowledgment to Mega Satish for her exceptional collaboration and scholarly partnership during these Kaggle courses. Her intellectual agility, a veritable superpower to rapidly synthesize complex logic and articulate it with clarity, was the driving force behind the successful mastery of this intensive curriculum. She navigated advanced data science and machine learning concepts with remarkable speed, clarifying intricate details in a way that made the learning process reciprocal and effortless. Her engagement was essential for the completion of these course milestones; this educational journey would not have been possible without her steady discipline, ability to simplify the complex, and constant encouragement. Thank you, Mega, for everything you shared and taught along the way.

Special thanks to Technocolabs Softwares for facilitating this internship experience and providing the structured learning environment that made these achievements possible.


↑ Back to Top

Authors  ·  Overview  ·  Courses  ·  Structure  ·  Certifications  ·  Quick Start  ·  Usage Guidelines  ·  License  ·  About  ·  Acknowledgments


🏆 Kaggle


Computer Engineering (B.E.) - University of Mumbai

Semester-wise curriculum, laboratories, projects, and academic notes.