This repository contains a comprehensive collection of Machine Learning laboratory experiments designed to provide hands-on experience with fundamental concepts in data analysis, preprocessing, and statistical analysis using Python.
This repository includes three progressive laboratory experiments that cover essential topics in machine learning and data science. Each lab is designed to build upon previous knowledge and introduce new concepts in a structured manner.
- Objective: Introduction to Python programming fundamentals for data analysis
- Topics Covered: NumPy arrays, basic data structures, and fundamental operations
- Tools Used: Python, NumPy, Pandas
- Objective: Learn data preprocessing techniques and create meaningful visualizations
- Topics Covered: Data loading, cleaning, preprocessing, and visualization techniques
- Tools Used: Python, Pandas, Matplotlib, NumPy
- Dataset: Students Performance Dataset
- Objective: Perform comprehensive descriptive statistical analysis on datasets
- Topics Covered: Statistical measures, data distribution analysis, and advanced visualization
- Tools Used: Python, Pandas, Matplotlib, Seaborn, NumPy
- Dataset: Iris Dataset
- Objective: Implement linear regression techniques for predictive modeling
- Topics Covered: Simple Linear Regression, model evaluation, visualization
- Tools Used: Python, Scikit-learn, Pandas, Matplotlib
- Dataset: TvMarketing.csv
- Objective: Implement fundamental classification algorithms
- Topics Covered: Naive Bayes, Decision Trees, model evaluation
- Tools Used: Python, Scikit-learn, Pandas, Matplotlib
- Objective: Implement advanced classification algorithms
- Topics Covered: K-Nearest Neighbors, Support Vector Machines, hyperparameter tuning
- Tools Used: Python, Scikit-learn, Pandas, Matplotlib
- Objective: Implement ensemble learning techniques
- Topics Covered: Random Forests, Gradient Boosting, feature importance
- Tools Used: Python, Scikit-learn, Pandas, Matplotlib
- Objective: Implement unsupervised learning techniques
- Topics Covered: K-Means Clustering, PCA, dimensionality reduction
- Tools Used: Python, Scikit-learn, Pandas, Matplotlib
- Programming Language: Python 3.x
- Libraries:
- NumPy - Numerical computing
- Pandas - Data manipulation and analysis
- Matplotlib - Data visualization
- Seaborn - Statistical data visualization
- Development Environment: Google Colab
- File Formats: Jupyter Notebooks (.ipynb), PDF reports
All experiments in this repository were designed and executed in Google Colab, providing a cloud-based environment with pre-installed libraries and easy sharing capabilities.
Machine-Learning-Lab/
├── README.md
├── ML_Lab.ipynb (Original combined notebook)
├── datasets/
│ ├── README.md
│ ├── TvMarketing.csv
│ ├── advertising.csv
│ ├── vehicle_co2_dataset.csv
│ └── DATA.csv
├── Lab-01/
│ ├── README.md
│ ├── ML_LAB_1.ipynb
│ └── ML Lab-1.pdf
├── Lab-02/
│ ├── README.md
│ ├── ML_LAB_2.ipynb
│ ├── ML Lab-2.pdf
│ └── StudentsPerformance.csv
├── Lab-03/
│ ├── README.md
│ ├── ML_LAB_3.ipynb
│ ├── ML Lab-3.pdf
│ └── Iris.csv
├── Lab-04/
│ ├── README.md
│ └── ML_LAB_4.ipynb
├── Lab-05/
│ ├── README.md
│ └── ML_LAB_5.ipynb
├── Lab-06/
│ ├── README.md
│ └── ML_LAB_6.ipynb
├── Lab-07/
│ ├── README.md
│ └── ML_LAB_7.ipynb
└── Lab-08/
├── README.md
└── ML_LAB_8.ipynb
- Clone this repository to your local machine
- Open the desired lab folder
- Upload the
.ipynbfile to Google Colab - Follow the instructions in each notebook
- Refer to the corresponding PDF report for detailed analysis and results
This repository is part of a Machine Learning course curriculum and demonstrates practical implementation of theoretical concepts through hands-on programming exercises.