🩺 Kidney Disease Classification – MLflow & DVC

An end-to-end kidney disease classification system with experiment tracking, reproducible pipelines, and production-ready ML workflows.

🌐 Live App • 🔁 Workflow • 🏗️ Architecture • 🚀 Quick Start • 📊 MLflow & DVC

🎯 Overview

This project is a production-style machine learning system for kidney disease classification, designed with MLOps best practices in mind.

It focuses on:

Modular ML pipelines
Experiment tracking with MLflow
Reproducibility using DVC
Configuration-driven development
Deployment-ready Streamlit interface

The project structure closely resembles real-world industry ML systems.

🔁 Workflows

The entire pipeline is configuration-driven and modular:

Update config.yaml
Update secrets.yaml (optional)
Update params.yaml
Define entities
Update configuration manager in src/config
Update individual components
Update pipeline logic
Update main.py
Update dvc.yaml
Update app.py

This design ensures clean separation of concerns and easy experimentation.

🏗️ Architecture

Data Ingestion
      ↓
Data Validation
      ↓
Data Transformation
      ↓
Base Model Preparation (VGG16)
      ↓
Model Training
      ↓
Model Evaluation
      ↓
MLflow Logging & Registry
      ↓
Streamlit Inference App

DVC orchestrates the pipeline, while MLflow tracks experiments and models.

🛠️ Tech Stack

Layer	Technology
Language	Python 3.10
Model	VGG16 (Transfer Learning)
ML Framework	TensorFlow / Keras
Experiment Tracking	MLflow
Pipeline Orchestration	DVC
Frontend	Streamlit
Configuration	YAML
Tracking Server	DagsHub

🚀 Quick Start

Prerequisites

Conda
Python 3.10
Git
DVC

Clone the Repository

git clone https://github.com/vivek34561/kidney_disease_classification
cd kidney_disease_classification

Create Conda Environment

conda create -n myenv python=3.10 -y
conda activate myenv

Install Dependencies

pip install -r requirements.txt

Run the Application

python app.py

Open your browser and navigate to the local Streamlit URL shown in the terminal.

📊 MLflow & DVC

MLflow

Tracks experiments, parameters, metrics, and artifacts
Maintains model registry
Enables reproducibility and comparison

Useful commands:

mlflow ui

Documentation:

MLflow with DagsHub

Tracking URI:

https://dagshub.com/vivek34561/kidney_disease_classification.mlflow

Set environment variables:

export MLFLOW_TRACKING_URI=https://dagshub.com/vivek34561/kidney_disease_classification.mlflow
export MLFLOW_TRACKING_USERNAME=vivek34561
export MLFLOW_TRACKING_PASSWORD=your_token_here

Then run:

python main.py

DVC Commands

dvc init
dvc repro
dvc dag

dvc repro runs the entire ML pipeline
dvc dag visualizes pipeline dependencies

🧠 Why MLflow & DVC?

MLflow

Production-grade experiment tracking
Parameter, metric, and artifact logging
Model versioning and comparison

DVC

Lightweight pipeline orchestration
Reproducible experiments
Data and model version control
Ideal for PoC and research-to-production workflows

🔮 Future Improvements

CI/CD integration for ML pipelines
Automated model promotion rules
Cloud-based artifact storage
Model drift detection
API-based inference service

👨‍💻 Author

Vivek Kumar Gupta AI Engineering Student | ML & MLOps Enthusiast

GitHub: https://github.com/vivek34561

LinkedIn: https://linkedin.com/in/vivek-gupta-0400452b6

Portfolio: https://resume-sepia-seven.vercel.app/

📄 License

Align it directly with ML Engineer / MLOps Engineer job descriptions
Create a system design diagram explanation for interviews

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.dvc		.dvc
.github/workflows		.github/workflows
config		config
mlruns/0		mlruns/0
model		model
research		research
src/cnnClassifier		src/cnnClassifier
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
fix_model.py		fix_model.py
inputImage.jpg		inputImage.jpg
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
runtime.txt		runtime.txt
scores.json		scores.json
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 Kidney Disease Classification – MLflow & DVC

🎯 Overview

🔁 Workflows

🏗️ Architecture

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Clone the Repository

Create Conda Environment

Install Dependencies

Run the Application

📊 MLflow & DVC

MLflow

MLflow with DagsHub

DVC Commands

🧠 Why MLflow & DVC?

MLflow

DVC

🔮 Future Improvements

👨‍💻 Author

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🩺 Kidney Disease Classification – MLflow & DVC

🎯 Overview

🔁 Workflows

🏗️ Architecture

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Clone the Repository

Create Conda Environment

Install Dependencies

Run the Application

📊 MLflow & DVC

MLflow

MLflow with DagsHub

DVC Commands

🧠 Why MLflow & DVC?

MLflow

DVC

🔮 Future Improvements

👨‍💻 Author

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages