💡 Intelligent Analytics Assistant: Text-to-SQL

The Intelligent Analytics Assistant is a full-stack, AI-powered application that allows users to query a large-scale MySQL database using natural language. It leverages a powerful Large Language Model (LLM) to translate business questions into complex, executable SQL, providing real-time analytics on a 1.3 million-row dataset.

✨ Features

Natural Language Query: Translate English questions into valid MySQL queries, including complex joins and window functions.
Dual-Engine LLM: Configured to easily switch between Gemini 2.5 Flash (via API for reliability) and TinyLlama (via Ollama for local execution).
High-Volume Data: Successfully handles a database containing over 1.3 million records across three key tables (orders, order_items, customer_meetings).
Full-Stack Architecture: Separates concerns across a dedicated frontend, backend API, and database layer.
Debugging/Transparency: Displays the generated SQL query before execution for transparency and debugging.

🏗️ Architecture

The project is built on three main components:

Component	Technology	Role
Frontend	Streamlit	Provides the interactive chat interface and visualizes the results.
Backend	FastAPI (Python)	The core API that handles all logic: accepts the natural language query, calls the LLM, and executes the final SQL against the database.
LLM (AI Brain)	Gemini 2.5 Flash (or TinyLlama)	Generates the required MySQL query based on the database schema.
Database	MySQL	Stores the 1.3M+ rows of raw analytical data.

⚙️ Setup and Installation

Follow these steps to get the application running on your local machine.

1. Prerequisites

You must have the following installed:

Python 3.10+
MySQL Server (and administrative access)
(Optional, for local LLM) Ollama (if using TinyLlama)

2. Clone the Repository

git clone https://github.com/HenryMorganDibie/intelligent-analytics-assistant.git
cd intelligent-analytics-assistant

3. Environment Setup

Create and activate a Python virtual environment, and install the required dependencies.

python -m venv .venv
.\.venv\Scripts\activate

pip install -r requirements.txt

4. Database Configuration & Data Loading

You must set up the MySQL database and load the data.

Create Database: Log into your MySQL server and create an empty database named analytics_db.
Configure .env: Create a file named .env in the project root and fill in your MySQL credentials and LLM preferences.

Create a .env file in the project root with the following content:

# Database Credentials
DB_NAME=analytics_db
DB_USER=root
DB_PASS="YOUR_ACTUAL_PASSWORD" 
DB_HOST=localhost
DB_PORT=3306

# LLM Engine Configuration (SET USE_GEMINI=True for the final, reliable setup)
USE_GEMINI=True 
GEMINI_API_KEY="YOUR_GEMINI_API_KEY"
GEMINI_MODEL="gemini-2.5-flash"

# (Optional) Ollama Configuration
OLLAMA_HOST=http://localhost:11434 
OLLAMA_MODEL=tinyllama

Load Data: Use the provided script to create the tables and load the data.

python data_loader.py

5. LLM Setup (If using Ollama)

# Navigate to your Ollama installation directory
cd "$env:LOCALAPPDATA\Programs\Ollama"

# Pull the required model
.\ollama pull tinyllama

▶️ Running the Application

You must run the backend and frontend in two separate terminals.

Launch the Backend API (Terminal 1)

# Ensure you are in the project root and the .venv is active
uvicorn backend.main:app --reload

Launch the Frontend UI (Terminal 2)

# Ensure you are in the project root and the .venv is active
streamlit run frontend/app.py

🧪 Testing and Demos

Try these questions to test its full functionality:

Complexity	Question	Expected SQL Component
Simple Aggregate	"What is the total quantity of all orders?"	`SELECT SUM(total_qty) FROM orders`
Relational Join	"What is the average total price of orders placed by customers who have never had a meeting?"	`LEFT JOIN ... WHERE meetingid IS NULL`
Window Function	"For every customer, list the date of their meeting and the date of their previous meeting."	`LAG(meetingdate) OVER (PARTITION BY customer_id ...)`

✍️ Author

Henry Dibie

Contact: https://www.linkedin.com/in/kinghenrymorgan/

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
data_loader.py		data_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💡 Intelligent Analytics Assistant: Text-to-SQL

✨ Features

🏗️ Architecture

⚙️ Setup and Installation

1. Prerequisites

2. Clone the Repository

3. Environment Setup

4. Database Configuration & Data Loading

5. LLM Setup (If using Ollama)

▶️ Running the Application

🧪 Testing and Demos

✍️ Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

💡 Intelligent Analytics Assistant: Text-to-SQL

✨ Features

🏗️ Architecture

⚙️ Setup and Installation

1. Prerequisites

2. Clone the Repository

3. Environment Setup

4. Database Configuration & Data Loading

5. LLM Setup (If using Ollama)

▶️ Running the Application

🧪 Testing and Demos

✍️ Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages