Data Analysis Copilot is an AI-powered tool that helps you analyze data using natural language queries. It combines the power of LLMs with SQL and Python to provide interactive data analysis capabilities.
- Natural language to SQL query conversion
- Automated data analysis code generation
- Interactive Streamlit interface
- Support for SQLite databases
- Visualization capabilities
- Clone the repository:
git clone https://github.com/yourusername/data-analysis-copilot.git
cd data-analysis-copilot
- Create a virtual environment and activate it:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
- Start the Streamlit app:
streamlit run src/app.py
-
Upload your SQLite database file through the interface.
-
Enter your analysis questions in natural language.
-
View the generated SQL queries, analysis code, and results.
data-analysis-copilot/
├── src/ # Source code
├── core/ # Core functionality
├── services/ # Business logic services
|── utils/ # Utility functions
├── requirements.txt # Project dependencies
└── README.md # Project documentation
- Add an intent classification layer to know if the user input needs code generation to perform the data analysis or only querying the database. Adding this layer reduces the execution time and also reduces the resources used.