This repository contains the Exploratory Data Analysis (EDA) project for course BMAN73701. The project involves analyzing three primary datasets to extract meaningful business insights through data cleaning, processing, and visualization.
The repository is organized into the following directories:
- Final_notebooks: Contains the finalized Jupyter Notebooks documenting each step of the analysis process.
- Final_python: Contains the final Python scripts used for the project.
- Original_datasets: The three source datasets provided for the coursework.
- Processed_datasets: Updated versions of the datasets saved after cleaning and transformation steps.
- Old_files: Archive of preliminary work and drafts contributed by the project team.
To run the notebooks and scripts in this repository, you will need:
- Python 3
- Jupyter Notebook or JupyterLab
- Pandas
- NumPy
- Matplotlib / Seaborn
- Clone the repository:
git clone https://github.com/fazlur97/Python_EDA_Group.git
- Navigate to the project folder:
cd Python_EDA_Group - Navigate to the project folder:
pip install pandas numpy matplotlib seaborn