Skip to content

Arif-1411/Data-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Data Analysis Projects

Exploratory data analysis projects using Python. Covers data cleaning, feature engineering, statistical analysis, and insight extraction from real-world datasets.


Tools and Libraries

  • Python
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Scikit-learn (preprocessing and feature engineering)
  • Jupyter Notebook

Topics Covered

Data Cleaning

  • Handling missing values
  • Removing duplicates
  • Fixing inconsistent data types
  • Outlier detection and treatment

Exploratory Data Analysis (EDA)

  • Univariate and bivariate analysis
  • Distribution analysis
  • Correlation analysis
  • Statistical summaries

Feature Engineering

  • Creating new features from existing data
  • Encoding categorical variables
  • Feature scaling and normalization
  • Handling skewed distributions
  • Binning and discretization

Data Visualization

  • Bar charts, histograms, box plots
  • Heatmaps and correlation matrices
  • Pair plots and scatter plots
  • Time series plots

Repository Structure

data-analysis/
    project-1/
        dataset/
        notebook.ipynb
        README.md
    project-2/
        dataset/
        notebook.ipynb
        README.md
    ...

How to Run

  1. Clone the repository
git clone https://github.com/your-username/data-analysis.git
  1. Install dependencies
pip install pandas numpy matplotlib seaborn scikit-learn jupyter
  1. Open any project notebook
jupyter notebook

Author

Arifudheen M GitHub: Arif-1411

About

Exploratory data analysis projects using Python, Pandas, NumPy, Matplotlib, and Seaborn. Covers data cleaning, visualization, statistical analysis, and insight extraction from real-world datasets.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors