Skip to content

DingYX0731/STA326_Assignment1

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Assignment 1: Data Collection

Project Overview

  • Objective: To understand and apply different methods of data collection, analysis, and cleaning across various data formats.
  • Key Skills: Web Scraping, JSON and CSV Data Handling, Data Cleaning, Python Programming.

Introduction

The project is structured into four key tasks, starting with web scraping to collect raw data from websites, followed by analyzing pre-collected data in JSON and CSV formats, and concluding with a crucial data cleaning step to ensure data quality. Try to understand and finish the tasks in Assignment1.

Table of Contents

  1. Task 1: Web Scraping for Data Acquisition
  2. Task 2: Analyzing Pre-collected Data in JSON Format
  3. Task 3: Exploring Pre-collected Data in CSV Format
  4. Task 4: Data Cleaning

Task 1: Web Scraping for Data Acquisition

  • Objective: Learn how to extract data from websites using web scraping tools like Beautiful Soup.
  • Key Concepts: HTML structure, CSS selectors, Python scripting.

Task 2: Analyzing Pre-collected Data in JSON Format

  • Objective: Understand how to load, and analyze data stored in JSON files.
  • Key Concepts: JSON structure, data parsing, nested objects and arrays.

Task 3: Exploring Pre-collected Data in CSV Format

  • Objective: Master the techniques for importing, manipulating, and analyzing data in CSV files using pandas.
  • Key Concepts: CSV file structure, pandas DataFrame, data manipulation.

Task 4: Data Cleaning

  • Objective: Learn the importance of and methods for cleaning data to enhance the reliability and accuracy of analyses.
  • Key Concepts: Handling missing values, data validation.

Submission

  • Make improvements or modifications to the project and submit your code via a Pull Request (PR) back to the original repository.
  • Export your Jupyter Notebook (.ipynb file) as a PDF and submit it via the Blackboard system.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%