Skip to content

UKDataServiceOpen/Working-with-Reddit-data

 
 

Repository files navigation

Working with Reddit Data

Table of contents

About

Resources on Reddit data for research.

Reddit Data

You will need to install PRAW.

Reddit Scraper Binder Link

Click the button to open an editable version of the Reddit scraping demo. This may take a minute to start. You won't be able to run this without generating your own Reddit API keys. We recommend creating a separate Reddit account for API keys.

Binder

Reddit Preprocessing Binder Link

This notebook outlines exploration of Reddit data, tidying, basic graph plotting and wordcloud generation.

Binder

Requirements

  • Python 3.8+
  • PRAW (Python Reddit API Wrapper)
  • Jupyter / JupyterLab for notebooks
  • Install dependencies via requirements.txt or environment.yml

How to run (quick)

  1. Create a virtual environment:
    • python3 -m venv .venv
    • source .venv/bin/activate
  2. Install dependencies:
    • pip install -r requirements.txt or
    • conda env create -f environment.yml
  3. Open notebooks:
    • jupyter lab
    • or use the Binder links above (you must provide Reddit API keys).

About

How to scrape data from Reddit, legally and ethically.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%