Skip to content

There are two subreddits I am interested in: /r/News and /r/TheOnion. The first contains titles of news articles, while the second contains titles of satirical news articles. Can I build a classification model using natural language processing that can accurately predict which subreddit a given post came from?

Notifications You must be signed in to change notification settings

JCacho2007/Fake-News-Classification-NLP

Repository files navigation

Project 3 - Classification with Natural Language Processing

Author: Grace Campbell

Problem Statement

Reddit is a content aggregation website where members can submit links, text posts, images, and videos, which other members can then comment on and discuss. The posts "are organized by subject into user-created boards called 'subreddits', which cover a variety of topics including news, science, movies, video games, music, books, fitness, food, and image-sharing." (Wikipedia)

There are two subreddits I am interested in: /r/News and /r/TheOnion. The first contains titles of news articles, while the second contains titles of satirical news articles. Can I build a classification model using natural language processing that can accurately predict which subreddit a given post came from?

Project Directory

  1. Data Preparation
  2. Modeling

About

There are two subreddits I am interested in: /r/News and /r/TheOnion. The first contains titles of news articles, while the second contains titles of satirical news articles. Can I build a classification model using natural language processing that can accurately predict which subreddit a given post came from?

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published