Skip to content

Analyze streaming data from meetup.com using spark-streaming and store it in AWS RDS.

Notifications You must be signed in to change notification settings

harxish/RSVP-Analysis

Repository files navigation

Meetup RSVP Stream Analysis

Abstract

Meetup is a service used to organize online groups that host in-person and virtual events for people with similar interests. It provides an API which gives us real time RSVP to these events. We try to solve two problems in here,

  • Allow meetup organizers to identify trending topics related to their meetup. We computed Trending Topics based on the description of the events matching the tags of interest to us.
  • See which Meetup events attract the most responses within our region.
  • Plot the grouped records.

Big Data Architecture




Technology Stack

  • WebSocket - Library used to read JSON response from meetup website.
  • Kafka - Used to collect the responses from the WebSocket module and add it to the meetup-rsvp topic.
  • Spark Streaming - Used to Stream data from the Kafka Topic
  • Amazon RDS - Store the streaming processed data to plot.
  • Matplotlib - Plot the processed data from RDS.

How to Run

  1. Run the core ipynb to start the listening, process and upload it to the RDS server.
  2. Run the plot ipynb to read data from RDS and plot it using matplotlib.

Issues

Open an issue if you get any problem working with it :)

About

Analyze streaming data from meetup.com using spark-streaming and store it in AWS RDS.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published