Recommendation Engine For Blog Articles

Project Description

In this project, I will create a recommendation engine for users of the IBM Watson Studio blog article platform. This engine will provide new articles for the user, no matter if the user is new or have already interacted with the platform.

Therefore, I will use the following algorithms:

Ranked-Based
User-Based
Matrix Factorization (FunkSVD)

File Descriptions

Jupyter Notebook: Recommendation Engine for IBM Watson Studio Blog Articles.ipynb
- This Jupyter Notebook includes the following content:
  1. Exploratory Data Analysis
  2. Ranked-Based Recommendation
  3. User-Based Recommendation
  4. Matrix Factorization Recommendation
  5. Final Model
- Used Libraries:
  - import numpy as np
  - import pandas as pd
  - import matplotlib.pyplot as plt
  - import statistics
  - from collections import Counter
CSV-file: articles_community.csv
- File with all blog articles from the platform
- 5 columns (datatypes: 1 integers, 4 objects)
- 1056 rows
- Hardly any duplicates/NaN-Values
CSV-file: user-item-interactions.csv
- File with all user-article interactions
- 3 columns (datatypes: 1 float, 2 objects)
- 45993 rows

Exploratory Data Analysis

In the file user-item-interactions.csv, there are the following findings:

80% of articles were viewed 80 times:

Most users interacted with 1 to 20 articles:

Most users viewed 1 to 7 different articles (not including multi viewing):

Ranked-Based Recommendation

The algorithm uses a ranking to find the most viewed items and sorts them from most viewed to least viewed.

User-Based Recommendation

The algorithm search for user with similar behavior. To find similar behavior, I look at the articles a user has viewed and compare them to all users to find the user with the most matches. The items that the first user didn't see, but the similar user did, will be recommended.

Matrix Factorization Recommendation

he next recommendation algorithm will be based on the Matrix Factorization. Therefore, I use the famous FUNK-SVD algorithm.

The FUNK-SVD based on the splitting of a matrix in three matrices with latent factors. The problem with the normal SVD is that it can´t work with nan values, so objects which a user didn´t see jet. So, solve this problem, the FUNK-SVD use just the existing rating of a user and updated the nan and all other values of the latent factors with it. This will be done so long until a minimum error is reached.

The smallest MSE in at the number of 3 latent factors.

The define the cut point is at ± 0.000356 (the MSE at the lowest point).

Summery

The Ranked-Based Recommendation just recommend the most popular articles. For completely new users (cold-start-problem) this is complete enough. But for older user this is not satisficing. Therefore, the User-Based Recommendation can be used. The problem here is that a minimum number of articles seen is necessary. If this value is by 3, then just 50% of the users can have a recommendation with this algorithm. The last recommendation function can solve the problem. With the FUNK-SVD there is the possibility to predict article for a user. Just on interaction with an article can be enough. But the accuracy depends on the number of previews user interactions. Therefore, a cutoff value is necessary to find just the articles with the best accuracy. Overall, a combination of all 3 algorithm is the best option to find the best recommendation.

An improvement for my final model can be to restart the searching in the User-Based-Recommendation, when articles are sorted out because they have already been seen.

Acknowledgements

The dataset for this analyzation was thankfully provided from IBM Watson Studio: https://www.ibm.com/blogs/watson/

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
README.md		README.md
Recommendation Engine for IBM Watson Studio Blog Articles.ipynb		Recommendation Engine for IBM Watson Studio Blog Articles.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation Engine For Blog Articles

Project Description

File Descriptions

Exploratory Data Analysis

Ranked-Based Recommendation

User-Based Recommendation

Matrix Factorization Recommendation

Summery

Acknowledgements

About

Releases

Packages

Languages

maximkiesel1/Recommendation_Engine

Folders and files

Latest commit

History

Repository files navigation

Recommendation Engine For Blog Articles

Project Description

File Descriptions

Exploratory Data Analysis

Ranked-Based Recommendation

User-Based Recommendation

Matrix Factorization Recommendation

Summery

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages