AirBnB-Review-Helpfulness-Classifier

Hugging Face Model

Link to hugging face model : airbnb-reviews-helpfulness-classifier-roberta-base

Description

This model is an AirBnB reviews helpfulness classifier. It can predict the helpfulness, from most helpful (A) to least helpful (C) of the reviews on AirBnB website.

Pre-trained LLM

Our project fine-tuned FacebookAI/roberta-base for multi-class text (sequence) classification.

Dataset

5000 samples are scraped from AirBnB website based on listing_id from this Kaggle AirBnB Listings & Reviews dataset. Samples were translated from French to English language.

Training Set : 4560 samples synthetically labelled by GPT-4 Turbo. Cost was approximately $60.

Test/Evaluation Set : 500 samples labelled manually by two groups (each group labelled 250 samples), majority votes applies. A scoring rubrics (shown below) is used for labelling.

Training Details

hyperparameters =  {'learning_rate': 3e-05,
                    'per_device_train_batch_size': 16,
                    'weight_decay': 1e-04,
                    'num_train_epochs': 4,
                    'warmup_steps': 500}

We trained our model on Colab Pro which costed us approximately 56 computing units.

This fine-tuned roberta-based model is a text classifier to predict the helpfulness of AirBnB reviews.

Slides

Collaborators: Li Hui Cham, Nicholas Wong, Isaac Sparrow, Christopher Arraya, Lei Zhang, Leonard Yang

Credit to my wonderful teammate Li Hui for organizing our work

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
images		images
.DS_Store		.DS_Store
.README.md.swp		.README.md.swp
LICENSE		LICENSE
README.md		README.md
demo_airbnb_classifier.ipynb		demo_airbnb_classifier.ipynb
finetuning.ipynb		finetuning.ipynb
scraping-and-synthetic-labeling.ipynb		scraping-and-synthetic-labeling.ipynb
translation-code.ipynb		translation-code.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AirBnB-Review-Helpfulness-Classifier

Hugging Face Model

Description

Pre-trained LLM

Dataset

Training Details

Slides

About

Releases

Packages

Languages

License

nicwjh/Review-Helpfulness-Classifier

Folders and files

Latest commit

History

Repository files navigation

AirBnB-Review-Helpfulness-Classifier

Hugging Face Model

Description

Pre-trained LLM

Dataset

Training Details

Slides

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages