Skip to content

wooing1084/DataScienceTermProject

Repository files navigation

Data Science Termproject

가천대학교 2023학년 1학기 데이터과학

Theme Park visitor Prediction(서울대공원)

Data References

  1. 기상청 기상자료개방포털
  2. 대기질측정정보
  3. 서울대공원 일일입장객수 정보

How to run?

Just run these files.

  • ensemble_gradient_boost.ipynb
  • regression_model.ipynb

Business objective

When people go out with their family or friends, people often go to amusement parks. However, there are so many people that you may not be able to ride the rides you want to ride or see what you want to see. The word "눈치게임" has even been coined to avoid this situation. We identified these problems and planned an amusement park visitor prediction model to provide convenience to people.

Architecture

Used algorithms

Regression

Generalized Additive Model (GAM)

Ensemble Learning

Gradient boosting Regressor

Result

Optimal hyperparameter n_estimators : 200, max_depth : 3, learning_rate : 0.1
Feature importances
Accuracy

Conclusion

The prediction accuracy is about 70 percent, so we think we can roughly see how much it comes numerically. However, We think it is a little difficult to know the degree of congestion by figures alone. Therefore, it would be more helpful if we created a congestion level.

Improvements There were some improvements to be made while carrying out this project. Most of these were data set issues, including. 1. In the past, many people visited Seoul Grand Park, but the number of visitors decreased over time. This phenomenon has led to a data imbalance.
  1. It does not properly reflect the expected days of many visitors, such as holidays and Children's Day. Although the day of year value was added to reflect this, it was not accurately reflected.

  2. No visitor data has been collected after the pendemic. The official Endemic Declaration has only been issued relatively recently. Therefore, the most recent trend was not reflected.

  3. Information about the park or surrounding events was not reflected. In the case of large parks, the number of visitors varies greatly depending on the event, but data could not be collected.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •