Skip to content

Vietjet website crawler using Selenium and Jupyter Notebook for extracting flight information.

Notifications You must be signed in to change notification settings

phitrann/vietjet-crawler

Repository files navigation

Vietjet Crawler using Python Selenium

This is a Jupyter notebook for crawling flight information from the Vietjet website using the Selenium library. The script automates the process of extracting flight details such as departure time, arrival time, and ticket prices.

Prerequisites

Before running the script, make sure you have the following installed:

  • Python 3.x
  • Selenium library (pip install selenium)

Usage

  1. Clone this repository or download the notebook vietjet_crawler.ipynb.

  2. Run the notebook vietjet_crawler.ipynb.

  3. Sit back and relax while the script automates the process of navigating to the Vietjet website, searching for flights, and extracting the flight details. The results will be saved in a CSV file named {datetime}.csv.

Notes

  • Please ensure that you use this script responsibly and in compliance with the Vietjet website's terms of service.

  • The script is built using Selenium, which controls the Chrome browser.

  • You can customize the script further to extract additional flight information or implement other functionalities based on your requirements.

  • For any issues or questions, please feel free to open an issue in this repository.

Happy crawling!

About

Vietjet website crawler using Selenium and Jupyter Notebook for extracting flight information.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published