tertiary-trekker

1. Background

This application crawls and scrapes data from University websites in Singapore, parses their content and consolidates their information into an Elasticsearch database for searching. Document clustering may then be performed to allow for users to discover similar webpages between University websites.

This project aims to assist prospective University students by presenting course information in a consolidated manner to facilitate comparisons.

Made for NUS Orbital 2024.

2. External README

Can be found at https://docs.google.com/document/d/1WzcwicQI4hg8aESSdqEEhPX5unTr_2-EmTcJpDUixUE/edit?usp=sharing

3. Installation

Web Scraper

Clone the Repository using

git clone https://github.com/C5hives/tertiary-trekker

From the project directory, navigate to the webpage-scraper Folder.
Install the required dependencies using

npm install

Compile the Typescript project files by using the tsc command.
Run the compiled Javascript files using

npm start

File Parser

Same as Web Crawler
From the project directory, navigate to the webpage-parser Folder.
Install the required dependencies using

mvn install

Run the Unit test using

mvn clean test

Database

Check external README

Backend

Check external README

Frontend

Check external README

4. Collaborators

Jewi

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.vscode		.vscode
backend		backend
frontend		frontend
webpage-parser		webpage-parser
webpage-scraper		webpage-scraper
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tertiary-trekker

1. Background

2. External README

3. Installation

Web Scraper

File Parser

Database

Backend

Frontend

4. Collaborators

About

Releases

Packages

Contributors 2

Languages

C5hives/tertiary-trekker

Folders and files

Latest commit

History

Repository files navigation

tertiary-trekker

1. Background

2. External README

3. Installation

Web Scraper

File Parser

Database

Backend

Frontend

4. Collaborators

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages