A specific crawler to get chinese light novels chapters.
Node.js
- Initialize the server
- Open your console.
- Go to reader-crawler folder.
- Download the libraries needed, running
npm install
- Run the server, with
npm run start
- Send the first URL to the server
- Open your browser
- Make a GET request to the URL
localhost:<port>/crawl?url=<url>&title=<title>&chapters=<chapters>
where<port>
is the port your server is listening (default is 8888)<url>
is the initial url<title>
is something in the page title to identify the useful URLs from the trash (a good choice would be... the novel's title)<chapters>
is the approximate amount of chapters the novel has<encoding>
is the encoding the page has (it is optional, and the default is GBK)
Victor Sin Yu Chen
Center of Informatics (CIn), UFPE