Affluent Challenge

Project Overview

This is my first contact with scrapping, I did scraping of a website called Affluent and I learned a few things that we should considerate before use puppeteer or any automate browser.

Always check if there's an API we can use
Check if we can use reverse engineer to get use the api.
If there's no api we can try using request only, the reasons for this is because the request uses less resources than puppeteer or something like selenium, so on.
If we cannot find a api to use, we cannot do reverse engineer and use request because the site uses javascript to render, the last option is to use an automated browser.

Technologies Used

Getting Started

Prerequisites

Ensure you have the following installed on your local machine:

NodeJS

Installing/Run locally

Make sure you have nodejs installed.

Clone

  - git clone https://github.com/GOlmedoFormosa/node-scraping
  - cd node-scraping
  - npm install

Create/configure .env environment with your credentials. A sample .env.example file has been provided to get you started. Make a duplicate of .env.example and rename to .env, then configure your credentials.
Run npm run watch to start the server and watch for changes
Open your browser and go to localhost:8080

Run process to create a request, get users data and store the values in the db

Run npm run processUsers here we are using request-promise to fetch users data and store it in the mysql database.

Run the automate browser, do the scraping and store the values in the db

Run npm run processScraping this will run puppeteer, get the data and store it in the mysql database.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
knexfile.js		knexfile.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Affluent Challenge

Project Overview

Technologies Used

Getting Started

Prerequisites

Installing/Run locally

Run process to create a request, get users data and store the values in the db

Run the automate browser, do the scraping and store the values in the db

About

Releases

Packages

Languages

gustavo-olmedo/affluent-scraping-challenge

Folders and files

Latest commit

History

Repository files navigation

Affluent Challenge

Project Overview

Technologies Used

Getting Started

Prerequisites

Installing/Run locally

Run process to create a request, get users data and store the values in the db

Run the automate browser, do the scraping and store the values in the db

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages