-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add "web-scraping-linkedin-jobs-using-puppeteer" blog post #28
feat: add "web-scraping-linkedin-jobs-using-puppeteer" blog post #28
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
]; | ||
``` | ||
|
||
To iterate through the `searchParamsList` array, we essentially need to convert it from an Array to an Observable using the `fromArray` operator. Subsequently, we will use the `concatMap` operator to sequentially process each searchText and locationText pair. The power of RxJS here is that, in the case where we may want to switch from sequential to parallel processing, we just need to change the `concatMap` for a `mergeMap`. In this case, it is not recommended because we will exceed LinkedIn's rate limits, but it's something to consider in other scenarios. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same here
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
|
||
### RxJS | ||
|
||
[RxJS](https://rxjs.dev/) és una biblioteca per a la programació reactiva en JavaScript. Proporciona un conjunt d'eines i abstraccions per treballar amb fluxos de dades asincrònics. Utilitzarem RxJS en aquest exemple perquè ofereix els avantatges següents: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"asincrònics" or "asíncrons"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
😆 ChatGPT 4 fent de les seves.... sona molt millor asíncron si. Asincrònic sembla també vàlid, però no l'havia sentit mai
|
||
``` | ||
|
||
El codi proporcionat extreu efectivament tota la informació de treball disponible de la pàgina. Encara que el codi és molt estètic, aconsegueix la feina, que és típic per a codi de "scraping" web. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Encara que el codi és molt estètic", what do you mean by that?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hauria de ser "Encara que el codi NO és molt estètic" (falta el NO) tot hi que estaria be reformular una mica la frase
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
blog/ca/3_web-scraping-de-feines-a-linkedIn-utilitzant-puppeteer-i-rxjs.mdx
Outdated
Show resolved
Hide resolved
0ceba97
to
fd914bb
Compare
fd914bb
to
cca6b0a
Compare
/assets/blog/web-scraping-de-feines-a-linkedin-utilitzant-puppeteer-i-rxjs/cover-image.png | ||
--- | ||
|
||
El web scraping pot semblar una tasca senzilla, però hi ha molts reptes a superar. En aquest blog, ens endinsarem en com fer "scraping" a LinkedIn per extreure ofertes de feina. Per fer això, utilitzarem [Puppeteer](https://pptr.dev/) i [RxJS](https://rxjs.dev/). L'objectiu és assolir web scraping d'una manera declarativa, modular i escalable. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Més que endinsarem, diria aprendrem.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
agreed
|
||
> Alguns dels fragments en aquest blog poden ometre parts per claredat. Podeu trobar el codi complet en aquest [repositori](https://github.com/llorenspujol/linkedin-jobs-scraper). | ||
|
||
This translation maintains the meaning of the original English text in Catalan. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Això s'hauria de treure no?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes
|
||
## Anar a la llista de llocs de treball de LinkedIn i extreure les dades | ||
|
||
Aquesta és la part central d'aquest bloc, on ens submergim en el procés d'accés a les ofertes de feina de LinkedIn, analitzant el contingut HTML i recuperant les dades d'ofertes de feina en format JSON. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
bloc o blog?
5348dee
to
91afc88
Compare
Draft previ a afegir el blog de com fer "scraping " a linkedin. Em falten algunes millores en el blog, i espero trobar també una millor foto de portada