Skip to content
This repository has been archived by the owner on Apr 17, 2024. It is now read-only.

Getting 999 when hitting public profile pages in linkedIn. #105

Open
ladlakaran2607 opened this issue May 7, 2018 · 13 comments
Open

Getting 999 when hitting public profile pages in linkedIn. #105

ladlakaran2607 opened this issue May 7, 2018 · 13 comments

Comments

@ladlakaran2607
Copy link

When I hit public profiles I get 999. I guess after 4 to 5 continous hits, linkedIn blocks the ip address. So changed the ip, but still 999 exists. Is it possible that they are able to get local machine ip address?
How to solve this error without login?

I'm stuck with this. Please help.

@adolfopeccin
Copy link

Same here

@rochenka
Copy link

@Karan-Daiict @adolfopeccin same for me until I found an API that avoids the 999, i make GET requests to that API which calls Linkedin and returns the data, the API is from proxycrawl.

@ladlakaran2607
Copy link
Author

ladlakaran2607 commented Oct 10, 2018 via email

@rochenka
Copy link

@Karan-Daiict I do not use Linkedin API, i use ProxyCrawl API which calls linkedin internally. https://proxycrawl.com/scraping-api-avoid-captchas-blocks

@ladlakaran2607
Copy link
Author

ladlakaran2607 commented Oct 10, 2018 via email

@rochenka
Copy link

@Karan-Daiict yes I do not want to manage headless browser infrastructure or whatever tricks comes along the way from Linkedin, especially that i am building a project for recruiting so I need data constantly, that is why I pay for it, the other option is to find proxies that work for Linkedin which is another difficult task to do and also I have to pay for it, so I went for an easy good solution. Do you use proxies or how do you get the data?

@ladlakaran2607
Copy link
Author

ladlakaran2607 commented Oct 10, 2018 via email

@rochenka
Copy link

I do not really know what they do internally, i just receive the html data that is public, i do not think they support private data crawling from linkedin though.

@ladlakaran2607
Copy link
Author

ladlakaran2607 commented Oct 10, 2018 via email

@adolfopeccin
Copy link

@rochenka how did you implemented the scraping for linkedin data using proxycrawl?

@rochenka
Copy link

rochenka commented Nov 30, 2018

@adolfopeccin Its basically simple, proxycrawl does the crawling with their API, you need to have a token. I think you can get one token for 1000 free requests. So, proxycrawl gives me the raw html data response and I then use linkedin-scraper to scrape the html content.

@abhishmitra
Copy link

@rochenka

Hi,
I spoke to proxycrawl but they have certain limits and stuff. Did they impose those recently? How are they working for you?

@Bilal815
Copy link

@rochenka

Hi, I spoke to proxycrawl but they have certain limits and stuff. Did they impose those recently? How are they working for you?

I recently tried Crawlbase which was ProxyCrawl previously. They say that with JS-enabled for dynamic content which is a need for LinkedIn, you need to pay them a little. I tried my luck and it got interesting here https://crawlbase.com/docs/crawling-api/scrapers/#linkedin.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants