Welcome, Guest: Register On Nairaland / LOGIN! / Trending / Recent / New
Stats: 3,158,188 members, 7,835,946 topics. Date: Tuesday, 21 May 2024 at 05:55 PM

How I Crawled Leads From The Most Difficult Nigerian Site - Programming - Nairaland

Nairaland Forum / Science/Technology / Programming / How I Crawled Leads From The Most Difficult Nigerian Site (176 Views)

How I Crawled Leads From Jiji / I Wrote My Most Difficult C++ Programme Yet,barely 2 weeks since I started(pics) / Learn How To Save Img In Mysql Database. Many Web Developers Find It Difficult.. (2) (3) (4)

(1) (Reply)

How I Crawled Leads From The Most Difficult Nigerian Site by DataMina: 6:58pm On Oct 24, 2023
Hello Nairalanders,

I want to share my experience in crawling leads from Jiji. As a web scraping enthusiast, I've tackled various sites, but this one proved to be difficult. This is because the phone numbers are in the product detail page and you have to be logged in to get it. Another issue I faced was that the website is loaded with java script and uses infinite scrolling.

I was able to circumvent these road blocks by using selenium to login to the site with my personal details after which I saved the cookies as a json file. As the cite uses infinite scrolling I scraped all the sellers url links to a csv file. I then read the url links so as to crawl the site using the links.

Remember my login details have been saved as cookies in a json file, In the process of crawling the site again using the URLs, I applied the cookies from the json file. The crawler visited each seller's detail page using the saved URL links and clicked the "Show Contact" button to extract the contact information.

Retailers specializing in second-hand items like laptops can leverage this approach coupled with applied analytics on the scraped the data to gain swift access to mouth-watering deals, faster than a regular visitor.

You can check out the code for executing the project in my github repo:
https://github.com/StephDAnalyst/JijiLeadScraping

1 Like

(1) (Reply)

Successful Deployment Of Machine Learning Models In C# For Tennis Analytics / Giveaway Price!!! / Looking To Buy Google Adsense... Hit Me Up If You Want To Sell

(Go Up)

Sections: politics (1) business autos (1) jobs (1) career education (1) romance computers phones travel sports fashion health
religion celebs tv-movies music-radio literature webmasters programming techmarket

Links: (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Nairaland - Copyright © 2005 - 2024 Oluwaseun Osewa. All rights reserved. See How To Advertise. 8
Disclaimer: Every Nairaland member is solely responsible for anything that he/she posts or uploads on Nairaland.