Require code to scrape [url removed, login to view] and store various fields to a mongoDB. Specifically, it will have the ability to scrape the RSS feed, link below:
[url removed, login to view]
The RSS feed has numerous categories, but interested in scraping the below ones:
1. “All News Releases from PR Newswire”
2. “News for Investors”
The code needs to have the ability to scrape any of the 2 categories via just a variable change in code via a look-up table. For example, if a user wants to scrape “News for Investors”, a look-up table needs to match a dynamic user entered variable to the proper RSS feed, in this example, [url removed, login to view]
And scrape that category. Provide a section in a readme file where this variable resides.
Only scrape articles that are in English, i.e. en-us
For the link immediately found in the main RSS feed and the link leading to the FULL article, the following MUST be stored to a mongoDB (allow NO duplicates via upsert or other method based on some unique parameter to each article) (refer to attached PDF) for details.
18 freelancers are bidding on average $188 for this job
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
Hi, I have scraped RSS feeds before using Python and I've used MongoDB before as well. I even have "MongoDB for developers" certificate from 10gen. I can write a script that does what you need.