I'am looking for someone able to perform the following tasks :
1. Scan a web directory in order to (a) provide me with its exhaustive sitemap (list of all existing URL, csv/txt format), (b) extract as much information as possible on each website included in the directory (name, website URL, author, social accounts, RSS)
2. Scan each external website included in the directory (est. 5,000 websites) in order to (a) also provide me with exhaustive sitemaps, (b) extract additional contact informations and merge them with data extracted during task 1b.
All of this should be performed using a very simple query. This job include a first analysis as soon as possible (imply 60% milestone payment) and two updates for the upcoming months (20% each).