Data Extraction
$100-400 USD
Paid on delivery
Extractor needed for Craigslist consisting of the following:
? 1)? ? ? ? ? ? ? ? ? ? ? ? ? ? Email extracted from each separate category: community, housing, personals, for sale, jobs, discussion forums, services, and gigs. ? Compiled in separate txt files with category names as listed above.
2)? ? ? ? ? ? ? ? ? ? ? ? ? ? Data extracted from all sub categories, will also be compiled in separate txt files for all listed sub-categories named.
3)? ? ? ? ? ? ? ? ? ? ? ? ? ? Extractor to work on all listed cities / and countries.
4)? ? ? ? ? ? ? ? ? ? ? ? ? ? Manual input of cities and categories is ok, provided the programmer gives the code necessary to input for each city - category. ?
5)? ? ? ? ? ? ? ? ? ? ? ? ? ? A zip file that has limited function can be supplied so you can see what I have now. The file is full of bad code and does not work as I have outlined above.
6)? ? ? ? ? ? ? ? ? ? ? ? ? ? Software must remove all duplicate addresses.
7)? ? ? ? ? ? ? ? ? ? ? ? ? ? Software must not go back to a page it has already crawled.
8)? ? ? ? ? ? ? ? ? ? ? ? ? ? Software must have multiple keyword capability and allow me to manually input multiple keywords to search for in the classified txt of the post.
9)? ? ? ? ? ? ? ? ? ? ? ? ? ? Software must remove all @[login to view URL] addresses
10)? ? ? ? ? ? ? ? ? ? Software must work on Windows XP.
11)? ? ? ? ? ? ? ? ? ? Desktop application, NOT web based.
12)? ? ? ? ? ? ? ? ? ? Prefer Perl 5+ coding.
13)? ? ? ? ? ? ? ? ? ? Fixed price of $400.00 for a working program as outlined above. Additional payments for work with updates and other projects.
## Deliverables
QUESTIONS FOR PROGRAMER:
What projects of this type have you done before?
What is the estimated time frame to completion?
Bids will be close on 10.28.2008 and programmer will be chosen.
Project ID: #3245870