I need a web scraper written for the following url:
[login to view URL]
The "search" box under the State Search options will need to be clicked.
All information needed is available on the main page. The number of rows will vary.
EXCLUDE data if the following names are in the "Contact" column:
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in the "Ship City" column
origin_state --> data located in the "Ship State" column
ship_date --> data located in the "Ship Date" column changed to the YYYY-MM-DD format, the time is not needed
destination_city --> data located in the "Destination City" column
destination_state --> data located in the "Destination State" column
receive_date --> leave blank
trailer_type --> data located in the "Trailer Type" column
load_size --> if the text "LTL-Less than truckload" is located in the "Trailer Type" column, add text "Partial" as the load_size;
if not partial, add text "Full" to this column
weight --> leave blank
length --> leave blank
width --> leave blank
height --> leave blank
trip_miles --> data located in the "Miles" column
pay_rate --> data lcoated in the "Payment" column
contact_phone --> data located in the "Phone" column
contact_name --> data located in the "Contact" column
tarp_required --> leave blank
comment --> leave blank
load_number --> data located in the "Order" column
commodity --> leave blank
The first line of the output should contain all of the column headers.
Any field that contain no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[login to view URL]' and the output file should be
called '[login to view URL]'
It will be scheduled in cron to run unattended every 15 minutes.
Please specify what language/OS/modules you plan to use.
Also, please include the word "raccoon" in your bid so I know that
you read this description.