I need a web scraper written for the .xlsx file in the following directory:
[login to view URL]
The latest .xlsx file within that directory will need to be downloaded. The name of the file is subject to change daily and will need to be identified by the latest .xlsx extension.
All information needed is available on the main page. The number of rows will vary. If there is a row without an origin city, skip that row. Data will be listed in blocks with
different contact information for each block, contact information will be located above the block of data.
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in the "Load Origin" column before the ,
origin_state --> data located in the "Load Origin" column after the ,
ship_date --> data located in the "Date" column, change to the YYYY-MM-DD format,
if the date column is blank or has multiple date use the current days date, also in the YYYY-MM-DD format
destination_city --> data located in the "Destination" column before the ,
destination_state --> data located in the "Destination" column after the ,
receive_date --> leave blank
trailer_type --> data is the abbreviation located in the "Type" column
load_size --> add the text "Full"
weight --> leave blank
length --> leave blank
width --> leave blank
height --> leave blank
trip_miles --> data located in the "Miles" column
pay_rate --> data located in the "Rate" column
contact_phone --> data located in the contact cell above each block of loads (ie: PH (812-823-4212)
contact_name --> data located in the contact cell above each block of loads, the contact name will be listed after the word contact
tarp_required --> leave blank
comment --> data located in the "Quantity/Notes" column
load_number --> leave blank
commodity --> leave blank
The first line of the output should contain all of the column headers.
Any field that contains no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[login to view URL]' and the output file should be
called '[login to view URL]'
It will be scheduled in cron to run unattended every 15 minutes.
Please specific what language/OS/modules you plan to use.
Also, please include the word "raccoon" in your bid so I know that
you read this description.
I can provide you a Perl scraper that will use WWW::Mechanize and other modules. As a result of my work you'll receive same Perl scraping code as before.
13 фрилансеров(-а) в среднем готовы выполнить эту работу за $125
Hello How are you My name is Xu i have full time and I can start to work immediately Please contact me and do let us discuss about your project Thanks for your posting
CONTENT=HELLO I CAN START RIGHT NOW - I AM EXPERT IN and I BET YOU CANNOT FIND BETTER FREELANCER THAN ME ... pLEASE MESSEGE ME AND LETS DISCUSS THE THINGS THANKSPlease Reply
raccoon. Hi. Great app writer for your projects. I have writen scraping app for many years. I am ready to write your project. Thank you for visiting my profile