Find Jobs
Hire Freelancers

Webcrawler for PDFs - repost 2

$30-250 USD

Closed
Posted about 10 years ago

$30-250 USD

Paid on delivery
Need to create a web crawler that will crawl through .edu pages to find PDF files. These PDF files will contain textbook-class information lists. Looking to crawl for only one file. So the crawler should be able to find specifically the textbook list not ALL pdfs as most colleges have 1000's of PDF files on their sites. *Note: Untied States law requires universities to have pdf files with textbook, class, and courses offered data on their sites. Since this information is valuable and profitable universities tend to hide it (while keeping it "available to the public" ) on their websites. This crawler will crawl through a .edu doman and subsequent sub-domains to find PDF book-lists, course- lists or class-lists.
Project ID: 5499185

About the project

5 proposals
Remote project
Active 10 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
5 freelancers are bidding on average $200 USD for this job
User Avatar
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$250 USD in 5 days
4.9 (92 reviews)
6.5
6.5
User Avatar
I am expert crawler maker. Does those books have isbn so that I can recognize that that is a text book or not ?
$188 USD in 5 days
5.0 (35 reviews)
5.6
5.6
User Avatar
Hello, I am experienced PHP programmer with over 3 years experience and I have experience making web crawlers, emails extractors, specific data extractors which is very close to your requirements in this project. Please provide me with an example university website and an example pdf so I can get more clear idea about the requirements and the complexity of the project. Thanks, John
$200 USD in 5 days
5.0 (44 reviews)
4.9
4.9
User Avatar
it is an easy job consider it done once you award the project to me as the project is already done just some modifications for your requirements as I'm an expert in webscrapping and java developer for 5 years.
$111 USD in 3 days
5.0 (4 reviews)
4.8
4.8
User Avatar
Hi, i wrote a couple of crawlers like this in the past. It will consist of a java program which follows all the links on the page and saves the data from it. If you want i can use a small database to save the page informations which would allow it to re-use the data for further searches. Best regards Sebastian
$250 USD in 10 days
5.0 (1 review)
3.9
3.9

About the client

Flag of UNITED STATES
Santa Clara, United States
5.0
2
Payment method verified
Member since Dec 13, 2013

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.