Find Jobs
Hire Freelancers

PHP Spider

$30-100 USD

Closed
Posted about 21 years ago

$30-100 USD

Paid on delivery
Hello, I need a Web Spider written in PHP 4.0. The Spider must read from a MySQL DB a list of Pending Sites to be Spidered. The spider must be able to access HTML pages (htm and html extensions), CGI, Perl, PHP, Cold Fusion, ASP, and each frame in Framesets. If a webpage uses a drop down list for links (common JavaScript feature), the Spider must be able to grab the links. Spider must recognize and ignore the following extensions MP3, GIF, PNG, JPG, SWF, MPG, AVI, WAV, and any other binary or non-text files. Spider must also be able to pull information and links out of tables. All links that the spider gets must be made into complete URLs, not relative links, and must include any querystring information. For each of the Pending URLS, the spider must 1. Get the title, Baseref, all meta tags, all links with their text (what the visitor sees as the link on the screen), all email addresses with their text(what the visitor sees as the link on the screen), and the text of the page, stripped of all. 2. This information must be put into the MySQL databasse in 4 tables. All page information, except links and email, will go into "SpideredSites" table. All links will go into "Pending URLs". All eamils will go into "SpideredEmails". And, all links will be added to "ReferralLinks". This last table will also contain the unique ID from "SpideredSites" for the site that was spidered to get that link. 3. Spider must update "Pending URLs" to indicate that the URL was spidered (this is a Yes/No column that will be set to Yes). 4. Spider must output to browser the ID from Pending URLs, the ID from SpideredSites, and the URL as a link, and on the following line the date and time. This is followed by two Carriage Return Line Feeds. 5. The spider should repeat steps 1-4 until all Pending URLs are spidered, or until a specific number of files have been spidered (a configuration file should be made to allow me to set the number of Pending URLs to do at one time). ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased. ## Platform PHP 4.0
Project ID: 2923261

About the project

4 proposals
Remote project
Active 21 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average $131 USD for this job
User Avatar
See private message.
$76.50 USD in 14 days
5.0 (99 reviews)
7.2
7.2
User Avatar
See private message.
$106.25 USD in 14 days
5.0 (1 review)
2.0
2.0
User Avatar
See private message.
$42.50 USD in 14 days
0.0 (1 review)
0.0
0.0
User Avatar
See private message.
$297.50 USD in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
United States
4.5
4
Member since Jan 2, 2003

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.