Crawler spider
Application that captures data from websites of publishers and book shops.
The application must search continuously in the websites of the publisher and download the data of the new editions and/or update the existing data.
For istance: reading data by the following website
[login to view URL]
the application must search the book code, title, writer, publisher and price, checking if these data already exist in our database.
The application will add or update the data in our database.
The spider will continue to autosearch data in other valid pages
[login to view URL]
[login to view URL]
The data to search and add are: book code, title, writer, publisher house and price
The application must have a web interface in order to start and manage the search::
First part of url from [login to view URL] to [login to view URL]
Text to search, for example: class='title_page’ class=’ publisher house, code and so on..