Java web crawler and text extraction modules

Closed Posted Jun 12, 2013 Paid on delivery
Closed Paid on delivery

Part A ) Extract information from a given set of url's (BID URLs) which contain many PDF in Spanish and extract from the PDFs text using regular expressions.

Example:

The URL [login to view URL] should produce the following : Gerente de proyecto, Desarollador Java, Desarrollador PHP, Desarrollador Forms, Desarrollador .NET , Arquitecto de Software. This text is in page 47 of one of the files listed in the url. Keep in mind you have to parse all the docs in the URL.

Part B) After extracting the text the idea is to Store some of the text that matches certain criteria into a relational database (Mysql). With the above example the idea would be to store in a table with three fields:

| URL

| [login to view URL] | Gerente de Proyecto | Ingeniero de Sistemas

Un (1) año en Gerencia de proyectos informáticos | 1

Conditions:

1. Automatic replies that do not ask for especific information will be automatically discarded.

2. Deliverable MUST be configured as a working java maven project and does NOT have to be web.

3. Only one payment will be made when deliverables work and fully tested.

4. Project will be awarded to the first programmer to submit a working prototype of part A.

Java MySQL

Project ID: #4618081

About the project

15 proposals Remote project Active Jul 19, 2013

15 freelancers are bidding on average $655 for this job

fattahaabdul

Let an expert do it.

$488 USD in 10 days
(95 Reviews)
8.3
IMSeriousBidder

Hello Sir, I can do this project for you, and Part A ready please check your PM for more details Thanks Bing

$742 USD in 10 days
(109 Reviews)
7.5
dobreiiita

Hello, I can help you with this project, Thanks

$684 USD in 12 days
(426 Reviews)
7.5
shenchilang

Experienced java programmer.

$790 USD in 5 days
(82 Reviews)
6.5
rhkchathuranga

I have good experience in Java Web Scraping applications. Please check your P.M.B. sir.....

$555 USD in 4 days
(57 Reviews)
6.4
barundebnath

Hi, I am an expert web-scrapping application maker and also very comfortable with extracting text from pdf and regex. Please see private message for more details. Thanks

$833 USD in 10 days
(50 Reviews)
5.8
rajofficial2009

Hi, Please check your PM and hoping a early reply to discuss further.

$621 USD in 15 days
(8 Reviews)
5.2
thanhhungqb

Dear sir, I have experience about Java extracting. Please see pmb for more details. Thanks.

$600 USD in 10 days
(57 Reviews)
4.9
vickkey7

I am ready to start this project.....

$631 USD in 10 days
(11 Reviews)
4.3
jitendraparmar07

Expert scraper here. Please check your PMB.

$789 USD in 5 days
(11 Reviews)
4.3
arsingh1212

Hi Java experts here! Please see your PMB for details.

$631 USD in 21 days
(20 Reviews)
3.5
justforbusn

please check my private message!

$555 USD in 10 days
(5 Reviews)
3.7