Web crawling task B - only feasible with help from automated crawlingsuchen

Completed Posted Nov 17, 2015 Paid on delivery
Completed Paid on delivery

I have a list here with patent numbers in excel format.

Go to the website

[url removed, login to view]

paste the first patent number.

Then look at the resulting website.

Extract the text from the section

- "claims" (exists for all records)

- "Parent case text" (exists only for some records)

-"Current U.S. Class:"

-"Current CPC Class:"

-"Current International Class:"

and, for each record, paste the content for claims, parent case text, and the three classes into the excel cells following the patent number.

Note:

1) some of the claims may be very long, not fitting into a single cell. So you should have the possibility to split the content into two or three cells, if necessary. As there may be only very few cases, it would probably be easier to manually check for this by counting the number of words per cell and do the few cases, where content doesn't fit into a cell, manually.

2) not all fields exist/contain data, especially related to the references cited section

3) only automated crawling is feasible, no manual work extracting the whole data!

4) I don't work with milestone payments, I work with step by step invoices

5) The project budget is USD 20

Web Scraping

Project ID: #8919003

About the project

4 proposals Remote project Active Nov 17, 2015

Awarded to:

titukhan13

thanks for invite.i am ready for your work please award me .i will try to complete asap without any [login to view URL] for your [login to view URL]

$20 USD in 1 day
(30 Reviews)
4.3

4 freelancers are bidding on average $23 for this job

kushagrachadha

A proposal has not yet been provided

$25 USD in 1 day
(0 Reviews)
0.0
PepperClove

Greetings! Please open the communication in PMB so that we can look into the project in detail and furbish you an accurate/ precise and lowest possible bid, thanks ! Pepper Clove Software LLP is a fast growing off More

$25 USD in 1 day
(0 Reviews)
0.0