[login to view URL] url extractor

Completed Posted Jan 18, 2010 Paid on delivery
Completed Paid on delivery

We can download DMOZ database at http://rdf.dmoz.org. The database is in RDF/XML data which is very large, currently over 1,8 GB in size (260MB in a zipped file distributed by [url removed, login to view]). This file contains over 590,000 categories and 4,530,823 web links.

I want someone to extract the entire urls category wise in text files. After extracting the urls output should be in [url removed, login to view] but not the inner pages links like [url removed, login to view] .

PHP

Project ID: #592238

About the project

11 proposals Remote project Active Jan 18, 2010

Awarded to:

kcmakwana

Sir, I have gone through the site , huge amount of open data is available in compress format. I am ready to extract urls from this and make available in text format. Also ready to incorporate to your site in the way yo More

$150 USD in 3 days
(7 Reviews)
3.3

11 freelancers are bidding on average $171 for this job

soner

I can do that extration easyly. Please contact.

$240 USD in 0 days
(120 Reviews)
7.9
SigmaVisual

We can help in your project, please check PMB to see our related experience.

$225 USD in 4 days
(229 Reviews)
7.8
srinichal

I can deliver the script to you to automate the downlaod

$160 USD in 2 days
(81 Reviews)
6.9
lastguru

Greetings.. we have proven again and again our ability to develop quality products on-time and on-budget without sacrificing quality. there will not be any communication gap. Waiting excited to hear and interact wi More

$150 USD in 5 days
(12 Reviews)
5.5
zhukaster

I can do this extractor, cause have required experience. Feel free to contact me. Thank you.

$60 USD in 2 days
(22 Reviews)
4.6
rados

Interested!

$200 USD in 2 days
(2 Reviews)
2.2
wisit4template

i will finish you work in few time and in low bid. if you attended pls give me more detail. i can start now!!.

$200 USD in 1 day
(0 Reviews)
0.0
djfd

Hi! Ready to help with your xml extraction. Further detait are in pmb. Regrads

$200 USD in 3 days
(0 Reviews)
0.0
jingxianmoyang

Hello sir, I am familar with sort data by regex. Regards, Aking

$120 USD in 3 days
(0 Reviews)
0.0