Find Jobs
Hire Freelancers

hadoop project

$10-30 USD

Closed
Posted about 3 years ago

$10-30 USD

Paid on delivery
Phase 2: Implement MR programs to solve unstructured data problems on the HDFS set up. (25 points) Due date: 4/17/2021. In this phase you will implement the word co-occurrence MR algorithm discussed in the Lin and Dyer’s book. You’ll select a data set from publications in any subject area you are familiar with and prepare co-occurrence or co-author information from the publications. The stripes method for co-occurrence may be better suited for this application. Map will have to parse and drop the extra text in the publications. We need only the first author as key and rest of the authors as value and number of occurrences in a given corpus. Input: Many publications from an author. Output: Author as the key and value is the associated array with the co-authors along with number of occurrences as entry in the associated array. Mandatory requirement: Every team has to have its own data set and cannot copy each other.
Project ID: 29921525

About the project

Remote project
Active 3 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

About the client

Flag of UNITED STATES
Buffalo, United States
0.0
0
Member since Apr 18, 2021

Client Verification

Other jobs from this client

hadoop map reduce project
$10-30 USD
Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.