The program is small, it is a data preprocessing step, to be built using Spark (GraphX if it is optimal for the problem since the data is a graph) with Scala language and hadoop DFS.
Detailed description about the program processing/input/output attached in "description" file.
The project is completed by running the program in my computer.
6 freelancers are bidding on average $33 for this job
This project required knowledge in Scala and Hadoop, with is not easy skills. Those bid with $20 or $30 are not serious. I'm good at Hadoop, good enough to handle this project. Please award this to me.
I'm expert on spark and cost of this project is more than 30 USD. I do for two reason : 1- I'm new here and nobody give me project 2- you can reviews my profile after finish project