Find Jobs
Hire Freelancers

Analyze some Data using Spark

$30-250 USD

Completed
Posted about 7 years ago

$30-250 USD

Paid on delivery
I am looking for a freelancer to help me with my project. The skill required is Spark (PySpark, Spark SQL). Using Spark, we need to analyze two datasets(will be provided), compare the two and generate graphs and word clouds.
Project ID: 13406203

About the project

19 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Updated proposal: Script to process the datasets with PySpark. 2 types of the output should be implemented: 1) CSV file per each dashboard (for each type of analysis) 2) Graphical python output (with ggplot or similar lib) List of analytic queries:  Top N Tags based on item-tag/tags files (citeulike-a)  Camparision between Top N words used in both the data sets (citeulike-a and citeulike-t) (comparison word cloud) (using [login to view URL] and [login to view URL])  Top N words used in dataset citeulike- a and citeulike-t (individually) (commonality cloud)  Top N papers which have highest no. of citations (citeulike-a )  Top N articles which used highest number of words (analysis on [login to view URL]) (citeulike-a)  Top N users, whose items has maximum tags (citeulike-a)  Top N users, who have maximum items (citeulike-a)  Top N highly used items (citeulike-a)  Top N users whose items are most cited (citeulike-a)  Top N citations used (citeulike-a)  Top N articles based on tag-items (citeulike-t)  Comparison between Top N users, whose items has maximum tags in citeulike-a and citeulike-t  Comparison between Top N users, who have maximum items in citeulike-a and citeulike-t  Comparison between Top N highly used items in citeulike-a and citeulike-t  Comparison between Top N articles which used highest number of words in citeulike-a and citeulike-t ============================================================================= Hi, I have an experience with etl developmen
$66 USD in 8 days
5.0 (5 reviews)
4.7
4.7
19 freelancers are bidding on average $173 USD for this job
User Avatar
Greetings sir, i am an expert freelancer for this job and your 100% satisfaction is assured if you allow me to serve. Here is the reason. Why you should pick me? a) I am a very expert and have the same kind of experience of 5 years. b) I work very hard (16+ hours a day and 7 days a week) and also very fast so... it will be done very soon than most of the other providers c) And most important part is my policy: "I will give you (to my client) life time support (as long as you keep relation with me). And fix any bugs/problem without any cost. So, don't ever worry about me” Please sir, leave a reply ASAP, as I am waiting for your kind reply
$250 USD in 3 days
5.0 (278 reviews)
8.0
8.0
User Avatar
20+ years industry work experience in the area of IT, Finance & Banking. Also, I have 4+ years of experience in Economics, Business Analytics and Advanced statistics projects using software such as Python, R, SPSS, Minitab, Matlab, Big Data Analytics, Hadoop(Hive, PIG, SPARK, SCALA, ADAM, SSH/OpenSSH, Ansible, etc), Excel Data Pack, AIIMS, Tableau, Dashboard, PhotoShop,Illustrator, Sketch, Android, UI/UX, etc. and statistical tools and methods such as Multiple Regression Analysis, Correlation, Market Basket Analysis, Linear Programming, Monte Carlo Analysis, Principal Component Analysis, ANOVA/MANOVA/ANCOVA, Time Series, ARIMA, ARMA, ARCH, GARCH and financial analysis such as Financial Projections, Ratio analysis, Balance sheet and P&L Analysis, Cash Flow analysis, Fundamental analysis and Technical analysis. I also have good knowledge and experience of Project Management tools and body of knowledge such as scope and time management, budgeting, critical path, and network diagram, etc. I have performed economic analysis such as Price and Demand, Prices and Production, Demand and Supply, Free Trade, National Income, Balance of Payment, Exchange Rates and relative prices, Equilibrium in Forex Market, Money, Interest Rate and Exchange rate, Aggregate demand, etc I am B.S c(Electronics), PGHDSM (Systems Management, NIIT), GMTP(Financial Management, ICICI), PGDM (Marketing, AIMA), EPBABI(Business Analytics and Intelligence, IIM Ranchi). Digital Marketing Certification from NIESBUD
$250 USD in 4 days
4.9 (4 reviews)
4.0
4.0
User Avatar
Hello. How are you. I have read and understood the project. I have strong knowledge in Probability and Statistics. And I'm expert in R, Matlab, Python, Spark etc. I'm interested this project. So, firstly I want to discuss with you about this project. Then I'll be happy. I wait for your good reply. Thanks.
$200 USD in 5 days
5.0 (3 reviews)
3.6
3.6
User Avatar
Your project is a perfect macth for my skills. Looking forward to walk you through my portfolio during a short skype session. best regards, Joerg.
$199 USD in 7 days
5.0 (3 reviews)
3.2
3.2
User Avatar
Hi there, I'm a professional Big Data software developer and CS graduate. I'm having expertise in developing software using Hadoop, Spark, HBase, Scala, Cassandra etc. See some of the works in my portfolio. Ping me for discussion.
$250 USD in 3 days
4.9 (3 reviews)
3.2
3.2
User Avatar
Hey, I am data engineer working in an Online Advertisement firm. I have worked on many spark, pig projects I will be able to help you solve these problem. Please have a look at my profile and we can discuss more. Thanks
$133 USD in 2 days
4.9 (3 reviews)
3.2
3.2
User Avatar
Though I am new here but my team has 3 years of experience into Apache/Spark/Akka. Can very well execute this Project
$225 USD in 4 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have strong 11+ years of experience in SQL and data analytics. I can be trusted to deliver a robust and sound solution for your requirement. Once I see your requirement I can provide you more details. Thanks Sovan
$222 USD in 4 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have done couple of similar project using Apache Spark & Scala, implemented to import large transaction files and process in the spark and push the results to the Hive tables post processing for Visualization.
$222 USD in 7 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello. I have prior experience with spark and PySpark. I have been working on analyzing massive datasets using Spark before. I also have experience with the requisite visualizations. If you can let me know the exact details of the project, I will describe the planned road-map. Feel free to contact me anytime you like. Thanks in advance.
$50 USD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
$155 USD in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
currently I am working on same kind of requirement.
$55 USD in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am a spark except. can help you here.
$166 USD in 2 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Manhattan, United States
5.0
1
Payment method verified
Member since Mar 15, 2017

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.