Twitter Web Scraping (need to search for embedded URLs in tweets for specific texts)
$30-250 USD
Completed
Posted over 9 years ago
$30-250 USD
Paid on delivery
You can see the attached WORD document:
Twitter Web Scraping Project
1. I will provide a file that contains the project number for 12,363 projects on [login to view URL]
2. You need to write a script that is able to extract the tweets posted on Twitter for each project. You will need to find all tweets that have a url link embedded in it with [login to view URL]<PROJECT NUMBER> as part of the URL link For example, for project number 45405, would must identify all tweets that have a URL link embedded in it with [login to view URL] as part of the URL link. These include the following tweets for example:
eurocentrique
Help make it happen for Briefcase Board Game on @indiegogo [login to view URL] -- Greek #Gaming #startup gets crowdfunded.
Reply Retweet Favorite Feb 28, 2012
benny275
I just funded Briefcase Board Game on @indiegogo. Fund it too! [login to view URL] …
Reply Retweet Favorite Feb 11, 2012
sternenfahrer
I just funded Briefcase Board Game on @indiegogo. Fund it too! [login to view URL] … Kickstarter für Europa,sehr schön ...
Reply Retweet Favorite Feb 15, 2012
kstylianopoulos
I just funded Briefcase Board Game on @indiegogo. Fund it too! [login to view URL] …
Reply Retweet Favorite Feb 12, 2012
However, simply searching for “[login to view URL]” in the twitter search webpage ([login to view URL]) will not show any results. I’m not sure how you should do this, but that is your task.
3. For each identified tweet, extract the tweet URL, tweet text, the timestamp of the tweet, the author twitter handle, the number of followers the tweet author, the number of retweets of the tweet, and the number of times it was made a favorite.
4. For tweets that have “?a=” following the [login to view URL]<PROJECT NUMBER> in the embedded link (see the examples above), extract the number following it. For example, for [login to view URL] extract 432950. This is the Indiegogo ID of the tweet author.
5. Please create an output file with the following columns for all tweets identified:
a. Indiegogo project number
b. Tweet URL
c. Indiegogo ID of the tweet author (if present)
d. Tweet text
e. Tweet timestamp
f. Author twitter handle
g. Number of followers the tweet author
h. The number of retweets of the tweet
i. The number of times it was made a favorite
Hi sir,
I am scraping expert, I have did too many similar projects, please check my feedback then you will know.
Can you tell me more details? then I will provide demo data for you.
Thanks,
Kimi
(Expert in Social Media development) Hello, I understand what exactly you need. Being a professional developer with more than 5 year of development experience I assure professional development with unlimited revision if any required. I’ve more than 5 year experience in desktop application (.Net, Java, C#, C++). Feel free to check my profile and rating. Looking forward to hear from you. Thanks & Regards