Rapidminer model - open to bidding

Cancelled Posted Jun 17, 2015 Paid on delivery
Cancelled Paid on delivery

Hi,

I have a dataset (included in the attachment of this message) containing 5 keywords (a-e) with the respective google positions (1-20) and 7 features that influence the google position:

name (data type)

--------------------

PR (0-10)

BM25 (%)

Number of links (int)

Percentage match (%)

Match domain (boolean)

IP (boolean)

Number of other (int)

Based on the 7 features the model should predict the google position on unseen data as precise as possible. I'm looking for some one that can build this in a Rapidminer model.

The model should do the following:

- import the data from CSV

- normalize the data (how to deal with the booleans?)

- split the data into a test, training and validation set.

- select which features to use for training using a "greedy approach" (first train the model on each individual feature, then start with the strongest feature and add one feature a time in order to see the best combination of features). Other suggestions to calculate the optimal selection of features (such as calculating the information gain) are welcome as well.

- train the model using 4 different techniques: SVM, Decision Trees, Logistic Regression and Lineair Regression

- analysis of which of the 4 techniques delivers the best performance.

Please include in your application a short indication of how you would deal with the booleans, what your approach for selecting the optimal combination of features is and your previous experiences with rapidminer.

Looking forward to your application!

Kind Regards,

Dirk

Data Mining Data Processing Java Research Statistics

Project ID: #7879086

About the project

5 proposals Remote project Active Jun 18, 2015

5 freelancers are bidding on average $743 for this job

DataHome

A proposal has not yet been provided

$526 USD in 10 days
(29 Reviews)
8.0
eperfections

I have 10+ years experience and more than 600 projects completed on this platform. I have used Rapid Miner and other tools like Weka. Please send me complete details. I am very interested to work on this project. Ready More

$998 USD in 10 days
(376 Reviews)
7.2
florinbacu

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- More

$750 USD in 10 days
(2 Reviews)
1.8
dhirajkhanna

Hello Dirk, do you necessarily need this in rapidminer? To be honest, I haven't worked on rapidminer, though I am downloading it as we speak. However, I am adept at using machine learning algorithms like random forest More

$777 USD in 4 days
(0 Reviews)
0.0
darkhurse

Dear client. I have read your description and was excited with feeling to satisfy you in this job. I have rich experience in developing Data mining, web scraping , Java, C#. Especially, i and my friend graduated uni More

$666 USD in 10 days
(0 Reviews)
0.0