Find Jobs
Hire Freelancers

487366 Data Mining

N/A

In Progress
Posted about 13 years ago

N/A

Paid on delivery
I need a program in JAVA that it will read from a directory the training set that it contains spam and legitimate [login to view URL] will read all the emails breaking them into words and put all the words in an one dimensional array with their frequency. then each email must have an array with the length of the previous array with the words that have the most frequency and each vector has to declare if the email is spam or not . The program will make use of a stop list that it will contains all the unnecessary words and symbols like and on (, . "") and it will remove them from the vectors. we need to decide which of the words we will keep. Output file format: The left column will have the names of the .txt testing files The right column will have the predictions ‘s' for spam ‘l' for legitimate The two column must be separated with a tab (\t) I will give you the training set
Project ID: 2233276

About the project

Remote project
Active 12 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

About the client

Flag of
5.0
4
Member since Mar 8, 2010

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.