I have an existing R script that's about 30 lines of code that I want rewritten into a simple Java app.
It reads an unknown csv file, detects the fields and outputs a text summary including: Common Value Counts (with Other and NA/Blank) and Min-Max Summary (uses string length for text fields). Summary data should be saved to a database.
My existing Java app schedules file downloads. I want to add a new tab to an existing Java app to track results in a grid. I also want a new checkbox so user can control whether to run the file summary or not.
Data Output Sample:
ObservationID,ColumnID,FieldName,Package,Measure,Value
1,21,Address,1,800 BOYLSTON ST,3257
2,21,Address,1,200 FANEUIL HALL MARKET PL,2584
3,21,Address,1,1 CITYWIDE ST,2080
4,21,Address,1,100-199 FANEUIL HALL MARKET PL,2030
5,21,Address,1,300 FANEUIL HALL MARKET PL,1850
6,21,Address,1,417 WASHINGTON ST,1828
7,21,Address,1,1 FANEUIL HALL MKT PL PL,1427
8,21,Address,1,2360 WASHINGTON ST,1348
9,21,Address,1,200 LOGAN AIRPORT TRMNL B,1321
10,21,Address,1,(OTHER),336111
11,21,Address,1,NA'S,125
12,21,Address,3,Min.,2.00
13,21,Address,3,1st Qu.,15.00
14,21,Address,3,Median,17.00
15,21,Address,3,Mean,17.07
16,21,Address,3,3rd Qu.,19.00
17,21,Address,3,Max.,32.00