Data scientist and big data subject matter expert having experience on solving analytical use cases. Have proficiency in Python, Machine Learning, Recommendation Engines, Big data , Hadoop, SQL and NoSQL DB’s.
Technical Skills -
• Languages: Python, C, Java, shell scripting, SQL, PL/SQL.
• Hadoop Ecosystem: Hdfs, Yarn, MapReduce, Hbase, Hive, Pig, Oozie, Sqoop, Zookeeper
• Hadoop Platform: Cloudera, MapR, Apache
• Kafka, Spark Streaming
• Software: PyCharm, Jupyter notebook
• Operating Systems: Linux, Windows.
• Cloud Service Platform: Amazon Web Services.
• Machine Learning, Text Analytics
• Algorithms: Linear Regression, Logistics Regression, K Nearest Neighbors, Support Vector Machines, K Means Clustering, Ensemble Methods: Bagging, Boosting, Random Forest, Naïve bays and decision tree
• Python- Numpy, Pandas, Matplotlib, Seaborn, Scikit Learn, Bokeh.
• Deep learning frameworks: Keras and TensorFlow