Technical Areas:
-
load csv data using pandas et visualize the data
-
data exploration and observe that the data is unbalance
-
text preprocessing
-
Simple classification model using Naive Bayes, Decision Tree, Logistic Regression and SVM
Tools:
-
Visual Studio
-
Pandas
-
sklearn
-
nltk
Soft skills:
-
Work with my team to discuss and debug together
-
check youtube channel for, data cleaning, ML and NLP course
Highlights:
-
load csv file , perform data exploration and data cleaning
-
Modeling the data with some basic classification algorithms
-
compute recall precision and f1 score for each algorithms
Next step
-
Need to gather more data
-
deal with unbalanced data
-
build the basic recommender system