Project on Language Classification using subwords in Natural Language Processing and Naive Bayes Classifier.