Spam Filtering using Text Categorization
-
02-07-2021 - |
Question
I am trying to build spam filtering application using text categorization. From where should I start learning the algorithms? Which algorithm should I implement first? Anyone can suggest any good material?
Solution
The easiest one to start with is Naive Bayes.
http://nlp.stanford.edu/IR-book/html/htmledition/naive-bayes-text-classification-1.html
OTHER TIPS
I found this article [PDF] that gives quite a good overview of available machine learning techniques and their performance for spam filtering.
Here you can find more information on the subject as well as training data!
I must say that there are volumes of material online, simple google search for "spam filtering machine learning techniques" is enough.
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow