๐คฆโโ๏ธ Naive Bayes
= a simple (โnaiveโ) ๐ฎโโ๏ธ Supervised machine learning method for ๐ท Classifier based on bayes rule
Formal definition
Assumptions
- Bag of words assumption: Word position doesnโt matter
- Conditional independence: Feature probabilities are independent of the class
Formula
Calculating
Using the formula above, can be calculated:
- ๐ Maximum likelihood estimation:
- = fraction of times word appears among all words of topic
Problems
- Zero-probabilities: โ smoothing (= add +1 to each word count)
- Unknown-words: โ ignore them
๐ Pros
- fast, low storage
- robust to irrelevant features
- good all-rounder (โBayes has never failed meโ)