🏷 Text classification
= the process of categorizing elements into organized groups
Formal definition
- Input:
- a pattern: (?)
- fixed set of classes:
- Output:
- predicted class:
Methods
- 🌙 Symbolic (deep) processing:
- → hand coded rules
- 👮♂️ Supervised machine learning:
Multiple classes
- Build a classifier for each class
- Compare the results of each classifier and chose the highest result
Evaluation
Measures
Averaging
- Macroaveraging: Compute performance for each class, then average
- Microaveraging: Collect decision for all classes, compute contingency table, evaluate