Alex' Gardenアレックスの庭

Search

❯

❯

❯

❯

❯

❯

❯

❯

🔮 Word Prediction

Jul 21, 2025, 1 min read

🔮 Word Prediction

How?

Determine the 🎲 Probability of each word: MLE
Use a N-Grams 2 to obtain the most likely follow-up word

Problems

Sparse data:
- Most events of encountering long word sequences hardly ever occur
  - → Markov Assumption → Only look at N-Grams (typically 2 or 3)
Zeroes:
- Some input words are not in the training set → MLE estimates P(w) = 0
  - → Smoothing: assigning small non-zero probabilities to P(w) = 0
  - → Back-off: use lower order n-grams when higher ones aren’t available
Underflow:
- Multiplying many small numbers can result in an underflow → loss of data
  - → Do all calculations in log space

Graph View

🔮 Word Prediction
How?
Problems

Backlinks

🍔 Maximum likelihood estimation
🎲 Probability Theory
💭 Markov Assumption (Language)
📙 Language Model
🗯 NL1 Lectures