Natural language engineering 1
โ ๏ธ Important
- Assessment: in person exam (22.02.2022)
- Textbook: Jurafsky & Martin 3rd edition
Lecture
๐ VL01 NLE Overview
- What is ๐ฃ Natural Language Engineering?
- Whats the state of the art of NLE?
๐ VL02 Basic NLE Pipeline
- What is โ Natural Language Processing?
- What is ๐ง Linguistic analysis?
- Which ๐ช Linguistic analysis levels exist?
- How does the ๐ช GATE NLP pipeline work?
๐ VL03
โ public holiday
๐ VL04 RegEx & FSE
- What are ๐ก Regular Expressions?
- Whatโs a ๐ Formal Language and how does it relate to ๐ Formal grammar
- How to classify formal grammar?
- What are ๐ฐ Automata and which classes are there?
- How to define ๐ Finite state automata and which types are there?
- 1๏ธโฃ Deterministic finite automata Vs. ๐ข Non-deterministic finite automata
- How to implement RegEx using a FSA?
๐ VL05 Preprocessing
- What is ๐งฝ Preprocessing (NLP) and why is it important?
- ๐ฆพ State machines in detail
๐ VL06 Word Prediction
- Whatโs ๐ฒ Probability?
- How do ๐งฎ Frequentist Probability and ๐จโ๐ฆฑ Subjective Probability differ from each other?
- Whatโs the difference between ๐งโโ๏ธ Prior Probability and ๐ซ Conditional Probability?
- What is a ๐ช Trial and what does it consist of?
- What does the โ Chain Rule (Probability) state?
- What does the ๐ Bayes Theorem state?
- How to do ๐ฎ Word Prediction?
- Whatโs the ๐ Maximum likelihood estimation?
- What is a ๐ Language Model?
- What are some problems during word prediction?
- What does the ๐ญ Markov Assumption (Language) state?
๐ VL07 Text classification
- Whatโs ๐ท Classifier?
- How does ๐คฆโโ๏ธ Naive Bayes work?
- ๐ฏ Accuracy Vs. ๐น Precision Vs. ๐ Recall
- Whatโs the โ Balanced F measure?
๐ VL08 POS Tagging
- Which categories of POS tags are there? โ ๐งฉ Parts of Speech
- Which POS-tagging methods are there?
- How to do โ Hand coded POS-Tagging?
- Whatโs the ๐ถ Brill tagger algorithm
- What is a โฉ Markov model?
- How can the ๐ฅท Hidden Markov model be used for ๐ท POS-Tagging?
๐ VL09 Logistic Regression
- What are some ๐ท ML Classifiers?
- Which types of ๐ถ Logistic Regression are there?
- How does 1๏ธโฃ Binary logistic regression work?
- Whatโs the Cross-entropy loss function?
- Whatโs the Stochastic gradient descent?
- How to calculate the Z-Score?
- Whatโs the Logistic Sigmoid Function?
๐ VL10 Text embeddings 1
- What is a ๐ Lemma and โ Lexical semantics?
- Whatโs a ใฐ๏ธ Word embedding?
- ๐ฆ Sparse vector VS. ๐ Dense vector
- Whatโs ๐งฎ tf-idf? (WTF)
- Whatโs the Pointwise Mutual Information and PPMI?
- How to calculate the ๐ Vector length?
- How to calculate the โซ๏ธ Dot-product and ๐ Cosine similarity?
๐ VL11 Word2Vec
- How does ๐ word2vec work?
๐ VL 12 Formal grammars
- What is ๐ง BERT?
- ๐ Contextual embedding Vs. ๐คทโโ๏ธ Non-Contextual embedding
- Whatโs the ๐ Chomsky normal form?
- Whatโs a ๐ Formal generative grammar?
- What are some Phenomena?
๐ VL 13 Syntax & semantic analysis
- Whatโs ๐ณ Parsing?
- Whatโs ๐ฒ Probabilistic parsing?
- Whatโs the 1๏ธโฃ Predicate Calculus?
- What are the units of a formal grammar?
- Define ๐ฉ Syntax
- Whatโs โ Syntax-driven semantic analysis?
- What are the time complexities of different ๐ฐ Automata?
๐ VL 14 NLE Applications
๐ VL 15 Revision
- ๐ช GATE NLP pipeline
- ๐ก Regular Expressions, ๐ฐ Automata
- 1๏ธโฃ Type 1 error, 2๏ธโฃ Type 2 error
โน๏ธ Course topics
- Motivation
- Regular expressions
- Basic statistical natural language processing
- Part-of-speech tagging
- Text classification
- Lexical semantics (embeddings)
- Context-free grammars
- Parsing principles + Complexity
- Applications: E, IR, QA,