|
|
|
|
|
linguist.page@gmail.com
Home
»
Computational Linguistics
»
Machine Learning
»
Neural Networks & Deep Learning
»
Architectures Relevant to NLP
Convolutional Neural Networks (CNN) — for text classification
Recurrent Neural Networks (RNN)
Vanishing / Exploding gradient problem
Long Short-Term Memory (LSTM)
Gated Recurrent Unit (GRU)
Bidirectional RNNs
Sequence-to-Sequence (Seq2Seq) models
Encoder-Decoder architecture
Attention mechanism (Bahdanau, Luong)
Self-attention
Multi-head attention
Positional encoding
The Transformer architecture
BERT & variants (RoBERTa, ALBERT, DistilBERT)
GPT & autoregressive language models
T5, BART (seq2seq transformers)
Large Language Models (LLMs) — architecture level