Large language models (LLMs)

Overview

Language model implemented with huge Neural networks.

Neural language models perform the language modeling tasks by using a neural network rather than explicitly estimating the conditional probabilities, by learning good vector-space representations for words, sentences, etc.

Issues

Large language models are often called Stochastic parrot, given that they are simply models that predict the next (or missing) token one by one. One exemplary problem from this is that large language models often confidently lie.

They can suffer from Training data leakage in language models.

Topics

Models

Tutorials and reviews

From scratch

Text generation

Memes