Large language models (LLMs)

Natural Language Processing
- Biomedical natural language processing
Natural language understanding
Capacities
- Capacity
- Similarity to humans - see also Personas
Domain-specific
Impact
- Impact of LLMs on teaching
Models
- Open source models
Tools
- Tools
Training
- Finetuning
  - Reinforcement learning from human feedback
  - LoRA
- Synthetic data
  - Use of synthetic data
Persona
- Personas
Visualization
- Visualization

Overview

Language model implemented with huge Neural networks.

Neural language models perform the language modeling tasks by using a neural network rather than explicitly estimating the conditional probabilities, by learning good vector-space representations for words, sentences, etc.

Issues

Large language models are often called Stochastic parrot, given that they are simply models that predict the next (or missing) token one by one. One exemplary problem from this is that large language models often confidently lie.

They can suffer from Training data leakage and memorization in language models.

Topics

Models

Transformer model
- GPT 4
- Dolly
- OLMo - a completely open model

Tutorials and reviews

From scratch

Text generation

Controllable Neural Text Generation by Lilian Weng

Memes

https://yyiki.s3.amazonaws.com/public/imgs/gromit_LLM_nextword.mp4

https://twitter.com/seanjtaylor/status/1605250865157140481 ↩