Large Language Models: A Short Introduction

What is an LLM

Example of prediction of the next word in a sentence. Image by author.
Different shapes of a linear regression. Image by author.
A basic flow diagram depicting various stages of LLMs from pre-training to prompting/utilization. Prompting LLMs to generate responses is possible at different training stages like pre-training, instruction-tuning, or alignment tuning. “RL” stands for reinforcement learning, “RM” represents reward-modeling, and “RLHF” represents reinforcement learning with human feedback. Image and caption taken from paper referenced in [2]

Why LLMs Matter

Example of a question I asked Gemini 🤣 Image by author.
Example of a question I asked Gemini 🤣 Image by author.

Conclusion

Chronological display of LLM releases: blue cards represent ‘pre-trained’ models, while orange cards correspond to ‘instruction-tuned’ models. Models on the upper half signify open-source availability, whereas those on the bottom are closed-source. The chart illustrates the increasing trend towards instruction-tuned and open-source models, highlighting the evolving landscape and trends in natural language processing research. Image and caption taken from paper referenced in [2]