Sebastian Raschka
My name is Sebastian Raschka, and I am a machine learning and AI researcher. Next to being a researcher, I also have a strong passion for education and am best known for my bestselling books on machine learning using open-source software.
After my PhD, I joined the University of Wisconsin-Madison as a professor in the Department of Statistics, where I focused deep learning and machine learning research until 2023.
Taking a yearlong break from academia, I joined Lightning AI in 2022, where I am now a Staff Research Engineer focusing on the intersection of AI research, software development, and large language models (LLMs).
If you are interested in learning more about me or my projects, please visit my website at https://sebastianraschka.com
Sessions
This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from the ground up in PyTorch. We will kick off this tutorial with an introduction to LLMs, recent milestones, and their use cases. Then, we will code a small GPT-like LLM, including its data input pipeline, core architecture components, and pretraining code ourselves. After understanding how everything fits together and how to pretrain an LLM, we will learn how to load pretrained weights and finetune LLMs using open-source libraries.