Sebastian's books: https://sebastianraschka.com/books/
This video explains what PyTorch buffers are, a concept that is particularly useful when dealing with GPU computations and implement large models like LLMs.
Code notebook: https://github.com/rasbt/LLMs-from-sc...
GitHub discussion about "triu" in the forward pass: https://github.com/rasbt/LLMs-from-sc...
Link to the Studio GPU environment to follow along: https://lightning.ai/seraschka/studio...
---
To support this channel, please consider purchasing a copy of my books: https://sebastianraschka.com/books/
---
/ rasbt
/ sebastianraschka
https://magazine.sebastianraschka.com
コメント