About
A guide to building your own working LLM, by Sebastian Raschka.
Features
- Complete Python + PyTorch implementation of GPT (multi-head attention, transformer blocks, etc.).
- Step-by-step building process from raw text preprocessing to model training and evaluation.
- Detailed explanation of key mathematical concepts (attention scores, cross-entropy, layer normalization).
- Comparison with large foundation models: scaling laws, data size, training compute trade-offs.
- Bonus chapters on fine-tuning for instruction following and text classification tasks.
Links
Categories
Reviews
0
Write a Review
Get new AI tools weekly
Subscribe to our newsletter and never miss a tool.
Get new AI tools weekly
Subscribe to our newsletter and never miss a tool.