Published onOctober 25, 2024Introduction to Transformers - Part 02 - Llama 3 - Building a LLM from scratchtransformernlpcodeLearn how to adapt the vanilla Transformer to build a state-of-the-art LLM like Meta's Llama 3, exploring key architectural changes and optimizations.
Published onOctober 13, 2024Introduction to Transformers - Part 01 - Implementation of a translation modeltransformernlpcodeA full implementation of the Transformer architecture in PyTorch, built entirely from scratch, designed specifically for use in a translation model.
Published onAugust 21, 2024Introduction to Transformers - Part 00 - ArchitecturetransformernlptheoryA short introduction to the architecture behind transformers how they can be utilized in NLP tasks.