Language Modelling

Overview

The focus of this repo is practical implementation + research understanding of modern language models (LMs), including building, training, fine-tuning, and aligning models from scratch

Objectives

Developing a good research foundation
Learning and building things from First Principles ❤️
Apply research thinking to solve real-world problems

Notes

Training a full LLM is computationally expensive → focus is on Small Language Models (SLMs)
Concepts scale directly from SLM → LLM

Acknowledgement 🙏

This work is deeply inspired by the incredible body of research and open contributions in the field of language modelling. I would like to acknowledge:

Foundational research papers such as Attention Is All You Need, which introduced the Transformer architecture and revolutionized modern NLP
Key advancements in large-scale language modelling, including works behind models like GPT and BERT
Open-source communities, blogs, and educational resources that make complex concepts accessible and reproducible

This repository is an attempt to learn from these works, re-implement ideas from first principles, and build an intuitive as well as practical understanding of language models

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Modelling

Overview

Objectives

Notes

Acknowledgement 🙏

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Language Modelling

Overview

Objectives

Notes

Acknowledgement 🙏

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages