Skip to content

ruhaankumar2013-debug/Practical-AI-engineering

Repository files navigation

๐Ÿง  LLM Dev Curriculum

The open-source, community-driven roadmap to building with Large Language Models โ€”

from your first "Hello, GPT" to shipping production AI systems.

Stars Contributors License: MIT PRs Welcome Last Updated

โญ If this helped you, star it โ€” it helps others find it too.

๐Ÿš€ Start Learning ยท ๐Ÿ“š All Modules ยท ๐Ÿ—๏ธ Projects ยท ๐Ÿค Contribute ยท ๐Ÿ’ฌ Community


๐ŸŒŸ Why This Curriculum?

There are hundreds of LLM tutorials out there. Most are either:

  • ๐Ÿ”ด Too shallow โ€” "just call the API and you're done"
  • ๐Ÿ”ด Too academic โ€” dense math, no runnable code
  • ๐Ÿ”ด Too opinionated โ€” locked into one framework
  • ๐Ÿ”ด Outdated โ€” written in 2023 and never touched since

This curriculum is different. It's:

โœ… What you get
Beginner โ†’ Production Start from zero ML knowledge. End with a deployed, monitored AI system.
Framework-agnostic OpenAI, Anthropic, HuggingFace, open-weight models โ€” you learn all of them.
Hands-on first Every concept has code. Every phase has a capstone project you can put on your portfolio.
Community-maintained Updated by practitioners, not just educators. PRs welcome.
Clearly leveled Every module is tagged ๐ŸŸข Beginner ยท ๐ŸŸก Intermediate ยท ๐Ÿ”ด Advanced โ€” no surprises.

๐Ÿ—บ๏ธ The Big Picture

YOU ARE HERE
     โ”‚
     โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 0 ยท Foundations                    ๐ŸŸข No prerequisites โ”‚
โ”‚  What LLMs are, how transformers work, tokens & embeddings   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 1 ยท Prompting                           ๐ŸŸข Beginner   โ”‚
โ”‚  Prompt engineering, CoT, few-shot, system prompts           โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 2 ยท APIs & Integrations                 ๐ŸŸข Beginner   โ”‚
โ”‚  Call LLMs from code. Build your first real app.             โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
            โ”‚                               โ”‚
            โ–ผ                               โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”       โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 3 ยท RAG  ๐ŸŸก    โ”‚       โ”‚  PHASE 4 ยท Fine-Tuning ๐ŸŸก โ”‚
โ”‚  Retrieval-Augmented  โ”‚       โ”‚  LoRA, QLoRA, RLHF,       โ”‚
โ”‚  Generation, vector   โ”‚       โ”‚  dataset prep             โ”‚
โ”‚  DBs, chunking        โ”‚       โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜                   โ”‚
            โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 5 ยท Agents                         ๐ŸŸก Intermediate    โ”‚
โ”‚  Tool use, ReAct, memory, multi-agent systems                โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 6 ยท Evaluation                     ๐ŸŸก Intermediate    โ”‚
โ”‚  Evals, benchmarks, LLM-as-judge, eval pipelines            โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 7 ยท Production                          ๐Ÿ”ด Advanced   โ”‚
โ”‚  Deploy, scale, monitor, optimize, guardrails                โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                            โ”‚
                            โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  PHASE 8 ยท Advanced Topics                     ๐Ÿ”ด Advanced   โ”‚
โ”‚  Train from scratch, multimodal, MoE, research papers        โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

๐Ÿ Quick Start

Not sure where to begin? Pick your entry point:

๐ŸŸข "I know nothing about LLMs or ML"

Start โ†’ 00-foundations_what_are_llms.md

๐ŸŸข "I know ML / data science, new to LLMs"

Start โ†’ 02-prompt_engineering.md

๐ŸŸข "I'm a developer, just want to build an app NOW"

Start โ†’ 01-apis_quickstart.md

๐ŸŸก "I want to connect LLMs to my own data"

Start โ†’ 03-rag_pipeline.md

๐ŸŸก "I want to customize / fine-tune a model"

Start โ†’ 04-finetuning_lora.md

๐Ÿ”ด "I need to ship this to production"

Start โ†’ 06-safety_guardrails.md

๐Ÿ“š Modules

Legend: ๐Ÿ“– Guide ยท ๐Ÿงช Notebook ยท ๐Ÿ’ป Code ยท ๐Ÿ—๏ธ Project ยท ๐ŸŸข Beginner ยท ๐ŸŸก Intermediate ยท ๐Ÿ”ด Advanced


๐ŸŸข Phase 0 โ€” Foundations

No prerequisites. If you're brand new, start here.

# Module Format Est. Time
0 What Are LLMs? ๐Ÿ“– 20 min

๐ŸŸข Phase 1 โ€” Prompting

Learn to communicate with LLMs effectively. The highest ROI skill in this entire curriculum.

# Module Format Est. Time
1 Prompt Engineering Techniques ๐Ÿ“– + examples 1 hr

๐ŸŸข Phase 2 โ€” APIs & Integrations

Call LLMs from your code. Build your first real application.

# Module Format Est. Time
2 OpenAI API Quickstart ๐Ÿ“– + ๐Ÿ’ป 1 hr

๐ŸŸก Phase 3 โ€” RAG (Retrieval-Augmented Generation)

Give your LLM a memory. Connect it to your own documents and data.

# Module Format Est. Time
3 Building a RAG Pipeline End-to-End ๐Ÿ“– + ๐Ÿงช 2 hr

๐ŸŸก Phase 4 โ€” Fine-Tuning

When prompting isn't enough โ€” adapt a model to your exact task.

# Module Format Est. Time
4 LoRA and QLoRA: Efficient Fine-Tuning ๐Ÿ“– + ๐Ÿงช 3 hr

๐Ÿ”ด Phase 5 โ€” Production

Ship it. Scale it. Keep it working. Don't go broke.

# Module Format Est. Time
5 Safety, Guardrails & Content Filtering ๐Ÿ“– + ๐Ÿ’ป 1.5 hr

๐Ÿ“– Additional Resources

Resource Link
๐Ÿ“š Glossary 05-glossary.md

๐Ÿ—๏ธ Projects Summary

Each phase ends with a capstone project you can actually put on your portfolio.

Phase Project Skills Demonstrated
0 Foundations LLM basics
1 Prompt Playground Prompt engineering
2 CLI Chatbot API calls, conversation history
3 Chat with Your Docs RAG, vector DB, embeddings
4 Custom Fine-Tune Dataset prep, LoRA, model training
6 Safety Implementation Guardrails, content filtering

๐Ÿ—‚๏ธ Repository Structure

Practical-AI-engineering/
โ”œโ”€โ”€ 00-foundations_what_are_llms.md
โ”œโ”€โ”€ 01-apis_quickstart.md
โ”œโ”€โ”€ 02-prompt_engineering.md
โ”œโ”€โ”€ 03-rag_pipeline.md
โ”œโ”€โ”€ 04-finetuning_lora.md
โ”œโ”€โ”€ 05-glossary.md
โ”œโ”€โ”€ 06-safety_guardrails.md
โ”œโ”€โ”€ README.md                 โ† you are here
โ”œโ”€โ”€ contributing.md
โ”œโ”€โ”€ roadmap.md
โ””โ”€โ”€ LICENSE

๐Ÿ”– Format Legend

Icon Meaning
๐Ÿ“– Written guide (Markdown)
๐Ÿงช Jupyter Notebook (runnable)
๐Ÿ’ป Standalone code / project
๐Ÿ—๏ธ Capstone project
๐ŸŸข Beginner โ€” no prior ML knowledge needed
๐ŸŸก Intermediate โ€” comfortable with Python and APIs
๐Ÿ”ด Advanced โ€” ML background helpful

๐Ÿค Contributing

This curriculum is community-powered. All contributions welcome โ€” big or small.

  • Typo / small fix โ†’ open a PR directly
  • Improve an explanation โ†’ open a PR with a brief note on what was unclear
  • Add a new module โ†’ open an issue first so we can align on scope
  • Translate to your language โ†’ see the Translations section

Please read contributing.md before submitting. Be kind.


๐ŸŒ Translations

Language Status Maintainer
๐Ÿ‡ฌ๐Ÿ‡ง English โœ… Complete Core Team
๐Ÿ‡ฎ๐Ÿ‡ณ Hindi ๐Ÿšง In Progress Contribute โ†’
๐Ÿ‡ช๐Ÿ‡ธ Spanish ๐Ÿšง In Progress Contribute โ†’
๐Ÿ‡ต๐Ÿ‡น Portuguese ๐Ÿ™‹ Needed Contribute โ†’
๐Ÿ‡จ๐Ÿ‡ณ Chinese (Simplified) ๐Ÿ™‹ Needed Contribute โ†’
๐Ÿ‡ฏ๐Ÿ‡ต Japanese ๐Ÿ™‹ Needed Contribute โ†’

๐Ÿ’ฌ Community

  • GitHub Discussions โ€” questions, study groups, project showcases
  • Issues โ€” bug reports, content errors, suggestions

๐Ÿ“œ License

MIT โ€” free to use, share, fork, and build on.
Attribution appreciated but not required.


Built with โค๏ธ by the community ยท Contribute ยท Star it โญ

This curriculum is updated regularly. Watch the repo to stay notified.

About

The complete open-source LLM curriculum โ€” prompt engineering, RAG, fine-tuning, agents, and production deployment. Beginner to advanced. Python. Free forever.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors