72 Techniques to Optimize LLMs in Production
...explained with usage.
A collection of 90 posts
...explained with usage.
...explained visually!
A case study on how Claude achieves 92% cache hit-rate.
A comprehensive guide to Opik, an open-source LLM evaluation and observability framework.
AI Agents Crash Course—Part 15 (with implementation).
...explained with usage.
...that actually prevents hallucinations (explained visually).
...explained step-by-step.
100% local.
...explained visually.
...explained visually and with code.
...explained step-by-step with code.
...explained visually.
...explained step-by-step with code.
..powered with MCP + Tools + Memory + Observability.
Understanding every little detail on vector databases and their utility in LLMs, along with a hands-on demo.
100% locally.
...explained with visuals.
...explained step-by-step.
Techniques used in DeepSeek, Llama 4, and Gemma.
A from-scratch implementation of Llama 4 LLM, a mixture-of-experts model, using PyTorch code.
AI Agents Crash Course—Part 14 (with implementation).
Step-by-step code walkthrough.
100% local, using open-source Graphiti.
AI Agents Crash Course—Part 13 (with implementation).
AI Agents Crash Course—Part 12 (with implementation).
AI Agents Crash Course—Part 11 (with implementation).
...explained visually and with code.
(it does not compete with MCPs).
AI Agents Crash Course—Part 10 (with implementation).