吴正龙’s machine-learning Bookmarks

27 OCT 2025
[Thinking Machines] On-Policy Distillation

The reliable performance of on-policy training with the cost-efficiency of a dense reward signal.

28 FEB 2025
Gradient Descent Explained

A practical breakdown of Gradient Descent, the backbone of ML optimization, with step-by-step examples and visualizations.

24 AUG 2024
[Chip Huyen] Machine Learning Interviews Book

Machine learning interview prep materials representing the collective wisdom of many people who have sat on both sides of the table, and who have spent a lot of time thinking about the hiring process.

30 JUL 2023
[Eugene Yan] Patterns for Building LLM-based Systems and Products

Practical patterns for integrating large language models (LLMs) into real systems and products. Overview of seven key patterns: evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.

21 MAY 2023
[Eugene Yan] Some Intuition on Attention and the Transformer

Different angle on attention mechanism to help build further intuition. Intended for people who have read the "Attention is All You Need" paper and have a basic understanding of how attention works.

07 JUN 2020
[Lili Jiang] A Visual Explanation of Gradient Descent Methods

Visual walkthrough of how various gradient descent methods work. Assumes basic familiarity of why and how gradient descent is used in machine learning.

04 NOV 2019
[Pratik Bhavsar] Transfer Learning in NLP

Overview of different types of transfer learning techniques, and how they can be used to transfer knowledge to a different task, language or domain.

14 MAY 2019
[Edward Z. Yang] PyTorch Internals

A whirlwind tour of PyTorch's internals, covering tensors, autograd, and the overall structure of the project. Meant for aspiring OSS contributors.

24 DEC 2018
[Jay Alammar] Illustrated BERT

BERT was a model that broke several records for how well models could handle language-based tasks.

21 MAR 2017
[Sebastian Ruder] Transfer Learning - Machine Learning's Next Frontier

Overview of transfer learning and discussion of practical applications and methods.