吴正龙’s machine-learning Bookmarks

27 OCT 2025
The reliable performance of on-policy training with the cost-efficiency of a dense reward signal.
28 FEB 2025
A practical breakdown of Gradient Descent, the backbone of ML optimization, with step-by-step examples and visualizations.
24 AUG 2024
Machine learning interview prep materials representing the collective wisdom of many people who have sat on both sides of the table, and who have spent a lot of time thinking about the hiring process.
30 JUL 2023
Practical patterns for integrating large language models (LLMs) into real systems and products. Overview of seven key patterns: evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.
21 MAY 2023
Different angle on attention mechanism to help build further intuition. Intended for people who have read the "Attention is All You Need" paper and have a basic understanding of how attention works.
07 JUN 2020
Visual walkthrough of how various gradient descent methods work. Assumes basic familiarity of why and how gradient descent is used in machine learning.
04 NOV 2019
Overview of different types of transfer learning techniques, and how they can be used to transfer knowledge to a different task, language or domain.
14 MAY 2019
A whirlwind tour of PyTorch's internals, covering tensors, autograd, and the overall structure of the project. Meant for aspiring OSS contributors.
24 DEC 2018
BERT was a model that broke several records for how well models could handle language-based tasks.
21 MAR 2017
Overview of transfer learning and discussion of practical applications and methods.