Markola - wuzhenglong's machine-learning Bookmarks

27 OCT 2025

[Thinking Machines] On-Policy Distillation

The reliable performance of on-policy training with the cost-efficiency of a dense reward signal.

28 FEB 2025

A practical breakdown of Gradient Descent, the backbone of ML optimization, with step-by-step examples and visualizations.

gradient-descent machine-learning

24 AUG 2024

[Chip Huyen] Machine Learning Interviews Book

Machine learning interview prep materials representing the collective wisdom of many people who have sat on both sides of the table, and who have spent a lot of time thinking about the hiring process.

machine-learning

30 JUL 2023

[Eugene Yan] Patterns for Building LLM-based Systems and Products

Practical patterns for integrating large language models (LLMs) into real systems and products. Overview of seven key patterns: evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.

large-language-models machine-learning

21 MAY 2023

[Eugene Yan] Some Intuition on Attention and the Transformer

Different angle on attention mechanism to help build further intuition. Intended for people who have read the "Attention is All You Need" paper and have a basic understanding of how attention works.

large-language-models machine-learning

07 JUN 2020

[Lili Jiang] A Visual Explanation of Gradient Descent Methods

Visual walkthrough of how various gradient descent methods work. Assumes basic familiarity of why and how gradient descent is used in machine learning.

gradient-descent machine-learning

04 NOV 2019

[Pratik Bhavsar] Transfer Learning in NLP

Overview of different types of transfer learning techniques, and how they can be used to transfer knowledge to a different task, language or domain.

machine-learning nlp

14 MAY 2019

[Edward Z. Yang] PyTorch Internals

A whirlwind tour of PyTorch's internals, covering tensors, autograd, and the overall structure of the project. Meant for aspiring OSS contributors.

machine-learning pytorch

24 DEC 2018

[Jay Alammar] Illustrated BERT

BERT was a model that broke several records for how well models could handle language-based tasks.

machine-learning

21 MAR 2017

[Sebastian Ruder] Transfer Learning - Machine Learning's Next Frontier

Overview of transfer learning and discussion of practical applications and methods.

machine-learning transfer-learning

吴正龙’s machine-learning Bookmarks