aidevblogs
⌘K
BlogsVideosTweets
AllLLMsComputer VisionMLOpsAgentsData EngineeringResearchSafety
Olmo 3 and the Open LLM Renaissance
Cameron Wolfe·cameronrwolfe.substack.com·11 days ago·LLMs
Group Relative Policy Optimization (GRPO)
Cameron Wolfe·cameronrwolfe.substack.com·about 1 month ago·LLMs
PPO for LLMs: A Guide for Normal People
Cameron Wolfe·cameronrwolfe.substack.com·about 2 months ago·LLMs
REINFORCE: Easy Online RL for LLMs
Cameron Wolfe·cameronrwolfe.substack.com·3 months ago·LLMs
Online versus Offline RL for LLMs
Cameron Wolfe·cameronrwolfe.substack.com·4 months ago·LLMs
GPT-oss from the Ground Up
Cameron Wolfe·cameronrwolfe.substack.com·4 months ago·LLMs
Direct Preference Optimization (DPO)
Cameron Wolfe·cameronrwolfe.substack.com·5 months ago·LLMs
Reward Models
Cameron Wolfe·cameronrwolfe.substack.com·6 months ago·LLMs
AI Agents from First Principles
Cameron Wolfe·cameronrwolfe.substack.com·7 months ago·LLMs
A Guide for Debugging LLM Training Data
Cameron Wolfe·cameronrwolfe.substack.com·7 months ago·LLMs
Llama 4: The Challenges of Creating a Frontier-Level LLM
Cameron Wolfe·cameronrwolfe.substack.com·8 months ago·LLMs
Vision Large Language Models (vLLMs)
Cameron Wolfe·cameronrwolfe.substack.com·9 months ago·LLMs
nanoMoE: Mixture-of-Experts (MoE) LLMs from Scratch in PyTorch
Cameron Wolfe·cameronrwolfe.substack.com·10 months ago·LLMs
Demystifying Reasoning Models
Cameron Wolfe·cameronrwolfe.substack.com·10 months ago·LLMs
Mixture-of-Experts (MoE) LLMs
Cameron Wolfe·cameronrwolfe.substack.com·11 months ago·LLMs
Scaling Laws for LLMs: From GPT-3 to o3
Cameron Wolfe·cameronrwolfe.substack.com·12 months ago·LLMs
Finetuning LLM Judges for Evaluation
Cameron Wolfe·cameronrwolfe.substack.com·about 1 year ago·LLMs
Automatic Prompt Optimization
Cameron Wolfe·cameronrwolfe.substack.com·about 1 year ago·LLMs
Model Merging: A Survey
Cameron Wolfe·cameronrwolfe.substack.com·over 1 year ago·LLMs
Using LLMs for Evaluation
Cameron Wolfe·cameronrwolfe.substack.com·over 1 year ago·LLMs