Tokens for Thoughts

Welcome to my blog! Here I share thoughts on AI research, machine learning, and technology trends.

Featured

The Central Dogma of Artificial Intelligence

📅 Published: February 7, 2026

Every mature science has its central dogma. Biology has DNA → RNA → Protein. What is ours? Intelligence is the compression of experience into generalization. This essay explores how compression theory explains pre-training, post-training, and alignment — and why understanding compression losses may be the key frontier in AI.

The RL Environment Field Guide

📅 Published: January 26, 2026

A practical guide to RL environments using Pokemon Red as a case study. Covers the agent-environment loop, observation and action spaces, reward design, trajectories, graders, and the credit assignment problem—with real code examples and lessons from training an LLM-based gameplay agent.

Post-training 101: A Hitchhiker's Guide to LLM Post-training

📅 Published: September 12, 2025

A comprehensive guide to post-training techniques for large language models, covering supervised fine-tuning, RLHF, reward models, and practical implementation details. This guide walks through the entire journey from pre-training to instruct-tuned models with hands-on examples and best practices.

PyTorch Basics & Tutorial

8 minute read

Published: August 23, 2025

PyTorch tutorial that takes you from basic tensor operations to advanced topics like attention mechanisms and mixed precision training.

Han Fang