Tokens for Thoughts

Welcome to my blog! Here I share thoughts on AI research, machine learning, and technology trends.

Post-training 101: A Hitchhiker's Guide to LLM Post-training

📅 Published:

A comprehensive guide to post-training techniques for large language models, covering supervised fine-tuning, RLHF, reward models, and practical implementation details. This guide walks through the entire journey from pre-training to instruct-tuned models with hands-on examples and best practices.


PyTorch Basics & Tutorial

8 minute read

Published:

PyTorch tutorial that takes you from basic tensor operations to advanced topics like attention mechanisms and mixed precision training.