about me

Han Fang is an AI Research Scientist at Meta’s Superintelligence Labs, working at the frontier of Self Improvement & Agents. He founded Meta AI’s production post-training team and led production post-training for Llama 2 and Llama 3. He launched Meta AI in 2023 and scaled it to 1 billion MAU — driving integrated training runs, core capabilities, tool use, and data flywheel. Most recently, he is a core contributor to Agents in Muse Spark, driving agentic tool use to SoTA on MCP-Atlas.

Han holds a PhD in Applied Mathematics & Machine Learning, published in top-tier venues with 12K+ citations. He is a recipient of the President’s Award to Distinguished Doctoral Students, the Woo-Jong Kim Dissertation Award, and the Excellence in Research Award.

Google Scholar / CV / Linkedin / Twitter

news

Autodata: Automatic Data Scientist

2026

AI agents that function as data scientists, iteratively building high-quality training and evaluation datasets. Agentic Self-Instruct converts inference compute into better data. Paper · Blog

Launched Agentic Tool Use in Muse Spark

2026

Core contributor to Agents in Muse Spark 🥑, driving agentic tool use to SoTA on MCP-Atlas. Natively multimodal reasoning with tool-use, and multi-agent orchestration.

Meta AI reached 1 billion MAU

2025

Improved Meta AI's multilinguality, enabled roll-out to 12 languages and 40+ countries. Blog · News

Earlier

  • 2024 Launched voice mode and photo editing in Meta AI · Blog
  • 2024 Launched Llama 3 on Meta AI · Mixture of Judges · Blog
  • 2023 Launched Meta AI with Llama 2 · Meta AI · Talk at Connect
  • 2021 Meta AI Few-Shot Learner (FSL) · Blog
  • 2020 Training AI to detect hate speech · Blog

View all news →

blog

The Central Dogma of Artificial Intelligence

February 2026

Every mature science has its central dogma. Biology has DNA → RNA → Protein. What is ours? Intelligence is the compression of experience into generalization.

The RL Environment Field Guide

January 2026

A practical guide to RL environments using Pokemon Red as a case study. Covers the agent-environment loop, observation spaces, reward design, and credit assignment.

Post-training 101: A Hitchhiker's Guide

September 2025

A comprehensive guide to post-training techniques for LLMs, covering supervised fine-tuning, RLHF, reward models, and practical implementation details.

View all posts →

featured papers

Generalized Parallel Scaling with Interdependent Generations

Harry Dong, David Brandfonbrener, Eryk Helenowski, Yun He, Mrinal Kumar, Han Fang, Yuejie Chi, Karthik Abinav Sankararaman · ICML 2026

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Zishun Yu, Tengyu Xu, Di Jin, Karthik Abinav Sankararaman, Yun He, Wenxuan Zhou, Zhouhao Zeng, Eryk Helenowski, Chen Zhu, Sinong Wang, Hao Ma, Han Fang · ICML 2025

The Perfect Blend: Redefining RLHF with Mixture of Judges

Tengyu Xu, Eryk Helenowski, Karthik Abinav Sankararaman, Di Jin, Kaiyan Peng, Eric Han, Shaoliang Nie, Chen Zhu, Hejia Zhang, Wenxuan Zhou, Zhouhao Zeng, Yun He, Karishma Mandyam, Arya Talabzadeh, Madian Khabsa, Gabriel Cohen, Yuandong Tian, Hao Ma, Sinong Wang, Han Fang · arXiv 2024

Linformer: Self-Attention with Linear Complexity

Sinong Wang, Belinda Z. Li, Madian Khabsa, Han Fang, Hao Ma · arXiv 2020

View all publications →