Writing

Sort entries by: publication date, title

Jul 2026 Giving a 3-bit GLM-5.2 Vision in llama.cpp Attaching a 943 MB vision projector to an unchanged 343 GB Q3 text model: native multimodal support in 44 lines across six files, with the patch, checksums, and an honest four-case evaluation.
Jun 2026 Scaling Laws for jaxchat: a Chinchilla IsoFLOP Sweep on One Node A Chinchilla-style IsoFLOP sweep run end to end on one 8×RTX 6000 node: 30 runs across four compute budgets recover textbook C^0.50 exponents on both axes.
Jun 2026 On-Policy Distillation Through a Distributional Lens SFT, RL, and on-policy distillation as three corners of one (α, λ, π_T) policy gradient: one trainer, eight experiments, and the per-token KL clip that separates collapse from the best recipe.
Jun 2026 Making Physics: an illustrated reference guide to physics through simulation A 20-chapter visual guide where every figure is a real render from a from-scratch C++ engine, from mechanics to a general-relativistic black-hole ray tracer.