Blog
Writing about AI, career lessons, and things I find interesting.
Failure Modes
Apr 30, 2026The Reliability Gap in AI Essay Grading
Why LLM-based essay graders score the same essay differently each run, what the MCAS rescoring incident reveals about the category, and the five engineering controls that turn a language model into a reliable scoring instrument.
22 min read·Technical·evaluation, reliability, responsible-ai, case-study
Grownomic: Starting and Shutting Down an AI Startup in a Year
Three pivots, paying customers, and a platform policy that ended the final bet. What I learned building and shutting down Grownomic AI.
9 min read·Career & Industry