How can I use AI agents to run ML experiments autonomously?

Set up a fixed-budget experiment loop: the human writes a strategy document (a Markdown file), the AI agent iterates on training code. Each experiment runs for exactly 5 minutes on a fixed metric, so every change is fairly compared. The agent commits improvements to a git branch and discards failures. 100+ experiments can run overnight without human involvement.

When to use this

When you have a training pipeline and want to explore hyperparameters, architectures, or optimizer settings at scale.

When to skip this

For one-off training runs or when you need human intuition for fundamental architecture decisions.

Advanced Single GPU, git, AI coding agent (Claude/GPT)

#AI Agents #AI Coding

By Andrej Karpathy

Source: Twitter →

Docs: Karpathy Autoresearch Repository →

Extracted: 2026-03-11 Verified: 2026-03-11