The way the world is formed, anything can be interesting enough to write at length about. Humans have a test-time-compute and general ability1 to do what large language models at the moment seemingly cannot, allowing for countless variations between subjects, ideas, footnotes, nested footnotes, alt text, writing, drawings, data, anything, really. Yet with finite time, one must pick and choose what one wishes to understand, and in the dawning-storm air of the 21st century, eyes increasingly turn toward LLMs as the most relevant item to understand.2 And if you doubt for a minute that they aren’t inherently interesting, only important—then you’ve both failed to understand the generalized rule of reality-depth and you haven’t met with the works of Theia Vogel.

The works of thebes

Behold! A gallery of some interesting parts, which will eventually be full of interesting commentary aimed at destroying thebes’ illegibility.

“from a conversation with sonnet 3.6 about model personality spaces”

Tweet by Gavin Leech, infamous PhD and YouTuber, commenting humorously on a METR eval chart itself commented by Toby Ord of the now-extinct Future of Humanity Institute of Oxford. The METR eval chart shows how for every marginal hour of work, different models + humans (represented by a gray line) improve their “average normalized score” at whatever this benchmark arena is. Gavin says, britishly, "this new grey model has crazy-good test-time scaling. Should keep an eye on it”. Meanwhile, rising twitter picocelebrity croissanthology dot com comments “how far along do you reckon it rises till it essentially plateaus". Gavin, inspired by a new streak of wit, parries with a single photo of 4 books in Robert A. Caro's magnum opus biography of Lyndon B. Johnson. The point here is that humans, given enough time, tenacity, intelligence, can do practically anything cf. e.g. David Deutsche: in this case, becoming president of the United States. Can large language models do the same? The world can only await with baited breath. It does not yet seem like this is the case.

See also, appropriately, a thebes tweet:

Image of a thebes-style character bent over on the ground looking at a small insect curiously, while a GIANT GLOWING VORTEX OF WONDER pattern-strewn with conveniently disposed importance-arrows vibrates behind them and they have a hand held up like “epepep shut up for a sec I'm concentrated”. Tweet caption is “procrastination of the call",

However, “finite time” is just the hypothetical upper-bound limit here. And one reason among many why large language models are the most obvious candidate for “most important development of our age” is that its outcome might ensure these kinds of massive magnum opera becomes possible, someday over our limited horizon.


  1. See Gavin Leech: 

  2. This isn’t to say that time is actually what’s limiting you. If you feel like you aren’t sinking deep enough into the curiosity-bogs reality offers you, on priors I would expect it’s a problem isomorphic to this one rather than a “finite time” problem. See Scott Alexander on time seldom being the bottleneck in his experience, or Gwern’s entire site, where he finds the time to talk about all sorts of ridiculous things and ALSO the most important development of our age.