This speaks very much to the idea that LLMs are in some sense a ridiculously eff...

in-silico · 2026-02-23T18:11:57 1771870317

It's a good way to frame base models that have only been pretrained.

However, modern frontier models have undergone rounds of fine-tuning, RLHF (reinforcement learning from human feedback), and RLVR (RL from verifiable rewards) that turn them into something else. The compressed internet is still in there, but it's wrapped in problem-solving and people-pleasing circuitry.

vizzier · 2026-02-23T18:04:57 1771869897

I've thought of them for a while as just a really complicated indexing strategy.

r_lee · 2026-02-23T18:07:56 1771870076

I mean, the transformer is basically like a big query engine and the model is the dataset + some logic or whatever

it's kind of like that by definition, with the whole Attention stuff etc.