thtgrisdjdjdh 2 hours ago

Works only for verifiable rewards, since humans (thankfully) don't have a good theory of knowledge (epistemology).

There's only so far that these agents can go.