2026, Year of Reinforcement Learning? aimlbling-about.ninerealmlabs.com 5 points by namnnumbr 5 hours ago
thtgrisdjdjdh 2 hours ago Works only for verifiable rewards, since humans (thankfully) don't have a good theory of knowledge (epistemology).There's only so far that these agents can go.
Works only for verifiable rewards, since humans (thankfully) don't have a good theory of knowledge (epistemology).
There's only so far that these agents can go.