I’m skeptical this would work in production better than RLHF, if the agent makes...

		digitcatphd on Oct 13, 2024 \| parent \| context \| favorite \| on: Gödel Agent: A self-referential agent framework fo... I’m skeptical this would work in production better than RLHF, if the agent makes a mistake, how is it supposed to know to correct itself and understand what it did wrong to prevent it? It seems better to try again recursively until it finds the solution like a human