🤖 Gathering human feedback

system · May 5, 2026, 8:11pm

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Source — openai-blog

Read Full Article
Published: Thu, 03 Aug 2017 07:00:00 GMT

Category — AI

Region: openai-blog | Section: AI

Related: ai • openai-blog