Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforceme...

In Openai IA, Research

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Better exploration with parameter noise

Dota 2

Related Posts

Put AI to work: Automate and Scale Financial Operations

Uber enables outstanding on-demand experiences with AI

Introducing GPT-5.4 mini and nano

Nubank elevates customer experiences with OpenAI