Improving Model Safety Behavior with Rule-Based Rewards

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.

In Openai IA, Research

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.

GPT-4o mini: advancing cost-efficient intelligence

SearchGPT is a prototype of new AI search features

Related Posts

Introducing the Codex app

Navigating health questions with ChatGPT

The International 2018: Results

GPT-5.1 Instant and GPT-5.1 Thinking System Card Addendum