concept reinforcement ★ seed

RLHF (2017)

Reinforcement Learning from Human Feedback, introduced by Christiano et al. (2017). Uses human preferences to define reward signals. Critical for aligning LLMs like ChatGPT and Claude.

#rlhf #alignment #human-feedback #christiano #2017