concept reinforcement seed

RLHF (2017)

Reinforcement Learning from Human Feedback, introduced by Christiano et al. (2017). Uses human preferences to define reward signals. Critical for aligning LLMs like ChatGPT and Claude.

#rlhf #alignment #human-feedback #christiano #2017