Stellantis interview question

Expliquez le Reinforcement Learning from Human Feedbac ?