What I Read: Reinforcement Learning from Human Feedback