Reinforcement Learning from Human Feedback (RLHF) in Notebooks