Apr 23, 2026 RLHF and DPO: Teaching Language Models to Be Helpful and Harmless Apr 02, 2026 RLHF and DPO: Teaching Language Models to Be Helpful and Harmless