NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Reinforcement Learning from Human Feedback (rlhfbook.com)
dang 3 hours ago [-]
Related. Others?

RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)

verdverm 6 hours ago [-]
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
leggerss 5 hours ago [-]
You could say he's also learning from human feedback
klelatti 7 hours ago [-]
Web version with links, etc:

https://rlhfbook.com/

dang 3 hours ago [-]
Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.
iisweetheartii 7 hours ago [-]
[dead]
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 21:15:10 GMT+0000 (Coordinated Universal Time) with Vercel.