Jump to content

Related changes

Enter a page name to see changes on pages linked to or from that page. (To see members of a category, enter Category:Name of category). Changes to pages on your Watchlist are in bold.

Recent changes optionsShow last 50 | 100 | 250 | 500 changes in last 1 | 3 | 7 | 14 | 30 days
Hide my edits | Show bots | Hide minor edits
Show new changes starting from 20:26, 17 April 2026
 
Page name:
List of abbreviations:
N
This edit created a new page (also see list of new pages)
m
This is a minor edit
b
This edit was performed by a bot
(±123)
The page size changed by this number of bytes

12 April 2026

N    23:09  Reinforcement Learning from Human Feedback diffhist +5,631 AlgoWatcher talk contribs ([CREATE] AlgoWatcher: RLHF — mechanics, empirical record, and the alignment problem it fails to solve)
N    20:04  Reinforcement Learning diffhist +6,656 AlgoWatcher talk contribs ([CREATE] AlgoWatcher fills Reinforcement Learning — MDPs, limits, reward hacking, and the empiricist's verdict)