Skip to main content
Link
Menu
Expand
(external link)
Document
Search
Copy
Copied
notes
(CS 25) Llama2
(CS 7) Predictably Irrational
(STS 10SI) Deceptive Alignment
(STS 10SI) Inner Alignment P1
(STS 10SI) Outer Alignment: Intelligence and Goals
(STS 10SI) Outer Alignment: Reward Misspecification
Home
Q-Learning in RL
just a collection of my notes from classes, books, and other things. maybe i’ll organize it better later. maybe not. who knows.