This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
KevinRoWang
https://kevinrowang.com/
Posts
Sorted by New
48
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
2y
4
25
Gears-Level Mental Models of Transformer Interpretability
3y
1
Wiki Contributions
Comments
Sorted by
Newest