This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
technicalities
Posts
Sorted by New
100
Shallow review of live agendas in alignment & safety
1y
17
45
ActAdd: Steering Language Models without Optimization
1y
2
31
Announcing the Alignment of Complex Systems Research Group
2y
11
Wiki Contributions
Comments
Sorted by
Newest
Shallow review of live agendas in alignment & safety
technicalities
1y
1
0
I like this. It's like a structural version of control evaluations. Will think where to put it in
Reply
I like this. It's like a structural version of control evaluations. Will think where to put it in