This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Jett Janiak
Posts
Sorted by New
29
Polysemantic Attention Head in a 4-Layer Transformer
1y
0
8
An adversarial example for Direct Logit Attribution: memory management in gelu-4l
1y
0
36
A circuit for Python docstrings in a 4-layer attention-only transformer
2y
3
Wiki Contributions
Comments
Sorted by
Newest