This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
Subscribe
Discussion
0
1
Transformers
Subscribe
Discussion
0
1
This page is a stub.
Posts tagged
Transformers
Most Relevant
1
61
How LLMs are and are not myopic
janus
2y
7
1
70
Modern Transformers are AGI, and Human-Level
Abram Demski
1y
30
2
33
Residual stream norms grow exponentially over the forward pass
Stefan Heimersheim
,
Alex Turner
2y
6
1
27
Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
Cinera Verinia
2y
9
0
17
Concrete Steps to Get Started in Transformer Mechanistic Interpretability
Neel Nanda
2y
5
1
8
AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them
Roman Leventov
1y
0
1
143
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai
11mo
4
1
28
Attention SAEs Scale to GPT-2 Small
Connor Kissane
,
Robert Krzyzanowski
,
Arthur Conmy
,
Neel Nanda
1y
0
1
20
Brief Notes on Transformers
Adam Jermyn
2y
2
1
21
Understanding mesa-optimization using toy models
tilmanr
,
rusheb
,
Guillaume Corlouer
,
Dan Valentine
,
Alex Spies
,
Michael Ivanitskiy
,
Can
2y
0
1
17
Building a transformer from scratch - AI safety up-skilling challenge
Marius Hobbhahn
2y
0
0
13
Deconfusing In-Context Learning
Arjun Panickssery
1y
0
1
16
New Tool: the Residual Stream Viewer
Adam Yedidia
1y
1
1
9
The positional embedding matrix and previous-token heads: how do they actually work?
Adam Yedidia
2y
1