This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
miles
Posts
Sorted by New
13
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
2mo
0
17
Some Quick Follow-Up Experiments to “Taken out of context: On measuring situational awareness in LLMs”
8mo
0
19
Unfaithful Explanations in Chain-of-Thought Prompting
1y
0
Wiki Contributions
Comments