AI ALIGNMENT FORUM
AF

miles

Posts

Sorted by New

13Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

2mo

0

17Some Quick Follow-Up Experiments to “Taken out of context: On measuring situational awareness in LLMs”

8mo

0

19Unfaithful Explanations in Chain-of-Thought Prompting

1y

0

Wiki Contributions

Comments