This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Chain-of-Thought Alignment
•
Applied to
LLMs Do Not Think Step-by-step In Implicit Reasoning
by
Bogdan Ionut Cirstea
6d
ago
•
Applied to
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
by
Bogdan Ionut Cirstea
8d
ago
•
Applied to
A Little Depth Goes a Long Way: the Expressive Power of Log-Depth Transformers
by
Bogdan Ionut Cirstea
14d
ago
•
Applied to
~80 Interesting Questions about Foundation Model Agent Safety
by
Rohan Subramani
1mo
ago
•
Applied to
Meta AI (FAIR) latest paper integrates system-1 and system-2 thinking into reasoning models.
by
happy friday
1mo
ago
•
Applied to
the case for CoT unfaithfulness is overstated
by
Rohan Subramani
1mo
ago
•
Applied to
Thinking LLMs: General Instruction Following with Thought Generation
by
Bogdan Ionut Cirstea
2mo
ago
•
Applied to
5 ways to improve CoT faithfulness
by
CBiddulph
2mo
ago
•
Applied to
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
by
Bogdan Ionut Cirstea
2mo
ago
•
Applied to
Understanding Hidden Computations in Chain-of-Thought Reasoning
by
rokosbasilisk
3mo
ago
•
Applied to
AI Alignment and the Quest for Artificial Wisdom
by
Madhusudhan Pathak
5mo
ago
•
Applied to
Whirlwind Tour of Chain of Thought Literature Relevant to Automating Alignment Research.
by
sevdeawesome
5mo
ago
•
Applied to
Language and Capabilities: Testing LLM Mathematical Abilities Across Languages
by
Ethan Edwards
8mo
ago
•
Applied to
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
by
Miles Turpin
9mo
ago