AI ALIGNMENT FORUM
Tags
AF

Simulator Theory

•

Applied to Interview with Robert Kralisch on Simulators by WillPetillo 4mo ago

•

Applied to Using ideologically-charged language to get gpt-3.5-turbo to disobey it's system prompt: a demo by Milan Weibel 5mo ago

•

Applied to Karpenchuk's Theory: Human Life as a Simulation for Consciousness Development by Karpenchuk Bohdan 5mo ago

•

Applied to How are Simulators and Agents related? by Robert Kralisch 8mo ago

•

Applied to A Review of In-Context Learning Hypotheses for Automated AI Alignment Research by Alfie Lamerton 8mo ago

•

Applied to Language and Capabilities: Testing LLM Mathematical Abilities Across Languages by Ethan Edwards 9mo ago

•

Applied to The case for more ambitious language model evals by Arun Jose 11mo ago

•

Applied to OpenAI Credit Account (2510$) by Emirhan BULUT 1y ago

•

Applied to Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor by Roger Dearnaley 1y ago

•

Applied to Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI? by Roger Dearnaley 1y ago

•

Applied to On the future of language models by Roger Dearnaley 1y ago

•

Applied to How to Control an LLM's Behavior (why my P(DOOM) went down) by Roger Dearnaley 1y ago

•

Applied to Is Interpretability All We Need? by Roger Dearnaley 1y ago

•

Applied to Impressions from base-GPT-4? by quila 1y ago

•

Applied to Revealing Intentionality In Language Models Through AdaVAE Guided Sampling by Katherine Crowson 1y ago