AI ALIGNMENT FORUM
Wikitags
AF

Research Agendas

Settings

Applied to RFC: a tool to create a ranked list of projects in explainable AI by eamag 23d ago

Applied to Synthetic Neuroscience by hpcfung 1mo ago

Applied to How far along Metr's law can AI start automating or helping with alignment research? by Christopher King 1mo ago

Applied to Give Neo a Chance by ank 2mo ago

Applied to Share AI Safety Ideas: Both Crazy and Not by ank 2mo ago

Applied to The Theoretical Reward Learning Research Agenda: Introduction and Motivation by Joar Skalse 2mo ago

Applied to How to Contribute to Theoretical Reward Learning Research by Joar Skalse 2mo ago

Applied to Unaligned AGI & Brief History of Inequality by ank 2mo ago

Applied to Intelligence–Agency Equivalence ≈ Mass–Energy Equivalence: On Static Nature of Intelligence & Physicalization of Ethics by ank 2mo ago

Applied to Notes on notes on virtues by David Gross 2mo ago

Applied to Human-AI Relationality is Already Here by Clark 2mo ago

Applied to Rational Effective Utopia & Narrow Way There: Multiversal AI Alignment, Place AI, New Ethicophysics... (Updated) by ank 2mo ago

Applied to The Road to Evil Is Paved with Good Objectives: Framework to Classify and Fix Misalignments. by Shivam Arora 3mo ago

Applied to False Positives in Entity-Level Hallucination Detection: A Technical Challenge by Max Kamachee 3mo ago

Applied to You should delay engineering-heavy research in light of R&D automation by Daniel Paleka 4mo ago

Applied to My AGI safety research—2024 review, ’25 plans by Steve Byrnes 4mo ago

Applied to Shallow review of technical AI safety, 2024 by Dakara 4mo ago

Applied to Shallow review of live agendas in alignment & safety by Dakara 4mo ago

Applied to Retrospective: PIBBSS Fellowship 2024 by DusanDNesic 4mo ago