AI ALIGNMENT FORUM
Tags
AF

Squiggle Maximizer (formerly "Paperclip maximizer")

•

Applied to Why Recursive Self-Improvement Might Not Be the Existential Risk We Fear by Nassim_A 1mo ago

•

Applied to Seeking feedback on a critique of the paperclip maximizer thought experiment by RobertM 5mo ago

•

Applied to A simple case for extreme inner misalignment by Raymond Arnold 5mo ago

•

Applied to What's wrong with the paperclips scenario? by No77e 7mo ago

•

Applied to Towards an Ethics Calculator for Use by an AGI by Sean Sweeney 1y ago

•

Applied to Reaction to "Empowerment is (almost) All We Need" : an open-ended alternative by Ryo 1y ago

•

Applied to Out of the Box by jesseduffield 1y ago

•

Applied to Non-superintelligent paperclip maximizers are normal by Adam Zerner 1y ago

•

Applied to Nature < Nurture for AIs by scottviteri 2y ago

•

Applied to Will Artificial Superintelligence Kill Us? by James_Miller 2y ago

•

Applied to But What If We Actually Want To Maximize Paperclips? by snerx 2y ago

•

Applied to Prediction: any uncontrollable AI will turn earth into a giant computer by Karl von Wendt 2y ago

Ryan Greenblatt v1.12.0Apr 5th 2023 GMT (+316/-20) LW1

Any future ~~AGI,~~AGI with full power over the lightcone, if it is not to destroy ~~us,~~most potential from a human perspective, must have something sufficiently close to human values as its terminal value (goal). Further, seemingly small deviations could result in losing most of the value. Human values ~~don't~~seem unlikely to spontaneously emerge in a generic optimization ~~process.~~process^[1]. A dependably safe AI would therefore have to be programmed explicitly with human values or programmed with the ability (including the goal) of inferring human values.

^{^}
Though it's conceivable that empirical versions of moral realism could hold in practice.

•

Applied to An Appeal to AI Superintelligence: Reasons to Preserve Humanity by James_Miller 2y ago