New(ish) AI control ideas

Stuart_Armstrong

AI ALIGNMENT FORUM
AF

0 New(ish) AI control ideas

by Stuart_Armstrong

31st Oct 2017

4 min read

0

List of LinksAI

Personal Blog

New(ish) AI control ideas

0IAFF-User-111

New Comment

1 comment, sorted by

top scoring

Click to highlight new comments since: Today at 2:04 PM

[-]IAFF-User-11110y00

Thanks! I love having central repos.

A quick question / comment, RE: "I decided to try and attack as many of these ideas as I could, head on, and see if there was any way of turning these objections."

Q: What do you mean (or have in mind) in terms of "turning [...] objections"? I'm not very familiar with the phrase.

Comment: One trend I see is that technical safety proposals are often dismissed by appealing to one of the 7 responses you've given. Recently I've been thinking that we should be a bit less focused on finding airtight solutions, and more focused on thinking about which proposed techniques could be applied in various scenarios to significantly reduce risk. For example, boxing an agent (e.g. by limiting it's sensors/actuators) might significantly increase how long it takes to escape.

Moderation Log

0

New(ish) AI control ideas

0

0

New(ish) AI control ideas

0

Important background ideas: