This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
Subscribe
Discussion
0
1
AI Misuse
Raymond Arnold
AI Misuse
Subscribe
Discussion
0
1
Written by
Raymond Arnold
last updated
1st May 2023
Summaries
Cancel
Submit
AI misuse.
Humans using AI in a way that harms humanity.
Posts tagged
AI Misuse
Most Relevant
2
31
Managing catastrophic misuse without robust AIs
Ryan Greenblatt
,
Buck Shlegeris
1y
4
1
14
Adversarial Robustness Could Help Prevent Catastrophic Misuse
ao
1y
15
0
53
Covert Malicious Finetuning
Tony Wang
,
dannyhalawi
9mo
3
0
11
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Soroush Pour
,
rusheb
,
Quentin Feuillade--Montixi
,
Arush Tagade
,
Stephen Casper
1y
1