AI ALIGNMENT FORUM
AF

Wikitags

AI Misuse

Edited by Raemon last updated 1st May 2023

AI misuse. Humans using AI in a way that harms humanity.

Subscribe
1
Subscribe
1
Discussion0
Discussion0
Posts tagged AI Misuse
31Managing catastrophic misuse without robust AIs
ryan_greenblatt, Buck
2y
4
14Adversarial Robustness Could Help Prevent Catastrophic Misuse
aog
2y
15
52Covert Malicious Finetuning
Tony Wang, dannyhalawi
1y
3
11Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush, scasper
2y
1
Add Posts