AI ALIGNMENT FORUM
AF

John Hughes

Former MATS scholar working on scalable oversight and adversarial robustness.

Posts

Sorted by New

40Tips and Code for Empirical Research Workflows

2mo

2

31Best-of-N Jailbreaking

4mo

1

46Debating with More Persuasive LLMs Leads to More Truthful Answers

1y

7

Wikitag Contributions

Comments

Sorted by

Tips and Code for Empirical Research Workflows

2mo20

Thanks Neel! I'm glad you found it helpful. If you or your scholars recommend any other tools not mentioned in the post, I'd be interested to hear more.