AI ALIGNMENT FORUM
Wikitags
AF

Machine Unlearning

Settings

Applied to The case for unlearning that removes information from LLM weights by Ebenezer Dukakis 2mo ago

Applied to Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model by Saketh Baddam 2mo ago

Applied to Gradient Routing: Masking Gradients to Localize Computation in Neural Networks by Alex Turner 4mo ago

Applied to Breaking Circuit Breakers by Nicky Pochinkov 8mo ago

Applied to Unlearning via RMU is mostly shallow by Nicky Pochinkov 8mo ago

Applied to Deep Forgetting & Unlearning for Safely-Scoped LLMs by Nicky Pochinkov 1y ago

Applied to LLM Modularity: The Separability of Capabilities in Large Language Models by Nicky Pochinkov 1y ago

Applied to Machine Unlearning Evaluations as Interpretability Benchmarks by Nicky Pochinkov 1y ago

Nicky Pochinkov v1.0.0Oct 23rd 2023 GMT (+1330) LW2

Created by Nicky Pochinkov at 1y