A technical note on bilinear layers for interpretability — AI Alignment Forum