@balesni
deception evals. reversal curse. latent reasoning. @apolloaisafety // best way to support 🇺🇦