Maksym Andriushchenko @ ICML'24
1 month
Releasing Nemotron-4-340B, a GPT-4 level model with permissive licensing, is a bold move from Nvidia. But how robust is this model to simple jailbreaking attacks? 🤔
Not much, despite the refusal training, red teaming, LLM scanners, etc. The universal prompt template from our