Using a single attack won’t do, unless you are in a Hollywood film. This post covers AutoAttack, the pioneer ensemble adversarial attack, and shows how to test the adversarial robustness of AI models more rigorously.
ChatGPT is excellent in extracting structured information from text. Can it evaluate our personality traits? This post describes our work on LLM personality assessment, accepted to the CAIHu workshop @ AAAI ’24.
Intuitive understanding of adversarial attacks is core for understanding AI security. This post aims to explain adversarial attacks with… Elves (instead of technical terminology).