Vulnerability of Diffusion Models to Adversarial Attacks

Summary

  • Conducted an extensive literature survey of diffusion models and their advantages and disadvantages.
  • Implemented an architecture which works with different pretrained DDPMs and classifiers.
  • Developed a novel adversarial attack using the Class-Activation Maps of classifiers and the predicted noise maps from the UNet model of a DDPM which gives a better attack in terms of ASR, Robust accuracy and FID.
  • Worked with datasets like CIFAR10, CelebaHQ, FFHQ etc.