초록

적대적 공격은 딥러닝 모델 입력 데이터에 미세한 노이즈를 추가함으로써 모델이 잘못된 판단을 내리도록 유도한다. 의료영상에서의 딥러닝은 높은 성능을 보이지만 적대적 공격에 취약하다는 위험성이 있다. 또한 이 중 분할 모델의 경우 적대적 공격의 방어가 더욱 까다롭지만 이에 관련된 보안연구는 주목받지 못하고 있었다. 본 연구에서는 뇌 종양 분할 모델에서 FGSM 공격을 수행한 후 이에 대한 방어로 혼합 입력 이미지 변환과 기울기 정규화를 수행한다. 제안한 JPEG 압축과 가우시안 필터의 적용은 원본 이미지에 대한 성능은 유지하면서 적대적 노이즈를 제거하였다. 또한 기존 강건성 확보를 위한 방법인 기울기 정규화 모델과 비교하여 입력 이미지 변환법은 더 높은 방어 성능을 보일　뿐 아니라 모델의 재훈련 없이 적용 가능하다는 장점이 있다. 이를 통해 의료 인공지능 분야에서의 보안의 취약함을 확인하고 모델의 전처리 단계에서 적용할 수 있는 강건성 확보를 제안한다.

Adversarial attacks induce the model to make misjudgments by adding fine noise to the deep learning model input data. Deep learning in medical images raises the expectations for computer-assisted diagnosis, but there is a risk of being vulnerable to adversarial attacks. In addition, in the case of the double segmentation model, the defense of adversarial attacks is more difficult, but security studies related to this topic have not received attention. In this study, we perform FGSM attacks ony brain tumor segmentation models and employ input image transformation and gradient regularization as defenses against these attacks. The proposed application of JPEG compression and Gaussian filters effectively removes adversarial noise while maintaining performance in the original images. Moreover, the input image transformation method, when compared to the conventional gradient regularization model for achieving robustness, not only exhibits a higher defense performance but also offers the advantage of being applicable without the need for model retraining. Through this research, we identify vulnerabilities in the security of medical artificial intelligence and propose ensuring robustness that can be applied in the preprocessing stage of the model.