ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Delving in the Loss Landscape to Embed Robust Watermarks into Neural Networks

Enzo Tartaglione, Marco Grangetto, Davide Cavagnino, Marco Botta

Auto-TLDR; Watermark Aware Training of Neural Networks

Abstract Slides Poster

In the last decade the use of artificial neural networks (ANNs) in many fields like image processing or speech recognition has become a common practice because of their effectiveness to solve complex tasks. However, in such a rush, very little attention has been paid to security aspects. In this work we explore the possibility to embed a watermark into the ANN parameters. We exploit model redundancy and adaptation capacity to lock a subset of its parameters to carry the watermark sequence. The watermark can be extracted in a simple way to claim copyright on models but can be very easily attacked with model fine-tuning. To tackle this culprit we devise a novel watermark aware training strategy. We aim at delving into the loss landscape to find an optimal configuration of the parameters such that we are robust to fine-tuning attacks towards the watermarked parameters. Our experimental results on classical ANN models trained on well-known MNIST and CIFAR-10 datasets show that the proposed approach makes the embedded watermark robust to fine-tuning and compression attacks.

Similar papers

Removing Backdoor-Based Watermarks in Neural Networks with Limited Data

Xuankai Liu, Fengting Li, Bihan Wen, Qi Li

Auto-TLDR; WILD: A backdoor-based watermark removal framework using limited data

Delving in the Loss Landscape to Embed Robust Watermarks into Neural Networks

Similar papers

Removing Backdoor-Based Watermarks in Neural Networks with Limited Data

Adaptive Noise Injection for Training Stochastic Student Networks from Deterministic Teachers

Beyond Cross-Entropy: Learning Highly Separable Feature Distributions for Robust and Accurate Classification

Adversarially Training for Audio Classifiers

Compression Strategies and Space-Conscious Representations for Deep Neural Networks

Exploiting Non-Linear Redundancy for Neural Model Compression

Large-Scale Historical Watermark Recognition: Dataset and a New Consistency-Based Approach

How Does DCNN Make Decisions?

Optimal Transport As a Defense against Adversarial Attacks

Learning Sparse Deep Neural Networks Using Efficient Structured Projections on Convex Constraints for Green AI

A Delayed Elastic-Net Approach for Performing Adversarial Attacks

Neuron-Based Network Pruning Based on Majority Voting

Generalization Comparison of Deep Neural Networks Via Output Sensitivity

Variational Inference with Latent Space Quantization for Adversarial Resilience

Speeding-Up Pruning for Artificial Neural Networks: Introducing Accelerated Iterative Magnitude Pruning

Rethinking of Deep Models Parameters with Respect to Data Distribution

Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Fine-Tuning Convolutional Neural Networks: A Comprehensive Guide and Benchmark Analysis for Glaucoma Screening

Defense Mechanism against Adversarial Attacks Using Density-Based Representation of Images

Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

Verifying the Causes of Adversarial Examples

Norm Loss: An Efficient yet Effective Regularization Method for Deep Neural Networks

Joint Compressive Autoencoders for Full-Image-To-Image Hiding

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Towards Explaining Adversarial Examples Phenomenon in Artificial Neural Networks

Efficient Online Subclass Knowledge Distillation for Image Classification

ESResNet: Environmental Sound Classification Based on Visual Domain Models

ResNet-Like Architecture with Low Hardware Requirements

MINT: Deep Network Compression Via Mutual Information-Based Neuron Trimming

Multimodal Side-Tuning for Document Classification

Video Face Manipulation Detection through Ensemble of CNNs

F-Mixup: Attack CNNs from Fourier Perspective

Activation Density Driven Efficient Pruning in Training

Learning to Prune in Training via Dynamic Channel Propagation

Improving Model Accuracy for Imbalanced Image Classification Tasks by Adding a Final Batch Normalization Layer: An Empirical Study

Is the Meta-Learning Idea Able to Improve the Generalization of Deep Neural Networks on the Standard Supervised Learning?

Task-based Focal Loss for Adversarially Robust Meta-Learning

Can Data Placement Be Effective for Neural Networks Classification Tasks? Introducing the Orthogonal Loss

Softer Pruning, Incremental Regularization

Revisiting the Training of Very Deep Neural Networks without Skip Connections

Improving Gravitational Wave Detection with 2D Convolutional Neural Networks

CCA: Exploring the Possibility of Contextual Camouflage Attack on Object Detection

Hcore-Init: Neural Network Initialization Based on Graph Degeneracy

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Accuracy-Perturbation Curves for Evaluation of Adversarial Attack and Defence Methods

Adversarial Training for Aspect-Based Sentiment Analysis with BERT

On the Use of Benford's Law to Detect GAN-Generated Images

Rethinking Experience Replay: A Bag of Tricks for Continual Learning