An Information Asymmetry Game for Trigger-based DNN Model Watermarking
By: Chaoyue Huang , Gejian Zhao , Hanzhou Wu and more
Potential Business Impact:
Protects computer brains from being copied.
As a valuable digital product, deep neural networks (DNNs) face increasingly severe threats to the intellectual property, making it necessary to develop effective technical measures to protect them. Trigger-based watermarking methods achieve copyright protection by embedding triggers into the host DNNs. However, the attacker may remove the watermark by pruning or fine-tuning. We model this interaction as a game under conditions of information asymmetry, namely, the defender embeds a secret watermark with private knowledge, while the attacker can only access the watermarked model and seek removal. We define strategies, costs, and utilities for both players, derive the attacker's optimal pruning budget, and establish an exponential lower bound on the accuracy of watermark detection after attack. Experimental results demonstrate the feasibility of the watermarked model, and indicate that sparse watermarking can resist removal with negligible accuracy loss. This study highlights the effectiveness of game-theoretic analysis in guiding the design of robust watermarking schemes for model copyright protection.
Similar Papers
Protecting Deep Neural Network Intellectual Property with Chaos-Based White-Box Watermarking
Cryptography and Security
Protects AI "brains" from being stolen.
ChainMarks: Securing DNN Watermark with Cryptographic Chain
Cryptography and Security
Protects computer brains from being copied.
DeepTracer: Tracing Stolen Model via Deep Coupled Watermarks
Cryptography and Security
Protects AI art from being stolen and copied.