Decreasing Model Stealing Querying for Black Box Adversarial Attacks

Psathas, S.G.

Decreasing Model Stealing Querying for Black Box Adversarial Attacks

Bachelor thesis (2022)

Authors

S.G. Psathas Electrical Engineering, Mathematics and Computer Science

Contributors

C. Hong Data-Intensive Systems - (mentor)

J. Huang Data-Intensive Systems - (mentor)

Stefanie Roos Data-Intensive Systems - (mentor)

L. A.N. Lan Embedded Systems - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Machine learning Adversarial attacks Model stealing

To reference this document use:

http://resolver.tudelft.nl/uuid:8659a918-ae38-48be-acb7-565d03cf0fc0

More Info

expand_more

Published Date

24-06-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

A machine learning classifier can be tricked us- ing adversarial attacks, attacks that alter images slightly to make the target model misclassify the image. To create adversarial attacks on black-box classifiers, a substitute model can be created us- ing model stealing. The research question this re- port address is the topic of using model stealing while minimizing the amount of querying the sub- stitute model needs to train. The solution used in this report is a variant of the ActiveThief algo- rithm that makes use of active learning to deter- mine which data is being queried. The paper exper- iments with different subset selection strategies to find the most informative data points. Also, a seed- ing algorithm based on clustering is explored and finally, a stopping criterion for the ActiveThief al- gorithm is proposed. These variations are evaluated on their accuracy and the number of queries they take to achieve that accuracy. This paper shows cluster seeding is an alternative to random seeding in ActiveThief. This paper also presents different subset selection strategies that outperform the ran- dom sampling strategy. Finally, a stopping criterion based on entropy is introduced that halts the algo- rithm when an uncertainty threshold is reached.

Files

Decreasing_Model_Stealing_Quer... (pdf)

(pdf | 1.04 Mb)

Unknown license

Decreasing_Model_Stealing_Quer... (pdf)

(pdf | 1.05 Mb)

Unknown license