Student surpasses teacher: Imitation attack for black-box NLP APIs

Qiongkai Xu, Xuanli He, Lingjuan Lyu, Lizhen Qu, Gholamreza Haffari

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

11 Citations (Scopus)

Abstract

Machine-learning-as-a-service (MLaaS) has attracted millions of users to their splendid large-scale models. Although published as black-box APIs, the valuable models behind these services are still vulnerable to imitation attacks. Recently, a series of works have demonstrated that attackers manage to steal or extract the victim models. Nonetheless, none of the previous stolen models can outperform the original black-box APIs. In this work, we conduct unsupervised domain adaptation and multi-victim ensemble to showing that attackers could potentially surpass victims, which is beyond previous understanding of model extraction. Extensive experiments on both benchmark datasets and real-world APIs validate that the imitators can succeed in outperforming the original black-box models on transferred domains. We consider our work as a milestone in the research of imitation attack, especially on NLP APIs, as the superior performance could influence the defense or even publishing strategy of API providers.

Original languageEnglish
Title of host publicationProceedings of the Main Conference - The 29th International Conference on Computational Linguistics
EditorsHansaem Kim, James Pustejovsky, Leo Wanner
Place of PublicationStroudsburg PA USA
Pages2849-2860
Number of pages12
Publication statusPublished - 2022
EventInternational Conference on Computational Linguistics 2022 - Gyeongju, Korea, South
Duration: 12 Oct 202217 Oct 2022
Conference number: 29th
https://coling2022.org/
https://aclanthology.org/volumes/2022.coling-1/ (Proceedings)

Publication series

NameThe 29th International Conference on Computational Linguistics
PublisherAssociation for Computational Linguistics (ACL)
Number1
Volume29
ISSN (Print)2951-2093

Conference

ConferenceInternational Conference on Computational Linguistics 2022
Abbreviated titleCOLING
Country/TerritoryKorea, South
CityGyeongju
Period12/10/2217/10/22
Internet address

Cite this