Abstract
Identifying vulnerabilities in the source code is essential to protect the software systems from cyber security attacks. It, however, is also a challenging step that requires specialized expertise in security and code representation. To this end, we aim to develop a general, practical, and programming language-independent model capable of running on various source codes and libraries without difficulty. Therefore, we consider vulnerability detection as an inductive text classification problem and propose ReGVD, a simple yet effective graph neural network-based model for the problem. In particular, ReGVD views each raw source code as a flat sequence of tokens to build a graph, wherein node features are initialized by only the token embedding layer of a pre-trained programming language (PL) model. ReGVD then leverages residual connection among GNN layers and examines a mixture of graph-level sum and max poolings to return a graph embedding for the source code. ReGVD outperforms the existing state-of-the-art models and obtains the highest accuracy on the real-world benchmark dataset from CodeXGLUE for vulnerability detection. Our code is available at: https://github.com/daiquocnguyen/GNN-ReGVD.
Original language | English |
---|---|
Title of host publication | Proceedings - 2022 ACM/IEEE 44th International Conference on Software Engineering |
Subtitle of host publication | Companion Proceedings, ICSE-Companion 2022 |
Editors | Matthew B. Dwyer |
Place of Publication | New York NY USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 178-182 |
Number of pages | 5 |
ISBN (Electronic) | 9781665495981 |
ISBN (Print) | 9781665495998 |
DOIs | |
Publication status | Published - 2022 |
Event | International Conference on Software Engineering 2022: Software Engineering in Society - Pittsburgh, United States of America Duration: 22 May 2022 → 27 May 2022 Conference number: 44th https://ieeexplore.ieee.org/xpl/conhome/9793840/proceeding (Proceedings) https://conf.researchr.org/home/icse-2022 (Website) |
Publication series
Name | Proceedings - International Conference on Software Engineering |
---|---|
Publisher | Association for Computing Machinery (ACM) |
ISSN (Print) | 0270-5257 |
Conference
Conference | International Conference on Software Engineering 2022 |
---|---|
Abbreviated title | ICSE-SEIS 2022 |
Country/Territory | United States of America |
City | Pittsburgh |
Period | 22/05/22 → 27/05/22 |
Internet address |
Keywords
- Graph Neural Networks
- Security
- Text Classification
- Vulnerability Detection