Abstract
In industrial enterprises, effective retrieval of three-dimensional (3-D) computer-aided design (CAD) models can greatly save time and cost in new product development and manufacturing, thus, many researchers have focused on it. Recently, many view-based 3D model retrieval methods have been proposed and have achieved state-of-the-art performance. However, most of these methods focus on extracting more discriminative view-level features and effectively aggregating the multi-view images of a 3D model, and the latent relationship among these multi-view images is not fully explored. Thus, we tackle this problem from the perspective of exploiting the relationships between patch features to capture long-range associations among multi-view images. To capture associations among views, in this work, we propose a novel patch convolutional neural network (PCNN ) for view-based 3D model retrieval. Specifically, we first employ a CNN to extract patch features of each view image separately. Second, a novel neural network module named PatchConv is designed to exploit intrinsic relationships between neighboring patches in the feature space to capture long-range associations among multi-view images. Then, an adaptive weighted view layer is further embedded into PCNN to automatically assign a weight to each view according to the similarity between each view feature and the view-pooling feature. Finally, a discrimination loss function is employed to extract the discriminative 3D model feature, which consists of softmax loss values generated by the fusion classifier and the specific classifier. Extensive experimental results on two public 3D model retrieval benchmarks, namely, the ModelNet40, and ModelNet10, demonstrate that our proposed PCNN can outperform state-of-the-art approaches, with mAP values of 93.67%, and 96.23%, respectively.
Original language | English |
---|---|
Title of host publication | Proceedings of the 29th ACM International Conference on Multimedia |
Editors | Liqiang Nie, Qianru Sun, Peng Cui |
Place of Publication | Need York NY USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 2699-2707 |
Number of pages | 9 |
ISBN (Electronic) | 9781450386517 |
DOIs | |
Publication status | Published - 2021 |
Event | ACM International Conference on Multimedia 2021 - Chengdu, China Duration: 20 Oct 2021 → 24 Oct 2021 Conference number: 29th https://dl.acm.org/doi/proceedings/10.1145/3474085 (Proceedings) https://2021.acmmm.org/ (Website) |
Conference
Conference | ACM International Conference on Multimedia 2021 |
---|---|
Abbreviated title | MM 2021 |
Country/Territory | China |
City | Chengdu |
Period | 20/10/21 → 24/10/21 |
Internet address |
|
Keywords
- 3D model retrieval
- adaptive weighted view layer
- discrimination loss
- patch convolutional neural network