A novel patch convolutional neural network for view-based 3D model retrieval

Zan Gao, Yuxiang Shao, Weili Guan, Meng Liu, Zhiyong Cheng, Shengyong Chen

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

6 Citations (Scopus)

Abstract

In industrial enterprises, effective retrieval of three-dimensional (3-D) computer-aided design (CAD) models can greatly save time and cost in new product development and manufacturing, thus, many researchers have focused on it. Recently, many view-based 3D model retrieval methods have been proposed and have achieved state-of-the-art performance. However, most of these methods focus on extracting more discriminative view-level features and effectively aggregating the multi-view images of a 3D model, and the latent relationship among these multi-view images is not fully explored. Thus, we tackle this problem from the perspective of exploiting the relationships between patch features to capture long-range associations among multi-view images. To capture associations among views, in this work, we propose a novel patch convolutional neural network (PCNN ) for view-based 3D model retrieval. Specifically, we first employ a CNN to extract patch features of each view image separately. Second, a novel neural network module named PatchConv is designed to exploit intrinsic relationships between neighboring patches in the feature space to capture long-range associations among multi-view images. Then, an adaptive weighted view layer is further embedded into PCNN to automatically assign a weight to each view according to the similarity between each view feature and the view-pooling feature. Finally, a discrimination loss function is employed to extract the discriminative 3D model feature, which consists of softmax loss values generated by the fusion classifier and the specific classifier. Extensive experimental results on two public 3D model retrieval benchmarks, namely, the ModelNet40, and ModelNet10, demonstrate that our proposed PCNN can outperform state-of-the-art approaches, with mAP values of 93.67%, and 96.23%, respectively.

Original languageEnglish
Title of host publicationProceedings of the 29th ACM International Conference on Multimedia
EditorsLiqiang Nie, Qianru Sun, Peng Cui
Place of PublicationNeed York NY USA
PublisherAssociation for Computing Machinery (ACM)
Pages2699-2707
Number of pages9
ISBN (Electronic)9781450386517
DOIs
Publication statusPublished - 2021
EventACM International Conference on Multimedia 2021 - Chengdu, China
Duration: 20 Oct 202124 Oct 2021
Conference number: 29th
https://dl.acm.org/doi/proceedings/10.1145/3474085 (Proceedings)
https://2021.acmmm.org/ (Website)

Conference

ConferenceACM International Conference on Multimedia 2021
Abbreviated titleMM 2021
Country/TerritoryChina
CityChengdu
Period20/10/2124/10/21
Internet address

Keywords

  • 3D model retrieval
  • adaptive weighted view layer
  • discrimination loss
  • patch convolutional neural network

Cite this