Memory Matching is Not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation

Jintu Zheng, Yun Liang , Yuqing Zhang, Wanchao Su

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Memory-based video object segmentation methods model multiple objects over long temporal-spatial spans by establishing memory bank, which achieve the remarkable performance. However, they struggle to overcome the false matching and are prone to lose critical information, resulting in confusion among different objects. In this paper, we propose an effective approach which jointly improving the matching and decoding stages to alleviate the false matching issue. For the memory matching stage, we present a cost aware mechanism that suppresses the slight errors for short-term memory and a shunted cross-scale matching for long-term memory which establish a wide filed matching spaces for various object scales. For the readout decoding stage, we implement a compensatory mechanism aims at recovering the essential information where missing at the matching stage. Our approach achieves the outstanding performance in several popular benchmarks (i.e., DAVIS 2016 &2017 Val (92.4% &88.1%), and DAVIS 2017 Test (83.9%)), and achieves 84.8% &84.6% on YouTubeVOS 2018 &2019 Val.

Original languageEnglish
Title of host publicationPattern Recognition - 27th International Conference, ICPR 2024 Kolkata, India, December 1–5, 2024 Proceedings, Part XXII
EditorsApostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal
Place of PublicationCham Switzerland
PublisherSpringer
Pages188-203
Number of pages16
ISBN (Electronic)9783031783128
ISBN (Print)9783031783111
DOIs
Publication statusPublished - 2025
EventInternational Conference on Pattern Recognition 2024 - Kolkata, India
Duration: 1 Dec 20245 Dec 2024
Conference number: 27th
https://link.springer.com/book/10.1007/978-3-031-78354-8 (Proceedings)
https://icpr2024.org/ (Website)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume15322
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Pattern Recognition 2024
Abbreviated titleICPR 2024
Country/TerritoryIndia
CityKolkata
Period1/12/245/12/24
Internet address

Keywords

  • Compensatory Decoding
  • False Matching Alleviation
  • Video Object Segmentation

Cite this