Abstract
We report on our system used in the TRECVID 2014 Multimedia Event Detection (MED) and Multimedia Event Recounting (MER) tasks. On the MED task, the CMU team achieved leading performance in the Semantic Query (SQ), 000Ex, 010Ex and 100Ex settings. Furthermore, SQ and 000Ex runs are significantly better than the submissions from the other teams. We attribute the good performance to 4 main components: 1) large-scale semantic concept detectors trained on video shots for SQ/000Ex systems, 2) better features such as improved trajectories and deep learning features for 010Ex/100Ex systems, 3) a novel Multistage Hybrid Late Fusion method for 010Ex/100Ex systems and 4) improved reranking methods for Pseudo Relevance Feedback for 000Ex/010Ex systems. On the MER task, our system utilizes a subset of features and detection results from the MED system from which the recounting is then generated. Recounting evidence is presented by selecting the most likely concepts detected in the salient shots of a video. Salient shots are detected by searching for shots which have high response when predicted by the video level event detector.
Original language | English |
---|---|
Title of host publication | TRECVID 2014 |
Editors | Paul Over |
Place of Publication | Saarbrücken/Wadern Germany |
Publisher | Schloss Dagstuhl |
Number of pages | 14 |
Publication status | Published - 2014 |
Externally published | Yes |
Event | TREC Video Retrieval Evaluation 2014 - Orlando, United States of America Duration: 10 Nov 2014 → 12 Nov 2014 https://dblp.org/db/conf/trecvid/trecvid2014.html (Proceedings) https://www-nlpir.nist.gov/projects/tv2014/index.html (Website) |
Conference
Conference | TREC Video Retrieval Evaluation 2014 |
---|---|
Abbreviated title | TRECVID 2014 |
Country/Territory | United States of America |
City | Orlando |
Period | 10/11/14 → 12/11/14 |
Internet address |