Skip to main navigation Skip to search Skip to main content

Probabilistic non-negative matrix factorization and its robust extensions for topic modeling

Minnan Luo, Feiping Nie, Xiaojun Chang, Yi Yang, Alexander Hauptmann, Qinghua Zheng

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Traditional topic model with maximum likelihood estimate inevitably suffers from the conditional independence of words given the documents topic distribution. In this paper, we follow the generative procedure of topic model and learn the topic-word distribution and topics distribution via directly approximating the word-document co-occurrence matrix with matrix decomposition technique. These methods include: (1) Approximating the normalized document-word conditional distribution with the documents probability matrix and words probability matrix based on probabilistic non-negative matrix factorization (NMF); (2) Since the standard NMF is well known to be non-robust to noises and outliers, we extended the probabilistic NMF of the topic model to its robust versions using ℓ2, 1 -norm and capped ℓ2, 1 -norm based loss functions, respectively. The proposed framework inherits the explicit probabilistic meaning of factors in topic models and simultaneously makes the conditional independence assumption on words unnecessary. Straightforward and efficient algorithms are exploited to solve the corresponding non-smooth and non-convex problems. Experimental results over several benchmark datasets illustrate the effectiveness and superiority of the proposed methods.

Original languageEnglish
Title of host publicationProceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17)
Subtitle of host publicationSan Francisco, California, USA — February 04 - 09, 2017
EditorsSatinder Singh, Shaul Markovitch
Place of PublicationPalo Alto CA USA
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Pages2308-2314
Number of pages7
Publication statusPublished - 2017
Externally publishedYes
EventAAAI Conference on Artificial Intelligence 2017 - Hilton San Francisco Union Square, San Francisco, United States of America
Duration: 4 Feb 201710 Feb 2017
Conference number: 31st
http://www.aaai.org/Conferences/AAAI/aaai17.php

Conference

ConferenceAAAI Conference on Artificial Intelligence 2017
Abbreviated titleAAAI 2017
Country/TerritoryUnited States of America
CitySan Francisco
Period4/02/1710/02/17
Internet address

Cite this