TY - JOUR
T1 - Neural Topic Modeling with Deep Mutual Information Estimation
AU - Xu, Kang
AU - Lu, Xiaoqiu
AU - Li, Yuan-fang
AU - Wu, Tongtong
AU - Qi, Guilin
AU - Ye, Ning
AU - Wang, Dong
AU - Zhou, Zheng
N1 - Funding Information:
We would like to thank the reviewers for their comments, which helped improve this paper considerably. This work was supported in part by the Young Scientists Fund of the National Natural Science Foundation of China (Grant No. 62202240 ), and in part by the Research Foundation for Advanced Talents of Nanjing University of Posts and Telecommunications under Grants NY218118 and NY219104 , in part by an open project of the State Key Laboratory of Smart Grid Protection and Control, Nari Group Corporation , under Grant SGNR0000KJJS2007626 , in part by the Jiangsu Project of Social Development under Grant BE2020713 and in part by National Key R&D Program of China : 2018YFB0505003 .
Publisher Copyright:
© 2022
PY - 2022/11/28
Y1 - 2022/11/28
N2 - The emerging neural topic models make topic modeling more easily adaptable and extendable in unsupervised text mining. However, the existing neural topic models are difficult to retain representative information of the documents within the learnt topic representation. Fortunately, Deep Mutual Information Estimation (DMIE), which maximizes the mutual information between input data and the hidden representations to learn a good representation of the input data. DMIE provides a new paradigm for neural topic modeling. In this paper, we propose a neural topic model which incorporates deep mutual information estimation, i.e., Neural Topic Modeling with Deep Mutual Information Estimation (NTM-DMIE). NTM-DMIE is a neural network method for topic learning which maximizes the mutual information between the input documents and their latent topic representation. To learn robust topic representation, we incorporate the discriminator to discriminate negative examples and positive examples via adversarial learning. Moreover, we use both global and local mutual information to preserve the rich information of the input documents in the topic representation. We evaluate NTM-DMIE on several metrics, including accuracy of text clustering, with topic representation, topic uniqueness and topic coherence. Compared to the existing methods, the experimental results show that NTM-DMIE can outperform in all the metrics on the four datasets.
AB - The emerging neural topic models make topic modeling more easily adaptable and extendable in unsupervised text mining. However, the existing neural topic models are difficult to retain representative information of the documents within the learnt topic representation. Fortunately, Deep Mutual Information Estimation (DMIE), which maximizes the mutual information between input data and the hidden representations to learn a good representation of the input data. DMIE provides a new paradigm for neural topic modeling. In this paper, we propose a neural topic model which incorporates deep mutual information estimation, i.e., Neural Topic Modeling with Deep Mutual Information Estimation (NTM-DMIE). NTM-DMIE is a neural network method for topic learning which maximizes the mutual information between the input documents and their latent topic representation. To learn robust topic representation, we incorporate the discriminator to discriminate negative examples and positive examples via adversarial learning. Moreover, we use both global and local mutual information to preserve the rich information of the input documents in the topic representation. We evaluate NTM-DMIE on several metrics, including accuracy of text clustering, with topic representation, topic uniqueness and topic coherence. Compared to the existing methods, the experimental results show that NTM-DMIE can outperform in all the metrics on the four datasets.
KW - Deep mutual information
KW - Neural network
KW - Neural topic modeling
KW - Topic discovery
UR - http://www.scopus.com/inward/record.url?scp=85138073185&partnerID=8YFLogxK
U2 - 10.1016/j.bdr.2022.100344
DO - 10.1016/j.bdr.2022.100344
M3 - Article
AN - SCOPUS:85138073185
SN - 2214-5796
VL - 30
JO - Big Data Research
JF - Big Data Research
M1 - 100344
ER -