Abstract
Human minds spontaneously integrate two inherited cognitive capabilities: perception and reasoning to accomplish cognitive tasks such as problem solving, imagination, and causation. It is observed in the primate brains that perception offers the assistance required for problem comprehension, whilst the reasoning elucidates upon the facts recovered during perception in order to make a decision. The field of artificial intelligence (AI) thus considers perception and reasoning as two complementary areas that are realized by machine learning and logic programming, respectively. In this work, we propose a generative model using a collaborative guessing game of the kind first introduced by David Lewis in his famous work called the Lewis signaling game that is synonymous with the "20 Questions'' game. Our proposed model, <u>G</u>uess-<u>I</u>t-<u>G</u>enerator (GIG) is a collaborative framework that engages two recurrent neural networks in a guessing game. GIG unifies perception and reasoning with a view to generating labeled images by capturing, (X, y), the underlying density of a data distribution, i.e. (X, y)-p(X, y). An encoder attends to a region of the input image and encodes that onto a latent variable that acts as a perception signal to a decoder. In contrast, the decoder leverages on the perception signals to guess the image and verifies the guess by reasoning with logical facts derived from the domain knowledge. Our experiments and comprehensive studies on seven datasets: PCAM, Chest-Xray-14, FIRE, HAM10000 from the medical domain, and CIFAR 10, LSUN, ImageNet, among standard benchmark datasets, show significant promise for the proposed method.
Original language | English |
---|---|
Title of host publication | Proceedings of the 30th ACM International Conference on Multimedia |
Editors | Marco Bertini, Klaus Schoeffmann |
Place of Publication | New York NY USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 6396-6405 |
Number of pages | 10 |
ISBN (Electronic) | 9781450392037 |
DOIs | |
Publication status | Published - 2022 |
Event | ACM International Conference on Multimedia 2022 - Lisbon, Portugal Duration: 10 Oct 2022 → 14 Oct 2022 Conference number: 30th https://dl.acm.org/doi/proceedings/10.1145/3503161 (Proceedings) https://2022.acmmm.org/ (Website) |
Conference
Conference | ACM International Conference on Multimedia 2022 |
---|---|
Abbreviated title | MM 2022 |
Country/Territory | Portugal |
City | Lisbon |
Period | 10/10/22 → 14/10/22 |
Internet address |
|
Keywords
- first order logic
- generative model
- lewis signaling guessing game
- medical labeled image generation