Abstract
Physical scene understanding is a fundamental human ability. Empowering artificial systems with such understanding is an important step towards flexible and adaptive behavior in the real world. As a step in this direction, we propose a novel approach to physical scene understanding in video. We train a deep neural network for video prediction which embeds the video sequence in a low-dimensional recurrent latent space representation. We optimize the total correlation of the latent dimensions within a variational recurrent auto-encoder framework. This encourages the representation to disentangle the latent physical factors of variation in the training data. To train and evaluate our approach, we use synthetic video sequences in three different physical scenarios with various degrees of difficulty. Our experiments demonstrate that our model can disentangle several appearance-related properties in the unsupervised case. If we add supervision signals for the latent code, our model can further improve the disentanglement of dynamics-related properties.
Original language | English (US) |
---|---|
Title of host publication | Pattern Recognition - 41st DAGM German Conference, DAGM GCPR 2019, Proceedings |
Editors | Gernot A. Fink, Simone Frintrop, Xiaoyi Jiang |
Publisher | Springer |
Pages | 595-608 |
Number of pages | 14 |
ISBN (Print) | 9783030336752 |
DOIs | |
State | Published - 2019 |
Event | 41st DAGM German Conference on Pattern Recognition, DAGM GCPR 2019 - Dortmund, Germany Duration: Sep 10 2019 → Sep 13 2019 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 11824 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 41st DAGM German Conference on Pattern Recognition, DAGM GCPR 2019 |
---|---|
Country/Territory | Germany |
City | Dortmund |
Period | 09/10/19 → 09/13/19 |
Bibliographical note
Funding Information:This work has been supported through Cyber Valley.
Publisher Copyright:
© Springer Nature Switzerland AG 2019.
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science