Improving disentangled representation learning with the beta bernoulli process

Prashnna Gyawali; Zhiyuan Li; Cameron Knight; Sandesh Ghimire; B. Milan Horacek; John Sapp; Linwei Wang

doi:10.1109/ICDM.2019.00127

Improving disentangled representation learning with the beta bernoulli process

Prashnna Gyawali, Zhiyuan Li, Cameron Knight, Sandesh Ghimire, B. Milan Horacek, John Sapp, Linwei Wang

Medicine

Medicine

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

10 Citations (Scopus)

Abstract

To improve the ability of variational auto-encoders (VAE) to disentangle in the latent space, existing works mostly focus on enforcing the independence among the learned latent factors. However, the ability of these models to disentangle often decreases as the complexity of the generative factors increases. In this paper, we investigate the little-explored effect of the modeling capacity of a posterior density on the disentangling ability of the VAE. We note that the independence within and the complexity of the latent density are two different properties we constrain when regularizing the posterior density: while the former promotes the disentangling ability of VAE, the latter - if overly limited - creates an unnecessary competition with the data reconstruction objective in VAE. Therefore, if we preserve the independence but allow richer modeling capacity in the posterior density, we will lift this competition and thereby allow improved independence and data reconstruction at the same time. We investigate this theoretical intuition with a VAE that utilizes a non-parametric latent factor model, the Indian Buffet Process (IBP), as a latent density that is able to grow with the complexity of the data. Across two widely-used benchmark data sets (MNIST and dSprites) and two clinical data sets little explored for disentangled learning, we qualitatively and quantitatively demonstrated the improved disentangling performance of IBP-VAE over the state of the art. In the latter two clinical data sets riddled with complex factors of variations, we further demonstrated that unsupervised disentangling of nuisance factors via IBP-VAE - when combined with a supervised objective - can not only improve task accuracy in comparison to relevant supervised deep architectures, but also facilitate knowledge discovery related to task decision-making.

Original language	English
Title of host publication	Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019
Editors	Jianyong Wang, Kyuseok Shim, Xindong Wu
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1078-1083
Number of pages	6
ISBN (Electronic)	9781728146034
DOIs	https://doi.org/10.1109/ICDM.2019.00127
Publication status	Published - Nov 2019
Event	19th IEEE International Conference on Data Mining, ICDM 2019 - Beijing, China Duration: Nov 8 2019 → Nov 11 2019

Publication series

Name	Proceedings - IEEE International Conference on Data Mining, ICDM
Volume	2019-November
ISSN (Print)	1550-4786

Conference

Conference	19th IEEE International Conference on Data Mining, ICDM 2019
Country/Territory	China
City	Beijing
Period	11/8/19 → 11/11/19

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

ASJC Scopus Subject Areas

General Engineering

Access to Document

10.1109/ICDM.2019.00127

Cite this

Gyawali, P., Li, Z., Knight, C., Ghimire, S., Horacek, B. M., Sapp, J., & Wang, L. (2019). Improving disentangled representation learning with the beta bernoulli process. In J. Wang, K. Shim, & X. Wu (Eds.), Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019 (pp. 1078-1083). Article 8970693 (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2019-November). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDM.2019.00127

Improving disentangled representation learning with the beta bernoulli process. / Gyawali, Prashnna; Li, Zhiyuan; Knight, Cameron et al.
Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019. ed. / Jianyong Wang; Kyuseok Shim; Xindong Wu. Institute of Electrical and Electronics Engineers Inc., 2019. p. 1078-1083 8970693 (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2019-November).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Gyawali, P, Li, Z, Knight, C, Ghimire, S, Horacek, BM , Sapp, J & Wang, L 2019, Improving disentangled representation learning with the beta bernoulli process. in J Wang, K Shim & X Wu (eds), Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019., 8970693, Proceedings - IEEE International Conference on Data Mining, ICDM, vol. 2019-November, Institute of Electrical and Electronics Engineers Inc., pp. 1078-1083, 19th IEEE International Conference on Data Mining, ICDM 2019, Beijing, China, 11/8/19. https://doi.org/10.1109/ICDM.2019.00127

Gyawali P, Li Z, Knight C, Ghimire S, Horacek BM , Sapp J et al. Improving disentangled representation learning with the beta bernoulli process. In Wang J, Shim K, Wu X, editors, Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 1078-1083. 8970693. (Proceedings - IEEE International Conference on Data Mining, ICDM). doi: 10.1109/ICDM.2019.00127

Gyawali, Prashnna ; Li, Zhiyuan ; Knight, Cameron et al. / Improving disentangled representation learning with the beta bernoulli process. Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019. editor / Jianyong Wang ; Kyuseok Shim ; Xindong Wu. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 1078-1083 (Proceedings - IEEE International Conference on Data Mining, ICDM).

@inproceedings{eec88cc17148441b8390e1f1c5ec0b61,

title = "Improving disentangled representation learning with the beta bernoulli process",

abstract = "To improve the ability of variational auto-encoders (VAE) to disentangle in the latent space, existing works mostly focus on enforcing the independence among the learned latent factors. However, the ability of these models to disentangle often decreases as the complexity of the generative factors increases. In this paper, we investigate the little-explored effect of the modeling capacity of a posterior density on the disentangling ability of the VAE. We note that the independence within and the complexity of the latent density are two different properties we constrain when regularizing the posterior density: while the former promotes the disentangling ability of VAE, the latter - if overly limited - creates an unnecessary competition with the data reconstruction objective in VAE. Therefore, if we preserve the independence but allow richer modeling capacity in the posterior density, we will lift this competition and thereby allow improved independence and data reconstruction at the same time. We investigate this theoretical intuition with a VAE that utilizes a non-parametric latent factor model, the Indian Buffet Process (IBP), as a latent density that is able to grow with the complexity of the data. Across two widely-used benchmark data sets (MNIST and dSprites) and two clinical data sets little explored for disentangled learning, we qualitatively and quantitatively demonstrated the improved disentangling performance of IBP-VAE over the state of the art. In the latter two clinical data sets riddled with complex factors of variations, we further demonstrated that unsupervised disentangling of nuisance factors via IBP-VAE - when combined with a supervised objective - can not only improve task accuracy in comparison to relevant supervised deep architectures, but also facilitate knowledge discovery related to task decision-making.",

author = "Prashnna Gyawali and Zhiyuan Li and Cameron Knight and Sandesh Ghimire and Horacek, {B. Milan} and John Sapp and Linwei Wang",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 19th IEEE International Conference on Data Mining, ICDM 2019 ; Conference date: 08-11-2019 Through 11-11-2019",

year = "2019",

month = nov,

doi = "10.1109/ICDM.2019.00127",

language = "English",

series = "Proceedings - IEEE International Conference on Data Mining, ICDM",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1078--1083",

editor = "Jianyong Wang and Kyuseok Shim and Xindong Wu",

booktitle = "Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019",

address = "United States",

}

TY - GEN

T1 - Improving disentangled representation learning with the beta bernoulli process

AU - Gyawali, Prashnna

AU - Li, Zhiyuan

AU - Knight, Cameron

AU - Ghimire, Sandesh

AU - Horacek, B. Milan

AU - Sapp, John

AU - Wang, Linwei

PY - 2019/11

Y1 - 2019/11

N2 - To improve the ability of variational auto-encoders (VAE) to disentangle in the latent space, existing works mostly focus on enforcing the independence among the learned latent factors. However, the ability of these models to disentangle often decreases as the complexity of the generative factors increases. In this paper, we investigate the little-explored effect of the modeling capacity of a posterior density on the disentangling ability of the VAE. We note that the independence within and the complexity of the latent density are two different properties we constrain when regularizing the posterior density: while the former promotes the disentangling ability of VAE, the latter - if overly limited - creates an unnecessary competition with the data reconstruction objective in VAE. Therefore, if we preserve the independence but allow richer modeling capacity in the posterior density, we will lift this competition and thereby allow improved independence and data reconstruction at the same time. We investigate this theoretical intuition with a VAE that utilizes a non-parametric latent factor model, the Indian Buffet Process (IBP), as a latent density that is able to grow with the complexity of the data. Across two widely-used benchmark data sets (MNIST and dSprites) and two clinical data sets little explored for disentangled learning, we qualitatively and quantitatively demonstrated the improved disentangling performance of IBP-VAE over the state of the art. In the latter two clinical data sets riddled with complex factors of variations, we further demonstrated that unsupervised disentangling of nuisance factors via IBP-VAE - when combined with a supervised objective - can not only improve task accuracy in comparison to relevant supervised deep architectures, but also facilitate knowledge discovery related to task decision-making.

AB - To improve the ability of variational auto-encoders (VAE) to disentangle in the latent space, existing works mostly focus on enforcing the independence among the learned latent factors. However, the ability of these models to disentangle often decreases as the complexity of the generative factors increases. In this paper, we investigate the little-explored effect of the modeling capacity of a posterior density on the disentangling ability of the VAE. We note that the independence within and the complexity of the latent density are two different properties we constrain when regularizing the posterior density: while the former promotes the disentangling ability of VAE, the latter - if overly limited - creates an unnecessary competition with the data reconstruction objective in VAE. Therefore, if we preserve the independence but allow richer modeling capacity in the posterior density, we will lift this competition and thereby allow improved independence and data reconstruction at the same time. We investigate this theoretical intuition with a VAE that utilizes a non-parametric latent factor model, the Indian Buffet Process (IBP), as a latent density that is able to grow with the complexity of the data. Across two widely-used benchmark data sets (MNIST and dSprites) and two clinical data sets little explored for disentangled learning, we qualitatively and quantitatively demonstrated the improved disentangling performance of IBP-VAE over the state of the art. In the latter two clinical data sets riddled with complex factors of variations, we further demonstrated that unsupervised disentangling of nuisance factors via IBP-VAE - when combined with a supervised objective - can not only improve task accuracy in comparison to relevant supervised deep architectures, but also facilitate knowledge discovery related to task decision-making.

UR - http://www.scopus.com/inward/record.url?scp=85078896334&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85078896334&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2019.00127

DO - 10.1109/ICDM.2019.00127

M3 - Conference contribution

AN - SCOPUS:85078896334

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 1078

EP - 1083

BT - Proceedings - 19th IEEE International Conference on Data Mining, ICDM 2019

A2 - Wang, Jianyong

A2 - Shim, Kyuseok

A2 - Wu, Xindong

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 19th IEEE International Conference on Data Mining, ICDM 2019

Y2 - 8 November 2019 through 11 November 2019

ER -

Improving disentangled representation learning with the beta bernoulli process

Abstract

Publication series

Conference

Bibliographical note

ASJC Scopus Subject Areas

Access to Document

Other files and links

Fingerprint

Cite this