TY - GEN
T1 - Estimating Total Correlation with Mutual Information Estimators
AU - Bai, Ke
AU - Cheng, Pengyu
AU - Hao, Weituo
AU - Henao, Ricardo
AU - Carin, Lawrence
N1 - KAUST Repository Item: Exported on 2023-07-28
PY - 2023/6/4
Y1 - 2023/6/4
N2 - Total correlation (TC) is a fundamental concept in information theory which measures statistical dependency among multiple random variables. Recently, TC has shown noticeable effectiveness as a regularizer in many learning tasks, where the correlation among multiple latent embeddings requires to be jointly minimized or maximized. However, calculating precise TC values is challenging, especially when the closed-form distributions of embedding variables are unknown. In this paper, we introduce a unified framework to estimate total correlation values with sample-based mutual information (MI) estimators. More specifically, we discover a relation between TC and MI and propose two types of calculation paths (tree-like and line-like) to decompose TC into MI terms. With each MI term being bounded, the TC values can be successfully estimated. Further, we provide theoretical analyses concerning the statistical consistency of the proposed TC estimators. Experiments are presented on both synthetic and real-world scenarios, where our TC estimators demonstrate effectiveness in all TC estimation, minimization, and maximization tasks.
AB - Total correlation (TC) is a fundamental concept in information theory which measures statistical dependency among multiple random variables. Recently, TC has shown noticeable effectiveness as a regularizer in many learning tasks, where the correlation among multiple latent embeddings requires to be jointly minimized or maximized. However, calculating precise TC values is challenging, especially when the closed-form distributions of embedding variables are unknown. In this paper, we introduce a unified framework to estimate total correlation values with sample-based mutual information (MI) estimators. More specifically, we discover a relation between TC and MI and propose two types of calculation paths (tree-like and line-like) to decompose TC into MI terms. With each MI term being bounded, the TC values can be successfully estimated. Further, we provide theoretical analyses concerning the statistical consistency of the proposed TC estimators. Experiments are presented on both synthetic and real-world scenarios, where our TC estimators demonstrate effectiveness in all TC estimation, minimization, and maximization tasks.
UR - http://hdl.handle.net/10754/693270
UR - https://proceedings.mlr.press/v206/bai23a.html
UR - http://www.scopus.com/inward/record.url?scp=85165219021&partnerID=8YFLogxK
M3 - Conference contribution
SP - 2147
EP - 2164
BT - 26th International Conference on Artificial Intelligence and Statistics, AISTATS 2023
PB - ML Research Press
ER -