The Effect of Evidence Transfer on Latent Feature Relevance for Clustering

Apply

The Effect of Evidence Transfer on Latent Feature Relevance for Clustering

Journal Title: Informatics - Year 2019, Vol 6, Issue 2

Abstract

Evidence transfer for clustering is a deep learning method that manipulates the latent representations of an autoencoder according to external categorical evidence with the effect of improving a clustering outcome. Evidence transfer’s application on clustering is designed to be robust when introduced with a low quality of evidence, while increasing the effectiveness of the clustering accuracy during relevant corresponding evidence. We interpret the effects of evidence transfer on the latent representation of an autoencoder by comparing our method to the information bottleneck method. Information bottleneck is an optimisation problem of finding the best tradeoff between maximising the mutual information of data representations and a task outcome while at the same time being effective in compressing the original data source. We posit that the evidence transfer method has essentially the same objective regarding the latent representations produced by an autoencoder. We verify our hypothesis using information theoretic metrics from feature selection in order to perform an empirical analysis over the information that is carried through the bottleneck of the latent space. We use the relevance metric to compare the overall mutual information between the latent representations and the ground truth labels before and after their incremental manipulation, as well as, to study the effects of evidence transfer regarding the significance of each latent feature.

Authors and Affiliations

Athanasios Davvetas, Iraklis A. Klampanos, Spiros Skiadopoulos and Vangelis Karkaletsis

Keywords

deep neural networks; evidence transfer; relevance; feature selection; information bottleneck; latent features; deep learning

Selective Wander Join: Fast Progressive Visualizations for Data Joins

Progressive visualization offers a great deal of promise for big data visualization; however, current progressive visualization systems do not allow for continuous interaction. What if users want to see more confident...

Big Data in the Era of Health Information Exchanges: Challenges and Opportunities for Public Health

Public health surveillance of communicable diseases depends on timely, complete, accurate, and useful data that are collected across a number of healthcare and public health systems. Health Information Exchanges (HIEs)...

Evaluating Awareness and Perception of Botnet Activity within Consumer Internet-of-Things (IoT) Networks

The growth of the Internet of Things (IoT), and demand for low-cost, easy-to-deploy devices, has led to the production of swathes of insecure Internet-connected devices. Many can be exploited and leveraged to perform l...

Identifying Opportunities to Integrate Digital Professionalism into Curriculum: A Comparison of Social Media Use by Health Profession Students at an Australian University in 2013 and 2016

Social media has become ubiquitous to modern life. Consequently, embedding digital professionalism into undergraduate health profession courses is now imperative and augmenting learning and teaching with mobile technol...

Acknowledgement to Reviewers of Informatics in 2016

The editors of Informatics would like to express their sincere gratitude to the following reviewers for assessing manuscripts in 2016. We greatly appreciate the contribution of expert reviewers, which is crucial to the...

EP ID EP44182
DOI https://doi.org/10.3390/informatics6020017
Views 285
Downloads 0