The Effect of Evidence Transfer on Latent Feature Relevance for Clustering
Journal Title: Informatics - Year 2019, Vol 6, Issue 2
Abstract
Evidence transfer for clustering is a deep learning method that manipulates the latent representations of an autoencoder according to external categorical evidence with the effect of improving a clustering outcome. Evidence transfer’s application on clustering is designed to be robust when introduced with a low quality of evidence, while increasing the effectiveness of the clustering accuracy during relevant corresponding evidence. We interpret the effects of evidence transfer on the latent representation of an autoencoder by comparing our method to the information bottleneck method. Information bottleneck is an optimisation problem of finding the best tradeoff between maximising the mutual information of data representations and a task outcome while at the same time being effective in compressing the original data source. We posit that the evidence transfer method has essentially the same objective regarding the latent representations produced by an autoencoder. We verify our hypothesis using information theoretic metrics from feature selection in order to perform an empirical analysis over the information that is carried through the bottleneck of the latent space. We use the relevance metric to compare the overall mutual information between the latent representations and the ground truth labels before and after their incremental manipulation, as well as, to study the effects of evidence transfer regarding the significance of each latent feature.
Authors and Affiliations
Athanasios Davvetas, Iraklis A. Klampanos, Spiros Skiadopoulos and Vangelis Karkaletsis
Preferences of Informal Carers on Technology Packages to Support Meal Production by People Living with Dementia, Elicited from Personalised AT and ICT Product Brochures
Assistive technology (AT) can help support the continued independence of people living with dementia, supported by informal carers. Opinions and preferences of informal carers towards a range of assistive and digital i...
Artery Segmentation in Ultrasound Images Based on an Evolutionary Scheme
Segmentation in ultrasound (US) images is a challenge in computer vision, due to the high signal noise, artifacts that produce discontinuities in the boundaries and shadows that hide part of the received signal. In thi...
Fitness Activity Recognition on Smartphones Using Doppler Measurements
Quantified Self has seen an increased interest in recent years, with devices including smartwatches, smartphones, or other wearables that allow you to monitor your fitness level. This is often combined with mobile apps...
Direct Visual Editing of Node Attributes in Graphs
There are many expressive visualization techniques for analyzing graphs. Yet, there is only little research on how existing visual representations can be employed to support data editing. An increasingly relevant task...
Constructing Interactive Visual Classification, Clustering and Dimension Reduction Models for n-D Data
The exploration of multidimensional datasets of all possible sizes and dimensions is a long-standing challenge in knowledge discovery, machine learning, and visualization. While multiple efficient visualization methods...