TitleInformation Loss Evaluation based on Fuzzy and Crisp Clustering of Graph Statistics
Publication TypeConference Paper
Year of Publication2012
AuthorsNettleton D.F.
Conference NameIEEE World Congress on Computational Intelligence (WCCI) 2012
Conference LocationBrisbane, Australia
Date Published11/06/2012
KeywordsData Privacy, fuzzy clustering, graphs
Abstract

In this paper we apply different types of clustering, fuzzy (fuzzy c-Means) and crisp (k-Means) to graph statistical data in order to evaluate information loss due to perturbation as part of the anonymization process for a data privacy application. We make special emphasis on two major node types: hubs, which are nodes with a high relative degree value, and bridges, which act as connecting nodes between different regions in the graph. By clustering the graph's statistical data before and after perturbation, we can measure the change in characteristics and therefore the information loss. We partition the nodes into three groups: hubs/global bridges, local bridges, and all other nodes. We suspect that the partitions of these nodes are best represented in the fuzzy form, especially in the case of nodes in frontier regions of the graphs which may have an ambiguous assignment.