Identifying Structure across Pre-partitioned Data

Marx, Zvika; Dagan, Ido; Shamir, Eli

Identifying Structure across Pre-partitioned Data

Zvika Marx, Ido Dagan, Eli Shamir

Advances in Neural Information Processing Systems 16 (NIPS 2003)

Abstract

We propose an information-theoretic clustering approach that incorporates a pre-known partition of the data, aiming to identify common clusters that cut across the given partition. In the standard clustering setting the formation of clusters is guided by a single source of feature information. The newly utilized pre-partition factor introduces an additional bias that counterbalances the impact of the features whenever they become correlated with this known partition. The resulting algorithmic framework was applied successfully to synthetic data, as well as to identifying text-based cross-religion correspondences.

Abstract

Name Change Policy