Clustering Sequece Data using Hiddem Markov Model Representation

Cen Li and Gautam Biswas

Department of Computer Science, Vanderbilt University
Box 1679, Station B,
Nashville, TN 37235

in SPIE'99 Conference on Data Mining and Knowledge Discovery: Theory, Tools, and Technology,
page 14-21, Orlando, FL, April 4-6,1999


This paper proposed a clustering method for sequence data using hidden Markov model(HMM) representation. The proposed methodology improves upon existing HMM based clustering methods in two ways:

The algorithm is presented in terms of four nested levels of searches: Preliminary results are given to support the proposed methodology.

Keywords: clustering, hidden Markov model, model selection, Bayesian Information Criterion (BIC), mutual information

Full Paper (PDF 163840 bytes)