Tuesday, January 31, 2017

Steps to Create Text Clusters

Create TFIDF Matrix: rows are documents, columns are normalized text tokens, N x T
Apply Sigular Value Decomposition(SVD) to reduce dimensions, N x V
Use Gaussian Mixture Model(GMM) to create clusters: N x S

No comments:

Post a Comment