k means - How can I cluster text data with multiple columns?

Question

Welcome To Ask or Share your Answers For Others

k means - How can I cluster text data with multiple columns?

1 Reply

深蓝 · Answer 1 · 2022-01-31T07:23:29+0000

You can vectorize each column separately and concatenate the results.

Just make sure you do a sparse concatenation.

However, clustering text with k-means is not at all working well. K-means is very sensitive to outliers and noise, and test is full of noise. Fundamental assumptions of k-means (k signals, and i.i.d. Gaussian error) do not hold for text. Good luck...

Categories

k means - How can I cluster text data with multiple columns?

k means - How can I cluster text data with multiple columns?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags