Grouping text documents by similarity using an unsupervised method is known as clustering.
This technique aggregates similar data points without requiring prior labels.
For text data, algorithms such as K-Means or Hierarchical Clustering place documents with related content into distinct clusters.
This capability is vital for automatically organizing extensive text datasets for applications like search engines, topic modeling, and recommendation systems.
By assigning similar documents to the same group, clustering enhances information retrieval and user access to relevant content.
It represents a core technique within Natural Language Processing (NLP) and unsupervised machine learning.