Data Clustering

Definition of Data Clustering

Data Clustering is the classification of a data set into a group of numbers, where members of each group surround a particular number.

More About Data Clustering

    In two-way clustering, data is clustered by using its properties and is represented in a matrix.

Example of Data Clustering

    The ages of 6 people participating in a swimming competition are 39, 34, 35, 34, 33, and 27. In this data, the cluster of ages 34, 35, 34, and 33 are close to the age 34.

Video Examples: Data Mining, Classification, Clustering, Association Rules, Regression, Deviation


Solved Example on Data Clustering

Ques: Identify the cluster in the given data set around the number 75.
56, 61, 71, 75, 77, 74, 76, 75, 75, and 60

    Choices:
    A. 75, 77, 74, 76, 75, and 75
    B. 71, 61, and 60
    C. 60 and 56
    D. 77 and 56
    Correct Answer: A

Solution:

    Step 1: The numbers which are close around a particular number in a given set of data are called cluster.
    Step 2: In the data set, the numbers 77, 74, and 76 are close around the number 75.
    Step 3: So, 75, 77, 74, 76, 75, and 75 from the cluster of the number 75.