"Clustering data"

mskhmskh MemberPosts:13Learner I
edited May 2019 inHelp
Hi,
I calculate standard deviation and average of my data set. I want to cluster my data set into 3 clusters which cluster_0 consists of data between 0 to average, cluster_1 between average and average+standard deviation and cluster_3 consists of data between 2*standard deviation to maximum value. which clustering technique i should use?
Thanks
Tagged:

Answers

  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn
    If you have discrete cluster identities in mind, then this really isn't an application of clustering. Clustering techniques are generally non-deterministic and unsupervised ML algorithms.
    But you can easily code your "clusters" manually in RapidMiner using some if/then logic within Generate Attributes, or Discretize by User Specification.
    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
    sgenzer mskh robin
Sign InorRegisterto comment.