cluster analysis - Cut off point in k-means clustering in sas -


so want classify data clusters cut-off point in sas. method use k-means clustering. (i don't mind method, long as, gives me 3 groups.)

my code clustering:

proc fastclus data=maindat outseed=seeds1 maxcluster =3 maxiter=0; var value resid; run; 

i have problem output result. want cut-off point value include in output file. (i don't want cut-off point resid). there anyway in sas?

edit: joe point out, can't achieve i'm looking using k-mean clustering. there way? basically, want cut-off point can apply data set.

what have:

cluster  value      resid  1        34        11.7668  2        38.9      0.5328  3        42.625    -13.2364 

what want:

cluster  value      resid       cut-off value (interger)  1        34        11.7668     1-36  2        38.9      0.5328      36-40  3        42.625    -13.2364    40-44 

my data:

data maindat; input  value resid ; datalines; 44  -4.300511714 44  -9.646920963 44  -15.86956805 43  -16.14857235 43  -13.05797186 43  -13.80941206 42  -3.521394503 42  -1.102526302 42  -0.137573583 42  2.669238665 42  -9.540489193 42  -19.27474303 42  -3.527077011 41  1.676464068 41  -2.238822314 41  4.663079037 41  -5.346920963 40  -8.543723186 40  0.507460641 40  0.995302284 40  0.464194011 39  4.728791571 39  5.578685423 38  2.771297564 38  7.109159247 37  15.96059456 37  2.985292226 36  -4.301136971 35  5.854674875 35  5.797294021 34  4.393329025 33  -6.622580905 32  0.268500302 27  12.23062252 ; run; 

i don't think completely.

k-means clustering uses euclidean distance between of variables provide it. means it's not solely using value cluster observations: it's using resid well.

as such, it's possible row value seems should go cluster 2 should go cluster 3, if resid value closer there.

in example, if request out dataset, see true. proc freq of out dataset reveals cluster 1 has 3 rows, values 27, 37, , 38. cluster 2 has of rows - 7 in total - ranging 32 44. cluster 3 ranges 40 44.

as such, there's no reasonable way define clusters way ask method of clustering. clusters typically defined centroid, , that's outstat dataset; can determine cluster particular value should assigned based on this.


Comments

Popular posts from this blog

java - Could not locate OpenAL library -

c++ - Delete matches in OpenCV (Keypoints and descriptors) -

sorting - opencl Bitonic sort with 64 bits keys -