5.4 主成分聚类与主成分回归
5.4.1 变量聚类与样品分类
主成分分析可用于聚类:变量聚类与样品聚类。
变量聚类:由主成分系数的差异,可将变量聚类。例如例5.5中第2主成分中murder,rape, assult系数为负的, burglary,larceny, auto系数是正的。按系数正负可把7个变量分为两类:电磁兼容性 murder, rape, assult属于暴力程度严重的一类;道歉信怎么写才真诚burglary,larceny,auto属于暴力程度较轻的一类。按照这种方法,根据主成分系数的正负可以将变量聚类。
样品聚类:如果2个主成分能很好的概括随机向量的信息,计算每个样品的这两个主成分得分,把他们的散点图画出来,就能从图上将样品分类。
例5.5(续2) 按照第一、第二主成分得分,画出散点图
data crime; /*建立数据集crime*/
input state $ 1-15 murder rape robbery assult burglary larceny auto;
/*建立变量state murder rape robbery assult burglary larceny auto。state $ 1-15表示前15列存州名。murder rape robbery assult burglary larceny auto 表按摩手法大全7种罪的犯罪率*/
cards; /*以下为数据体*/
一起洗澡的老师 Albama 14.2 25.2 96.8 278.3 1135.5 1881.9 280.7
Alaska 10.8 51.6 96.8 284.0 1331.7 3369.8 753.3
唐朝官制
Arirona 9.5 34.2 138.2 312.3 2346.1 4467.4 439.5
Arkansas 8.8 34.2 138.2 312.3 2346.1 4467.4 439.5
Califonia 11.5 49.4 287.0 358.0 2139.4 3499.8 663.5
个人简历表格
Colorado 6.3 42.0 170.7 292.9 1935.2 3903.2 477.1
Conecticat 4.2 16.8 129.5 131.8 1346.0 2620.7 593.2
Delaware 6.0 24.9 157.0 194.2 1682.6 3678.4 467.0
Florida 10.2 39.6 187.9 449.1 1859.9 3840.5 351.4
Geogia 11.7 31.1 140.5 256.5 1351.1 2170.2 297.9新阴流
Hawaii 7.2 25.5 128.0 64.1 1911.5 3920.4 489.4
Idaho 5.5 19.4 39.6 172.5 1050.8 2599.6 237.6
Illinois 9.9 21.8 211.3 209.0 1085.0 2828.5 528.6
Indiana 7.4 26.5 123.2 153.5 1086.2 2498.7 377.4
Iowa 2.3 10.6 41.2 89.8 812.5 2685.1 219.9
Kansas 6.6 22.0 100.7 180.5 1270.4 2739.3 244.3
Kentaky 10.1 19.1 81.1 123.3 872.2 1662.1 245.4
Loisana 15.5 30.9 142.9 335.5 1165.5 2469.9 337.7
Maine 2.4 13.5 38.7 170.0 1253.1 2350.7 246.9
Maryland 8.0 34.8 292.1 358.9 1400.0 3177.7 428.5
Masschutts 3.1 20.8 169.1 231.6 1532.2 2311.3 1140.1
Michigan 9.3 38.9 261.9 274.6 1522.7 3159.0 545.5
Minnesota 2.7 19.5 85.9 85.8 1134.7 2559.3 343.1
Mississippi 14.3 19.6 65.7 189.1 915.6 1239.9 144.4
Missouri 9.6 28.3 189.0 233.5 1318.3 2424.2 378.4
Montana 5.4 16.7 39.2 156.8 804.9 2773.2 309.3
Nebraska 3.9 18.1 64.7 112.7 760.0 2316.1 249.1
Nevada 15.8 49.1 323.1 355.0 2453.1 4212.6 559.2
Mew Hampashare 3.2 10.7 23.2 76.0 1041.7 2343.9 293.4
New Jery 5.6 21.0 180.4 185.1 1435.8 2774.5 511.5
New Maxico 8.8 39.1 109.6 343.4 1418.7 3008.6 259.5
New York 10.7 29.4 472.6 319.1 1728.0 2782.0 745.8
North Carolina 10.6 17.0 61.3 318.3 1154.1 2037.8 192.1
North Dakoda 100.9 9.0 13.3 43.8 446.1 1843.0 144.7
Ohio 7.8 27.3 190.5 181.1 1216.0 2696.8 400.4
Oklahoma 8.6 29.2 73.8 205.0 1288.2 2228.1 326.8
Oregan 4.9 39.9 124.1 286.9 1636.4 3506.1 388.9
Pennsyvania 5.6 19.0 130.3 128.0 877.5 1624.1 333.2
Rhode Island 3.6 10.5 86.5 201.0 1849.5 2844.1 791.4
South Carolina 11.9 33.0 105.9 485.3 1613.6 2342.4 245.1
South Dakoda 2.0 13.5 17.9 155.7 570.5 1704.4 147.5
Tenne 10.1 29.7 145.8 203.9 1259.7 1776.5 314.0
Texas 13.3 33.8 152.4 208.2 1603.1 2988.7 397.6
三星A7000