R语言聚类分析

更新时间:2023-07-30 14:32:30 阅读：评论：0

R语⾔聚类分析基本过程

读数据

数据标准化

计算距离

分类

library('flexclust')# 导⼊数据集

data(nutrient)# 加载数据

row.names(nutrient)<- tolower(row.names(nutrient))# 把⾏索引⼩写

90后head(nutrient)

#由于能量变化范围⽐其他变量更⼤，缩放数据有利于均衡各变量的影响。

nutrient_scaled <- scale(nutrient)# 数据标准化

head(nutrient_scaled)

d <- dist(nutrient_scaled)# 计算距离默认欧式距离黑豆饭

fit_average <- hclust(d, method="average")# 聚类

plot(fit_average, hang =-1, main ="Average Linkage Clustering")

初一作文我的偶像

K-means聚类与PAM聚类

仁学

# K-means聚类，做之前⽤outlet去除异常值

library('flexclust')# 导⼊数据集

data(nutrient)

row.names(nutrient)<- tolower(row.names(nutrient))# 把⾏索引⼩写

library(factoextra)

nutrient_scaled <- scale(nutrient)# 数据标准化

d <- dist(nutrient_scaled)

fviz_cluster(kmeans(d,3), nutrient)

# 围绕中⼼点的划分（PAM）

library('flexclust')# 导⼊数据集

library(cluster)

可爱的狗种类data(nutrient)

row.names(nutrient)<- tolower(row.names(nutrient))# 把⾏索引⼩写

nu_pam <- pam(nutrient,3, metric="euclidean")

有熊氏百合花图片大全fviz_cluster(nu_pam, nutrient)

七五三

本文发布于:2023-07-30 14:32:30，感谢您对本站的认可！

标签：数据变量距离

留言与评论（共有 0 条评论）