文章/答案/技术大牛

发布

社区首页 >问答首页 >ggplot2:使用geom_polygon覆盖凸簇的问题

问ggplot2:使用geom_polygon覆盖凸簇的问题
EN

Stack Overflow用户

提问于 2019-12-11 04:11:21

回答 2查看 266关注 0票数 1

我想根据这些数据创建一个覆盖了凸簇和组(V1)颜色的图；

 str(stackoverflow)
 Classes ‘data.table’ and 'data.frame': 174 obs. of  4 variables:
 $ ID   : Factor w/ 277 levels "1001","1021",..: 1 2 4 5 6 7 8 9 10 11 ...
 $ UMAP1: num  -1.1313 -0.8176 0.1355 -0.0957 0.0724 ...
 $ UMAP2: num  0.219 0.48 -1.378 -0.95 -1.229 ...
 $ V1   : Factor w/ 3 levels "0","1","2": 3 1 1 1 1 1 1 1 1 1 ...

我计算了一个K-means聚类，如下所示；

km<-eclust(lipid.b.kmeans[,2:3],"kmeans", k=5, nboot = 2)

并且由此得到的km是；

    km
K-means clustering with 5 clusters of sizes 38, 15, 18, 42, 61

Cluster means:
       UMAP1     UMAP2
1 -5.3988979 -1.585529
2 -0.4963504  0.470514
3  4.9895693  3.208727
4  1.6177653  1.461844
5  0.8990981 -1.081347

Clustering vector:
  [1] 2 2 5 5 5 5 5 5 5 5 1 1 1 1 1 5 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 3 1 1 1 4 1 4 1 1 4 1 1 1 1 4 3 4 4 4 4 4 4 4 4 3 4 4 3 4 4 4
 [72] 4 3 4 4 4 4 4 3 2 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 3 3 3 4 3 3 4 3 3 3 3 5 3 2 4 3 3 2 2 2 4 2 5 5 5 2 5 5 5 2 5 5 5 2 5 5 5 5 5 5 5 5 5 2 5 5 5
[143] 5 5 5 5 5 5 5 5 5 5 5 5 5 2 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 2 5

Within cluster sum of squares by cluster:
[1] 13.897357  2.224761  2.208410 16.884947 59.421921
 (between_SS / total_SS =  95.8 %)

然后我生成了一个凸日期；

k<-fviz_cluster(km, data = stackoverflow[, 2:3], repel = T, ellipse.type = "convex") 
d<-k$data    
convex.data<-d %>% group_by(cluster) %>% slice(chull(x,y))

这就是；

str(convex.data)
Classes ‘grouped_df’, ‘tbl_df’, ‘tbl’ and 'data.frame': 46 obs. of  4 variables:
 $ name   : Factor w/ 174 levels "1","10","100",..: 46 118 101 116 108 119 123 30 1 99 ...
 $ x      : num  -1.54 -1.87 -1.89 -1.7 -1.67 ...
 $ y      : num  -1.36 -1.362 -1.089 -0.592 -0.56 ...
 $ cluster: Factor w/ 5 levels "1","2","3","4",..: 1 1 1 1 1 1 1 2 2 2 ...
 - attr(*, "groups")=Classes ‘tbl_df’, ‘tbl’ and 'data.frame':  5 obs. of  2 variables:
  ..$ cluster: Factor w/ 5 levels "1","2","3","4",..: 1 2 3 4 5
  ..$ .rows  :List of 5
  .. ..$ : int  1 2 3 4 5 6 7
  .. ..$ : int  8 9 10 11 12 13 14
  .. ..$ : int  15 16 17 18 19 20 21
  .. ..$ : int  22 23 24 25 26 27 28 29 30 31
  .. ..$ : int  32 33 34 35 36 37 38 39 40 41 ...
  ..- attr(*, ".drop")= logi TRUE

当我使用ggplot2绘制时，使用；

ggplot(convex.data, aes(x,y))+geom_point(data=stackoverflow, aes(x=UMAP1, y=UMAP2, color= V1))+geom_polygon(data=convex.data, alpha=0.5, aes(fill=cluster,linetype=cluster))

我得到了这个图表

多边形凸簇图像的x和y尺度比geom_point图像的x和y尺度小。我使用expand_limits来校正比例差异，但根本不起作用。在这一点上，我真的很感激任何建议或指示来解决这个问题。

ggplot2

回答 2

Stack Overflow用户

回答已采纳

发布于 2019-12-11 16:08:44

也许您可以决定使用ggalt::geom_encircle()来简化您的工作，这里是一个使用iris数据集的示例：

set.seed(1234)
# using only two variables
model <- kmeans(iris[,1:2],3)
# new dataset with clusters as factors
iris1 <- data.frame(iris[,1:2], clust = as.factor(model$cluster))

library(ggplot2)
library(ggalt)
ggplot(iris1, aes(x =Sepal.Length, y =Sepal.Width, color = clust)) + geom_point() +
  geom_encircle(aes(fill = clust), s_shape = 1, expand = 0,
                alpha = 0.2, color = "black", show.legend = FALSE)

票数 3

Stack Overflow用户

发布于 2019-12-13 02:11:40

我通过使用来自ggscatter (ggpubr)的stat_chull解决了这个问题，如下所示；

ggscatter(na.omit(stackoverflow), "UMAP1", "UMAP2", color = "V1")+stat_chull(aes(color=cluster, fill=cluster), alpha=0.1, geom = "polygon", na.rm = T)

这给了我一个我想要的图。

还有一个包ggConvexHull ("cmartin/ggConvexHull")可以工作。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/59274662

复制

相似问题

问ggplot2:使用geom_polygon覆盖凸簇的问题
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问ggplot2:使用geom_polygon覆盖凸簇的问题EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问ggplot2:使用geom_polygon覆盖凸簇的问题
EN