文章/答案/技术大牛

发布

社区首页 >问答首页 >数据挖掘中的Sample_n与if_else在group_by之后的应用

问数据挖掘中的Sample_n与if_else在group_by之后的应用
EN

Stack Overflow用户

提问于 2020-11-30 12:23:24

回答 1查看 92关注 0票数 0

下面是一个测试DF：

test_df <- structure(list(plant_sp = c("plant_1", "plant_1", "plant_2", "plant_2", "plant_3",
                                       "plant_3", "plant_3", "plant_3", "plant_3", "plant_4", 
                                       "plant_4", "plant_4", "plant_4", "plant_4", "plant_4",
                                       "plant_5", "plant_5", "plant_5", "plant_5", "plant_5"), 
                          site = c("a", "a", "a", "a", "a",  
                                   "b", "b", "b", "b", "b",  
                                   "a", "a", "a", "a", "a",
                                   "b", "b", "b", "b", "b"),
                          sp_rich = c(5, 3, 5, 3, 5, 
                                      7, 8, 8, 8, 10,
                                      1, 4, 5, 6, 3, 
                                      7, 3, 12, 12,11)), 
                     row.names = c(NA, -20L), class = "data.frame", 
                     .Names = c("plant_sp", "site", "sp_rich"))

如果组中的行数大于3，我希望group_by plant_sp并提取3行随机行。

换句话说:取每一组，如果组大小大于3，则该组中仅随机保留3行。

我正在尝试使用if_else，但我无法做到这一点：

test_df <- test_df %>% group_by(plant_sp) %>%
if_else(length(plant_sp) > 3, sample_n(size =3))

我想我没有正确地使用length()函数。

你能帮帮我吗?

谢谢你，伊藤

dplyr

tidyr

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-12-01 00:39:34

如果您使用的是slice_sample 1.0.0或更高版本，则可以使用dplyr。它将在每组中保留3行。如果每个组中的行数小于3，则保留所有行。

library(dplyr)
test_df %>% group_by(plant_sp) %>% slice_sample(n = 3)

#  plant_sp site  sp_rich
#   <chr>    <chr>   <dbl>
# 1 plant_1  a           3
# 2 plant_1  a           5
# 3 plant_2  a           5
# 4 plant_2  a           3
# 5 plant_3  b           8
# 6 plant_3  b           8
# 7 plant_3  b           7
# 8 plant_4  b          10
# 9 plant_4  a           5
#10 plant_4  a           4
#11 plant_5  b           7
#12 plant_5  b          12
#13 plant_5  b           3

票数 2

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/65073527

复制

相似问题

问数据挖掘中的Sample_n与if_else在group_by之后的应用
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问数据挖掘中的Sample_n与if_else在group_by之后的应用EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问数据挖掘中的Sample_n与if_else在group_by之后的应用
EN