我正试着从一个页面中获取所有股票代码/缩写。但是当我擦拭页面时,会出现一些不想要的代码,它们基本上都以F或(space)结尾,例如"BDLL4F "或"QCOM34F"。我成功地使用gsub()和regex删除了它们。
stocktickers = c("PETR4", "VALE3", "MDNE3", "BDLL4F ", "QCOM34F", "SANB11", "USIM5")
stocktickers = gsub("(.*[ F]$)","NULL",stocktickers)
stocktickers = stocktickers[stocktickers!="NULL"]
> stocktickers
[1] "PETR4" "VALE3" "MDNE3" "SANB11" "USIM5" 如果值以指定的字符或整数开头或结尾,是否有任何函数会有条件地从数组中删除字符串?
发布于 2020-06-06 05:31:10
您可以使用regex删除不需要的代码。
例如,使用grep:
stocktickers = c("PETR4", "VALE3", "MDNE3","BDLL4F ","QCOM34F", "SANB11","USIM5")
grep("(F|\\s)$",stocktickers, value = TRUE, invert = TRUE)
#[1] "PETR4" "VALE3" "MDNE3" "SANB11" "USIM5" 这将从stocktickers中移除以"F"或空格结尾的值。
发布于 2020-06-06 18:27:03
endsWith在base R中的一个选项
stocktickers[!endsWith(trimws(stocktickers), "F")]
#[1] "PETR4" "VALE3" "MDNE3" "SANB11" "USIM5" 数据
stocktickers = c("PETR4", "VALE3", "MDNE3","BDLL4F ","QCOM34F", "SANB11","USIM5")https://stackoverflow.com/questions/62227520
复制相似问题