通过lapply和regex批量创建列到数据中的列,R表

au9on6nz  于 2023-01-06  发布在  其他
关注(0)|答案(1)|浏览(109)

我想在一些字符串后得到值,演示如下

  1. dt <- data.table(col.1 = c("a1, b2, c3, d4"))
  2. x <- c("a", "b", "c")
  3. dt[, (x) := lapply(FUN = str_match(string = .SD,
  4. pattern = paste0("(?<=", x, ")([\\d])"))[, 2],
  5. X = x),
  6. .SDcols = "col.1"]

理想的结果如下所示

  1. desirable <- data.table(col.1 = c("a1, b2, c3, d4"),
  2. a = c("1"),
  3. b = c("2"),
  4. c = c("3"))

我收到如下错误消息:

  • www.example.com中的错误match.fun(有趣):
  1. c("'str_match(string = .SD, pattern = paste0(\"(?<=\", x, \")([\\\\d])\"))[, ' is not a function, character or symbol", "' 2]' is not a function, character or symbol")*

但是我想不出怎么解决这个问题。谁能给予我点建议?

zrfyljdw

zrfyljdw1#

循环模式并使用str_match提取值

  1. library(data.table)
  2. library(stringr)
  3. dt[, (x) := lapply(paste0("(?<=", x, ")(\\d+)"),
  4. \(x) str_match(col.1, x)[, 2])]
  5. col.1 a b c
  6. 1: a1, b2, c3, d4 1 2 3

或使用strcapture

  1. pat <- paste0(sprintf("%s(\\d+)", x), collapse = ".*")
  2. cbind(dt, dt[, strcapture(pat, col.1, setNames(rep(list(integer()), 3), x))])
  3. col.1 a b c
  4. 1: a1, b2, c3, d4 1 2 3

相关问题