R语言 在循环中运行双因素混合模型ANOVA时出错,但在明确标识数据列时未出错

bz4sfanl  于 2022-12-20  发布在  其他
关注(0)|答案(1)|浏览(112)

这是产生错误的循环。
错误:!无法使用df_numeric[[column]]对列进行子集化。由于精度损失,无法从df_numeric[[column]]转换为。

df_numeric <- df[, sapply(df, is.numeric)]

for (column in names(df_numeric)) {
  res.aov <- anova_test(data = df, dv= df_numeric[[column]], wid = `Subject`, within = `Timepoint`, between = `Genotype`)
  get_anova_table(res.aov)
}

但是,当我拿出方差分析的代码,并专门从我的 Dataframe 中输入列时,它会生成正确的方差分析表结果。

res.aov <- anova_test(data = df, dv=  `Tregs CD127lo CD25+`, wid = `Subject`, within = `Timepoint`, between = `Genotype`)
get_anova_table(res.aov)

我试过使用df_numeric$column
Dataframe

library(rstatix)

 dput(df_numeric)
structure(list(`Tregs CD127lo CD25+` = c(2702, 2175, 2651, 1672.8, 
3762, 4264, 1975, 3208, 3285, 3457, 3383, 2619.9, 11872, 16101, 
13443, 3935, 1894, 2297, 7385, 8901, 9522, 7100, 8789, 9309, 
371, 379, 514), `Monocytes % of Live by Size` = c(1.38, 2.66, 
4.74, 5.83, 3.9, 5.06, 6.36, 3.45, 2.64, 6.33, 10.7, 9.41, 3.42, 
3.46, 2.73, 2.38, 3.12, 4.44, 5.31, 3.59, 4.91, 1.53, 6.54, 4.85, 
6.87, 3.66, 5.07), `NK cells` = c(90.62, 153.6, 159.8, 88, 118, 
159, 74, 82, 64, 30, 344, 73, 29, 198, 79, 145, 258, 307, 30, 
74.4, 0, 47.3, 32, 0, 52.6, 95.3, 51.7)), row.names = c(NA, -27L
), class = c("tbl_df", "tbl", "data.frame"))

> dput(df)
structure(list(Subject = c("ASCVD002", "ASCVD002", "ASCVD002", 
"ASCVD003", "ASCVD003", "ASCVD003", "ASCVD004", "ASCVD004", "ASCVD004", 
"ASCVD005", "ASCVD005", "ASCVD005", "ASCVD006", "ASCVD006", "ASCVD006", 
"ASCVD008", "ASCVD008", "ASCVD008", "ASCVD009", "ASCVD009", "ASCVD009", 
"ASCVD010", "ASCVD010", "ASCVD010", "ASCVD011", "ASCVD011", "ASCVD011"
), Timepoint = c("0", "0.25", "0.5", "0", "0.25", "0.5", "0", 
"0.25", "0.5", "0", "0.25", "0.5", "0", "0.25", "0.5", "0", "0.25", 
"0.5", "0", "0.25", "0.5", "0", "0.25", "0.5", "0", "0.25", "0.5"
), Genotype = c("Heterozygote", "Heterozygote", "Heterozygote", 
"Heterozygote", "Heterozygote", "Heterozygote", "Heterozygote", 
"Heterozygote", "Heterozygote", "GG", "GG", "GG", "AA", "AA", 
"AA", "GG", "GG", "GG", "AA", "AA", "AA", "AA", "AA", "AA", "GG", 
"GG", "GG"), `Tregs CD127lo CD25+` = c(2702, 2175, 2651, 1672.8, 
3762, 4264, 1975, 3208, 3285, 3457, 3383, 2619.9, 11872, 16101, 
13443, 3935, 1894, 2297, 7385, 8901, 9522, 7100, 8789, 9309, 
371, 379, 514), `Monocytes % of Live by Size` = c(1.38, 2.66, 
4.74, 5.83, 3.9, 5.06, 6.36, 3.45, 2.64, 6.33, 10.7, 9.41, 3.42, 
3.46, 2.73, 2.38, 3.12, 4.44, 5.31, 3.59, 4.91, 1.53, 6.54, 4.85, 
6.87, 3.66, 5.07), `NK cells` = c(90.62, 153.6, 159.8, 88, 118, 
159, 74, 82, 64, 30, 344, 73, 29, 198, 79, 145, 258, 307, 30, 
74.4, 0, 47.3, 32, 0, 52.6, 95.3, 51.7)), class = c("tbl_df", 
"tbl", "data.frame"), row.names = c(NA, -27L))
nhaq1z21

nhaq1z211#

感谢您提供dput和代码-您使用的是df数据集,因此实际上并不需要df_numeric数据集。要获取所有数值列的名称,可以使用以下代码:此外,您还为命令dv分配了一个值向量-它应该只是列的名称。
下面应该为你工作,它为我:

for (column in names(df)[unlist(lapply(df, is.numeric))]) {
  res.aov <- rstatix::anova_test(data = df, dv = column, 
                                 wid = `Subject`, within = `Timepoint`, between = `Genotype`)
  rstatix::get_anova_table(res.aov)
}

注意,在循环中,每次迭代都会覆盖res.aov,并且不会存储get_anova_table(res.aov)的结果-我建议将这些数据存储在列表中:

nnames <- names(df)[unlist(lapply(df, is.numeric))]
res.aov <- list()
aov_tab <- list()
for (column in nnames) {
  res.aov[[column]] <- rstatix::anova_test(data = df, dv = column, 
                                 wid = `Subject`, within = `Timepoint`, between = `Genotype`)
  aov_tab[[column]] <- rstatix::get_anova_table(res.aov[[column]])
}

相关问题