R语言 重新塑造数据以便与geeglm()一起使用

yqlxgs2m  于 2023-06-19  发布在  其他
关注(0)|答案(1)|浏览(135)

你能帮我弄清楚为什么我得到一个错误吗?
最初我的数据看起来像这样:

> attributes(compl)$names
 [1] "UserID"         "compl_bin"      "Sex.x"          "PHQ_base"       "PHQ_Surv1"      "PHQ_Surv2"      "PHQ_Surv3"    
 [8] "PHQ_Surv4"      "EFE"            "Neuro"          "Intervention.x" "depr0"          "error1_1.x"     "error1_2.x"   
[15] "error1_3.x"     "error1_4.x"     "stress0"        "stress1"        "stress2"        "stress3"        "stress4"      
[22] "hours1"         "hours2"         "hours3"         "hours4"         "subject"

首先,我重塑我的数据,为geeglm做准备:

compl$subject <- factor(rownames(compl))
nobs <- nrow(compl) 
compl_long <- reshape(compl, idvar = "subject",
                      varying = list(c("PHQ_Surv1", "PHQ_Surv2" ,
                                       "PHQ_Surv3", "PHQ_Surv4"), 
                                     c("error1_1.x", "error1_2.x",
                                       "error1_3.x", "error1_4.x"), 
                                     c("stress1", "stress2", "stress3",
                                       "stress4"), 
                                     c("hours1", "hours2", "hours3",
                                       "hours4")), 
                      v.names = c("PHQ", "error", "stress", "hours"),
                      times = c("1", "2", "3", "4"), direction = "long")
  • -(编者注:不确定下一个输出来自什么...)
[1] "UserID"         "compl_bin"      "Sex.x"          "PHQ_base"       "EFE"            "Neuro"          "Intervention.x"
 [8] "depr0"          "stress0"        "subject"        "time"           "PHQ"            "error"          "stress"       
[15] "hours"

然后我使用geeglm函数:

library(geepack)

geeSand=(geeglm(PHQ~as.factor(compl_bin) + Neuro+PHQ_base+as.factor(depr0) +
                    EFE+as.factor(Sex.x) + as.factor(error)+stress+hours,
                    family = poisson, data=compl_long,
                    id=subject, corst="exchangeable"))

我得到一个错误:

"Error in geese.fit(xx, yy, id, offset, soffset, w, waves = waves, zsca,  : 
  nrow(zsca) and length(y) not match"

如果我删除变量.factor(error)和hours,geeglm不会抱怨,我会得到输出。该函数不适用于错误和小时变量。我检查所有变量的长度,它们是相等的。你能帮我弄清楚是怎么回事吗?
非常感谢!

z9smfwbn

z9smfwbn1#

找到这个在:https://stat.ethz.ch/pipermail/r-help/2008-October/178337.html
我很确定这是鹅()中的一个bug,应该向
geepack的维护者问题在于失踪的处理
价值观。
如果看dim(na.omit(dat[,c("id","score","chem","time")]))一个
在geese.fit()中,zsca被设置为等于矩阵(1,N,1),其中N被设置为
等于length(id)。但是id的长度为46,而响应y的长度为
通过消除数据的任何行来削减到长度44,其中
所涉及的变量缺失。所以有问题。
问题的解决需要重写一些代码
Geepack的维护者。

相关问题