文章/答案/技术大牛

发布

社区首页 >问答首页 >SPIA软件包(R)在Illumina表达芯片数据中的应用

问SPIA软件包(R)在Illumina表达芯片数据中的应用
EN

Stack Overflow用户

提问于 2016-03-01 17:04:29

回答 2查看 174关注 0票数 1

我一直在试验用于表达式(MRNA)数据的替代注释。SPIA (信令路径集成分析)看起来很有趣，但它似乎为所有内容都提供了一条错误消息：

Error in spia(de = sigGenes, all = allGenes, organism = "hsa", plots = TRUE,  :  
de must be a vector of log2 fold changes. The names of de should >be     
included in the reference array!

输入需要一个log2折叠更改向量(我的向量命名为sigGenes)，其中Entrez为关联名称，以及微阵列(AllGenes)中包含的Entrez整数向量：

head(sigGenes)
6144 115286  23530  10776  83933   6232 
0.368  0.301  0.106  0.234 -0.214  0.591 
head(allGenes)
6144 115286  23530  10776  83933   6232

我已经删除了其EntrezID注释为NA的值。我还将我的数据从Illumina微阵列中的数据子集到了Affymetrix数组中的那些基因，使用下面的站点i列表中提供的例子。我还是会犯同样的错误。

以下是完整的R代码：

library(Biobase)
library(limma)
library(SPIA)

sigGenes <- subset(full_table, P.Value<0.01)$logFC
names(sigGenes) <- subset(full_table, P.Value<0.01)$EntrezID
sigGenes<-sigGenes[!is.na(names(sigGenes))] # remove NAs

allGenes <- unique(full_table$EntrezID[!is.na(full_table$EntrezID)])

spiaOut <- spia(de=sigGenes, all=allGenes, organism="hsa", plots=TRUE, data.dir="./")

还有什么我可以尝试的吗？抱歉，如果不谈话题(这里还是新的)。如果需要的话，我很乐意把这个问题转移到其他地方。

SPIA应用于Affymetrix平台数据的示例如下：http://www.gettinggeneticsdone.com/2012/03/pathway-analysis-for-high-throughput.html)

package

回答 2

Stack Overflow用户

回答已采纳

发布于 2016-03-02 16:06:47

根据我们的讨论，我建议删除sigGenes中的重复条目。如果没有更多的信息，就很难知道重复的内容可能来自何处，以及要删除哪些内容。

票数 0

Stack Overflow用户

发布于 2016-03-02 18:20:50

移除复本确实有帮助。作为一种解决办法，我在每一组重复项中选择了中值(仅因为值接近)，如下所示：

dups<-unique(names(sigGenes[which(duplicated(names(sigGenes)))])) # determine which are duplicates
dupID<-names(sigGenes) %in% dups # determine the laocation of all duplicates
sigGenes_dup<-vector(); j=0; # determine the median value for each duplicate
for (i in dups){j=j+1; sigGenes_dup[j]<- median(sigGenes[names(sigGenes)==i])  }
names(sigGenes_dup)<-dups
sigGenes<-sigGenes[!(names(sigGenes) %in% dups)] # remove duplicates from sigGenes
sigGenes<-c(sigGenes,sigGenes_dup) # append the median values of the duplicates

另一种选择是，只需删除副本就可以了：

dups<-unique(names(sigGenes[which(duplicated(names(sigGenes)))]))
sigGenes<-sigGenes[!(names(sigGenes) %in% dups)] # remove duplicates from sigGenes

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/35728963

复制

相似问题

问SPIA软件包(R)在Illumina表达芯片数据中的应用
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问SPIA软件包(R)在Illumina表达芯片数据中的应用EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问SPIA软件包(R)在Illumina表达芯片数据中的应用
EN