我正在为如何将一组fasta序列转换成一组XStrings或DNAstrings而苦苦挣扎。
我试图用read.fasta将我的fasta文件(包含几个序列)读入序列列表。如果我只处理一个序列,我将能够将它转换为DNAString (使用DNAString函数),但由于我有多个序列,我无法实现这一点。
举个例子:
my_sequences<-read.fasta(sequences.fasta)
my sequences
$`1:20000-20009`
[1] "agtcgctag"
attr(,"name")
[1] "1:20000-20009"
attr(,"Annot")
[1] ">1:20000-20009"
attr(,"class")
[1] "SeqFastadna"
$`1:30000-30010`
[1] "aggggggggggca"
attr(,"name")
[1] "1:30000-30010"
attr(,"Annot")
[1] "1:30000-30010"
attr(,"class")
[1] "SeqFastadna"
.
.
.
现在我无法将这个集合转换为一组XStrings或DNAStrings
发布于 2020-03-28 10:48:18
只需在生物字符串中使用readDNAStringSet
:
my_sequences<-readDNAStringSet(sequences.fasta)
例如:
fa = c(">1:20000-20009","agtcgctag",">1:30000-30010","aggggggggggca")
writeLines(fa,"test.fa")
library(Biostrings)
my_sequences<-readDNAStringSet("test.fa")
my_sequences
A DNAStringSet instance of length 2
width seq names
[1] 9 AGTCGCTAG 1:20000-20009
[2] 13 AGGGGGGGGGGCA 1:30000-30010
每个子集都是一个DNAString:
my_sequences[[1]]
9-letter "DNAString" instance
seq: AGTCGCTAG
https://stackoverflow.com/questions/60899896
复制相似问题