首页
学习
活动
专区
圈层
工具
发布
30 篇文章
1
使用PHATE复现Science Immunology上文章的结果
2
你确定你研究的是成纤维细胞吗
3
读取loom格式的单细胞文件
4
velocyto的正确安装方法
5
Seurat4.0系列教程20:单细胞对象的格式转换
6
Seurat4.0系列教程8:细胞周期评分和回归分析
7
MACA: 一款自动注释细胞类型的工具
8
肺癌四阶段:AAH-AIS-MIA-IA的单细胞图谱
9
你认为是双细胞人家说是全新细胞亚群
10
copykat为什么没有infercnv直观呢
11
一大波神经元单细胞亚群相关的标志基因
12
单细胞转录组分析—追踪移植后造血干细胞的分化
13
单细胞转录组揭示肺腺癌特有的肿瘤微环境
14
小细胞肺癌(SCLC)病人的scRNA-seq数据分析
15
单细胞转录组分析COVID-19重症患者肺泡巨噬细胞亚型
16
CancerSCEM: 人类癌症单细胞表达图谱数据库
17
你真的需要如此多的单细胞亚群注释工具吗
18
使用PHATE进行单细胞高维数据的可视化
19
小鼠早期原肠化的转录异质性和细胞命运决定的scRNA-seq图谱
20
单细胞测序揭示PD-L1免疫治疗联合紫杉醇化疗在三阴性乳腺癌中的作用机制
21
单细胞转录组的细分亚群的降维聚类分群加上gsea或者gsva以及转录因子和拟时序流程(仅需8000元)
22
单细胞不同亚群和状态能区分吗
23
肿瘤相关成纤维细胞异质性
24
肿瘤样品的单细胞需要提取上皮细胞继续细分
25
乳腺癌患者抗PD1治疗期间肿瘤内变化的单细胞图谱
26
晚期非小细胞肺癌肿瘤异质性和微环境的单细胞分析
27
脑组织单细胞悬液制备流程
28
什么,你想要的单细胞亚群比例太少了?
29
让Single cell UMAP注释支棱起来
30
RNAvelocity4:velocyto.R的使用
清单首页生信文章详情

你真的需要如此多的单细胞亚群注释工具吗

新鲜出炉(2021年10月)的,发表在:《Computational and Structural Biotechnology Journal》杂志的综述文章:《Automatic cell type identification methods for single-cell RNA sequencing》整理了目前的单细胞亚群注释工具,文章链接是:https://www.sciencedirect.com/science/article/pii/S2001037021004499

  • Lazy learning methods include CELLBLAST , scmap-cell , CellFishing.jl , and CellAtlasSearch .
  • Eager learning methods account for the majority of the automatic methods, including scHPL , clustifyr , MARS , scPretrain , Superscan , Seurat , , scLearn , scCapsNet , ACTINN , CaSTLe , CHETAH , SciBet , scID , scmap-cluster , scPred , SingleCellNet , SingleR , scVI , scMatch , scClassifR , and Garnett .
  • Marker learning methods include scTyper , DigitalCellSorter , SCINA , SCSA , CellAssign , and scCATCH . MarkerCount
  • To facilitate automatic cell-type identification, scLearn, CELLBLAST, SciBet, SingleCellNet, scMatch, Superscan, and Garnett provide processed training datasets. Moreover, DigitalCellSorter, SCSA, scTyper, and scCATCH provide canonical cell markers for certain cell types.

作者开发了一个整合这么多工具的包(AutomaticCellTypeIdentification),主要是把各个工具分成了3类:

  • eagersupervised methods include ACTINN, CaSTLe, CHETAH, clustifyr, Garnett, Markercount, MARS, scClassifR, scHPL, SciBet, scID, scLearn, scmapcluster, scPred, scVI, Seurat, SingleCellNet and SingleR.lazysupervised methods include CELLBLAST and scmapcell.markersupervised methods include scTyper, Markercount, SCSA, DigitalCellSorter and SCINA.

工作量有点大啊!

不过,综述文章关于软件工具算法测评的思路值得学习:

  • Fig. 1. Workflow of the traditional and automatic cell-type identification methods.
  • Fig. 2. Performance of the automatic cell-type identification methods using the Tabula Muris datasets.
  • Fig. 3. Performance of the automatic cell-type identification methods using PBMC and tumor datasets.
  • Fig. 4. Speed of automatic cell-type identification methods.
  • Fig. 5. Summary of performance of the automatic cell-type identification methods. Bar graphs of the automatic cell-type identification methods with six evaluation criteria indicated.

文章也提到了目前单细胞转录组测序数据都是多个样品了,所以确实存在两个难题(Yet, for integrated datasets, there are still two issues to be solved.):

  • The first is to try to avoid the influences of different sequencing technologies during the process of data integration, for example, by using MNN , CCA , LIGER , Scanorama , et al.
  • The second is to try to unify the currently inconsistent annotation levels in the training datasets, for example, by the joint usage of multiple training datasets , or by manual curation of each training dataset.

实际上我做的大量肿瘤单细胞数据分析项目里面,用不到这些自动化注释工具,都是自己肉眼看,需要有一些背景知识哦!比如背诵如下所示各个细胞亚群高表达量基因的列表:

代码语言:javascript
复制
# T Cells (CD3D, CD3E, CD8A), 
# B cells (CD19, CD79A, MS4A1 [CD20]), 
# Plasma cells (IGHG1, MZB1, SDC1, CD79A), 
# Monocytes and macrophages (CD68, CD163, CD14),
# NK Cells (FGFBP2, FCG3RA, CX3CR1),  
# Photoreceptor cells (RCVRN), 
# Fibroblasts (FGF7, MME), 
# Endothelial cells (PECAM1, VWF). 
# epi or tumor (EPCAM, KRT19, PROM1, ALDH1A1, CD24).
#   immune (CD45+,PTPRC), epithelial/cancer (EpCAM+,EPCAM), 
# stromal (CD10+,MME,fibo or CD31+,PECAM1,endo) 

最后,摘抄了这个综述文章里面收集整理的各个工具的详细GitHub网页链接:

代码语言:javascript
复制
Name of method Version URL
CELLBLAST v0.3.8 https://github.com/gao-lab/Cell_BLAST
CellFishing.jl v0.3.2 https://github.com/bicycle1885/CellFishing.jl
scmap-cell v1.6.0 https://github.com/hemberg-lab/scmap
ACTINN master https://github.com/mafeiyang/ACTINN
CaSTLe v1.0.0.2 https://github.com/yuvallb/CaSTLe
CHETAH v1.2.0 https://github.com/jdekanter/CHETAH
Garnett v0.1.19 https://github.com/cole-trapnell-lab/garnett
SciBet v0.1.0 https://github.com/zwj-tina/scibetR
scID v2.1 https://github.com/BatadaLab/scID
scLearn v1.0 https://github.com/bm2-lab/scLearn
scmap-cluster v1.6.0 https://github.com/hemberg-lab/scmap
scPred v1.9.0 https://github.com/powellgenomicslab/scPred
scVI v0.4.1 https://github.com/YosefLab/scvi-tools
Seurat v3.2.2 https://github.com/satijalab/seurat
SingleCellNet v0.1.0 https://github.com/pcahan1/singleCellNet
SingleR v1.1.1 https://github.com/dviraran/SingleR
CellAssign v0.99.21 https://github.com/Irrationone/cellassign
DigitalCellSorter v1.1 https://github.com/sdomanskyi/DigitalCellSorter
SCINA v1.2.0 https://github.com/jcao89757/SCINA
SCSA master https://github.com/bioinfo-ibms-pumc/SCSA
scTyper v0.1.0 https://github.com/omicsCore/scTyper
scHPL V0.0.2 https://github.com/lcmmichielsen/scHPL
MARS master https://github.com/snap-stanford/mars
clustifyr v1.5.0 https://github.com/rnabioco/clustifyr
scClassifR v1.1.1 https://github.com/grisslab/scClassifR
MarkerCount master https://github.com/combio-dku/MarkerCount/tree/master

入门单细胞数据处理,需要一些基础认知,也可以看基础10讲:

最基础的往往是降维聚类分群,参考前面的例子:人人都能学会的单细胞聚类分群注释

下一篇
举报
领券