前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >RNA-seq上下游分析snakemake流程

RNA-seq上下游分析snakemake流程

原创
作者头像
生信探索
发布2023-05-23 09:00:41
3050
发布2023-05-23 09:00:41
举报
文章被收录于专栏:生信探索生信探索

学习完snakemake后写的第一个流程是RNA-seq上游定量和下游的质控和差异分析。

使用fastp处理fastq文件,在使用START比对到基因组同时得到raw count,使用非冗余外显子长度作为基因的长度计算FPKM、TPM,同时也生成了CPM的结果。

非冗余外显子长度计算可以参考之前的推文转录组实战02: 计算非冗余外显子长度之和

对定量结果质控使用生信技能树的三张图(PCA、树状图、热图)。

使用python版的DEseq2对组间做差异分析(火山图和MA图)。

流程代码在https://jihulab.com/BioQuest/SnakeMake-RNA-seqhttps://github.com/BioQuestX/SnakeMake-RNA-seq

A SnakeMake workflow for Bulk RNA-seq

Reads were mapped onto ensembl genome with STAR, and adapters were removed with fastp.

For nomalisztion, gtftools was used to calculate gene_length and bioninfokit was used to give TPM, FPKM and CPM results.

For quality control, PCA plot, dendrogram plot and heatmap were used to show differences among samples or groups.

PyDESeq2 was used to perform differential expression anlysis.

General settings

To configure this workflow, modify config/config.yaml according to your needs, following the explanations provided in the file.

Sample sheet

  • Add samples to config/samples.tsv. Only the column Sample is mandatory, but any additional columns can be added.
  • For each sample, add one or more sequencing units (runs, lanes or replicates) to the Unit column of config/samples.tsv.
  • For each sample, define Group column(experimental or clinical attribute).

Report

QC plot

differential expression anlysis

  • MA plot
  • volcano plot

原创声明:本文系作者授权腾讯云开发者社区发表,未经许可,不得转载。

如有侵权,请联系 cloudcommunity@tencent.com 删除。

原创声明:本文系作者授权腾讯云开发者社区发表,未经许可,不得转载。

如有侵权,请联系 cloudcommunity@tencent.com 删除。

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • A SnakeMake workflow for Bulk RNA-seq
  • General settings
  • Sample sheet
  • Report
  • QC plot
  • differential expression anlysis
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档