GCG
, the old bioinformatics package, was named after the authors kept high-fiving each other, shouting “g
ood c
ode g
uys!”. (GCG is a software package for the analyses of gene and protein sequences.)Bowtie
is named so because “it is almost impossible to tie”, referring to code to avoid a “race condition” when using multiple processors.TopHat
is named do because it was the first spliced RNA-Seq aligner, and when it worked first time, the authors shouted `Top that!``.Velvet
is so named because @dzerbino wore velvet gloves
(天鹅绒手套) when coding it (via @pathogenomenick)Heng Li
writes all his code in x86 assembly language, and uses a C decompiler before releasing it. @lh3lh3 (via @torstenseemann)SRA
(short read archive) is the best known of the archives, and not many people know or use the MRA
(medium read archive), the KLRA
(kinda long read archive) and the LRA
(long read archive). (SRA: sequence read archive)EBI
(Illumina
is short for Illuminati
(光明会), the shadowy organisation that controls sequencing worldwide. (via @neilfws)HMMer
package was so named when someone asked how it worked, and the developers said Hmmmm… errr…
. (via @mgollery)Hidden Markov Models
are like the recipe for Kentucky Fried Chicken. There are only three
people in the world who understand small parts of how HMMs work, and only when they get together do they know the full
picture.HGAP
assembler is actually an elaborate front-end hiding three thousand slave laborers
all running GAP4
(via @IanGoodhead)Illumina machine
+ a bioinformatician
running assemblies (via @gedankenstuecke)pasting ORFs
into web BLAST (via @torstenseemann)p
in p-value
actually stands for p-otentially interesting! (via @jessenleon)e
in e-value stands for excellent
, as in “that’s an excellent BLAST hit”EBI
is an elaborate front-end to NCBI services. (现在EBI也做的越来越好,国内也有了更多越来越好的数据平台)pubmed.com
.number of replicates
needed for your RNA-seq experiment equals the impact factor
of the journal you want to publish in (via @torstenseemann)Altschul et al
have never read the paper. (发表了BLAST的那篇文章)protein
database for their own name
.ELVIS
appears 35 times in human peps (GRCh38). ELVISLIVES
appears 0 times. The king (猫王Elvis) has left the genome #slowday (via @rdemes) -RP11,
accounts for 72
percent of the human reference genome (via CanGenom)data formats
as there are Bioinformaticians (via @mgollery)80
character line wrapping was invented to standardise data sharing using MS Word (via @IanGoodhead)Excel
(via @CIgenomics)Nature
journals because their papers are first rejected by Gigascience
.one
HiSeq but made to look like hundreds by a set up of mirrors, like that bit in Enter the Dragon
(via @froggleston) (现在我们都用BGI
系列了)HiSeq
3 times, Illumina staff member will show up holding the HiSeq X Ten
system (via @nazeetafatima)short
as before the development of Basespace they were delivered via Twitter
(via @RoyChaudhuri)Phred
scores in honour of Fred Sanger
who developed DNA sequencing. #101bioinfofunfacts (via @tostenseemann)CriMap
was called CriMap because users do an awful lot of crying before they get a half decent map. (via @dj_de_koning)de-bugging tears
of a bioinformatician it is enough to fill an Olympic size swimming pool
annually (via @paulhoskisson)de Bruijn
properly100
most desirable jobs, bioinformatician
was a close second to astronaut
(via @dynomics)cuddling
(via @riccombeni)NIH sequencing costs plot
in talk/lecture you’re not a real bioinformatician (via @AliciaOshlack)