Heng Li's blog
Short RNA-seq read alignment with minimap2
Why is bwa-aln used for ancient DNA reads?
Where did BWA come from?
On the definition of pangenome
What high-performance language to learn?
Random open syncmers
A few suggestions for creating command line interfaces
Introducing dual assembly
Remapping an aligned BAM
Designing a command-line interface
An FM-index of 400k SARS-CoV-2 genomes
Concepts in phased assemblies
SNP vs SNV
Minigraph as a multi-assembly SV caller
Evaluating collapsed misassembly with asmgene
Base quality scores are essential to short read variant calling
Format, quality binning and file size
Fast high-level programming languages
auN: a new metric to measure assembly contiguity
On a reference pan-genome model (Part II)
On a reference pan-genome model
How much does developement time matter?
On maintaining bioinformatics software
SAM/BAM/samtools is 10 years old
On the definition of sequence identity
Seqtk: code walkthrough
On the MPEG-G alignment format
Minimap2 and the future of BWA
The history the MD tag and the CIGAR X operator
Immature thoughts on assembly De Bruijn graphs
Which human reference genome to use?
On NovaSeq Base Quality
Bioconda: a capable bio-software package manager
A reimplementation of symmetric DUST
A few comments on GraphMap
My thoughts on sharing genotype and phenotype data
A few hours with docker
The unary representation of variants
The problems with the VCF model
Correcting Illumina sequencing errors: extended background
The early history of the SAM/BAM format
BWA-MEM for long error-prone reads
On HiSeq X10 Base Quality
On the graphical representation of sequences
First update on GFA
Alternatives to PSMC
A proposal of the Grapical Fragment Assembly format
On the trend of disk-based algorithms
About static linking
Abreak: evaluating de novo assemblies
Random access to zlib compressed files
My blog