Counting a list of fastq files

As sequencing experiments generate batches of fastq file after demultiplexing, I wrote a script for getting the size of a list of fastq files in the path:

Usage:

./fastqSize <'$fastqpath/*fastq'>|<'$fastqpath/*fastq.gz'>

And this will output a very nice tab-deliminated format of file name on the first column and sequence count on the second.

The input can be gzip file as well, but is determined by the filename.

The code is deposited on github along with my other fastq manipulation tools.

*Compiling requires gzstream library

This work is licensed under a Creative Commons Attribution 4.0 International License. If you liked this post, you can share it with your followers or follow me on Twitter!

Douglas C. Wu

Counting a list of fastq files