/data/PwaniGenomics/fastq_data
#create symbolic link to the data in your user directory
ln -s /data/PwaniGenomics/fastq_data
Note: FASTQ reads are in .gz
format.
fna.amb
– Contains info on ambiguous bases (e.g., N).fna.ann
– Contains metadata such as sequence names and lengths.fna.bwt
– Burrows-Wheeler transformed sequence (compressed format).fna.pac
– Packed sequence data.fna.sa
– Suffix array used to locate sequences.FASTQ files (typically .fastq.gz
) contain sequencing reads. Each read consists of four lines:
@SRR15369215.126490887 # Sequence identifier
GGACCTTCTGTCATTTCACT... # Nucleotide sequence
+ # Separator
AAFFFJJJJJJJJJJJJJJJJ... # Base quality scores
SRR836370_1_subset.fastq.gz
and SRR836370_2_subset.fastq.gz
:SRR836370_1_subset.fastq.gz
and SRR836370_2_subset.fastq.gz
:SRR836370_1_subset.fastq.gz
and SRR836370_2_subset.fastq.gz
:SRR836370_1_subset.fastq.gz
and SRR836370_2_subset.fastq.gz
:SRR836370_1_subset.fastq.gz
and SRR836370_2_subset.fastq.gz
:Adapter sequence: CTGTCTCTTATACACATCT
Datasets are available at: https://tinyurl.com/Popgen2025PUZenodo