/data/PwaniGenomics/fastq_data
#create symbolic link to the data in your user directory
ln -s /data/PwaniGenomics/fastq_data
Note: FASTQ reads are in .gz format.
fna.amb – Contains info on ambiguous bases (e.g., N).fna.ann – Contains metadata such as sequence names and lengths.fna.bwt – Burrows-Wheeler transformed sequence (compressed format).fna.pac – Packed sequence data.fna.sa – Suffix array used to locate sequences.FASTQ files (typically .fastq.gz) contain sequencing reads. Each read consists of four lines:
@SRR15369215.126490887 # Sequence identifier
GGACCTTCTGTCATTTCACT... # Nucleotide sequence
+ # Separator
AAFFFJJJJJJJJJJJJJJJJ... # Base quality scores
SRR836370_1_subset.fastq.gz and SRR836370_2_subset.fastq.gz:SRR836370_1_subset.fastq.gz and SRR836370_2_subset.fastq.gz:SRR836370_1_subset.fastq.gz and SRR836370_2_subset.fastq.gz:SRR836370_1_subset.fastq.gz and SRR836370_2_subset.fastq.gz:SRR836370_1_subset.fastq.gz and SRR836370_2_subset.fastq.gz:Adapter sequence: CTGTCTCTTATACACATCT
Datasets are available at: https://tinyurl.com/Popgen2025PUZenodo