Basic Statistics
| Measure | Value |
|---|---|
| Filename | SRR837470.fastq.fasta.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 21220553 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 57 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| TCCTGTACTGAGCTGCCCCGATGGAATTCTCGGGGGCCAAGGAACTCCAG | 260044 | 1.2254346057805374 | RNA PCR Primer, Index 1 (96% over 29bp) |
| TCCTGTACTGAGCTGCCCCGATGGAATTCTCGGGGGCCAAGGAACTCCCG | 25830 | 0.12172161583159496 | RNA PCR Primer, Index 1 (96% over 27bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TATTTGC | 30 | 1.7390994E-6 | 46.461807 | 9 |
| TATTTGA | 910 | 0.0 | 46.461807 | 14 |
| TCGCCCA | 20 | 6.010657E-4 | 46.461807 | 13 |
| ATTTACA | 65 | 0.0 | 46.461807 | 11 |
| TGTTTAC | 40 | 5.144102E-9 | 46.461807 | 13 |
| TATCACT | 90 | 0.0 | 46.461807 | 12 |
| AACCGAC | 30 | 1.7390994E-6 | 46.461807 | 10 |
| TACCCTC | 65 | 0.0 | 46.461807 | 12 |
| GGTACAA | 170 | 0.0 | 46.461807 | 10 |
| CAAACGC | 170 | 0.0 | 46.461807 | 11 |
| ACAAAAG | 45 | 2.8194336E-10 | 46.461807 | 11 |
| GATGTTT | 35 | 9.435644E-8 | 46.461807 | 11 |
| TCCGCTC | 65 | 0.0 | 46.461807 | 11 |
| CATATCA | 30 | 1.7390994E-6 | 46.461807 | 10 |
| TACCATA | 60 | 0.0 | 46.461807 | 8 |
| GTACAAC | 170 | 0.0 | 46.461807 | 11 |
| AGATAGC | 110 | 0.0 | 46.461807 | 11 |
| CGACGAC | 30 | 1.7390994E-6 | 46.461807 | 10 |
| TCATTCC | 120 | 0.0 | 46.461807 | 11 |
| CATGATA | 80 | 0.0 | 46.461807 | 12 |