Array 1 38-188 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSHF01000156.1 Escherichia coli strain TzEc047 NODE_156_length_227_cov_10.38, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 38 29 100.0 32 ............................. GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 99 29 96.6 32 .A........................... CGCCAGTAATCAAAGTCGGCGCGTATGCAATG 160 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 3 29 98.9 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : AACCGCGCCAGTAATCAAAGTCGGCGCGTATGCAATGG # Right flank : GAGCCTGACGAGACTACTGAGGCCGTTCTGTCGAGTTCC # Questionable array : NO Score: 5.31 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.02, 8:0.4, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-7.20,-5.60] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [30.0-26.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.24,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 83443-84202 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSHF01000002.1 Escherichia coli strain TzEc047 NODE_2_length_247203_cov_14.1418, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 83443 29 96.6 32 ...........................G. CGACTGACTGTCGCGATTGAATCCCGCGAAAT 83504 29 96.6 32 ...........................G. GGTTATTGTCTGATGGAACCATTTGTTGGCGG 83565 29 93.1 31 ...........................GC GGAGCTGGTGGTGGTGATTGGCAAGCAGGCG 83625 29 100.0 32 ............................. TATTCGTAAGTTTTATGGGTGGATTTTAAGTG 83686 29 100.0 32 ............................. AGCTTAATGGCGACCAGGTGAGGATGTTAAAC 83747 29 100.0 32 ............................. AACAGCCAGCCCCGGCAGTGCCGGAGGAAATG 83808 29 100.0 32 ............................. CGCGGGCCGTATGCCTTGCTTATGAGGGGTAA 83869 29 100.0 32 ............................. GGCGGCAGGCCGCACAATCCGCATCACCGAGA 83930 29 100.0 32 ............................. CAAATCGGGCGAAATTCAGGATACTTTATCCG 83991 29 100.0 32 ............................. CGCCAGTAATCAAAGTCGGCGCGTATGCAATG 84052 29 100.0 32 ............................. GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 84113 29 96.6 32 .A........................... GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 84174 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 13 29 98.2 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTGCTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATGTTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.17 # Score Detail : 1:0, 2:3, 3:0, 4:0.91, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [2-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 109752-110573 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSHF01000002.1 Escherichia coli strain TzEc047 NODE_2_length_247203_cov_14.1418, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 109752 29 100.0 32 ............................. TTCAGCGGGTGATGATATCCGCGCCGCTATGA 109813 29 100.0 32 ............................. TGTTGTTTTCCGCCTGGTATTCTGGCAGATCA 109874 29 100.0 32 ............................. CGCCAGTGGGGGAGCCGTGAGGACCTGACCTG 109935 29 100.0 32 ............................. GAGTTTGAAAAATCGTGTGCGGATGCGGCAGC 109996 29 100.0 32 ............................. CTCAAAGGCAAAAAATACGATCTCGCCGGTGT 110057 29 96.6 32 ............T................ GGCCAACAAAACGTGTCAGAGTTCTGAAATTA 110118 29 100.0 32 ............................. AATAATGAAATTGAAACCGTATAGCATCACTC 110179 29 100.0 32 ............................. GTTTGATCAATATCACGCTGACAGATTACGGC 110240 29 100.0 32 ............................. TGCGGAGCCCGCGGTTTTTTGTGTTGCCAATG 110301 29 96.6 32 ..........................A.. GAACCGGGCATATCGCAGTGTGGGCGCTGATT 110362 29 100.0 32 ............................. CGCAGCGCCGACGAACTGGACGGCGCTATAAA 110423 29 96.6 32 ............G................ AGACACCAGAGGAAATAATAAAAATGATGGGG 110484 29 89.7 32 .T.........AT................ GGCGCACTGGATGCGATGATGGATATCACTTA 110545 28 75.9 0 ..A........C..T....A..-.C...C | T [110564] ========== ====== ====== ====== ============================= ================================ ================== 14 29 96.8 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCTGTTTCACTGGGAGATGCAGGCCATCGGAGTAGCTGAAATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGATTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTGTCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGGTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : CAGCTCCCATTTTCAAACCCATCAAGACGCCTTCGCCAACTCCTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCTATTCCGTGACTGCGTGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAATTAAGTTATCGAGATTAATCACCAGCAGCGTATTTTTCTTTTCGGTGTCACTCATCCGCT # Questionable array : NO Score: 6.10 # Score Detail : 1:0, 2:3, 3:0, 4:0.84, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-13] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [68.3-48.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 157913-158176 **** Predicted by CRISPRDetect 2.4 *** >NZ_WSHF01000003.1 Escherichia coli strain TzEc047 NODE_3_length_192682_cov_11.8184, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================= ================== 157913 28 100.0 33 ............................ AACCTACCGTCTTGGCTAGCGGTTGCAGCGAAC 157974 28 100.0 33 ............................ AACCTACCGTCTTGGCTAGCGGTTGCAGCGAAC 158035 28 100.0 32 ............................ GGAACAATCTTGCAAAGGCTGTGAAAGTTGGC 158095 28 100.0 28 ............................ TTCACAGGTAACATACTCCACCCACCAT 158151 26 85.7 0 ................A...-A..-... | ========== ====== ====== ====== ============================ ================================= ================== 5 28 97.1 32 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : GATAAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTCAAGTCCGTAATCTCGAAAGAGGTTGCGGACTTTTTATTTATGGGGTGGAGGTTCAGACCCTTTTTTTAATGATGATGGTAAGTTGTTGATAATTAGTGCTGCGGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : ATGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAACTTGTGCAGTATATCTACATCGAGACAAGTTATGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATAAAAAGGCCGGTTAAACCGACCTTTTACTCGTTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGA # Questionable array : NO Score: 5.66 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.4, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [8,6] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-8.00,-7.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: NA [58.3-55.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.24,0 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], //