Array 1 80-1127 **** Predicted by CRISPRDetect 2.4 *** >NZ_VDKS01000255.1 Escherichia coli strain G33 NODE_255_length_1160_cov_3.98523, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 80 29 100.0 32 ............................. GCGTGCCAACGGAATGCACCCTCATTGGTGAT 141 29 100.0 32 ............................. CCGCTGCGCTGGCGTCAGGCGTACCGGTGGTG 202 29 100.0 32 ............................. CTATTTTTACAGCGAGTTTTGCAACAAATTTT 263 29 100.0 32 ............................. AAATTAATTTTGAAACGGGTGTTGATGGTGAG 324 29 100.0 32 ............................. CGCCGCATTAATATTGATGACCAATATTCATT 385 29 100.0 32 ............................. ACAATCAGGGAACGATTGTTGACACTGTAAAA 446 29 96.6 32 ............................T TGATCCGCGACCGCCAGATGGCGCAGCCTGTC 507 29 100.0 32 ............................. GTGCGCCAGCTATAAAAAACTCACCATCAACA 568 29 86.2 12 ...............C.A.C.....G... CAAACAGAGCCG G [592] Deletion [609] 610 29 100.0 32 ............................. ATCCCCTGCGTCCTCTTTTGAATAATCGCGGC 671 29 100.0 32 ............................. CAAAAAAATAATATCCGGCAGTCTGTACGGTA 732 29 96.6 32 .......T..................... TCGGTTTGCGCGTATTGTTGTTCGGGGTCTAT 793 29 100.0 33 ............................. TAGGTAAATCACAGCTATTTGATAAGGGCGTGT 855 29 100.0 32 ............................. AGTCGAAATGAGGCGCGTGTTCTGGAGGATAT 916 29 96.6 32 ............................T GTGTTTGCGGCATTAACGCTCACCAGCATTTC 977 29 100.0 32 ............................. AATAGCAATAGTCCATAGATTTGCGAAAACAG 1038 29 100.0 32 ............................. GAGCCTGACAAGACTACTGAGGCCGTTCTGTC 1099 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================= ================== 18 29 98.3 31 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGAG # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTT # Questionable array : NO Score: 6.07 # Score Detail : 1:0, 2:3, 3:0, 4:0.91, 5:0, 6:0.25, 7:-0.09, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [75.0-36.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 8917-9128 **** Predicted by CRISPRDetect 2.4 *** >NZ_VDKS01000037.1 Escherichia coli strain G33 NODE_37_length_48556_cov_112.795, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 8917 29 100.0 32 ............................. CGCTTCTGCCTAATATCAAAACCCCAGCTACC 8978 29 100.0 32 ............................. CCGAGGATATTACCCGCCTCCGTGCTGAACTC 9039 29 100.0 32 ............................. CTATTCCCCTGACCGGACATACCCTGGACAAT 9100 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 4 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCCGCAGGTGAAATTGAACCGCCACAACCTGCGCCGGATATGTTACCGCCAGCCATCCCCGAACCTGAAACGTTGGGCGATAGCGGTCATCGAGGACGCGGCTGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCGCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAGCAAATTACCCAACTGGCAGGTTGTGGAAATGTGGTCATGGCCTGGGCGACAAATACTGAGTCTGGTTTTGAGTTTCAGACCTGGGGCAAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAAGTTAGACGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAACGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCTATTCCGTGACTGCGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 5.86 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [73.3-43.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 2023-2234 **** Predicted by CRISPRDetect 2.4 *** >NZ_VDKS01000142.1 Escherichia coli strain G33 NODE_142_length_2371_cov_2.52092, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 2023 29 100.0 32 ............................. GGATATAGAGCGGGTACTCGAGCGAAGCGGGG 2084 29 100.0 32 ............................. CCAGGACAGGCCGTGACGGTTGCCATTGAGTC 2145 29 100.0 32 ............................. TTTTTGTTCTCTTCAAAACGCCGAACAACCAA 2206 29 93.1 0 ............T.....A.......... | ========== ====== ====== ====== ============================= ================================ ================== 4 29 98.3 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCCGTTTCTCTGGGAGATGCCGGACATCGGAGTAGCTGAGATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGACTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTATCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCAC # Questionable array : NO Score: 5.77 # Score Detail : 1:0, 2:3, 3:0, 4:0.91, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [70.0-41.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 17220-16947 **** Predicted by CRISPRDetect 2.4 *** >NZ_VDKS01000054.1 Escherichia coli strain G33 NODE_54_length_31022_cov_116.438, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 17219 29 100.0 32 ............................. CCTGTTGCGCCTTATAATGCGTATACCGCGAA 17158 29 100.0 32 ............................. ACGCTGATTTGATGGGGGCTGGTGCAGGAGAT 17097 29 100.0 32 ............................. TTTCAGCAGTTCAGCGTAACACCGACGGTCAC 17036 29 100.0 32 ............................. CCCGGAATGCATTCTGAAGGTTTGCTGTATAT 16975 29 96.6 0 ............................A | ========== ====== ====== ====== ============================= ================================ ================== 5 29 99.3 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTATTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAATTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGAATTAAGTACTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTGTAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTACA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.02 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:0.8, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [68.3-75.0]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //