Array 1 1002576-1003337 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP070930.1 Escherichia coli strain 30COLEC chromosome, complete genome Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 1002576 29 100.0 32 ............................. TTCCGCGACCCGGCGATAAGGGAAGATGGGTG 1002637 29 100.0 32 ............................. TAACGACAGAGGGATTCGGCAGCGAAGAGGAT 1002698 29 100.0 32 ............................. CGTAGTTTCGGCAGTCCAGTGCCTCGTTACGT 1002759 29 100.0 32 ............................. ATAGAACGGGACGAGATTTTTAAACAATGGCT 1002820 29 100.0 32 ............................. CAATCTGAGCCAGACGCGACGAATAAAAGCAT 1002881 29 100.0 33 ............................. TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC 1002943 29 100.0 32 ............................. CTCTGATTCATCGGCGGCGATACTGTCATCAC 1003004 29 100.0 32 ............................. GAAAAACAAATAGATGGATAGCTCGATATCAT 1003065 29 100.0 32 ............................. CGGCTTATTGCTCTTGCCGACGGATTACAGTG 1003126 29 100.0 32 ............................. GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC 1003187 29 100.0 32 ............................. AGCGCGCGCGGGCTACTGCACTCGGTGATAAC 1003248 29 100.0 32 ............................. CCGAGCATTATATCCTGTGCGTCGTTCATTTA 1003309 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 13 29 100.0 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGTATTGCGCGTAATTGGCGTTTGTCGATGCAAACCCATAAATATTTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATATTTTGTCGCCTCTGAAAAACCTCAATTTTGCCCATCCTGGACTAATCATTATCATTCTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGCGTTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTGTAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : GCCAGAAAACATGAAAAAACTTTGGGAGGGGATGAGTTCCCATAAGCGCTAACTTAAGGGTTGAACCATCTGAAGAATGCGACGCCTCGGTGCCTCGTTAAGACGATGCCTCGCGTTCTTCAATTGCGTTTTGTAGGCTGTCAGGGATACTGTCCCACGAATGGCCACCTGTAAGCTCCAGATGACCATTTTTGTTATTCTCCACAACGAGTTAGTTCTTCTTTTCGGATCCGGCACTTCTGGGGGGGAAATCCAGCGATGGCTGGATTATGTCGTCAATTAAAAATGCGGCGAGTAGATTAGCAAATATCCACGCTTTCGCGAGTTCAGGTTCCTTTGCACGCAAAGCATCCAGGTGCAGCAAACTTTTGAGCCGCTTAAAAGCCAGTTCAATTTGCCATCGCAGACGGTAACAATCAGCCACTTGCTCTGCTGAATATTCATCTTCCGGTAATGATGTTAGCAATAGCACATGGCCCGCTGCTTCCAGCGTTTCCGCC # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [75.0-56.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.51,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 1004720-1005358 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP070930.1 Escherichia coli strain 30COLEC chromosome, complete genome Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 1004720 29 100.0 32 ............................. CTCTTCAGCAATGAAATCGTCAAACGAGATTA 1004781 29 100.0 32 ............................. ATTACGCCGCCTCGCGTTTTTAGTCATTTCTA 1004842 29 100.0 32 ............................. AGGAGTTTAATTTCCAGATTGAGCGCTGGATA 1004903 29 100.0 32 ............................. CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGG 1004964 29 100.0 32 ............................. CACGGCTGGCCATTTGAAATACCTGTTGCTCT 1005025 29 96.6 32 .T........................... AACAGCGAGCCAACTGGTTTCAGATTGCTGAA 1005086 29 96.6 32 .T........................... GCGATCTCGCGGAATACACCGACGAGGCGGGC 1005147 29 96.6 32 .T........................... TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC 1005208 29 96.6 32 ...C......................... GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 1005269 29 100.0 32 ............................. GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 1005330 29 96.6 0 ............................A | ========== ====== ====== ====== ============================= ================================ ================== 11 29 98.5 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TAGCTCAAAATCAGTGAACTGACAGGTATGAGGATCATATCCCATATGTAGTCGCCATTCAGCGCTGCCGCCCCCGGGCGCACTGATTGCTGTTCCATCGACAAGACGCAATCTCTTTCCGCTTGTACAACCCGTAACTGCGGCGCGTACAGCAAGTGTTTGTGCGGCAAGTATGCCAAACCAGTCGGCGGCATTCCGCAGCCGCTTCAGGAGAGCCACGTCAGATAATGTTGCAACGTCATGGAGCTGAGCCCATGCAGTGACTTCACGTAATGACATCCCCCCGGGGCCGTAAGCCAGCCCCAGACGTAGCAGAGTTGCAGCATCACGAATTTCGCGGCGGCGGGTTAGAGCCCCGGCATTACGTGCCGAAGTATCCAGTTCTTCGGGCTTACCAATATGGGCCAGAATTGCTGACCAGTTATCGTGAGAGTAATTCATCGGCACGTTAAATCATATCAGGCGTAATACCACAACCCTTAAGTTAGCGCTTATGGGAT # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATCTGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.19 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [56.7-68.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0.27 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 3 1030908-1031119 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP070930.1 Escherichia coli strain 30COLEC chromosome, complete genome Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 1030908 29 100.0 32 ............................. CGCACTCAAAATAGTAAATTAATTTATGAATT 1030969 29 100.0 32 ............................. ATCGGACGATGGCGATCGCAATCGCGCGGGAA 1031030 29 100.0 32 ............................. TTTTTGTTCTCTTCAAAACGCCGAACAACCAA 1031091 29 93.1 0 ............T.....A.......... | ========== ====== ====== ====== ============================= ================================ ================== 4 29 98.3 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCCGTTTCTCTGGGAGATGCCGGACATCGGAGTAGCTGAGATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTATATGTAGGTGATGTATCCGCAAAAATTCGTGAAATGATCTGGGAACAAATAGCTGGACTGGCGGAAGAAGGCAATGTAGTGATGGCATGGGCAACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACCCCGGTAGATTTGGATGGTTTAAGGTTGGTATCTTTTTTACCTGTTTGAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GGACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCCAACTCCCATTTTCATACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGCGGTAGCATTATCCGCATAACATCACGGCAGCGACGTTCTATTCTTCCTGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCAGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAATTTGTTGCTTCTACCGAAAGTACGGCAATACCGGCTTTGTCGAAAACTTCGGCGTCATTACAACAGCCAGTACCCTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCAATTCCATGACTACGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCACTGTTGAAATACAATTTATCGCCAACA # Questionable array : NO Score: 5.77 # Score Detail : 1:0, 2:3, 3:0, 4:0.91, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [6,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [70.0-41.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.92,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 4 3008737-3008534 **** Predicted by CRISPRDetect 2.4 *** >NZ_CP070930.1 Escherichia coli strain 30COLEC chromosome, complete genome Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================= ================== 3008736 28 100.0 33 ............................ AACCTACCGTCTTGGCTAGCGGTTGCAGCGAAC 3008675 28 100.0 32 ............................ GGAACAATCTTGCAAAGGCTGTGAAAGTTGGC 3008615 28 100.0 28 ............................ TTCACAGGTAACATACTCCACCCACCAT 3008559 26 85.7 0 ................A...A.-.-... | ========== ====== ====== ====== ============================ ================================= ================== 4 28 96.4 31 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : GATAAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTCAAGTCCGTAATCTCGAAAGAGGTTGCGGACTTTTTATTTATGGGGTGGAGGTTCAGACCCTTTTTTTAATGATGATGGTAAGTTGTTGATAATTAGTGCTGCGGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : TGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAACTTGTGCAGTATATCTACATCGAGACAAGTTATGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATAAAAAGGCCGGTTAAACCGACCTTTTACTCGTTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGAT # Questionable array : NO Score: 5.68 # Score Detail : 1:0, 2:3, 3:0, 4:0.82, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [6,8] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-7.70,-8.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [51.7-58.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], //