Array 1 854-336 **** Predicted by CRISPRDetect 2.4 *** >NZ_CXWS01000022.1 Escherichia coli isolate YS river water, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 853 29 100.0 32 ............................. TTCCGCGACCCGGCGATAAGGGAAGATGGGTG 792 29 100.0 32 ............................. GGCGGCATTACTGCTGCGAGTATAACTACGAT 731 29 100.0 32 ............................. CCGCCTGATGGCGACGCGTCTTTTAACCCCAT 670 29 100.0 33 ............................. TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC 608 29 100.0 32 ............................. CTCTGATTCATCGGCGGCGATACTGTCATCAC 547 29 100.0 32 ............................. GAAAAACAAATAGATGGATAGCTCGATATCAT 486 29 100.0 32 ............................. ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG 425 29 100.0 32 ............................. AGCGCGCGCGGGCTACTGCACTCGGTGATAAC 364 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 9 29 100.0 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CTGGATGAACTACTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGTATTGCGCGTAATTGGCGTTTGTCGATGCAAACCCATAAATATTTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATATTTTGTCGCCTCTGAAAAACCTCAATTTTGCCCATCCTGGACTAATCATTATCATTCTCTACAAATTCTGTGGCGTTAATTTTTCGTTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGCGTTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTGTAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : GCCAGAAAACATGAAAAAACTTTGGGAGGGGATGAGTTCCCATAAGCGCTAACTTAAGGGTTGAACCATCTGAAGAATGCGACGCCTCGGTGCCTCGTTAAGACGATGCCTCGCGTTCTTCAATTGCGTTTTGTAGGCTGTCAGGGATACTGTCCCACGAATGGCCACCTGTAAGCTCCAGATGACCATTTTTGTTATTCTCCACAACGAGTTAGTTCTTCTTTTCGGATCCGGCACTTCTGGGGGGGAAATCCAGCGATGGCTGGATTATGTCGTCAATTAAAAATGCGGCGAGTAGATTAGCAAATATCCACGCTTTCGCGAGTTCAGGTTCCT # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: R [56.7-75.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.51 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 21700-21497 **** Predicted by CRISPRDetect 2.4 *** >NZ_CXWS01000008.1 Escherichia coli isolate YS river water, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================ ================================= ================== 21699 28 100.0 33 ............................ AACCTACCGTCTTGGCTAGCGGTTGCAGCGAAC 21638 28 100.0 32 ............................ GGAACAATCTTGCAAAGGCTGTGAAAGTTGGC 21578 28 100.0 28 ............................ TTCACAGGTAACATACTCCACCCACCAT 21522 26 85.7 0 ................A...A.-.-... | ========== ====== ====== ====== ============================ ================================= ================== 4 28 96.4 31 GTTCACTGCCGTACAGGCAGCTTAGAAA # Left flank : GATAAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAACCGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTCAAGTCCGTAATCTCGAAAGAGGTTGCGGACTTTTTATTTATGGGGTGGAGGTTCAGACCCTTTTTTTAATGATGATGGTAAGTTGTTGATAATTAGTGCTGCGGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGA # Right flank : TGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGCTGCCAAACCTCATGGGTGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAACTTGTGCAGTATATCTACATCGAGACAAGTTATGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATAAAAAGGCCGGTTAAACCGACCTTTTACTCGTTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGAT # Questionable array : NO Score: 5.68 # Score Detail : 1:0, 2:3, 3:0, 4:0.82, 5:0, 6:0.25, 7:0.01, 8:0.6, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCACTGCCGTACAGGCAGCTTAGAAA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [6,8] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTTCACTGCCGTACAGGCAGCTTAGAAA with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-7.70,-8.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [3-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [51.7-58.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.65 Confidence: HIGH] # Array family : I-F [Matched known repeat from this family], // Array 1 18453-17936 **** Predicted by CRISPRDetect 2.4 *** >NZ_CXWS01000037.1 Escherichia coli isolate YS river water, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 18452 29 100.0 32 ............................. GGCAAAAACCATTTTTTTCCAGAGCGATTTAA 18391 29 100.0 32 ............................. CCCAGCCAGAAAACATCCGAACGCGTTGTCTT 18330 29 100.0 32 ............................. TTCAGCTCCAGCCATGAAATAGCCAAATGCCG 18269 29 100.0 32 ............................. GGCTATTTGTTAAAATTGGCCTGGATGGTAAC 18208 29 100.0 32 ............................. TGGCTGAGTACTTCGGCAAACGACACGGCGAT 18147 29 100.0 32 ............................. AAATTGATCCCAGGGTGATTATCGTGGGGATC 18086 29 93.1 32 .T..........C................ ATTTCTTTAATTATTTAGCTGATGCTTTTAAA 18025 29 93.1 32 .T..........C................ GGTAAAAACACGGTCTGAACCGACATTCATGT 17964 29 96.6 0 .T........................... | ========== ====== ====== ====== ============================= ================================ ================== 9 29 98.1 32 GAGTTCCCCGCGTCAGCGGGGATAAACCG # Left flank : CGTGCTTGCTGCTGGAGAAATACAACCGCCGGCCCCACCTGAAGATGCACAGCCTGTTGCCATTCCGCTTCCTGTTTCACTGGGAGATGCAGGCCATCGGAGTAGCTGAAATGAGTATGTTGGTCGTGGTCACTGAAAATGTACCTCCGCGCTTACGAGGCAGATTAGCCATCTGGTTGTTGGAGGTACGTGCAGGGGTGTATGTAGGTGTTGTATCCGCAAGAATCCGTGAAATGATCTGGGAACAAATATCTGGACTGGCGGAAGAAGGCAATGTGGTGATGGCATGGGCTACGAATACGGAATCGGGATTTGAGTTCCAGACATTTGGGGTAAACAGGCGTACTCCGGTAGATTTGGATGGTTTAAGGTTGGTCTCTTTTTTACCTGTTTAAAAACAAAGAATTAGCTGATCTTTAATAATAAGGAAATGTTACATTAAGGTTGGTGGGTTGTTTTTATGGGAAAAAATGCTTTAAGAACAAATGTATACTTTTAGA # Right flank : GGCGCACTGGATGCGATGATGGATATCACTTAGAATTCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCAAACCCATCAAGACGCCTTCGCCAGCTCCTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGTTGTTGATTATCCAGTCTTACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCAGTGCCTTTCGGATAATTTTTATTCAAACCCGGATTGGTCGTTGCGGCTATTCCGTGACTGCGCGCAATTGCCAGCGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAACAA # Questionable array : NO Score: 6.16 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGTCAGCGGGGATAAACCG # Alternate repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [4,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGTCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [43.3-70.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0,5.92 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 41708-40642 **** Predicted by CRISPRDetect 2.4 *** >NZ_CXWS01000037.1 Escherichia coli isolate YS river water, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 41707 29 100.0 32 ............................. CTCTTCAGCAATGAAATCGTCAAACGAGATTA 41646 29 100.0 32 ............................. GACCAGAGATGTCGTAGCCGTATTTCGCAGCC 41585 29 100.0 32 ............................. TCACCGGGTCAGATACTGATGTTATGGCTTAT 41524 29 100.0 32 ............................. AGGAGTTTAATTTCCAGATTGAGCGCTGGATA 41463 29 100.0 32 ............................. GGCACAAAAAAACCCGCGCACGGCGGGTTAAT 41402 29 100.0 32 ............................. GTGAGTCCGTCAGCGGTGCGCCGCTGCAACAC 41341 29 100.0 32 ............................. CTCGATCAGGAAAATGAATTCCTGGAAAAAAA 41280 29 100.0 32 ............................. AGGAGTTTAATTTCCAGATTGAGCGCTGGATA 41219 29 100.0 32 ............................. CTCGATCAGGAAAATGAATTCCTGGAAAAAAA 41158 29 100.0 32 ............................. CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGG 41097 29 100.0 32 ............................. CACGGCTGGCCATTTGAAATACCTGTTGCTCT 41036 29 96.6 32 .T........................... AACAGCGAGCCAACTGGTTTCAGATTGCTGAA 40975 29 96.6 32 .T........................... GCGATCTCGCGGAATACACCGACGAGGCGGGC 40914 29 96.6 32 .T........................... TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC 40853 29 96.6 32 ...C......................... TTCTTGCGGGTGTTGCAAATATTCTTCACGTA 40792 29 96.6 32 ...C......................... GAGCCTGACGAGACTACTGAGGCCGTTCTGTC 40731 29 100.0 32 ............................. GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT 40670 29 96.6 0 ............................A | ========== ====== ====== ====== ============================= ================================ ================== 18 29 98.9 32 GAGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : CCGCTTCAGGAGAGCCACGTCAGATAATGTTGCAACGTCATGGAGCTGAGCCCATGCAGTGACTTCACGTAATGACATCCCCCCGGGGCCGTAAGCCAGCCCCAGACGTAGCAGAGTTGCAGCATCACGAATTTCGCGGCGGCGGGTTAGAGCCCCGGCATTACGTGCCGAAGTATCCAGTTCTTCGGGCTTACCAATATGGGCCAGAATTGCTGACCAGTTATCGTGAGAGTAATTCATCGGCACGTTAAATCATATCAGGCGTAATACCACAACCCTTAAGTTAGCGCTTATGGGA # Right flank : CCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACACCTCATGTTCAAAATAGTTCTCCATGCCAGAGAGGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATATTGAATTAATGCCTGTTAATTTTTTCTTTAGAATAACAGTATATTTTTTAAGCTTGTTATTCATTGGTTAAGTAATAAATATGGAAGTTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATAACCGCCTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTTT # Questionable array : NO Score: 6.21 # Score Detail : 1:0, 2:3, 3:0, 4:0.95, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GAGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [3,6] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GAGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.00,-13.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [68.3-56.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.27,5.65 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //