Array 1 201224-201984 **** Predicted by CRISPRDetect 2.4 *** >NZ_VOTC01000001.1 Escherichia coli strain WU1022 NODE_1_length_348881_cov_505.216205, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 201224 29 100.0 32 ............................. GCATTGACGCTTTAAACGACGACGGACGCCAC 201285 29 100.0 32 ............................. AAAACAGCCTTTAGATTAGTACCTGACGACCG 201346 29 100.0 32 ............................. TAAACGCACCTGGCGCGCCACTTTATCAACAA 201407 29 100.0 32 ............................. CGGCTTGTTTAATTGCGTGGAACGTCTCAATT 201468 29 100.0 32 ............................. ACGGCGTGGATTGAGGGACGGGTATTTGGTCC 201529 29 96.6 32 ............................T AGATCGCGCCACGAGGAAACGAATATGAACGG 201590 29 100.0 32 ............................. GTCTGTGATGGCCTGCTCGTGAGTCCGCGGCG 201651 29 100.0 32 ............................. TTTTGATTTCATTAACGGCGCTCCCCATATTT 201712 29 100.0 32 ............................. TGCGCCGTAGCGTGTCCACCTATTGTAGTAAA 201773 29 100.0 32 ............................. ATACAAACGCGGTGTTTATCAATATGAATTTT 201834 29 100.0 32 ............................. TATTACGCGCCAGCAATGCTGACAGCGGCAAA 201895 29 100.0 32 ............................. CGCGAGAGCCAGCAAAACGCCAGGGCACAAAA 201956 29 93.1 0 .A..........................A | ========== ====== ====== ====== ============================= ================================ ================== 13 29 99.2 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : TGGATGAACTATTGGCAACGCTGACCGATGATAAACCGCGAGTCATTGCACTGCAGCCGATTAGCCAAAAGGATGATGCCACACGTTTGTGCATTGAAACCTGCATTGCGCGTAATTGGCGTTTGTCGATGCAAACACATAAATATCTAAATATTGCCTGATTAAACATTTATAAGCGTTATAAATGGGTGGAACCTGTAAAGACTTCTACTCATTTATATTGTTTGTCGCCTCTGAAAACTCCTCCATTTTACCCATCCAGGGCTAATCATTAGCATTCTCTACAAATTCTGTGGCATTAATTTTTCGCTGGAGTGAAAATTATTGCGGTAAAGTTTGGTAGATTTTAGTTTGTATAGAGTTATTTTAAATATTTACCTTTTTAATCAATGGATTAAGTACTCTTTAACATAATGGATGTGTTGTTTGTGTGATACTATAAAGTTGGTAGATTGTGACTGGCTTAAAAAATCATTAATTAATAATAGGTTATGTTTAGA # Right flank : ACCATATAACCCGTTATCTCTTTCTCAAGTTTTTATATTAGCAGTACTTGTAATAAGCAACATATCCACGTAACCCCTCATGTTCAAAATAGTTCTCCATGCCAGAGAAGTTCACAATTATCGATACAAAAAATTAAATTTAATCAAAGTGTTATTTGTATGATTCTTAAATCGTTAAGAAATTTTAATCTATTATTTTTTTAATGTTGAATTAATATCTATTAATTTTTTCTTTAGGTTAATAGTTTGTTTTTTAAGCTTGTTATTCATTGATTAAGTAATAAATCTGAAAATTTGTCTTTGTTTTGAGGCTAATGAGTGGTTTTACATTACCGTTTCTATACGCTGTTGATGAATAGTTCTTATGAATAAAGATATCCAGTTCATACTTTAAGTGAAAATTGATAAAGTGCGATTCGTATTGTCTTTTATTCTAAAGACATCGAGTGTAGTTAATATTCCTTGTAAAAACAGGGATAAACCGAACTAGTTAAAGTTTT # Questionable array : NO Score: 6.22 # Score Detail : 1:0, 2:3, 3:0, 4:0.96, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [75.0-68.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 217486-218125 **** Predicted by CRISPRDetect 2.4 *** >NZ_VOTC01000001.1 Escherichia coli strain WU1022 NODE_1_length_348881_cov_505.216205, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 217486 29 100.0 32 ............................. CGACAAAATTCTCAAAACTCGATCAGGAAAAT 217547 29 100.0 32 ............................. CCACCGTTTTCGCCCACCAGGGCGCACAACCC 217608 29 100.0 32 ............................. GAAAAAGAGAAGGTAGAGAAAGCGGAATCTGG 217669 29 100.0 32 ............................. CAGGTCTATCGGGCGATCAATAAAATCGGTCA 217730 29 100.0 32 ............................. GCGCACCGTTGCGTCGAAAAGGCGCTGGAGAT 217791 29 100.0 32 ............................. TACGCTTACACAACGGGCGAATATTTTAACGG 217852 29 100.0 32 ............................. GAACCCAATAGTGAAATACAGCATCATTTTTT 217913 29 100.0 32 ............................. ACCTGGAGGCGAAAAAGGCGCTTCGACGTAAA 217974 29 100.0 33 ............................. GAGGCCTATATCTCTAACCGCATCGGGCTGCGC 218036 29 100.0 32 ............................. GGGCAAATATAAATTCCAGCGTGCTTCATGAA 218097 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================= ================== 11 29 100.0 32 GTGTTCCCCGCGCCAGCGGGGATAAACCG # Left flank : GTCCTTGCTGCAGGTGAAATTGAACCACCACAACCCGCGCCGGATATGTTACCGCCTGCCATCCCTGAACCTGAAACGCTGGGTGATAGTGGTCACCGGGGGCGCGGCGGATGAGCATGGTCGTGGTTGTTACAGAAAATGTCCCTCCGCGCTTACGTGGACGGCTCGCAATCTGGCTACTGGAAGTGCGTGCCGGTGTGTATGTTGGTGATACATCAAAACGTATTCGGGAGATGATCTGGCAACAAATTACCCAACTGGCTGGTTGCGGAAATGTGGTGATGGCCTGGGCGACCAATACCGAATCGGGTTTTGAATTTCAGACCTGGGGAGAAAACAGACGTATTCCGGTGGATTTGGATGGGTTACGTTTGGTTTCTTTTCTTCCTGTTGATAATCAATAGGTTATGTGTTCTTTAAAAATAAGGAAATGTTTGAATTTAGTTGGTAGATTGTTGATGTGGAATAAATTTGTTTAAAAACAGATATGTATGCTTAGT # Right flank : GGGCGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCCGCCCCTGCGGTAGAACTCCCAGCTCCCATTTTCCAACCCATCAAGACGCCTTCGCCAACTCTTTCACCAGAGGTAGCATTATCCGCATAACGTCACGGCAGCGACGTTCTATTCTTCCAGGAAGTGCCTTATCAATATGCTGTTGATTATCAAACCTGACGTCATGCCAGCTATTTCCCGCAGGGAATGCGGCTGTTTTTGCGCGTTGCTGATAACCATCCTTATTCCCAAGATTCCAGTTAGTCGCTTCCACCGAAAGTACAGCAATGCCCGCTTTGTCGAATATTTCTGCGTCATTACAACACCCGGTGCCTTTCGGATAGTTTTTATTCAAACCAGGATTGGTCGTCGCGGCTATTCCCTGACTGCGCGCAATTGCCAGTGCCCTGTCGCGCGTTAATTTCCTTACTGCTTCAGGGGTTTTTACACCGCTGTTGAAATACAATTTATCGCCAAC # Questionable array : NO Score: 6.26 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGTTCCCCGCGCCAGCGGGGATAAACCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGTTCCCCGCGCCAGCGGGGATAAACCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.50,-12.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [73.3-40.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.51,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //