Array 1 4-337 **** Predicted by CRISPRDetect 2.4 *** >NZ_MOWP01000022.1 Frankia sp. CcI49 contig_21, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 4 29 100.0 32 ............................. CCGAATCCGCAGCCGGATGGCGGATCCGATGA 65 29 100.0 32 ............................. GCCGCGACCGCACCCACGGCGCGCTTGTGCGC 126 29 100.0 32 ............................. AGGACTCACATGGATGGTACGACGGGCGTGCG 187 28 96.6 32 .....................-....... CAACGCGAATACTCGGGTGCGCGATGGATTCT 247 29 100.0 33 ............................. TGCACAATCGGACCGCAGCAGGCTGTCAACGTC 309 29 89.7 0 ..T.....G..........A......... | ========== ====== ====== ====== ============================= ================================= ================== 6 29 97.7 32 GTCCTCCCCACGCCCGTGGGGGTGATCCG # Left flank : GCCG # Right flank : GCGGGCGCAGTAGCGCGGCGAGAGTCTATATTCCGGCAGATTATAGGTTTGAGTTTGGTTGTGGGGGGCTGGAGGTCATTGGGTATTGGGGTGACGGCGACCTCCGTGTGGCTTGGATTCCTGGGCCCCGTTCGTGCGGTGGCGAACGTGCGGTGGTGAAAATTTTGGGACGGCTACGGGTTCCCCTGGCGGCCTTCAGGAGGTTCTGAGGTTGGCGAGTGAGCATCGACGGTTTTCGCGCTCCTGTGGTGTAGGTCTCGGGGAATATCTCGGGGATCGGTAGGCTGAGCGTCCATTGGTATGACGAGTGTTTGTTTGCGAAAAGGTTCCATCCGACGCCCGGGAGCGCTGGACATCTTGCAGTCAGACGGTCGGCGGATCGCGAGGTCAAGGAAATGGTGTGATCGCGCCTACCGCGGTCCGTCGGCGGTCCGTGTGTGTTGGAGTCGGTGACCGTGAGGGAGGAGGAGCAAGCCGCTCCGACAGAGTCTGCTGAGCTG # Questionable array : NO Score: 6.15 # Score Detail : 1:0, 2:3, 3:0, 4:0.89, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCCTCCCCACGCCCGTGGGGGTGATCCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCCTCCCCACGCCCGTGGGGGTGATCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-11.50,-10.80] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-10] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [0.0-46.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0.64 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 282089-281122 **** Predicted by CRISPRDetect 2.4 *** >NZ_MOWP01000020.1 Frankia sp. CcI49 contig_19, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ===================================================================================================================== ================== 282088 29 100.0 32 ............................. ACATCCGAAACCGCGACGTTATCTGGATACGT 282027 28 96.6 32 ...........-................. ACGTACGACGGCCGCGAGTGGGTCGTCCGCGC 281967 29 100.0 32 ............................. CCTCATCGAGTGACGGTCTCTGTCTACGGCGC 281906 29 96.6 32 ...........A................. AGGCTGGAGATCAGCGGGGATTTCGGCTCGCG 281845 29 96.6 32 ............................T AGCCGGCCACTGTCGGCATGCTGTTGAACGGC 281784 29 100.0 32 ............................. GCTCGGATCAGTGAGCAGACCAGAAGATCACA 281723 29 100.0 32 ............................. GCTCGTTACCTCGGGAGTCTGGATACCACAGG 281662 29 96.6 32 ..................A.......... AGAATGCCGCCGCGCAGGAATCCCCCACAACC 281601 29 100.0 32 ............................. AGCTTCGCTTCGGTTTCGGTCATGGCGTTTCC 281540 29 100.0 32 ............................. GTGACGGGCCAGGGCTGGAACGGCGCCGAGAC 281479 29 93.1 32 .........G...........A....... CAGGTCGACCCGCAGTGATCTAGCGGGACCGG 281418 29 100.0 32 ............................. AGGGCGACCCGCCGCGGGTCTGGAACAAGGTC 281357 29 96.6 32 .C........................... GGCTGACCGGTTGGGGGTGGTTGCCTGCTTCG 281296 29 93.1 117 ...T.........T............... CATTCGAACCGCTGGGACGGCCGGCGCTTCGCGGCCCCGCTGGGGGTCAGCTTTCAGGAAGCTGCGACACCGTGGGGGTGATCCTGTGCGCGTGCTGTTCCGGTGGGTTCCCTGGAC 281150 29 96.6 0 ..............T.............. | ========== ====== ====== ====== ============================= ===================================================================================================================== ================== 15 29 97.7 38 GTCCTCCCCACGCCCGTGGGGGTGATCCG # Left flank : GCAGTGGTGA # Right flank : TACGAGCGCTACATCATGGGCGGGACCTACGTGTCCTCGCCACACTCGCATCAACAGGCGATGAAACCGGACCCTTCATGACCACCCAGGATGACCCAGCGTCAAACGCGGAAGCGGAAACGTCACTGGCGAGGCAGTCCCTGCCTACCACCCTGCCTGGTCACGGCAGATCCATCCCAGGGCCGGGCGGCGTGATGTCCAGCGGAATCCAGCCATGACCATCCAGATCACCGACGTCGACCACGTCCGCCTGGCCAAACCCGATTGCTGCTTCTGCAGGGTTCGGGTCTGCCTCTGGCCAGCCGGGGGCGTGGGATTGGTTCGGAGAGGGCGCCGCGGGCTCCGGCGGGGTCCTTGAAGGCGATCCCGGCGTTTCCGCTGGTTGATCTGTTGTAGCTGGTGTCGCTTGAGGAGCATTTGACGGATCGCGGTGAACAGCAGCGGGCCCAGTTGCCGGAGATGGTACGGCGGGCGAAGTATTTCCACCGGATGGTGGTTCA # Questionable array : NO Score: 5.38 # Score Detail : 1:0, 2:3, 3:0, 4:0.89, 5:0, 6:0.25, 7:-0.52, 8:1, 9:0.76, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCCTCCCCACGCCCGTGGGGGTGATCCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,2] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCCTCCCCACGCCCGTGGGGGTGATCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-10.80,-11.50] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [6-10] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [38.3-6.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [1.05,4.87 Confidence: MEDIUM] # Array family : I-E [Matched known repeat from this family], // Array 1 29418-28052 **** Predicted by CRISPRDetect 2.4 *** >NZ_MOWP01000013.1 Frankia sp. CcI49 contig_12, whole genome shotgun sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 29417 29 96.6 32 .......G..................... CCTGCTCAGCATCGGCCTCAGCCCTGCCCGTA 29356 29 96.6 32 ............................G CGGCAGGGTGATGACCGGTACGCCGAGGACCT 29295 29 100.0 32 ............................. GCGCTGGTCACCCCGCTGGCTGAGCTCGCGAT 29234 29 96.6 32 ............................T GCGATCGGCCAGATCGACACCAGCCAGGCCGC 29173 29 100.0 32 ............................. CCTCCAGCCGGAGCGGTCAGTCCTGGTCGGCC 29112 29 100.0 32 ............................. GTAAACAAGGCGAGGGTCACGGCCGACTCGGT 29051 29 96.6 32 ............................G CGCCAGTTCACGACCGGCCTCGGCACCCTCGG 28990 29 100.0 32 ............................. AGGTCGGGCAGGTCCGTGGTGGTGGTCCCGGC 28929 29 100.0 32 ............................. CGAGCCCGCGGAAGATCTGCTTTTCCCCGCAG 28868 29 100.0 32 ............................. ACGTCAGGCACATCAACGGCCAGTGCGCCGTC 28807 29 96.6 32 ..............T.............. AGCGCGGCGGACACGGCGAGGATCCCGGCCTG 28746 29 100.0 32 ............................. CGGGTGACGGTCGGCGGGCTGTCGGACGCCGG 28685 29 93.1 32 ...........................TG ACGGGATCACCCCCATGGCGGGCGGCGGTTTC 28624 29 96.6 32 ............................G CGATCGAGCCGAGAGTCAAGGAGAGGGAATCG 28563 29 93.1 32 .................A..........T CCGTCCGCGGTCGGCGGGTCCGGCGTCGTCGG 28502 29 96.6 32 ............T................ TGGATCAAGTTCGTGAAAAACGGGGCCAGCTG 28441 29 100.0 32 ............................. GACTCCACCACCAGTCTGGCCACCAACGAGGG 28380 29 100.0 32 ............................. CACTCAGGCGCGTCAGGCAGGCCACGGAGTTC 28319 29 100.0 32 ............................. AGGTTGATCGCGCGGTCGAGCCCCTTCCCGAC 28258 29 96.6 32 ..............G.............. TCGAAGGTGACCGACCAGCAGGCGCGCGAGGT 28197 29 100.0 32 ............................. TATTGGGCTCACATCGGTCTCAGTAAGTCTCA 28136 29 96.6 29 ..T.......................... GCCAACCTCAACGAGGGCCCGGCCCGCGC 28078 27 79.3 0 .....T....G.........--...C..T | ========== ====== ====== ====== ============================= ================================ ================== 23 29 97.2 32 GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Left flank : CGCATCGTCCACGACATCAAGGACCTCCTCGTCGACGGAGACAGCTCCACTCCAGACGAGGATTGCCTGCACCTGTGGGACGAGGTCGACGGCGAGGTCCCCGGCGGCGTCAACTGGGCCGTTGATCTCGCGGACGGTTGGGACGACCTCATCAGCCTCGGAATCACCGGACCCGACCTTCCCCAGACCTCACCGCCGTTCTGATGACCGTCATCGTCCTCATCGCGGCTCCCGAAGGCCTGCGCGGTCACCTCACCCGCTGGATGATCGAAATCGCCGCCGGCGTCTACGTGGGCAACCCCGGCGCCCGCATCCGCGACCGCCTCTGGACCCTGCTCGCCCAACGCATCGGAGACGGCCAGGCAGTCATGATCGAACCCGCCAACAACGAACAAGGCTGGGCCGCCCGTACCGCCGGCCGCGACCGCTACCACCCCATCGACTACGACGGACTGATGCTATTCGCCCGCCCCCGCAGCTAAAACCCCAGCTCAGCAAGC # Right flank : GATCAGGAACACGGCTAATCCGACCGGTCCTACATGTCCAAGCCGGGCCCGTTGCCGGGGGTCGGTGAAGTGAGGTTCTCGGGCTCCGCCGGGGCCATGGCGCCGAGTCGATGCCGGCGGGGAACGATGTCGGCGGCGGCTTTGGCCGCTGCGCGGTCGGCGTCGGCGAAGATCGAGGTGTAGATGTCGCCGGTGAAGTGGACGGAGGCGTGTCCGAGGAGTTCGGAGACGACCTTGAGGTCTTTTACGGCGCGGTAGGTGAGGCTGGCGGCGGTATGGCGTAGATCGTGTAGACGGATTGGTGGCAGCGGGCCGAAGGTGAGCGCGGTGTCGATGGCCTCGGTGGGCATGAGGTGGCGGGCGGCGAGCTGGTCGACGGTCCACTGGTGTTCGGTGTGTTCGCGGCGGATGGTGGCGAAGCGGGTGATGAGCCGTTCGAAGCGTTGCGAGACGCCGTTGGGGGTGAGCCTGCTGCCGTCGGGGTGGGTGAACACCCGGCC # Questionable array : NO Score: 5.86 # Score Detail : 1:0, 2:3, 3:0, 4:0.86, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.74, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,1] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTCGTCCCCGCGCACGCGGGGGTGTTCCC with 97% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.30,-13.00] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [7-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [36.7-38.3]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.37,5.28 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 39591-39863 **** Predicted by CRISPRDetect 2.4 *** >NZ_MOWP01000013.1 Frankia sp. CcI49 contig_12, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 39591 29 96.6 32 ............................T TCCCGGGCCTCCAGGCCGGACAGCATCGGCCG 39652 29 100.0 32 ............................. CGGCAGATCCGCTGCAGCGCAGTCGATCCCGG 39713 29 96.6 32 ............................G GAGCTGACCGGAGAGGACCCGGTGCGGGCCGA 39774 29 100.0 32 ............................. CCGGCGAACGCGCTGGTCAGGGCCCGGCGTAG 39835 29 100.0 0 ............................. | ========== ====== ====== ====== ============================= ================================ ================== 5 29 98.6 32 GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Left flank : ACCAGCTGCCGAATGATGCCACTGCCGGCCCTGCCGCACCGTCAGGCCCCGCCACCCCAAACCAGCCGCCTGTACCCGGGCAGCAAGACCCTCATCCTTCATCTGAAACGCCGGAGTCGCCTTACCCACATCATGCAGCCCACATAGCAACGCGAATAGTTTCCGGCCCCGCCCGTCGCTACAGGCATCCAGAGACCGTCGCACCGAAGAAGAAAGGTACCGATCGAAAACGAGCTCACCGACCGCCGCCGCGTCCAACAGATGCCCCAGGAGCAGATGCGCCGAACCCTGCCCGTTTTCCGGCTCCGACTTTCCCCACACAACACCCAGCGCCGTATCCTGCAAGGGACCCCCCAAAGTCGACCCAGCAACCCCACGATCCAACCATCATCAGGCCCCACCACCCCCAGTCCCGCCCATCACCGGGAACAAGATGATCAACCGGCAGGCCACTAATATTTTTTGGCACCGCTGAGCCGCAAACCCCCAGGTCAGCAAGT # Right flank : TCCGT # Questionable array : NO Score: 5.99 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:0.8, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [1,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGTCCCCGCGCACGCGGGGGTGTTCCC with 97% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.00,-12.30] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [43.3-3.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.14,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 1 31-1214 **** Predicted by CRISPRDetect 2.4 *** >NZ_MOWP01000014.1 Frankia sp. CcI49 contig_13, whole genome shotgun sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================= ================== 31 29 96.6 32 ............................G TCGATCACGTCCAGCGTGCGGTACCGCACCTT 92 29 100.0 32 ............................. CGACATAGGCAACCGGTGGCTGCCAGTTCGTG 153 29 100.0 32 ............................. GAGGTTGTCGGCCGCGACGTGCTCGAGGTCGG 214 29 96.6 33 ............................T CCCCTCGAGGCCACCCGCGGCGCCCTCCCCTAG 276 29 96.6 32 ...................C......... GCCAACTTGATCACGCCGACGCCCTGCGCCAG 337 29 100.0 32 ............................. TCGCGGTGGGCCCGATCGGTGTGGCTGCGCAC 398 29 100.0 32 ............................. CTCACCCCGGCGGTGGTCGCGAGCTGGGAGCG 459 29 100.0 32 ............................. GTGGGCAGCGTCTTCGGCGGCATGATCGGCTA 520 28 96.6 29 .....-....................... ACTGGGGCCAGTGGACGAGCAACTGCGCG 577 29 89.7 32 .........A............C.G.... CTGCTCACCAACGCGGGCGGGATCATCATGGT 638 29 100.0 32 ............................. ATCCGTCATGATCCGTCAATATCCGTCACTGA 699 29 96.6 32 ..........A.................. GGGATCGACGCCTGGCGCTGGCGCTCCTCCCA 760 29 96.6 32 ......................C...... GACGCCGCGGTCACCTCCATCTCCAACGTGCG 821 29 100.0 32 ............................. GTCGCGCCGGAGCCATCTCCGGCCTCCCTATG 882 29 100.0 32 ............................. TACGGCGTCATCTCGTTCGAAGCGCTCACGGC 943 29 96.6 32 .....................A....... GCGGTCGTCTCGTCCGGGTCGTGGACGAACAC 1004 29 93.1 32 .......................T....G CAGGCGAAGGACGGCTTCACGGTCACCACCCG 1065 29 93.1 32 .............T..............G ACGTAGACGCCGGTGGTGTCGTGGGTGAACGT 1126 29 96.6 32 ..................A.......... TGCGCCCACTGGGTGTTGAAGGAGCCGGCCCC 1187 28 86.2 0 A..............A.-.....G..... | ========== ====== ====== ====== ============================= ================================= ================== 20 29 96.7 32 GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Left flank : TTTCGCATCCCCTCCCGACTGAGGAGGGCCG # Right flank : CCATACGATCGTCTACGAGCTGGGGTTGGAGGACCTACGAGCAGGCCCACCCGGACGTGGTCGCGGTGTAGGTCCGCGAAGCCAACACCGAGGGTGAGGGCGCGATCTCCGCGGCTGAGCTGGTGCACGCGGCGCTGCGGATGTTCCCGGACCGGGTGATCGTCGGCGAGGCGCGCGGGCCCGAGGTGATCCCGATGCTCAACGCGATGAGCCAGGGCAGCGACGGGTCCATGACCCCGCTGCTCAAGGTGGTCTCCGGACCACTCGGGCACGCCTCCATCCACTTCACCGGCGACATCGACACGTCGATTTTCGCCGACGCCAACCCCGCAGCGGCCAAAGCTGCCGCCGCCATCGTCCCCCGCCGGCATCGACCGGGCTCTACAGCACCGGCGCAGCCCAAGAGTCTCGCTTCGCCGACCCTCAATAACGGGCCCGGCCTGGACATGTGGGGACAACAAGAATCAGTTCCCGGCCGAACCCCTCACCACCTGCGACTA # Questionable array : NO Score: 5.70 # Score Detail : 1:0, 2:3, 3:0, 4:0.84, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.60, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTCGTCCCCGCGCACGCGGGGGTCTTCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [1,5] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTCGTCCCCGCGCACGCGGGGGTGTTCCC with 97% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-13.00,-12.30] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [1-9] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [16.7-35.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.28,0.64 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //