Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

What is the problem for this code? 2

Status
Not open for further replies.

Everwood

Technical User
Jul 18, 2005
78
US
Hi all!

I asked a question about "how to handle two files
simultaneously" several days ago and a couple of
people gave me very useful tips. Thanks again!

Bascially, "short_sequence.txt" file holds 100 short
sequences (8 nucleotide long)and "long_sequence.txt"
file holds 100 long sequences (200 nucleotide long).
Every short sequence will be used to replace a substring
with the same width in the corresponding long sequences.
Then it will return a replaced file which holds 100 new
long sequences.

I have been testing my code and Duncdude's code for that
job. However, when I find is the substring in the replaced
txt file is not the same as the short sequences in the "short_sequences.txt".

What are your suggestions?

Thanks!


My code is :

CODE

#!/usr/bin/perl

use strict;
use warnings;

my (@short, @long, $offset); # the 'short' array will hold the short
#sequences while 'long' array the long sequences

open(SHORT, '<', "short_sequences.txt") || die "Can't open short_sequences.txt: $!\n";
while(<SHORT>){
chomp $_;
chop $_;
push(@short, $_);
}
close SHORT; #Close the file

open(LONG, '<', "long_sequences.txt") || die "Can't open long_sequences.txt: $!\n";
while(<LONG>){
chomp $_;
chop $_;
push(@long, $_);
}
close LONG; #Close the file

# replacement
for(my $i = 0; $i <= $#short; $i++){
$offset = int(rand(193));
print $offset."\n";
#print length($short[$i]);
substr($long[$i],$offset,length($short[$i]),$short[$i]);
print "\n", $long[$i], "\n";

}

Duncdude's code is:


CODE
#!/usr/bin/perl

open (SHORT, "< short.txt");
chomp (@short = <SHORT>);
close SHORT;

open (LONG, "< long.txt");
chomp (@long = <LONG>);
close LONG;

open (OUT_HTML, "> output.html");
print OUT_HTML "<pre>";
open (OUT_NORM, "> output.txt");

for ($x=0; $x<=$#short; $x++) {
$r=int(rand(length ($long[$x]) - length ($short[$x]) + 1));
print "### $r ###\n";

print "$long[$x]\n";

# this section is for visual purposes only
$output_norm = substr($long[$x], $r, length $short[$x]);
print " " x $r;
print "$output_norm\n";

substr($long[$x], $r, length $short[$x]) = "<font color=red><b>$short[$x]</b></font>";
print OUT_HTML "$long[$x]\n";

$long[$x] =~ s/<[^>]+>//g;
print OUT_NORM "$long[$x]\n";
}

close OUT_HTML;
close OUT_NORM;

The "short_sequences.txt" file is:

CAAGACAA
ACAGTTCA
CCGAATTC
GATACTAA
GTTTCACC
TCATCTCA
CCGCCGAT
TATTGTCA
ACGTAACG
TGAGGCTT
CAAATTAC
TCGACGGG
GGTCAGGT
ATAAGGCA
ACGGGCTG
CTCCTCGT
GTAATCAT
GACAAGAA
AACCGTGA
GTCCACGG
AAGCCGGG
CTTGTCCT
GTTCTTAC
AATGCCGG
TAGCAGGA
GTCTTGAG
AGATCTGA
ATTTGATG
TGTTGGCG
CACTGGTT
AGTGAAGG
CTATGATC
GCGATCTA
AATCCAAT
TGATCGCA
TTTGAGAA
TGGTATCT
TCTGGACG
AACCCTGA
TTCCACCT
CGCTCCTG
GACTAGGG
CCAAGCAT
CAACGTTA
TCCTTAAA
TCCCGCAT
TCGCCATT
AAGCTCTC
GTACCGTT
CCAATCCT
CAATCGTC
AGTCGCTA
TACACGAG
TTCGGTAT
ACCAATCG
TGGTGATA
ACCTATTG
TTCCGTAA
CGGCATCT
AAACATGA
CTTATAAT
CGAAGTGT
CCCCCATG
GACTAGAC
CAAAGTTA
AAGTAGTC
TTACGTCA
GACTGGAT
AATATTGG
CTGGGTAT
CTTCAGGT
TCTGTTGA
AAGTGCGT
ACCTTCGT
TAGAGACG
ACAGTCGA
AACGTCTT
GGCATGAT
TGGTTTAT
TTAACTAG
GTAATGGC
CTTGTCAT
TCTGCACA
ACTCAGAA
CTGACATG
ATTCAACC
TAGTCAAC
GACACGGT
AAGACTGC
ACGAGGCC
GTACGGTC
AGCCAGTG
CGTTGATC
CTTCAAAT
ACGTTAGA
CCAACGTC
TTCCGTGC
TTCGGTTC
TCCAATAC

The "long_sequences.txt" file is:

GAATCATATATTAGTCTCCACATACTCCGTTCGTGACCCATTACCCTTTCGGGAGAGCCACAGCAACTGTAGATCTCGAAGTTGACAGGGGCAACTAGAGGCCTCAGAATTCTCACTCTTGAGGAGAGAAGTCTAAGACCTACAGTATGGTCGGGTTAGTTTTTGTTCCGTCGAACCTTGGACTAACCACTGTCTGGATA
AGGATTACCCGCTGGACTTCAAACGCTCGTGAAGCATCGTATTGCGAGGCAACCGAGTCATAGCCCAGTCCGGGGGCCAACGCAGTGCCAGCATCTGCGTTGTTCATCGGTCCTCAGTCTCCCATCAACGTGGTCCACACCTAGCATCCTGGTTTTGCATCCGTAACAAAGGACGTTCGAAGTTTTTTGCCGGCGGGAAG
TGATAATTGGTGCAATATTCTCCATAACAGATCCTCGCCAATACGGATTTGAGGGATCCCTCTGCATTTCCACGAAGCGTGTCACCGATAGAGCAGAAATGCTTTACCGCCGCAGTGATTAGGCGGGTACAGTTGTCCAAACGCACACAACCGAAACCTCCCCATGCGTACTCGTTCGTTTAGTCGCGTACAGAGGGAAC
TCTAGCTTTCGTGAAGTCCCAGGCTCCTAGCCCTACAGCGACTTCTGGCCGGATCTGAGGGAAACTTGTCTAAACTTTTTTCCCCGCGAGACTCACTGTTATAAGAGCTGGAGTTTAGATGGGCATGGCATTCTATTGCTAGTATGTCTGAGAGGTGTTCTTTAGACCACGGGTTGATGCGGGTTCGAGTAGAGACACAC
CAGCGGGCGATCACCGTTAACGATGGTGGGTAACGGTCGGAATCGGAATAGGAGAAGAATATGAGATCGTCCATGGGACGTTTCAGACGAGACGTCCCAGCTAGCCGCTGCAATATGGTATTTGGACTCATTCTTCGAGACAGTAAGCGTGATTTTTATCAGGCCCGAGGCTGTTAAACACAGCTGCTCTAGGCCGAATG
ATGCACTTCCCCACGAACTCCTGTTAGGATGTACGTAAGAGTTCAAAATGCATTCCAATTCGTTTGACGTGCAAATTAGAACACCGATGGATTGTCCAGCTTTGCACTCTACGTGAGGGGAACTGTGGTAGGTTTTTTAATGAAGCCTGCCCCTAAGTCCTTAGCCTTTGATTCTGTTTTCCCACCTCCGGCATCCAACC
GTGAGCAAGTATAGCGTTCGTCCTCCGGAGATGGACCCACAATTCTCTGCCGCATAGTCGTATGCCAATTAGTTTAGTTAAAGCGTGACTTCGGGGGTATTAAAAGAGTGGGTAGAAGTAGCAATATTATGTTATAAGTGGATGACGCAGGTGATCATGTGCCTGTCCTCCTTCGCGATTTGCATTGATGGTGACTGTTA
CGTTTTAGGCTCGTACTTGTTAAGGTATATGTTCCAAGTTCAGCCCCGCTGGGTCCGTGGATTGATACATTAGTTTGAGACGCCTAAATATTATGGGTTACCTCGTAGCGGATATTAGTGCCACCTTGCGGTACGCGCTGTTCGACACTGGACTAAACCCTGAAGCTTCTGACAGCGTAACAAAAAAGCAGATGAATCAC
CAAACACGATTGTGCTTCGTGACGAACGGGTGTAGCACGTACCGTACCATGTGCCCCTAGTTCTGAAACGTGTCATCTTCTAACGCCTCACTCACCATAATGTTCACCTCGTGCTGTACACGCAGTGGCCTAGCGCTAGGAGCTCAATTGAGTGCGACATGCTCTGGACAGCTGTAGCAGCTATGCATTAGGCACCCGCC
AGATACGTTAAAGGACCCGTGTCGGTTCGGGATTGCTATAGGTTGTCCAGCCATCGGCTCAACCCCTGGGGCGCACGATTTCATGCATGCACATCGGCCAGTACGTTTGGCAAGGGCTCAATAGACAACCAATACTGATCCCGCGCCTACATTTTCGCTATCGCAATTATCAACATAGAATTCCAGACTCGACTTGCCAT
GTTGTAGGAGGACGCTGTGCCTGATGCCGGCAATAATTTCAGCGAGGCACTAGGAGGAAGTCCGGAAAAAGCAGCTTACCCCCAAGCCTTCCCCACATATTACTATGTGTTTATTACGGAAGACTGTGCGCGGCTTACGCATCAAGCACGTTAAACGCGTACCCTAGGTTCCATCCTGTACAGTTGTTCTTATAGTAGCG
GAAACGCGTTCGCGCACGACGGGTTATAAGTGGCCAACAGCTCTATGCTAATGGTGCACCGTTGGGGTCCTCGAAAACCTGTGAACAGGCACTCCTATCCAGACTACCTAAACGTTGAGCCCGAGGTCCGCCCGTGCTAATTAAGCCGGCTATAGGTTTCCGCAAGACGGAGTGCTTGTGCTGGCAAACAGGGGTTTAAG
GCTCGTACCAACAGGGATCTCTCCTATAAAAGGGGAATCACTTCATCGCGATCCCATTGGCCACGGCACGCTAGGCTGTCAGTGATGCAGCATGTGTATTTTTCGTCGCCCGTAACTGACACAAGGCCACATGAGTGGAGGCCCGGTATATGCAATTATCGGCGGTCGGGCTGTCTCGTATATTTCAATAGACAGGGGAC
GCTCGTACCAACAGGGATCTCTCCTATAAAAGGGGAATCACTTCATCGCGATCCCATTGGCCACGGCACGCTAGGCTGTCAGTGATGCAGCATGTGTATTTTTCGTCGCCCGTAACTGACACAAGGCCACATGAGTGGAGGCCCGGTATATGCAATTATCGGCGGTCGGGCTGTCTCGTATATTTCAATAGACAGGGGAC
TTCCCTTATCCGAAGGATTCACACAGACGTGCCAGTATTACGGACGGGGGGGCGTCTCGACCCGTTGGGGAGCGCCTGTAGCTGGGGTGCGCGACACTCAAGTTTGAGTAACGCAGGTTCGTAGCTAGTATGTTCAAAGACGTTCGCTGCACTCTCCGTCCCCATCAAGTTCTAGTTTATTAGTGGATCCGTGTATTAGC
AACTCAACCGCCAACCGAGAAGCCTAAACTTTTGATCTAAAACACCGAATACGGCGCCTGTATTCTGAGGGCAAGCGATGCTTCTGTCTAAATGCGTGTGCCAATCGCATCTCCGTTCTACGGCCACCGATCGCGCAACTTCCAATTAGTTGGAATGGCGTATGGACAGAACGTATGGGTCCGTGGTAAAGAAATTGTTG
ATTTGTCCCGGTTACGTACTCCACTGCCCCGTCGTGGTGATTAGAGTCATCTGACGTCCCCCACTTCCTTATCATCATACACATACAACGAAAACACTCCCACCGGCAAGTAATTCGGCTCCTCTTGTTAAATAGCCTTGGGCCTAGGCATTTACTTCACGTGAGTACAGGTTTCCATCATGAGTAGGAGTAACCCCGTA
CTGCTGTGTTTTTGCTCTAGAAACGCTCAGGGAGAATTTAAGTCGTGATCCTAATTAAAGATAGTTGCACTCGAAGATAGTATAATCACTCGCTGTATTGCGCCTTTAGGCGCGCGTCCCTGTCCAACTCCGAAAGTTAGTCCTTCTCGGGAGTGGTTCCAAAAATAAGCGAACCGTCAAGTTACCTTGGTATATAACCG
ACGCGCTTGAAGGACCCTGTCAATGATGGTGTAAATAAAGTCGGTATAGCTTGCACTTTAAGGCGGTAGCCCTAAGGAAGGTTTAGGACAGTTGTAGGCAGGTGATACGACTTGTTGTTGCTCTCTTAACGGGCTTTGAGGGTAAACCCGCATCGATAAACCGACGAGGATCATCGCGCATAAGGCAGGCATACCTTGTG
TTCGACGAGTAAAGACGGACGCGAGGTTCGCGCCAGTTCGTCCAGATTACCATGCGATGGTTTATCGTTCATCTAAGCAGCTACCCTTGCCGATTATGGAAAGATGTCATAAGTAGTCGCCAAGTGTTTAATGCTGCGAATCGAATAAACTACGCTGATTCCGGTCTAGAGGCTAGGCATCAGGTATGAAGCACTTAGGA
GATGCTTATACTTTAATGCCTTTGTGGTTTCACAGGGTCGACCTGGCGTAACGCTATCGGTTATCGCGGTTTCTTGGTAGCTCAGACGTTCGCTTTCGATGCTTATCCTGTCTCATCCCCATTCGCTCGCGGCCGCTACGCCGCGATAATCCATCCTTAATAGGGGTTCATACCTATTCCTCATTCAGGTTCTGGTAACG
ATACAGAATAATTTACCTGGGATTTGGCCGCGTATAATTTGACGGTTAGATAGGAACTCCCTTGCGCGCTACACGGGTTTACTACCAATTAGCTTAAATAGGATAAGTCCTGGGCCATGAAATGACGTTTAGCTTTCGAGTCTACGTGGGTATCCCTATGCTTGTTCAGTGATACGCGATACCCAGCCCGCCGTCTGACT
GGTCCTATAATGGCATCGTCAACCGAGCGTTCAGACTGATGGCGTCGATCAGTGTTAGATATAACTCGCGGCCATGCCTCCCCTCACAGTCGATTAGAGTACGGAGCGTATAGTCGCGTATTACCCCACCCACCTCACTTATAGAAAATACTGTCCCACGATAGCAAAGAAGGAGGCGTAAGGAGTCGCGCTTTCCTAGG
AACCTTACACCCGTACTCGGAGCGCACTATTCTCCCGCCCTGACCTGCATAGCTAGCCCTACTAATTCTACATAATGTGCAATCCCTAGGCATCGCTTCGGAGACCAATTGCGATATCTTGCGACCGTGTGGTTTCATAGATGATGATCCCACTTCTGGCCCATGATTGTGGGTCTGGGTGGAACGTCCACCTCAAAACC
CACGGTACGGCTCAACTGACCATGCTCGGCTGTTCCTCTATAAATCACGTGAGTGCGGGTCCAGGATGGGCTAAGAACATACATGATTCAACTGGCAACAGAAAGCGATTCTAGGCTTCATCTCATACTCGTGGCATCCTACAGTTGGGCGCCGCGTCAGTGGTGTCCGAGAACACCATTGCTAGACGCACTGAAAGATC
CATAGGTTGAGTTTTTATGTTCTGGGAACCGATAGAACGACTTGACGCATTCTATCCGACCTCAGCATTCGTTCTAATAAGAATGTGACACTTCCGGCGCCTGGATTTCCTTAAAAACACATCATTCCCTCGGATACAATGATCTGGCCAGTGCGGCAAACCACCCCTGGATCGTTCCAGACTACCGCTGCATTCACCTG
TTAATTCCTTGAGCTCATATCTACACCCAAAGTTTCGCAAGCGTCTGGTCCGGAGTAAGAACAGAGTGTCATTAACACTAACTACGTTAGTTCTGGGCAACCATCGGAACTCGCCCGTGCGACTGTGTTGTCCTCCGGCTGACCAGCCACCCTCTACTTTCCTAAACTCTTAGGGACGGTGGTTGAAACCAGCTTATCAT
GAGCCCACGCGGGATGGAGTATAGTGTCGCGTTTAAACACCGCGCCACCTGATTTAACAAAGATAAGGCTCTTTCGTATGGATAATCCTTACTTTTCTAGCGTATTCTTTGTAGCTGACCTGGATCTAGAGAGACTACAAGTGTTATGGTCGTCGGACATCCGGACGACTCACTGGAAGAGAAAGAGTTCTACAGACTAC
TAGTAACGGTGGTACGTCATGCGACGCGCTATGGGACCCTATTGTGAACCTGGCTCTTTCATGCCTGCAAACTATGTGCCCAACGGCACTTACGCTTTAGAGGGTCGCGATATAATGAATTGCTAAATGGGCTAGAAACCGACTGCCAAACCCTGCTGATCGAAGCTAGGTCATACGTTAAAGGGAGTGTGTATCGGAGT
ATTCTAGAGCGTTTCAGTGCTTTTCTCTCGTATTCCTGAAAACTTATCCGGTTGGTAAGTTACCAATTTGTTGGTCCCAAACCGTTGCCTCTATATCCGACCGTGATCGCCTAGCGCGGATTCAACTCTGTTGAAACGCTGATCACCCACATAGCACCTTCTAGCTCGGTGTTTTTGGCTTGACACAACGGTGGTTACCC
TTAACAGGGTTCGCGGCTGTATCCATCGTATCCCGTGAAATAGGATTCAGTCACGCGGTTCAAGGGCTATGTCGATCCGACGGAGTAACATATGTCTAAAGGATCCAACAATGATGATGGACAATCGTGTTGCTCACTAAATCTGTTGGCCGAAGCCTGGGTCTAGCAAAATAGACCGGCAGAACTTCCTAAATGCATTA
ATGTTTCACTTAGAATTAATAATTTTGATTCGCAAAGGGGAGAGATCGTGGTTACTTCACGTACTTTTTATGGAGTCCCTTCAGTACTCCCTACTCCGGTGATGCAGTCTATCCAAGTCATGCGCGATGCCCTATGCCGAGTTCATCCTACCACTCATTTCGTCTAATCCCTTAAATATACACCGGTATTCTCTCTGGTC
ATCAGTGCTACAATACGTGGGGGGCTCGATTACCGGTCGCAGTACTATCGGGGCTGGGTTTTTGATTTACCTGTTCAAATCTAGCGTTTTGGCATTGAGCCATCCCCGGCGATCTTACGAAAACCTGGGTCCCTCGGCCACTCGTCCCGTGGGCACGATCAATAACGATGAAAAACTTCAATCGACATTACTAAAAGGTG
ATATACTAGACTGGGCCCTTGTACCAAGTGGCATGTGGATTAGCGGAGTAATTCGTCAAGTTGAAGTATCCAGTTACCGTTAATTGCCCTCATACGAACTCCTAGTCACCTCCGAGGTAGGGGGTCCAAATTTCCAGCTGCAAACGCGTCCGTCGCGGTGATAGTCTTCTACACCTGGTATTTACTCATAAGAGCCACTA
TCCGCGGTACTGGCCTAGCCGGCCAGTTACGCCTCTCGCCGTCACACGTCAAGCTGTTATAGACCAGAACAGCAGCTTTCGAATAACTAGTAGATCAACACGGGTCATCACAATTACACTGCCAAGTGAGAGCACTCTGAACATATGCCTTCTGTTGCAGATAAGCCGTTATCGTTGAAAACCTTCTGTGGGCTTTGGAG
TTGCAGACTCAAGGACTACCCGCTCATCCAGGCTCGGCCGAGTTCGGACCCCGCTATGTGAGTTCTGGAGTTAAACAAAGGATTGAAGGGTGTCCTTGTTAATAGCAAAGCACCGATTTAGTAAAAGCCATCTTCGCGGACGATCAGACGAGCTAGCGCTATTGTTCGAGAGACCACCAGATCCGCTGAAGTATCAAATC
TGCAGTACACCAAAGAGTACCAGCTACGATGAGGGTCTGCCCAAAAGATCGATGTACACATCAATGCCCAACGGTGTTTCCTTCTGGCTTATACATAGGTATAAATATAGGTCATAGTCAAGTGCAGATGAACACCTGTGTCAAATGGTGAGTTAACGGAGCTTCTACTTGCTAGGCGCCATTTCCAAGAAGCATCGGTG
TTGACTGAGTTGCGCGCTTTCTTCCATGTTTTCTTTGTTACTTCGTCCTCCGTCGTTCACACTCCCAGATGAAACTACCCTATTCAAACTTATATAGGAATCCGGCAGATGTGTAAAAGTGGTCCAAGGTATCTGTGGAACTTGTACTGCCGGGTTTAGTGAAGACTTCGAATCCAGGCAATTAGATACCGTGCGGAATT
ACAAGCAGAAATTCCGTGGAGCCGCAATAATGCTTCTCAACGACTACTTGCCTAACCGCGATGATCAGTTTATCATAATCAACTCGATGTATCCGTTAACATAGGAGGATTCGATATTTGAACGCGGAAGTCTTATCATGCTCACCACCTCGTCCCGGCAGTGTCCTTGAGAGGGGGCCGTCCTCCTACCTTCTCTAGTA
AATAACCAGTAAAGACGTCGTCGGCAGGTAGGGCCCCAGGGCACTCGCTCTCGTGAGTCCACTAGGTTCCGTGAGAAGGTTAGGTTCTGCAAAGCCCTAACAACCCCCGGAATACTTTATGGGCTTGGTTCATAGACGAATTGGACGCCCGCAGGCCTTGCGGGCCTACGCTGCTGTTTTGGGGTCCGAATGCGAGAGTA
TCTGGGGAGACGCCCTATAATCCAAGCGTATCAATGTTGCTACTGGGTAGCCATAAAACGTTGGTAGACGCAAACTAGTTCAACTTACAGTTCATAGAAAATACTGAGACGCTTGTTTTGTTCAGCAGTTGCGCGAATTAGACGCAATCATTCGTTCAAGGTACGGTCCAGGACGCTGCGGGCGAGGTGGAACTAGTGTC
GTAGCACCAGGAATATAAATAATATAGTTTAGCTACCCACCTTAGGTACCAACATCCCTTCCTCTGCGACTCGTTCGCACTCTTTCTAAGACGAAAATCCATAATGGCCAGGGAACATACATGATGACCAGCCTCATTTACGTAGGTCGCTTGGGGAGAAATGAGGGTGGTGCCCGCTCTCTCCGAGTAGCCTCGTTCTC
ATATCATGAAACCTAGCGCTAAGCGGACGAAAAGAAGTTGCCCATCCGACTCGGAATAGGCTCATCACTGCTTCCTAGGCTCCAGCCGACTGACGATTCCCTCAGGAAACCTCCCTCGCCCTTGAGGTAGGATCGTAGACTTTACCCTCACGAAACAGTCGCTCATATAGCATCCATCGGCGCGGCCCTGCAGAGGTACT
GCTGGGGCAGCTCTCGTGAACAGTTATTGAAGCGCGGATGTAAACAGCGATGCTGAGCAAAATACTAACAGGTAGGCCGAGCCTGATCCTGTTAACTTTCGATCACATTCGGGCAACACGTACGTAGCCCTCCATGGCAAAGTCAGTTTTGCAAGCACGCCATTCACTCTGATTCGATTATTACAGTGGCAAGACGAGGC
CGTACGAGGTGAAGGTATAGCCTTTTGTCAGCCCCCTTAACATAGTGGCCTAATGCATGCAAACTTGCAGGCCCGCACCCAAAGAGATCAAAAGTTACAACCCAAAATTGAGTTGGTTATGGATCAGGGCCATAAGGTAGCTATAGGTTACCAACAGCAGGACCTGGGTAAGTCTGCAGCATTTTAACTGATAAGACTAA
AGCCGACTTGCGTTCCCATAGTAGTCTATTATTAATTCCGATAATGGTTGACGAAATGCTGCGCCGGGTGGATTAGCGGAACGCACTCTCAAGCGTTAAGCCTTGTGGCGGTGCTCACGGCACATATTATGCCTGCCATTAAACGTCCTGTAGGCAGAGGATGGGAGATTACACGGATGAGGTGAACATCGGGAGTATGA
TAAGTCCCAGCTTCCCTCTCCTTGTAAAATCAGGGAGCCTCCTTACGCTCCCGTTCTCTTTTTAAAGGATGAATCCCTTGTTTTCGTTGTTTTGTCAGAAACACTCGCGAAGCTGGAATTTCAGGGTAATTCTCGCGCCATACTCAACCCACGGATGCCTTCGCTATCGGAAAGCGTTTCTACTACGCCACGTACGGCCG
GCTAATTTCCCACTATCGTCCCCGCCCCTTGGATGGCTGATCAACTTAGCTCGTCCGGGCAGTGTAATTTTCACGCTGCCAAGGCACACTTATAGTTCTTAAAACCCGGAGAGTGCTACAAACTCCGCTGACACGGCTTTTTACTATAACCTAAGACCGAGGTTCCGACAAGTTCCGTTTCCGGCACCAAGCCCTTCAAA
TGAGTGCAAAGGGATTACGATATGTTAGTGGACTTGGTCACTCACTGTCCATGGCCCAATATACTAATTCAATAAGAAGACGTCACGGCATAAGCGACGCATCTCGTTCCACTCGCGGCATGCCAACACAAGTATCATGAACTCATCAGGCAAAGTAGAGATCTAAACTCGGTGATAGAGAGAAGTCATAATTCCGCAAT
GAGCGTGGAATTGCGCAGGAGGGTTATCTATGTTATCCGGACACCACAGAATTCGGCGCCCTCGCAGACCTTTTCATATTGCTCCCTTCCCTAGATACCTACCCCTCCTACCTCTAGACTATGAAGTGGCGACACCTCAAACGGGTTACTCGGGAACCGCGCCAGCCCCTTGTTGCCCTGTAGGCTAGCTCCGATCATAT
GCGTCAGGACAGACGGCGGGGACACCACAGACGTGCACTAGACGTGAGGCGCGGCCTCCCCAATAATTCTGAGACTATAAGTAGCTCCCTTTTAGAATGGAAGCTGGCCTACCGTATAAATATCGAAGATCGAACGCTCGTACCGTGTGTAACCTACTACGTCTAAGCTGTGAGGACAACTAGTGATTAGCCCAGCGCGT
GTCTTACGGCTTCGACTCGTACCAATTGGCATTCACCCGTATCAGTAGACGCTAGAGCATGGCTTAGGTATGCAAGTGGTACAACAGTCCTTAATGTCGAGTGGGCCTTAAACTGCTCGCACTACATCGGGGGGTCTGTGTAGCGTACACACACGGTCATTGGCAGAGCAGATAAGCATTTGGTTGCCTGCCTGAATTGT
CAGAGGAGTGACGGAGCTGACCCGTATCGTTATAAAGTAGATTCAAACGACGCCTTTTAGAATCCACAAATTGGTAACCTTTGTTGCACCGAATTGAGAGCGCTATCGTCATCAGACTTCTTCTTAAGGATTTTAGCGAACCTGACCCGATGGGGTCCCCAGCGAGAGCAGCGGCGGCTTCGCTGCAGACCACCCACTTA
GGTCCCTGGGCAGCAGTTCGGGGTAGAGGCTCGGTGAGCGGACTTGGCCACCGTAGGTCAGGACTTGGGTCACTCTCACCCGCACACAGGGCTGATGCATCGTCGCTTCTGGCGTAAAGAATACTCGAAAAGGCACGTTACTCCCTGCTCTACTCACTCAAAAGGGATCTAGTGGAGTCGTGAGCAGCCGGGACGGGACA
TCCGGTCAGCCCGAGGGGTGGATGAGGGGTTAGATACGGATAACTACGTTGTGCGGAAAATTAGCGTGATCCCCAGCACCGTTAGTACGTATGTCCGCCTTTTGAAACCAATGTCCTACCAACTGAGCGCCCTGATGGCCATGCCAGCTCTAAGACCCGCAGTAGTTAGGTGCAAAATGAGTCTTCTGCCTACGTGGATG
TAAACCACGCTCATTCCTAGTATCCTTCAAGTACGTGTCAGTGAGACAAAAAACTACTATAAATGACCACGCGCGGCAAAATTCAGGCAGTCGGTAGTAGCCCCACACAATCGCTCCACCATATCACGCCTATCGGTAGGTAAACAGTCTAGCACATGTTATAGTTAGTTAACCTAATTTAAGATGGATACTAGTGGTGC
ACGTTGCTAAGGTACAAGGGGGTTACACACGAGCAAATCTGGATTGGGTCATAACAGTGGGTACTGCATGGAAATTGTACGCACCCCCAGCCAATGGAGGAGGCGCCGGATGAGTCGACGGGGGCGGGCTCATTTACTTCAATATCAATTGACCTCAGTTAGCCCCTTCCCTCCTACCCAGGCGTATCAGCAGGACCGTA
CGGGAAGTATATCACTTGCCTCACGAGTTGAAAAGGATATCTTCCCCCAACGCACTCGGTAAGCGGATGTATTAATCTATCTTCGCTTTTGGGACTTATCCGTAAAATAGCTCGAACGGAGCGTTTGACCTCGTACTCACCATATGCCTAACGTGAATTACAACCTACAGGGCACTTACAACAAGCCTGGCCGATCTCAT
GAGTCAGTCCGGGGAGTTCCATTCACGTTGACTACGCAGATGAAATCATAAGTCAAGCGTGAGTTGTTTCTCCCGCTCCATGCATGTACGTCCGGTGCGTCAACAACTAATGTAGTTCTGTTTTCTCGACGTGTATGTACGGTAATAATTTATAGAAGGGACAGGAGGGTGTTAGCGCCGGCGCGAAACTAGAACAAAAG
TCTGCGTATTTAGGACTTGAGCTCTCATCGCGTTCCGGCCTCATGATGAATATTCGGCCGGCCAACCCGATCGGGCTCTGATACAGCGGCCCGTGAGGTTTGGCTGGGTGAGGTGGCCTAATGATTGACGATGAGGTCATAGCCCTCTGTGGGAATGGTCTCCTCAGCAAACAGGATGCTGTAATCGGAGGACTAGGAGG
TACAACCGTGGCGTCGTATTCATGTTACATACAAGTGGGCCTTAACGCCAGGCTGTAGCACACTCGTTTGACGTGCGTTGCGGTCTAGGATGGCTGAGCTGTCGAGGACCAGAGTCGACGCGGCGTGACTTGATTGCACTCTCAACGATATTCCCAGGCCTCTGGGGCGAAAGCGCATCTGTTCAGGAAAGACACCATAG
CCGGGCGGTGAGCCTGCGCACCAAAGGCTTCGCCCGCGTGGGTGGACCCATCGCAGAGTCCCTGCGTTATAAGCAATGTGTGTAACGTTTCCCATAATTAAGTCAGTGACTGGGTTTACGACCAATGGAACCCTGAAGATAATACAACTGGTGCAAGCATATATACGGGATGTCACGCGAACCTCCTATCGAGTACGGTA
GCAGCCGCCAGAGAAGGATTACAGTGTCGGGTACCTGCGGAGTACATCTCAACTAGTAACTCCGGACATAGCAGTTCACCGAATATCATCCTGAAGTCGGCAGCTTCGAAGCCCTCCGGTGGCTGCAGTGGGGAGGTTCACTGGGGACACGCGACTGGGCAACTCGCATGTCACATGCATCCTCGGACCACGCTCTCCCG
GGCATCCGCTTCTGCGTCCAGAGTTTGTTCTGAAATTGACAGCCAAGGGCTCAGGAGAGTCCGCATACACCGTATTGGTACCGCGAAAGATTATGATGCCAGAAGACGTTCACCAAAAGAATGCTTACTGCTACGGACCACGGGATGAGGCGGGTGACACGTTTCTCGAGAACTGGAACATGCTGGGCCTACATAGTATC
GCCGATTAAGCAGTGAATGCCATCTAACTGTTATTGGCCGACCTTCCTTTGTAATAATTTCATACCAGCGACAAGGGTAGGCGGCCGTCTCTTCGCGCCCAGTAGGTCTAATTCGCGGGCCCAGCACAACAAGAGTAAACGTCGAAATAACACGTAACTGTCGGCACGTGCGCGGCGTGCAGTAGGAGAATCGCTCAGAT
GCCTGGGCAGGCGGATCGATGGCACGCACTCAGTTACACTAATAATTTATCTACCTAACTCGCCGGCATTCATTGTGCCCTTTTAGCGGTACACATGCGAAGGTATTACAAACACAGTACCACTTCGGGAACGGTGTACCTAAAAACGCGGGCCACGGCCCCTTGTATCATAAACTCCACTCTTTGTTTCAGGTCTCCCT
AACGATCTTTCCCTATGAGTCTTACAGCAGACCGGCCTGTCCGTTTAGACCGCATGATAATTTTACGAAAGCGGCGCCAGGAGCACAACTACCCACCGATCGGAGATTGAGTCTTACTGCATGCCGGGTTGCATCTCTGGCAGTCTACTATTGTCGCAGGTCCGTTTCCACGTAATTATATACCATATCGGTTAGAGCCA
CTGTGCTCCAACCCTTGAAGTCCTACTTCCGTGCGACCACTCACGTCTTGGTGAGTACATTGAAGCATAGCCATATCCGTTGGCGGGACGTCCCTCGTCACGAGCGGCGATCGTCGACTCACCTTGACCTCTTGTACTGCTCGCGATCGCACTCCTCGTGCCATTCCAACGGGGTCCTTACCTAGCTAACAAGTAGATTT
TCTATCCCACACGTACTGGTAGCATGTGAGCATAGTCTGATAATAAGAGTCGGGCACTGATTCAGGCCAAGAGGAATCATATTGGTAGGGGAGGTCATCATTTCCTTTCCTGCGCATAAGCCGGCGTACTACTTCCCTTTCCGGGTTCGTTATAGGCACAATAACAGAATGTTCAAAATTGTTGTCAGGCATTTGGATGT
TCTCATTTGCTGGTTGTACAAAAGACCACACAAACCAGTACGATAACACCACCCCCCGTAATCACGCCGCTCGGCGAAGTAGCATCTGTTACATCGGGTCAGGAGTCGATGCACTTCCTGGACGGTTATGATTGCGGATATGGCATGTGAGCTATAACTCTTACGTTCAACTTGGAGGTCTCGTACGCGTGAGTTGCGTG
TGACAGTTATCCTTGGCAACGATCTCTGTCTTGTAATTGGGTGTGGCCAAACTTACAATCACTTACTTACGAACATCCTAGCACGACGTCGGGAAGCATTGGTTGGGCTACGTTCAGGGTTGCCTGACATTTATAGTTGTTAAGACCTGTCGCTAAGTGGTATAGTTGGCGTCGTATCTGAGATCGAAATCGGGCACCGC
CACCAATGTCGACCATGACGGAGCACGTCCGCTGGACCCAACCTTGATTGCAAGCTTTACGCATGGACTCTCCAAGTCCAGAAACGTACAACACGACATCTGAGTCGCAATATATCTAGTAAATAAAATGTCACGCGTACAATATGTTGTCTCGCACGAACCAAGCGTAGCCAAGACCAAGTGCTATCGATTCAATCCTC
AATTATTCAATGATGATTCACACAAGCTCTAGCAGTGATCCAGACGGAGCAACACCGATTAAGTGGGACTAATGCCCGCGGCATTCGCCCACATCGGTAGTGGACTTTCGGGTGCCGGTTATGTACGCCCCTCCGACGTCAACCCGGCAACCACTCGTATTCATGGTGTGGCGATACAACCTTGGGTTTTTCCGATTTAT
GGTCCTCCGTGCGTACACATCTATCGACCGCCCGCTCTAAGTTAAACCACCGCTCGCCAACAGGTTGTAATTATATTCGGTATGGTGTCAGCAAAATAGGTATTACGATTCGCATGATTATCGTCATTGTGGGCCTCACCCAGAGCTCCTAAGTTCTGCAGCAGAATTCCTCCCCAATAGTTGTTCCACACCGGGGAGGT
GACACTTTTGTCGGAATGAATATGCTTGATGATGCCGGGTAGGTTTTAGAACCGACAACATGATTGAGTAAGAAAGTGGACGGCGGGGGCGCGACCCCAGGGCCGCAAAGCATGTGGCACAGCTAGCTAGCTTACCGATAAGTACTTTCCGGGGGCCCTACAATGTAGAAGAGAGGGAGCGAACCCCCTCAACACCCTCC
CGTGACAATGATGTAACCGTCACGGACCCCTACTCTGCACATCGTTGCTCGCGTCTAACTGTATTCGGAGAACAGTATTGACGGAGTGGACCACTGGGTAAAGTCTTGTAGCATGAATGGTCAAGCGCGAGGGAATGAGTGTCCAGCAACCAAGCACCAACTACAGACAGGATTGAGTAACGAGCATGAAAGGAAAAATG
TAGCGTTGAACTGGGCCCGTAACACCCTCTAGCGCAGGTAGTAAACGCGGGAAATATCTGACCAAGATGTCATCGACGGAAGACTTATGAAGTATTGATCGGCAGAGTGGGAATGAGAGAATACAGCACTGTAGGTAAACCAGCAGGTAATTTCCATGACTTGCGTTAACGAACCGCGTGTGAGTAACAAACTGCTACTA
AAGAAGGAATCTAAGGCTTTACTTTCGGTATTCATACGCGATCCCGTCCTCCGGGTTGGTAGGGCAGAGAATGCCGGGATCAAGTGGATAATTATGTTTCTTTGGTCTATACCTCCTACCTTGGCGAGCTATGTGCGCACATTTCCATTCATCAGGGTGATGTAGCCAGCAACCACCGCGTTAAATATTGTGATCCCTGC
AATCAGGCAATAGCATTCCAAAACACATCTCAGGACCAGTTCTCCTCGCGTGCGCAATCTAGTCCGCCCGCGCGGGTTGCGCAGCTCTTCAATGCTGAAAGGCAGTGATCAGACCGACGGAGAGCGAGGGTACTTGGGTGCAGATCGTTATGGATTCCTAGATTAGTCCCGCGCCTAGTGTGGAAGTCCTAGCGATCGAC
GTCCCTGCCCTCGGCGGCCCTGCCGTCCGACTATTTCTCACACAACTTCCAGATACCCGTATTCTCATCGAACGGTATATTTGAAAGTTTCTGTATGCAATCCATTAAATGAGCCCTAAGGGCAATGCCGCCAACTACATACCAGGACAACATTCTATTTTGACTTGTTTTGAATACTCTCGACGCCCCGTATGGAGGTT
GTTCTGTTAGTCACCAGTACCCTTTCCTCAGAGCCTCGAAGAAGTTTATGGGAGGCGTAACTAAATATGCTTCGATAAAGATCGTCAGTGACTTTCCTTACTATTATACGAATTTTGCATCCTAACATTACCGCGACACGACTAATACCGTCTCGTATCGCGAGCGCCAATACCATCTTAAGGAGGCTGTTCAAAAGGAC
CGAAAGGGCCTCAATCCTAACCCAGCATATAATAGGCTATCCATAAACAGAAAACTGTCCCCGACGCACTACAAGTTCGGTTATCAAGATAATGCTGCCACGAGGTCGATGGTCTTAAAGCTCGTTGGGCTCTGTTTGCCGCAGGCTCCTTTTACCAAAGTAATTGTCTAATCTGTCGGCGTACTTGAACGTATTATCTC
CCCTAACACATTCGTGGTAGAGCCTGAAATAATAACCATAATACGACTTTTACATATTATTAATTTGCCACCCGTCAATATTCCTCCTAGGCCGGAACGTTGAAACTCTAAATTAGCAACCCTGCTAAAGGACGTACATTGTAAATCCTCCACACCCCGTTAACAATTGTGACTAACCACCGCTTAGGAGACTTCACCCA
GTCGAAGGTCCAATGAGTAAGATCTGATATGTATAACTCGCATCTCAAGCCGCCAAGTTATCTGCTCGAAAGCGAAAGTCCGACGTAAAAATCAACGCACAATATTTTCTGATAATGTAGTCTATTGTCTCATCGATAGCGCAACATCTTCCACAACTGGGCTCATGAAACAACCATTTGCGCAGTAAATGAGCTAGACG
TAGCCAGCTTTTGCCCGCGCGGTCGGGCGAAGTGATATGGGTCATGTTTGGGCAACCCAGCGGGGTAAACGTGGACCAATGTTACTTATTATGACGCCCTGCTCAAAGGTACCCCCATCCACTGTTGGGTGCTTTGTTGGATTGGATTACCTACAGCTTATTATAGCTCTAGCTGAACGGACACGTAAAACACCTTGGTA
TAACACCGAAGGGGCTCATCGAACATCAGGGGGGAAACGCCATCTTCTGGATAATTGCGTATCGGTACACCGACGTCCCATCGCCATCCAGCCGCAGACCTCAATTGTGAGACGAGTGAGCATAATTTGTACGAGGCGCCTTCCGACTGGTGCTTGTACCGTCAACTTTTGAAAGTTACAGTTGCTTTAATCCCGCGTAT
AGCCCCCTCGGCGCTTCGAAGAAATAGGTGTGGCTGTCCTGTACACCTGGGTTTAACCGGGACGACGAATACTGCCGTTACGAGTTCGATATGGAAGTAGCATCTCAGGAAAGTAAAGACGTATCAGTATGGTGATCGTGCACCTACCACAATCCCTGACCCCGACTTTTAGCAGGGTAAGCGAGAAGCTACACAACGAA
GCACTACAAAAGTGCAACAGACCGCATCTAGCATGCGCAAGTTACTCTGCTCACGTATATTCGCCAGAAAAGGGTGTTGAGGATTGCCACTATTGATAGGCTGCGTTAGGAGAAGCGTTTTCCATGCCTGAGTCGCATGGCTGTCCTCCGCAGTTGAAAAACACCGTCATCAATCATCCTCGTGACGTAGCTGGGGGTTT
AAGAGTGAATGCTGCTGGCCCGTCAACGAATATTCCTTCCGTCTGGTTGCCGTTGCGTATAGTGGGGTCACGACTCCGAATATGATGAGTGAGCAACAGTCATATCGTCAAGTATCGCCCCACCTATGTTACCCAGCGGTATACCGACTGCGTTTTCGATATGTCATATAAATTATTCAGTGAGCTAACCCTCAATACGG
GGCCCGTTTGTATATACAACGTATGGCAACTCTACTGGCAATCGAATGGTTACACTATTAGTTCAGTACTGTCTTCCCGATGGTCATACAAGAACGTGCCTTCTAGCGGATGATTGACATTACGCTTACGGACTTTCTCTCCCGATCGCGGGCTAGTCGGGCCATGGCTTATTTGGGCGGATTTCTTTCCAACAGTACTA
GTGGTCTTTCCTAGTTCAAGGAGTACCAAACCGAAGGGCTGTCATACAGGAGATGTAATTTACTTTTACGAAAACCTCAGCACGAGCGATGACCCTCATTGACTTAATAACCTCCACTGAGGTGATGGTTCTGGGTCCTGACGTTTACCAAACAACTCCTGAACTTAAGATACTTGAACTGTTACTTAAAATTTGTCCTA
TCAATGTACCCGGGCAAACAACTATGTGAACAAGTATCCCCACCGCTGTCGCCACAAACGAGTGTACTGCTGGCAATTCCGCTGTCGTTATAATAGCTCGTGAGCCATAATCGCTCAGTGCTCCTTACAGTTAGTTTGCGTCTACTGGTTGGAGGGCTTCCGTTGCTATCATCTGCACTTACCAGGCGGTACATTGCTGC
TGTTTAGCGAGCCTCACCGCGTGCCTGCGAATGCCCTAGAGAACAACCCCGCATCTCGCCGGCCTGTTGCACGCACATCTCACTCCTGTCCAATCGGGGTAATCCTTGCGGGCTCAGAGCACCACTCTGGTAAGATTTAAGCCGTTACTGGAGGAGAGTTAACTTGCGTCTAGGGTTAACGCCCGGTCCGGTCATCCATA
GGGCCCTTGGCCAAAACGAAAGTGAGATAGGGAAAATCCAGCACTGTACTAGACTTCAACGCTTTGTAGACCAGTCTTAGTCGCCAAATTTACGGAGGAATTGACCAGGGTTAAATGTAGTTGTGTGGACCTAGCCCAGATGAGGGGTCGGAAGCGTAGTACCGCATCGCAATCTTTGGCGGTTCAGATACTCCGTAACA
GGTCGGCTGAGAAGTAGGCCTCGCGGTTGTACGGCTAGATCGGGTCGTAGCCCGCCACGCTCGTGCCTAGCGCTGCGATGGACCACAGTGAGCGTAATCAATCGAGGGTGAAACAAGCGGTCTTAATCCACAGATATTGCACCTCTCTGGAGACCTCTATGTGTTAGAACGTGTGTACTTAGGAAAACGAAAAAACATAA
CCTATGGTATTACACGCTAAATCGAGCAAGGGACACCGAAGTGAGACTGGGTACAGTCTTAGGGTAGAGGTAAGCAGATGGAATCCGCTTCCAGGCGCACACCGACCTCAGAGTCCGACGAAAAGCGTGGTTGGAATAGGTCGATGTGGGATCTACGATGGGGTAGGAACTGGACCGCCAAAAACGTGATGCACGTCGTA
TCACGGATTAGCTTATGATATGTGGCCACCAAGGTAGGATCATGATGCTGAGAAGGGAGGGAGCCGATAAAAATTCCCTGGGCCGATTAGGGCTAGCTCCTCGTGGCGTGTAAATATGTACATAGGCAAGCCCCCGGTATGGGCGAGGCTACGGGTTTAGTTTGGCGAAGCCTATTGTGACCGTTCCTATGATGCAGACC
GGGCGTTAGGGAGTTCGGTGGAAAGGGGGTTTAACACTGCTGCACAGGTGTGGCCGACCTCATGATGATATCGTATCCGCAACGATTAGGATCATGCTGCGAACGAGCCACAAAGGTTTTTAAAGTAAGTTGGAGTAGTGTGGTCTAATACCATACACGGGGGTCGTTCAAGCACCGGTGGGATACCGATTTCTAGATAG
TTTAAGAATTTCTCGGCGGATCGTGGCAACAGTGATACTGCGTCACAGCGATTAACACACATGACACTTACAGCGTCCAAATGTCACCCGGAGTTCGTAAACCTTGGAGAGCGGTTGTCTGAAGGGGTCAAAACGTCAAACCCAATGTTCCGTATGATAAGGACGGAGCGAGACCCAGGGATCCTGTCCTTCCAGAAATA
GCAGGTTAATATCTATATTTAGCATTCCCGATCCTATATCTGGACGGCAGCGTCGACTCATCTAGCCATATCCGTGTCATAGAGATTGCCTTGTTGTTCTCCTTGCTAGGGGAAAGTGTCGAACTTCACGGCCTGGATTACATCCGAAGTGTGGAGATAAATATCGAGTTCTGCTGACTCTCAAATGAAACAACTTAACT



 
Not sure if this is any better but looks to me to work:
Code:
#!/usr/bin/perl -w
use strict;

my @short = `cat short_sequence.txt`;
my @long  = `cat long_sequence.txt`;
chomp @short;

for(my $i=0; $i<@short; $i++) {
  substr($long[$i], int(rand(193)), 8) = $short[$i];
  print $long[$i];
}

Try it and let me know.


Trojan.
 
Result of my test run is:
GAATCATATATTAGTCTCCACATACTCCGTTCGTGACCCATTACCCTTTCGGGAGAGCCACAGCAACTGTAGATCTCGAAGTTGACAGGGGCAACTAGAGGCCTCAGAATTCTCACTCTTGAGGAGAGAAGTCTAAGACCTACAGTATGGTCGGGTTAGTTTTTGTTCCGCAAGACAATGGACTAACCACTGTCTGGATA
AGGATTACCCGCTGGACTTCAAACGCTCGTGACAGTTCATATTGCGAGGCAACCGAGTCATAGCCCAGTCCGGGGGCCAACGCAGTGCCAGCATCTGCGTTGTTCATCGGTCCTCAGTCTCCCATCAACGTGGTCCACACCTAGCATCCTGGTTTTGCATCCGTAACAAAGGACGTTCGAAGTTTTTTGCCGGCGGGAAG
TGATAATTGGTGCAATATTCTCCATAACAGATCCTCGCCAATACGGATTTGAGGGATCCCTCTGCATTTCCACGAAGCGTGTCACCCGAATTCCAGAAATGCTTTACCGCCGCAGTGATTAGGCGGGTACAGTTGTCCAAACGCACACAACCGAAACCTCCCCATGCGTACTCGTTCGTTTAGTCGCGTACAGAGGGAAC
TCTAGCTTTCGTGAAGTCCCAGGCTCCTAGCCCTACAGCGACTTCTGGCCGGATCTGAGGGAAACTTGTCTAAACTTTTTTCCCCGCGAGACTCACTGTTATAAGAGCTGGAGTTTAGATACTAATGGCATTCTATTGCTAGTATGTCTGAGAGGTGTTCTTTAGACCACGGGTTGATGCGGGTTCGAGTAGAGACACAC
CAGCGGGCGATCACCGTTGTTTCACCTGGGTAACGGTCGGAATCGGAATAGGAGAAGAATATGAGATCGTCCATGGGACGTTTCAGACGAGACGTCCCAGCTAGCCGCTGCAATATGGTATTTGGACTCATTCTTCGAGACAGTAAGCGTGATTTTTATCAGGCCCGAGGCTGTTAAACACAGCTGCTCTAGGCCGAATG
ATGCACTTCCCCACGAACTCCTGTTAGGATGTACGTAAGAGTTCAAAATGCATTCCAATTCGTTTGACGTGCAAATTAGAACACCGATGGATTGTTCATCTCAGCACTCTACGTGAGGGGAACTGTGGTAGGTTTTTTAATGAAGCCTGCCCCTAAGTCCTTAGCCTTTGATTCTGTTTTCCCACCTCCGGCATCCAACC
GTGAGCAAGTATAGCGTTCGTCCTCCGGAGATGGACCCACAATTCTCTGCCGCATAGTCGTCCGCCGATTAGTTTAGTTAAAGCGTGACTTCGGGGGTATTAAAAGAGTGGGTAGAAGTAGCAATATTATGTTATAAGTGGATGACGCAGGTGATCATGTGCCTGTCCTCCTTCGCGATTTGCATTGATGGTGACTGTTA
CGTTTTAGGCTCGTACTTGTTAAGGTATATGTTCCAAGTTCAGCCCCGCTGGGTCCGTGGATTGATACATTAGTTTGAGACGCCTAAATATTATGGGTATTGTCATAGCGGATATTAGTGCCACCTTGCGGTACGCGCTGTTCGACACTGGACTAAACCCTGAAGCTTCTGACAGCGTAACAAAAAAGCAGATGAATCAC
CAAACACGATTGTGCTTCGTGACGAACGGGTGACGTAACGACCGTACCATGTGCCCCTAGTTCTGAAACGTGTCATCTTCTAACGCCTCACTCACCATAATGTTCACCTCGTGCTGTACACGCAGTGGCCTAGCGCTAGGAGCTCAATTGAGTGCGACATGCTCTGGACAGCTGTAGCAGCTATGCATTAGGCACCCGCC
AGATACGTTAAAGGACCCGTGTCGGTTCGGGATTGCTATAGGTTGTCCAGCCATCGGCTCAACCCCTGGGGCGCACGATTTCATGCATGCACATCGGCCAGTACGTTGAGGCTTGGCTCAATAGACAACCAATACTGATCCCGCGCCTACATTTTCGCTATCGCAATTATCAACATAGAATTCCAGACTCGACTTGCCAT
GTTGTAGGAGGACGCTGTGCCTGATGCCGGCAATAATTTCAGCGAGGCACAAATTACAAGTCCGGAAAAAGCAGCTTACCCCCAAGCCTTCCCCACATATTACTATGTGTTTATTACGGAAGACTGTGCGCGGCTTACGCATCAAGCACGTTAAACGCGTACCCTAGGTTCCATCCTGTACAGTTGTTCTTATAGTAGCG
GAAACGCGTTCGACGGGGACGGGTTATAAGTGGCCAACAGCTCTATGCTAATGGTGCACCGTTGGGGTCCTCGAAAACCTGTGAACAGGCACTCCTATCCAGACTACCTAAACGTTGAGCCCGAGGTCCGCCCGTGCTAATTAAGCCGGCTATAGGTTTCCGCAAGACGGAGTGCTTGTGCTGGCAAACAGGGGTTTAAG
GCTCGTACCAACAGGGATCTCTCCTATAAAAGGGGAATCACTTCATCGCGATCCCATTGGCCACGGCACGCTAGGCTGTCAGTGATGCAGCATGTGTATTTTTCGTCGCCCGTAACTGACACAAGGCCACATGAGTGGAGGGTCAGGTTATGCAATTATCGGCGGTCGGGCTGTCTCGTATATTTCAATAGACAGGGGAC
GCTCGTACCAACAGGGATCTCTCCTATAAAAGGGGAATCACTTCATCGCGATCCCATTGGCCACGGCACGCTAGGCTGTCAGTGATATAAGGCATGTATTTTTCGTCGCCCGTAACTGACACAAGGCCACATGAGTGGAGGCCCGGTATATGCAATTATCGGCGGTCGGGCTGTCTCGTATATTTCAATAGACAGGGGAC
TTCCCTTATCCGAAGGATTCACACAGACGTGCCAGTATTACGGACGGGGGGGCGTCTCGACCCGTTGGGGAGCGCCTGTAGCTGGGGTGCGCGACACTCAAGTTTGAGTAACGCAGGTTCGTAGCTAGTATGTTCAAAGACGTTCGCTGCACTCTCCGTCACGGGCTGGTTCTAGTTTATTAGTGGATCCGTGTATTAGC
AACTCAACCGCCAACCGAGAAGCCTAAACTTTTGATCTAACTCCTCGTATACGGCGCCTGTATTCTGAGGGCAAGCGATGCTTCTGTCTAAATGCGTGTGCCAATCGCATCTCCGTTCTACGGCCACCGATCGCGCAACTTCCAATTAGTTGGAATGGCGTATGGACAGAACGTATGGGTCCGTGGTAAAGAAATTGTTG
ATTTGTCCCGGTTACGTACTCCACTGCCCCGTCGTGGTGTAATCATTCATCTGACGTCCCCCACTTCCTTATCATCATACACATACAACGAAAACACTCCCACCGGCAAGTAATTCGGCTCCTCTTGTTAAATAGCCTTGGGCCTAGGCATTTACTTCACGTGAGTACAGGTTTCCATCATGAGTAGGAGTAACCCCGTA
CTGCTGTGTTTTTGCTCTAGAAACGCTCAGGGAGAATTTAAGTCGTGATCCTAATTAAAGATAGTTGCGACAAGAAATAGTATAATCACTCGCTGTATTGCGCCTTTAGGCGCGCGTCCCTGTCCAACTCCGAAAGTTAGTCCTTCTCGGGAGTGGTTCCAAAAATAAGCGAACCGTCAAGTTACCTTGGTATATAACCG
ACGCGCTTGAAGGACCCTGTCAATGATGGTGTAAATAAAGTCGGTATAGCTTGCACTTTAAGGCGGTAGCCCTAAGGAAGGTTTAGGACAGTTGTAGGCAGGTGATACGACTTGTTGTTGCTCTCTTAACGGGCTTTGAGGGTAAACCCGCATCGATAAACCGACGAGGATCATCGCAACCGTGACAGGCATACCTTGTG
TTCGACGAGTAAAGACGGACGCGAGGTGTCCACGGGTTCGTCCAGATTACCATGCGATGGTTTATCGTTCATCTAAGCAGCTACCCTTGCCGATTATGGAAAGATGTCATAAGTAGTCGCCAAGTGTTTAATGCTGCGAATCGAATAAACTACGCTGATTCCGGTCTAGAGGCTAGGCATCAGGTATGAAGCACTTAGGA
GATGCTTATACTTTAATGCCTTTGTGGTTTCACAGGGTCGACCTGGCGTAACGCTATCGGTTATCGCGGTTTCTTGGTAGCTCAGACGTTCGCTTTCGATGCTTATCCTGTCTCATAAGCCGGGGCTCGCGGCCGCTACGCCGCGATAATCCATCCTTAATAGGGGTTCATACCTATTCCTCATTCAGGTTCTGGTAACG
ATACAGAATAATTTACCTGGGATTTGGCCGCGTATAATTTGACGGTTAGATAGGAACTCCCTTGCGCGCTACACGGGTTTACTACCAATTAGCTTAAATAGGATAAGTCCTGGGCCATGAAATGACGTTTAGCTTTCGAGTCTACGTGCTTGTCCTTATGCTTGTTCAGTGATACGCGATACCCAGCCCGCCGTCTGACT
GGTCCTATAATGGCATCGTCAACCGAGCGTTCAGACTGATGGCGTCGATCAGTGTTAGATATAACTCGCGGCCATGCCTCCCCGTTCTTACGATTAGAGTACGGAGCGTATAGTCGCGTATTACCCCACCCACCTCACTTATAGAAAATACTGTCCCACGATAGCAAAGAAGGAGGCGTAAGGAGTCGCGCTTTCCTAGG
AACCTTACACCCGTACTCGGAGCGCACTATTCTCCCGCCCTGACCTGCATAGCTAGCCCTACTAATTCTACATAATGTGCAATCCCTAGGCATCGCTTCGGAGACCAAATGCCGGATCTTGCGACCGTGTGGTTTCATAGATGATGATCCCACTTCTGGCCCATGATTGTGGGTCTGGGTGGAACGTCCACCTCAAAACC
CACGGTACGGCTCAACTGACCATGCTCGGCTGTTCCTCTATAAATCACGTGAGTGCGGGTCCAGGATGGGCTAAGAACATACATGATTAGCAGGACAACAGAAAGCGATTCTAGGCTTCATCTCATACTCGTGGCATCCTACAGTTGGGCGCCGCGTCAGTGGTGTCCGAGAACACCATTGCTAGACGCACTGAAAGATC
CATAGGTTGAGTTTTTATGTTCTGGGAACCGATAGAACGACTTGAGTCTTGAGATCCGACCTCAGCATTCGTTCTAATAAGAATGTGACACTTCCGGCGCCTGGATTTCCTTAAAAACACATCATTCCCTCGGATACAATGATCTGGCCAGTGCGGCAAACCACCCCTGGATCGTTCCAGACTACCGCTGCATTCACCTG
TTAATTCCTTGAGCTCATATCTACACCCAAAGTTTCGCAAGCGTCTGGTCCGGAGTAAGAACAGAGTGTCATTAACACTAACTACGTTAGTTCTGGGCAACCATCGGAACTCGCCCGTGCGACTGTGTTGTCCTCCGGCTGACCAGCCACCCTCTACTTTCCTAAACTAGATCTGACGGTGGTTGAAACCAGCTTATCAT
GAGCCCACGCGGGATGGAGTATAGTGTCGCGTTTAAACACCGCGCCACCTGATTTAACAAAGATAAGGCTCTTTCGTATGGATAATCCTTACTTTTCTAGCGTATTCTTTGTAGCTGACCTGGATATTTGATGACTACAAGTGTTATGGTCGTCGGACATCCGGACGACTCACTGGAAGAGAAAGAGTTCTACAGACTAC
TTGTTGGCGTGGTACGTCATGCGACGCGCTATGGGACCCTATTGTGAACCTGGCTCTTTCATGCCTGCAAACTATGTGCCCAACGGCACTTACGCTTTAGAGGGTCGCGATATAATGAATTGCTAAATGGGCTAGAAACCGACTGCCAAACCCTGCTGATCGAAGCTAGGTCATACGTTAAAGGGAGTGTGTATCGGAGT
ATTCTAGAGCGTTTCAGTGCTTTTCTCTCGTATTCCTGAAAACTTATCCGGTTGGTAAGTTACCAATTTGTTGGTCCCAAACCGTTGCCTCTATATCCGACCGTGATCGCCTAGCGCGGATTCAACTCTGTTGAAACGCTGATCACCCACACACTGGTTTCTAGCTCGGTGTTTTTGGCTTGACACAACGGTGGTTACCC
TTAACAGGGTTCGCGGCTGTATCCATCGTATCCCGTGAAATAGGATTCAGTCACGCGGTTCAAGGGCTATGTCGATAGTGAAGGGTAACATATGTCTAAAGGATCCAACAATGATGATGGACAATCGTGTTGCTCACTAAATCTGTTGGCCGAAGCCTGGGTCTAGCAAAATAGACCGGCAGAACTTCCTAAATGCATTA
ATGTTTCACTTAGAATTAATAATTTTGATTCGCAAAGGGGAGAGATCGTGGTTACTTCACGTACTTTTTATGGAGTCCCTTCAGTACTCCCTACTCCGGTGATGCAGTCTATCCAAGTCATGCGCGATGCCCTATGCCGAGTCTATGATCCCACTCATTTCGTCTAATCCCTTAAATATACACCGGTATTCTCTCTGGTC
ATCAGTGCTACAATACGTGGGGGGCTCGATTACCGGTCGCAGTACTATCGGGGCTGCGATCTAGATTTACCTGTTCAAATCTAGCGTTTTGGCATTGAGCCATCCCCGGCGATCTTACGAAAACCTGGGTCCCTCGGCCACTCGTCCCGTGGGCACGATCAATAACGATGAAAAACTTCAATCGACATTACTAAAAGGTG
ATATACTAGACTGGGCCCTTGTACCAAGTGGCATGTGGATTAGCGGAGTAATTCGTCAAGTTGAAGTATCCAGTTACCGTTAATTGCCCTCATACGAACTCCTAGTCACAATCCAATTAGGGGGTCCAAATTTCCAGCTGCAAACGCGTCCGTCGCGGTGATAGTCTTCTACACCTGGTATTTACTCATAAGAGCCACTA
TCCGCGGTACTGGCCTAGCCGGCCAGTTACGCCTCTCGCCGTCACACGTCAAGCTGTTATAGACCAGAACAGCAGCTTTCGAATAACTAGTAGATCAACACGGGTCATCACAATTACACTGCCAAGTGAGAGTGATCGCAACATATGCCTTCTGTTGCAGATAAGCCGTTATCGTTGAAAACCTTCTGTGGGCTTTGGAG
TTGCAGACTCAAGGACTACCCGCTCATCCAGGCTCGGCCGAGTTCGGACCCCGCTATGTGAGTTCTGGAGTTAAACAAAGGATTGAAGGGTGTCCTTGTTAATAGCAAAGCACCGATTTAGTAAAAGCCATCTTCGCGGACGATCAGACGAGCTAGCGCTATTGTTCGAGAGACCACCAGATTTGAGAAAGTATCAAATC
TGCAGTACACCAAAGAGTACCAGCTACGATGAGGGTCTGCCCAAAAGATCGATGTACACATCAATGCCCAACGGTGTTTCCTTCTGGCTTATACATAGGTGGTATCTTAGGTCATAGTCAAGTGCAGATGAACACCTGTGTCAAATGGTGAGTTAACGGAGCTTCTACTTGCTAGGCGCCATTTCCAAGAAGCATCGGTG
TTGACTGAGTTGCGCGCTTTCTTCTCTGGACGCTTTGTTACTTCGTCCTCCGTCGTTCACACTCCCAGATGAAACTACCCTATTCAAACTTATATAGGAATCCGGCAGATGTGTAAAAGTGGTCCAAGGTATCTGTGGAACTTGTACTGCCGGGTTTAGTGAAGACTTCGAATCCAGGCAATTAGATACCGTGCGGAATT
ACAAGCAGAAATTCCGTGGAGCCGCAACCCTGATTCTCAACGACTACTTGCCTAACCGCGATGATCAGTTTATCATAATCAACTCGATGTATCCGTTAACATAGGAGGATTCGATATTTGAACGCGGAAGTCTTATCATGCTCACCACCTCGTCCCGGCAGTGTCCTTGAGAGGGGGCCGTCCTCCTACCTTCTCTAGTA
AATAACCAGTAAAGATTCCACCTGCAGGTAGGGCCCCAGGGCACTCGCTCTCGTGAGTCCACTAGGTTCCGTGAGAAGGTTAGGTTCTGCAAAGCCCTAACAACCCCCGGAATACTTTATGGGCTTGGTTCATAGACGAATTGGACGCCCGCAGGCCTTGCGGGCCTACGCTGCTGTTTTGGGGTCCGAATGCGAGAGTA
TCTGGGGAGACGCCCTATAATCCAAGCGTATCAATGTTGCTACTGGGTAGCCATAAAACGTTGGTAGACGCAAACTAGTTCAACTTACAGTTCATAGAAAACGCTCCTGCGCTTGTTTTGTTCAGCAGTTGCGCGAATTAGACGCAATCATTCGTTCAAGGTACGGTCCAGGACGCTGCGGGCGAGGTGGAACTAGTGTC
GTAGCACCAGGAATATAAATAATATAGTTTAGCTACCCACCTTAGGTAGACTAGGGCCTTCCTCTGCGACTCGTTCGCACTCTTTCTAAGACGAAAATCCATAATGGCCAGGGAACATACATGATGACCAGCCTCATTTACGTAGGTCGCTTGGGGAGAAATGAGGGTGGTGCCCGCTCTCTCCGAGTAGCCTCGTTCTC
ATATCATGAAACCTAGCGCTAAGCGGACGAAAAGAAGTTGCCCATCCGACTCGGAATAGGCTCCCAAGCATTTCCTAGGCTCCAGCCGACTGACGATTCCCTCAGGAAACCTCCCTCGCCCTTGAGGTAGGATCGTAGACTTTACCCTCACGAAACAGTCGCTCATATAGCATCCATCGGCGCGGCCCTGCAGAGGTACT
GCTGGGGCAGCTCTCGTGAACAGTTATTGAAGCGCGGATGTAAACAGCGATGCTGAGCAAAATACTAACAGGTAGGCCGAGCCTGATCCTGTTAACTTTCGATCACATTCGGGCAACACGTACGTAGCCCTCCATGGCAAAGTCAGTTTTGCAACAACGTTATTCACTCTGATTCGATTATTACAGTGGCAAGACGAGGC
CGTACGAGGTGAAGGTATAGCCTTTTGTCAGCCCCCTTAACATAGTGGCCTAATGCATGCAAACTTGCAGGCCCGCACCCAAAGAGATCAAAAGTTACAACCCAAAATTGAGTTGGTTATGGATCTCCTTAAAAAGGTAGCTATAGGTTACCAACAGCAGGACCTGGGTAAGTCTGCAGCATTTTAACTGATAAGACTAA
AGCCGACTTGCGTTCCCATAGTAGTCTATTATTAATTCCGATAATGGTTGACGAAATGCTGCGCCGGGTGGATTAGCGGAACGCACTCTCAAGCGTTAAGCCTTGTGGCGGTGCTCACGGCACATTCCCGCATTGCCATTAAACGTCCTGTAGGCAGAGGATGGGAGATTACACGGATGAGGTGAACATCGGGAGTATGA
TAAGTCCCAGCTTCCCTCTCCTTGTAAAATCAGGGAGCCTCCTTACGCTCCCGTTCTCTTTTTAAAGGATGAATCCCTTGTTTTCGTTGTTTTGTCAGAAACACTCGCGAAGCTGGAATTTCAGTCGCCATTTCGCGCCATACTCAACCCACGGATGCCTTCGCTATCGGAAAGCGTTTCTACTACGCCACGTACGGCCG
GCTAATTTCCCACTATCGTCCCCGCCCCTTGGATGGCTGATCAACTTAGCTCGTCCGGGCAGTGTAATTTTCACGCTGCCAAGGCACACTTATAGTTCTTAAAACCCGGAGAGTGCTACAAACTCCGCTGACACGAAGCTCTCCTATAACCTAAGACCGAGGTTCCGACAAGTTCCGTTTCCGGCACCAAGCCCTTCAAA
TGAGTGCAAAGGGATTACGATATGTTAGTGGACTTGGTCACTCACTGTCCATGGCCCAATATACTAATTCAATAAGAAGACGTCACGGCATAAGCGACGCATCTCGTTCCACTCGCGGCATGCCAACACAAGTATCATGAACTCATCAGGCAAAGTAGAGATGTACCGTTGGTGATAGAGAGAAGTCATAATTCCGCAAT
GAGCGTGGAATTGCGCAGGAGGGTTATCTATGTTATCCGGACACCACAGAATTCGGCGCCCTCGCAGCCAATCCTATATTGCTCCCTTCCCTAGATACCTACCCCTCCTACCTCTAGACTATGAAGTGGCGACACCTCAAACGGGTTACTCGGGAACCGCGCCAGCCCCTTGTTGCCCTGTAGGCTAGCTCCGATCATAT
GCGTCAGGACAGACGGCGGGGACACCACAGACGTGCACTAGACGTGAGGCGCGGCCTCCCCAATAATTCTGAGACTATAAGTAGCTCCCTTTTAGAATGGAAGCTGGCCTACCGTATAAATATCGAAGATCGAACGCTCGTACCGTGTGTAACCTACTACGTCTAAGCTGTGCAATCGTCTAGTGATTAGCCCAGCGCGT
GTCTTACGGCTTCGACTCGTACCAATTGGCATTCACCCGTATAGTCGCTAGCTAGAGCATGGCTTAGGTATGCAAGTGGTACAACAGTCCTTAATGTCGAGTGGGCCTTAAACTGCTCGCACTACATCGGGGGGTCTGTGTAGCGTACACACACGGTCATTGGCAGAGCAGATAAGCATTTGGTTGCCTGCCTGAATTGT
CAGAGGAGTGACGGAGCTGACCCGTATCGTTATAAAGTAGATTCAAACGACGCCTTTTAGAATCCACAAATTGGTAACCTTACACGAGCCGAATTGAGAGCGCTATCGTCATCAGACTTCTTCTTAAGGATTTTAGCGAACCTGACCCGATGGGGTCCCCAGCGAGAGCAGCGGCGGCTTCGCTGCAGACCACCCACTTA
GGTCCCTGGGCAGCAGTTCGGGGTAGAGGCTCGGTGAGCGGACTTGGCCACCGTAGGTCAGGACTTGGGTCACTCTCACCCGCACACAGGGCTGATGCATCGTCGCTTCTGGCGTAAAGAATACTCGAAAAGGCACGTTACTCCCTGCTCTACTCACTCAAAAGGGATTTCGGTATGTCGTGAGCAGCCGGGACGGGACA
TCCGGTCAGCCCGAGGGGTGGATGAGGGGTTAGATACGGATAACTACGTTGTGCGGAAAATTAGCGTGATCCCCAGCACCGTTAGTACGTATGTCCGCCTTTTGAAACCAATGTCCTACCAACTGAGCGCCCTGATGGCCATGCCAGCTCTAAGACCCGCAGTAGTTAGGTGCAAAATACCAATCGTGCCTACGTGGATG
TAAACCACGCTCATTCCTAGTATCCTTCAAGTACGTGTCAGTGAGACAAAAAACTACTATAAATGACCACGCGCGGCAAAATTCAGGCAGTCGGTAGTAGCCCCACACAATCGCTCCACCATATCATGGTGATAGGTAGGTAAACAGTCTAGCACATGTTATAGTTAGTTAACCTAATTTAAGATGGATACTAGTGGTGC
ACGTTGCTAAGGTACAAGGGGGACCTATTGGAGCAAATCTGGATTGGGTCATAACAGTGGGTACTGCATGGAAATTGTACGCACCCCCAGCCAATGGAGGAGGCGCCGGATGAGTCGACGGGGGCGGGCTCATTTACTTCAATATCAATTGACCTCAGTTAGCCCCTTCCCTCCTACCCAGGCGTATCAGCAGGACCGTA
CGGGAAGTATATCACTTGCCTCACGAGTTGAAAAGGATATCTTCCCCCAACGCACTCGGTAAGCGGATGTATTAATCTATCTTCGCTTTTGGGACTTATCCGTAAAATAGCTCGAACTTCCGTAATGACCTCGTACTCACCATATGCCTAACGTGAATTACAACCTACAGGGCACTTACAACAAGCCTGGCCGATCTCAT
GAGTCAGTCCGGGGAGTTCCATTCACGTTGACTACGCAGATGAAATCATAAGTCAAGCGTGAGTTGTTTCTCCCGCTCCATGCATGTACGTCCGGTGCGTCAACAACTAATGTACGGCATCTTTCTCGACGTGTATGTACGGTAATAATTTATAGAAGGGACAGGAGGGTGTTAGCGCCGGCGCGAAACTAGAACAAAAG
TCTGCGTATTTAGGACTTGAGCTCTCATCGCGTTCCGGCCTCATGATGAATATTCGGCCGGCCAACCCGATCGGGCTCTGATACAGCGGCCCGTGAGGTTTGGCTGGGTGAGGTGGCCTAATGATTGACGATGAGGTCATAGCCCTCTGTGGGAATGGTCTCCTCAGCAAACAGGATGCTGTAATCAAACATGAAGGAGG
TACAACCGTGGCGTCGTATTCATGTTACATACAAGTGGGCCTTAACGCCAGGCTGTAGCACACTCGTTTGACGTGCGTTGCGGTCTAGGATGGCTGAGCTGTCGAGGACCAGAGTCGACGCGGCGTGACTTGATTGCACTCTCAACGATATTCCCAGGCCTCTGGGGCGAAAGCGCATCTGTTCAGGCTTATAATCATAG
CCGGGCGGTGAGCCTGCGCACCAAAGGCTTCGCCCGCGTGGGTGGACCCATCGCAGAGTCCCTGCGTTATAAGCAATGTGTGTAACGTTTCCCATAATTAAGTCAGTGACTGGGTTTACGACCAATGCGAAGTGTAAGATAATACAACTGGTGCAAGCATATATACGGGATGTCACGCGAACCTCCTATCGAGTACGGTA
GCAGCCGCCAGAGAAGGATTACAGTGTCGGGTACCTGCGGAGTACATCTCAACTAGTAACTCCGGACATAGCAGTTCACCGAATATCATCCTGAAGTCGGCAGCTTCGAAGCCCCCCCATGGCTGCAGTGGGGAGGTTCACTGGGGACACGCGACTGGGCAACTCGCATGTCACATGCATCCTCGGACCACGCTCTCCCG
GGCATCCGCTTCTGCGTCCAGAGTTTGTTCTGAAATTGACAGCCAAGGGCTCAGGAGAGTCCGCATACACCGTATTGGTACCGCGAAAGATTATGATGCCAGAAGACGTTCACCAAAGACTAGACTACTGCTACGGACCACGGGATGAGGCGGGTGACACGTTTCTCGAGAACTGGAACATGCTGGGCCTACATAGTATC
GCCGATTAAGCAGTGAATGCCATCTAACTGTTATTGGCCGACCTTCCTTTGTAATAATTTCATACCAGCGACAAGGGTAGGCGGCCGTCTCTTCGCGCCCAGTAGGTCTAATTCGCGGGCCCAGCACAACAAGAGTAAACGTCGAAATAACACGTAACTCAAAGTTAGTGCGCGGCGTGCAGTAGGAGAATCGCTCAGAT
GCCTGGGCAGGCGGATCGATGGCACGCACTCAGTTACACTAATAATTTATCTACCTAACTCGCCGGCATTCATTGTGCCCTTTAAGTAGTCCACATGCGAAGGTATTACAAACACAGTACCACTTCGGGAACGGTGTACCTAAAAACGCGGGCCACGGCCCCTTGTATCATAAACTCCACTCTTTGTTTCAGGTCTCCCT
AACGATCTTTCCCTATGAGTCTTACAGCAGACCGGCCTGTCCGTTTAGACCGCATGATAATTTTACGAAAGCGGCGCTTACGTCACAACTACCCACCGATCGGAGATTGAGTCTTACTGCATGCCGGGTTGCATCTCTGGCAGTCTACTATTGTCGCAGGTCCGTTTCCACGTAATTATATACCATATCGGTTAGAGCCA
CTGTGCTCCAACCCTTGAAGTCCTACTTCCGTGCGACCACTCACGTCTTGGTGAGTACATTGAAGCATAGCCATATCCGTTGGCGGGACGTCCCTCGTCACGAGCGGCGATCGTCGACTCACCTTGACCTCTTGTACTGCTCGCGACTGGATTCCTCGTGCCATTCCAACGGGGTCCTTACCTAGCTAACAAGTAGATTT
TCTATCCCACACGTACTGGTAGCATGTGAGCATAGTCTGATAATAAGAGTCGGGCACTGATTCAGGCCAAGAGGAATCATATTGGTAGGGGAGGTCATCATTTCCTTTCCTGCGCATAAGCCGGCGTACTACTTCCCTTTCCGGGTTCGTTATAGAATATTGGACAGAATGTTCAAAATTGTTGTCAGGCATTTGGATGT
TCTCATTTGCTGGTTGTACAAAAGACCACACAAACCAGTACGATAACCTGGGTATCCGTAATCACGCCGCTCGGCGAAGTAGCATCTGTTACATCGGGTCAGGAGTCGATGCACTTCCTGGACGGTTATGATTGCGGATATGGCATGTGAGCTATAACTCTTACGTTCAACTTGGAGGTCTCGTACGCGTGAGTTGCGTG
TGACAGTTATCCTTGGCAACGATCTCTGTCTTGTAATTGGGTGTCTTCAGGTTTACAATCACTTACTTACGAACATCCTAGCACGACGTCGGGAAGCATTGGTTGGGCTACGTTCAGGGTTGCCTGACATTTATAGTTGTTAAGACCTGTCGCTAAGTGGTATAGTTGGCGTCGTATCTGAGATCGAAATCGGGCACCGC
CACCAATGTCGACCATGACGGAGCACGTCCGCTGGACCCAACCTTGATTGCAAGCTTTACGCATGGACTCTCCAAGTCCAGAAACGTACAACACGACATCTGAGTCGCAATATATCTAGTAAATAAAATGTCACGCGTACAATATGTTGTCTCGCACGAACCAAGCGTAGCCAAGACCTCTGTTGATCGATTCAATCCTC
AAAGTGCGTATGATGATTCACACAAGCTCTAGCAGTGATCCAGACGGAGCAACACCGATTAAGTGGGACTAATGCCCGCGGCATTCGCCCACATCGGTAGTGGACTTTCGGGTGCCGGTTATGTACGCCCCTCCGACGTCAACCCGGCAACCACTCGTATTCATGGTGTGGCGATACAACCTTGGGTTTTTCCGATTTAT
GGTCCTCCGTGCGTACACATCTATCGACCGACCTTCGTAAGTTAAACCACCGCTCGCCAACAGGTTGTAATTATATTCGGTATGGTGTCAGCAAAATAGGTATTACGATTCGCATGATTATCGTCATTGTGGGCCTCACCCAGAGCTCCTAAGTTCTGCAGCAGAATTCCTCCCCAATAGTTGTTCCACACCGGGGAGGT
GACACTTTTGTCGGAATGATAGAGACGGATGATGCCGGGTAGGTTTTAGAACCGACAACATGATTGAGTAAGAAAGTGGACGGCGGGGGCGCGACCCCAGGGCCGCAAAGCATGTGGCACAGCTAGCTAGCTTACCGATAAGTACTTTCCGGGGGCCCTACAATGTAGAAGAGAGGGAGCGAACCCCCTCAACACCCTCC
CGTGACAATGATGTAACCGTCACGGACCCCTACTCTGCACATCGTTGCTCGCGTCTAACTGTATTCGGAGAACAGTATTGACGGAGTGGACCACTGGGTAAAGTCTTGTAGCATGAATGGTCAAGCGCGAGGGAATGAGACAGTCGAAACCAAGCACCAACTACAGACAGGATTGAGTAACGAGCATGAAAGGAAAAATG
TAGCGTTGAACTGGGCCCGTAACACCCTCTAGCGCAGGTAGTAAACGCGGGAAATATCTGACCAAGATGTCATCGACGGAAGACTTATGAAGTATTGATCGGCAGAGTGGGAATGAGAGAATACAGCACTGTAGAACGTCTTGCAGGTAATTTCCATGACTTGCGTTAACGAACCGCGTGTGAGTAACAAACTGCTACTA
AAGAAGGAATCTAAGGCTTTACTTTCGGTATTCATACGCGATCCCGTCCTCCGGGTTGGTAGGGCAGAGAATGCCGGGATCAAGTGGATAGGCATGATTCTTTGGTCTATACCTCCTACCTTGGCGAGCTATGTGCGCACATTTCCATTCATCAGGGTGATGTAGCCAGCAACCACCGCGTTAAATATTGTGATCCCTGC
AATCAGGCAATAGCATTCCAAAACACATCTCAGGACCAGTTCTCCTCGCGTGCGCAATCTAGTCCGCCCGCGCGGGTTGCGCAGCTCTTCAATGCTGAAAGGCAGTGATCAGACCGACGGAGAGCGAGGGTACTTGGGTGGTTTATGTTATGGATTCCTAGATTAGTCCCGCGCCTAGTGTGGAAGTCCTAGCGATCGAC
GTCCCTGCCCTCGGCGGCCCTGCCGTCCGACTATTTCTCACACAACTTCCAGATACCCGTATTCTCATCGAACGGTATATTTGAAAGTTTCTGTATGCAATCCATTAAATGAGCCCTAAGGGCAATGCCGCCAACTACATACCAGGACAACATTCTATTTTGACTTGTTTTGAATACTCTCGACGCCCCGTATTAACTAG
GTTCTGTTAGTCACCAGTACCCTTTCCTCAGAGCCGTAATGGCGTTTATGGGAGGCGTAACTAAATATGCTTCGATAAAGATCGTCAGTGACTTTCCTTACTATTATACGAATTTTGCATCCTAACATTACCGCGACACGACTAATACCGTCTCGTATCGCGAGCGCCAATACCATCTTAAGGAGGCTGTTCAAAAGGAC
CGAAAGGGCCTCAATCCTAACCCAGCATATAATAGGCTATCCATAAACAGAAAACTGTCCCCGACGCACTACAAGTTCGGTTATCAAGATAATGCTGCCACGAGGTCGATGCTTGTCATGCTCGTTGGGCTCTGTTTGCCGCAGGCTCCTTTTACCAAAGTAATTGTCTAATCTGTCGGCGTACTTGAACGTATTATCTC
CCCTAACACATTCGTGGTAGAGCCTGAAATAATAACCATAATACGACTTTTACATATTATTAATTTGCCACCCGTCAATATTCCTCCTAGGCCGTCTGCACAAAACTCTAAATTAGCAACCCTGCTAAAGGACGTACATTGTAAATCCTCCACACCCCGTTAACAATTGTGACTAACCACCGCTTAGGAGACTTCACCCA
GTCGAAGGTCCAATGAGTAAGATCTGATATGTACTCAGAACATCTCAAGCCGCCAAGTTATCTGCTCGAAAGCGAAAGTCCGACGTAAAAATCAACGCACAATATTTTCTGATAATGTAGTCTATTGTCTCATCGATAGCGCAACATCTTCCACAACTGGGCTCATGAAACAACCATTTGCGCAGTAAATGAGCTAGACG
TAGCCAGCTTTTGCCCGCGCGGTCGGGCGAAGTGATATGGGTCATGTTTGGGCAACCCAGCGGGGTAAACGTGGACCAATGTTACTTATTATGACGCCCTGCTCAAAGGTACCCCCATCCACTGTTGGGTGCTTTGTTGGATTGGATCTGACATGGCTTATTATAGCTCTAGCTGAACGGACACGTAAAACACCTTGGTA
TAACACCGAAGGGGCTCATCGAACATCAGATTCAACCCGCCATCTTCTGGATAATTGCGTATCGGTACACCGACGTCCCATCGCCATCCAGCCGCAGACCTCAATTGTGAGACGAGTGAGCATAATTTGTACGAGGCGCCTTCCGACTGGTGCTTGTACCGTCAACTTTTGAAAGTTACAGTTGCTTTAATCCCGCGTAT
AGCCCCCTCGGCGCTTCGAAGAAATAGGTGTGGCTGTCCTGTACACCTGGGTTTAACCGGGACGACGAATACTGCCGTTACGAGTTCGATATGGAAGTAGCATCTCAGGAAAGTAAAGACGTATCAGTATGGTGATCGTGCACCTACCACAATCCCTGACCCCGACTTTTAGCAGGGTAAGCGTAGTCAACCACAACGAA
GCACTACAAAAGTGCAACAGACCGCATCTAGCATGCGCAAGTTACTCTGCTCACGTATATTCGCCAGAAAAGGGTGTTGAGGATGACACGGTTTGATAGGCTGCGTTAGGAGAAGCGTTTTCCATGCCTGAGTCGCATGGCTGTCCTCCGCAGTTGAAAAACACCGTCATCAATCATCCTCGTGACGTAGCTGGGGGTTT
AAGAGTGAATGCTGCTGGCCCGTCAACGAATATTCCTTCCGTCTGGTTGCCGTTGCGTATAGTGGGGTCACGACTCCGAATATGATGAGTGAGCAACAGTCATATCGTCAAGTATCGCCCCACCTATGTTACCCAGCGGTATACCGACTGCGTTTAAGACTGCTCATATAAATTATTCAGTGAGCTAACCCTCAATACGG
GGCCCGTTTGTATATACAACGTATGGCAACTCTACTGGCAATCGAATGGTTAACGAGGCCGTTCAGTACTGTCTTCCCGATGGTCATACAAGAACGTGCCTTCTAGCGGATGATTGACATTACGCTTACGGACTTTCTCTCCCGATCGCGGGCTAGTCGGGCCATGGCTTATTTGGGCGGATTTCTTTCCAACAGTACTA
GTGTACGGTCCTAGTTCAAGGAGTACCAAACCGAAGGGCTGTCATACAGGAGATGTAATTTACTTTTACGAAAACCTCAGCACGAGCGATGACCCTCATTGACTTAATAACCTCCACTGAGGTGATGGTTCTGGGTCCTGACGTTTACCAAACAACTCCTGAACTTAAGATACTTGAACTGTTACTTAAAATTTGTCCTA
TCAATGTACCCGGGCAAACAACTATGTGAAAGCCAGTGCCCACCGCTGTCGCCACAAACGAGTGTACTGCTGGCAATTCCGCTGTCGTTATAATAGCTCGTGAGCCATAATCGCTCAGTGCTCCTTACAGTTAGTTTGCGTCTACTGGTTGGAGGGCTTCCGTTGCTATCATCTGCACTTACCAGGCGGTACATTGCTGC
TGTTTAGCGAGCCTCACCGCGTGCCTGCGAATGCCCTAGAGAACAACCCCGCATCTCGCCGGCCTGTTGCACGCACATCTCACTCCTGTCCAATCGGGGTAATCCTTGCGGCGTTGATCCACCACTCTGGTAAGATTTAAGCCGTTACTGGAGGAGAGTTAACTTGCGTCTAGGGTTAACGCCCGGTCCGGTCATCCATA
GGGCCCTCTTCAAATACGAAAGTGAGATAGGGAAAATCCAGCACTGTACTAGACTTCAACGCTTTGTAGACCAGTCTTAGTCGCCAAATTTACGGAGGAATTGACCAGGGTTAAATGTAGTTGTGTGGACCTAGCCCAGATGAGGGGTCGGAAGCGTAGTACCGCATCGCAATCTTTGGCGGTTCAGATACTCCGTAACA
GGTCGGCTGAGAAGTAGGCCTCGCGGTTGTACGGCTAGATCGGGTCGTAGCCCGCCACGCTCGTGCCTAGCGCTGCGATGGACCACAGTGAGCGTAATCAATCGAGGGTGAAACAAGCGGTCTTAATCCACAGATATTGCACCTCTCTGGAGACCTCTATGTGTTAGAACGTGTGTACTACGTTAGACGAAAAAACATAA
CCTATGGTATTACACGCTAAATCGAGCAAGGGACACCGAAGTGAGACTGGGTACAGTCTTAGGGTAGAGGTAAGCAGATGGAATCCGCTTCCAGGCGCACACCGACCTCAGAGTCCGACGAAAAGCGTGGTTGGAATAGGTCGATGTGGGATCTACGATGGGGTACCAACGTCACCGCCAAAAACGTGATGCACGTCGTA
TCACGGATTAGCTTATGATATGTGGCCACCAAGGTAGGATCATGATGCTGAGAAGGGAGGGAGCCGATAAAAATTCCCTGGGCCGATTAGGGCTAGCTCTTCCGTGCGTGTAAATATGTACATAGGCAAGCCCCCGGTATGGGCGAGGCTACGGGTTTAGTTTGGCGAAGCCTATTGTGACCGTTCCTATGATGCAGACC
GGGCGTTAGGGAGTTCGGTGGAAAGGGGGTTTAACACTGCTGCACAGGTGTGGCCGACCTCATTTCGGTTCCGTATCCGCAACGATTAGGATCATGCTGCGAACGAGCCACAAAGGTTTTTAAAGTAAGTTGGAGTAGTGTGGTCTAATACCATACACGGGGGTCGTTCAAGCACCGGTGGGATACCGATTTCTAGATAG
TTTAAGAATTTCTCGGCGGATCGTGGCAACAGTGATACTGCGTCACAGCGATTAACACACATGACACTTACAGCGTCCAAATGTCACCCTCCAATACTAAACCTTGGAGAGCGGTTGTCTGAAGGGGTCAAAACGTCAAACCCAATGTTCCGTATGATAAGGACGGAGCGAGACCCAGGGATCCTGTCCTTCCAGAAATA
GCAGGTTAATATCTATATTTAGCATTCCCGATCCCGCTCCTGGACGGCAGCGTCGACTCATCTAGCCATATCCGTGTCATAGAGATTGCCTTGTTGTTCTCCTTGCTAGGGGAAAGTGTCGAACTTCACGGCCTGGATTACATCCGAAGTGTGGAGATAAATATCGAGTTCTGCTGACTCTCAAATGAAACAACTTAACT




Trojan.
 
a quick test of duncans code using 10 short and long sequences shows no problems:

Code:
my @data = <DATA>;
chomp(@data);
my @short = @data[0..9];
my @long = @data[10..19];

for (my $x=0; $x<=$#short; $x++) {
  my $r = int(rand(length ($long[$x]) - length ($short[$x]) + 1));
  print "\nreplacement string = $short[$x] , replacement position = $r\n";
  my $output_norm = substr($long[$x], $r, length $short[$x]);
  print " " x $r;
  print "$output_norm\n";
  substr($long[$x], $r, length $short[$x]) = "<font color=red><b>$short[$x]</b></font>";
  print "$long[$x]\n";
}
__DATA__
CAAGACAA
ACAGTTCA
CCGAATTC
GATACTAA
GTTTCACC
TCATCTCA
CCGCCGAT
TATTGTCA
ACGTAACG
TGAGGCTT
GAATCATATATTAGTCTCCACATACTCCGTTCGTGACCCATTACCCTTTCGGGAGAGCCACAGCAACTGTAGATCTCGAAGTTGACAGGGGCAACTAGAGGCCTCAGAATTCTCACTCTTGAGGAGAGAAGTCTAAGACCTACAGTATGGTCGGGTTAGTTTTTGTTCCGTCGAACCTTGGACTAACCACTGTCTGGATA
AGGATTACCCGCTGGACTTCAAACGCTCGTGAAGCATCGTATTGCGAGGCAACCGAGTCATAGCCCAGTCCGGGGGCCAACGCAGTGCCAGCATCTGCGTTGTTCATCGGTCCTCAGTCTCCCATCAACGTGGTCCACACCTAGCATCCTGGTTTTGCATCCGTAACAAAGGACGTTCGAAGTTTTTTGCCGGCGGGAAG
TGATAATTGGTGCAATATTCTCCATAACAGATCCTCGCCAATACGGATTTGAGGGATCCCTCTGCATTTCCACGAAGCGTGTCACCGATAGAGCAGAAATGCTTTACCGCCGCAGTGATTAGGCGGGTACAGTTGTCCAAACGCACACAACCGAAACCTCCCCATGCGTACTCGTTCGTTTAGTCGCGTACAGAGGGAAC
TCTAGCTTTCGTGAAGTCCCAGGCTCCTAGCCCTACAGCGACTTCTGGCCGGATCTGAGGGAAACTTGTCTAAACTTTTTTCCCCGCGAGACTCACTGTTATAAGAGCTGGAGTTTAGATGGGCATGGCATTCTATTGCTAGTATGTCTGAGAGGTGTTCTTTAGACCACGGGTTGATGCGGGTTCGAGTAGAGACACAC
CAGCGGGCGATCACCGTTAACGATGGTGGGTAACGGTCGGAATCGGAATAGGAGAAGAATATGAGATCGTCCATGGGACGTTTCAGACGAGACGTCCCAGCTAGCCGCTGCAATATGGTATTTGGACTCATTCTTCGAGACAGTAAGCGTGATTTTTATCAGGCCCGAGGCTGTTAAACACAGCTGCTCTAGGCCGAATG
ATGCACTTCCCCACGAACTCCTGTTAGGATGTACGTAAGAGTTCAAAATGCATTCCAATTCGTTTGACGTGCAAATTAGAACACCGATGGATTGTCCAGCTTTGCACTCTACGTGAGGGGAACTGTGGTAGGTTTTTTAATGAAGCCTGCCCCTAAGTCCTTAGCCTTTGATTCTGTTTTCCCACCTCCGGCATCCAACC
GTGAGCAAGTATAGCGTTCGTCCTCCGGAGATGGACCCACAATTCTCTGCCGCATAGTCGTATGCCAATTAGTTTAGTTAAAGCGTGACTTCGGGGGTATTAAAAGAGTGGGTAGAAGTAGCAATATTATGTTATAAGTGGATGACGCAGGTGATCATGTGCCTGTCCTCCTTCGCGATTTGCATTGATGGTGACTGTTA
CGTTTTAGGCTCGTACTTGTTAAGGTATATGTTCCAAGTTCAGCCCCGCTGGGTCCGTGGATTGATACATTAGTTTGAGACGCCTAAATATTATGGGTTACCTCGTAGCGGATATTAGTGCCACCTTGCGGTACGCGCTGTTCGACACTGGACTAAACCCTGAAGCTTCTGACAGCGTAACAAAAAAGCAGATGAATCAC
CAAACACGATTGTGCTTCGTGACGAACGGGTGTAGCACGTACCGTACCATGTGCCCCTAGTTCTGAAACGTGTCATCTTCTAACGCCTCACTCACCATAATGTTCACCTCGTGCTGTACACGCAGTGGCCTAGCGCTAGGAGCTCAATTGAGTGCGACATGCTCTGGACAGCTGTAGCAGCTATGCATTAGGCACCCGCC
AGATACGTTAAAGGACCCGTGTCGGTTCGGGATTGCTATAGGTTGTCCAGCCATCGGCTCAACCCCTGGGGCGCACGATTTCATGCATGCACATCGGCCAGTACGTTTGGCAAGGGCTCAATAGACAACCAATACTGATCCCGCGCCTACATTTTCGCTATCGCAATTATCAACATAGAATTCCAGACTCGACTTGCCAT

output is as expected.
 
Hi Kevin and Trojan,

Thanks! Both of the codes are right.
I used the wrong input file. My mistake
:(

 
Hi Everwood

I'm glad your all sorted, and that Kevin & Trojan have have helped you out. Otherwise this site will soon start growing arms & legs... ;-)


Kind Regards
Duncan
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top