jizosaves
Technical User
- Apr 7, 2009
- 1
Hi I have this dilemma.
I have an input file created with the form of
gnl|ti|1302443149
gnl|ti|1302443148
gnl|ti|1302443142
lets call it INPUT
I need to retrieve these records from another file, lets call it REF, in the form of :
...
ATAAGCAATCAACATTCCAAGCTTTGTTCTTAACTTTACCCCATCCCCTTCCACCACCGATTTTCTTGTT
ACATGTGTAATGTATTACCTATCAACTATTTCCTGTATGTGCTCTCTACCTTTACTTTCATGTTTCGTTT
AGTAAAATAAGT
>gnl|ti|1302443140 1101127226940
GTAAGCTGCAGGATTTATTGTTGTTAATCATAGAATATTTTGAGTACTCACCTCCGATATCTATCGGTGG
CGCGTTCATGCCACACTGGCCGGTGCCGTAATGTGGGGCAAGAAACATTCCCAATGAGTTTAAAAACGTC
ATCCGCGGCATACAAGAGCACAAGACGCGCCCTAAGTCGGATACCCAGCACTTTGGAGCACCCAACTCGC
ACACCCAAGTGACCCAACGCGGTAGCGGCGCCCCAAAGAGCACGTCGCCGACATTTCTATCCACTCAGCA
TAGCCGCGGAACCTGATGCTTGCTAGATGGCGCCGCAGTGTTCAAAATGCCGCTACCCAAGATAAAATGG
TCTATACTCGTGGAGGGCTCGCTACCGCAGGTGCTGCCATGCAGCTCTTGCTCGTGGTGGCGCCACCGCC
ATCACCTGTGCTCCCTGTGATCCCAGTCGAAGTGGTGACAGGGATTCTGGGAAGGCTCGTTGCCTCACGT
GCTGCTACGGAGCCCTGGCTCGTTAGGAGTGCTGTTGCTACTGCCAAAGCTTGTGCTCTTTGTCGAAACG
GCGACTGGAATCCGGAGACGGTTCGCTGCCGCAGCCGGTCGGAAAGTTCCGCGCGTTCGGTCGTAGCCCT
TCTTGGGAATTCGTTCCCTCCATGTTGAGGCTCTAGCAGGAGAAATCCACTGCTGACGTGTTTGTTACTT
GTCACCATGGAACTCAATCCGCAGAGGGCCATGGCCACGTGATCACACTTCTGAAATGTTAGAACCCTAC
TGAAAAAATTAACAAGGCGGTAAGTTGGGCTAGTTGGTTTAACATCGTCAAACGTGAAACAGCGCAAAAA
TTGAACAAAGGACAGTGAAAGGAGTGCACGGACAAGCGCTGTTAAAGCTGGACACGCGCATCGAACACGC
GACACTTGCATACAATCGGGCATCGGTTATCGGGCAGCGTACGGCT
>gnl|ti|1302443141 1101127226941
AATAGATAACAGAGGTGCAGATATGATGGGGCAGAACGGTTGTCCGGTCGGCGAATCTCAACTGGACTAA
AGGCCGATCACGACTGCAGCAACTGCAGCATGGATGTTTGGGAGTCGGCTCGTTTTCCCCAAGTCCCTAG
GTAGGGAATTCGAAGCCGCAGTTGGAAACCAGCAAGCCCCGCCTCTGTTCCATTCGATACACACATATTC
GCTCCTGCAAAGCCGCGCGAAAGCTCTGCCGTCAATCGAAAAGTAAAGACGGCGCCGGGGAGACAAGGAG
TAGTGGGCGCCTTTCCTAAAATATGTCCCGCCACCCTAAGTTGAAACGGCATTGTATACAAATAAATGCC
TACGGCGTCGGCTTGAGGACCCCGTGTAAGCAGCCTCCGGCCCTTAGAGTGCTCCTACCGTTTATCTTTC
TTTTATTAGCTTCCCGCCATGAGAAGTCGTACCGCAGGGTATGCCCCT...
this is DNA shotgun sequencing raw trace data.
I need to extract the records listed in INPUT from REF and compile the dna sequences in OUTPUT file.
i am pretty new to this sort of stuff but managed to use GREP to get my input file from sopme raw data.
Any help in this would greatly indebt me to you
Thanks
jizosaves
I have an input file created with the form of
gnl|ti|1302443149
gnl|ti|1302443148
gnl|ti|1302443142
lets call it INPUT
I need to retrieve these records from another file, lets call it REF, in the form of :
...
ATAAGCAATCAACATTCCAAGCTTTGTTCTTAACTTTACCCCATCCCCTTCCACCACCGATTTTCTTGTT
ACATGTGTAATGTATTACCTATCAACTATTTCCTGTATGTGCTCTCTACCTTTACTTTCATGTTTCGTTT
AGTAAAATAAGT
>gnl|ti|1302443140 1101127226940
GTAAGCTGCAGGATTTATTGTTGTTAATCATAGAATATTTTGAGTACTCACCTCCGATATCTATCGGTGG
CGCGTTCATGCCACACTGGCCGGTGCCGTAATGTGGGGCAAGAAACATTCCCAATGAGTTTAAAAACGTC
ATCCGCGGCATACAAGAGCACAAGACGCGCCCTAAGTCGGATACCCAGCACTTTGGAGCACCCAACTCGC
ACACCCAAGTGACCCAACGCGGTAGCGGCGCCCCAAAGAGCACGTCGCCGACATTTCTATCCACTCAGCA
TAGCCGCGGAACCTGATGCTTGCTAGATGGCGCCGCAGTGTTCAAAATGCCGCTACCCAAGATAAAATGG
TCTATACTCGTGGAGGGCTCGCTACCGCAGGTGCTGCCATGCAGCTCTTGCTCGTGGTGGCGCCACCGCC
ATCACCTGTGCTCCCTGTGATCCCAGTCGAAGTGGTGACAGGGATTCTGGGAAGGCTCGTTGCCTCACGT
GCTGCTACGGAGCCCTGGCTCGTTAGGAGTGCTGTTGCTACTGCCAAAGCTTGTGCTCTTTGTCGAAACG
GCGACTGGAATCCGGAGACGGTTCGCTGCCGCAGCCGGTCGGAAAGTTCCGCGCGTTCGGTCGTAGCCCT
TCTTGGGAATTCGTTCCCTCCATGTTGAGGCTCTAGCAGGAGAAATCCACTGCTGACGTGTTTGTTACTT
GTCACCATGGAACTCAATCCGCAGAGGGCCATGGCCACGTGATCACACTTCTGAAATGTTAGAACCCTAC
TGAAAAAATTAACAAGGCGGTAAGTTGGGCTAGTTGGTTTAACATCGTCAAACGTGAAACAGCGCAAAAA
TTGAACAAAGGACAGTGAAAGGAGTGCACGGACAAGCGCTGTTAAAGCTGGACACGCGCATCGAACACGC
GACACTTGCATACAATCGGGCATCGGTTATCGGGCAGCGTACGGCT
>gnl|ti|1302443141 1101127226941
AATAGATAACAGAGGTGCAGATATGATGGGGCAGAACGGTTGTCCGGTCGGCGAATCTCAACTGGACTAA
AGGCCGATCACGACTGCAGCAACTGCAGCATGGATGTTTGGGAGTCGGCTCGTTTTCCCCAAGTCCCTAG
GTAGGGAATTCGAAGCCGCAGTTGGAAACCAGCAAGCCCCGCCTCTGTTCCATTCGATACACACATATTC
GCTCCTGCAAAGCCGCGCGAAAGCTCTGCCGTCAATCGAAAAGTAAAGACGGCGCCGGGGAGACAAGGAG
TAGTGGGCGCCTTTCCTAAAATATGTCCCGCCACCCTAAGTTGAAACGGCATTGTATACAAATAAATGCC
TACGGCGTCGGCTTGAGGACCCCGTGTAAGCAGCCTCCGGCCCTTAGAGTGCTCCTACCGTTTATCTTTC
TTTTATTAGCTTCCCGCCATGAGAAGTCGTACCGCAGGGTATGCCCCT...
this is DNA shotgun sequencing raw trace data.
I need to extract the records listed in INPUT from REF and compile the dna sequences in OUTPUT file.
i am pretty new to this sort of stuff but managed to use GREP to get my input file from sopme raw data.
Any help in this would greatly indebt me to you
Thanks
jizosaves