I want to parse a FASTA-like file into a hash (without using bioperl). The file I'm working with is similar to...
>TTHERM_01213980
ATGGAGTAGATTTAATAAGACTAATAAGGATTGATTTCATGGTACTGTTATAATGCATAA
>TTHERM_00697570
ATGAATAAATATACTCTAATTACTTTAGGAGTTTGTATGCTTATAGTTAATGGGTTTTTG
AATAAGCATACCTTTTAATTATCTAACCACTAAACTGGCTTTGATTTATCTCTGTGTGCC
but the sequences are very long, and there are 40+ of them. I want the key to be the name (i.e. TTHERM_01213980) and the content to be the sequence, all of them in one hash.
I've come up with some ways of trying them, but I won't even try writing them because they're long, and don't work.
I appreciate any help I can get.
Thanks
>TTHERM_01213980
ATGGAGTAGATTTAATAAGACTAATAAGGATTGATTTCATGGTACTGTTATAATGCATAA
>TTHERM_00697570
ATGAATAAATATACTCTAATTACTTTAGGAGTTTGTATGCTTATAGTTAATGGGTTTTTG
AATAAGCATACCTTTTAATTATCTAACCACTAAACTGGCTTTGATTTATCTCTGTGTGCC
but the sequences are very long, and there are 40+ of them. I want the key to be the name (i.e. TTHERM_01213980) and the content to be the sequence, all of them in one hash.
I've come up with some ways of trying them, but I won't even try writing them because they're long, and don't work.
I appreciate any help I can get.
Thanks