Hi there,
I have a similar problem to that described in thread271-1497302, but unfortunately I haven't found a solution by now.
There is a very huge file to process consisting of 300000 columns and 1500 rows. About 20000 columns shall be deleted from that file. So it is clear, that I can't do this by writing down all the columns in an awk command like $1, $x etc. As the columns are not next to each other I can't define a range as well.
The (distinct) identifieres of the columns that shall be removed are in a text-file containing one column with 20000 identifiers (corresponding to the identifiers in the header/first line in file to process).
An equivalent of this question is of course how to extract columns(instead of delete) according to a list of identifiers. But I didn't figure out how to do this as well.
The task is probably much easier to do after columns have been transposed to rows. This did unfortunately not work due to performance issues (file size about 1GB).
Can anyone give me a hint ho to do this with awk or a shell script?
I'd appreciate any kind of help very much!
Best regards, Felix
I have a similar problem to that described in thread271-1497302, but unfortunately I haven't found a solution by now.
There is a very huge file to process consisting of 300000 columns and 1500 rows. About 20000 columns shall be deleted from that file. So it is clear, that I can't do this by writing down all the columns in an awk command like $1, $x etc. As the columns are not next to each other I can't define a range as well.
The (distinct) identifieres of the columns that shall be removed are in a text-file containing one column with 20000 identifiers (corresponding to the identifiers in the header/first line in file to process).
An equivalent of this question is of course how to extract columns(instead of delete) according to a list of identifiers. But I didn't figure out how to do this as well.
The task is probably much easier to do after columns have been transposed to rows. This did unfortunately not work due to performance issues (file size about 1GB).
Can anyone give me a hint ho to do this with awk or a shell script?
I'd appreciate any kind of help very much!
Best regards, Felix