SPLITING FILES

SOL9MAN · Oct 17, 2003

i am trying to split an ASCII file which has over 5 million records (total size of file is 1GB) into equal sizes. However the UNIX split command is to slow. As anybody come across a different/faster way of spliting a large file.

regards

mrregan · Oct 17, 2003

I dont know how fast this would be, but it should work:
dd if=bigfile of=smallfile1 ibs=200000000 count=1
dd if=bigfile of=smallfile2 ibs=200000000 count=1 skip=1
dd if=bigfile of=smallfile3 ibs=200000000 count=1 skip=2
dd if=bigfile of=smallfile4 ibs=200000000 count=1 skip=3
dd if=bigfile of=smallfile5 ibs=200000000 count=1 skip=4
This would create 200 million character files; you could use any other number you wanted. If your records are not the same length, you will wind up splitting records.

PHV · Oct 17, 2003

What do you mean by to slow ? (5 minutes, 5 hours, 5 days ...)
On which platform (*nix, CPU speed, ..) are you working ?
What are the options of the split command you're using ?
Are the records fixed length or variable ?
If you want good answer, ask good question.

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

SPLITING FILES

SOL9MAN

Technical User

mrregan

MIS

PHV

MIS

Similar threads

Part and Inventory Search

Sponsor