Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Data Scrubbing 30 Gigs of Flat Files

Status
Not open for further replies.

underwun

MIS
Nov 19, 2002
1
PH
Looking for a product that can help clean up 30 Gig archived flat files and pull the stuff that matters...any suggestions?
 
Hi underwun,

If it is on UNIX or similar platform, try the option of developing scripts for scurbbing flat files using a language called AWK with Shell script. This is really fast. And also AWK is pretty simple. Good luck.

Thanks and Regards,
Srinath M.K
 
If it is a one time work then you are probably better off writing C/Perl/awk scripts.

If it is a regular scenario which has to run as batch job then check out Ab Initio. This tool is expensive but is tailormade to carry out the complex transforms and handle very large data volumes. Can do 30 gigs in 10-30 mins depending upon the cleaning required.

Be Diligent!
DWTECH
 
Ab Initio is like using a sledge hammer to swat a fly if you're just trying to cleanup up a 30 gig file. Many people use ETL tools to manage data quality, just like many people use a hammer instead of a wrench to loosen a bolt. It'll get the job done, but....

Greg Leman
Metagenix, Inc.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top