Question for vgersh99

ranjit · Apr 24, 2003

You recently posted scripts to carryout file comparisons. Why is the array "arrA" in the below BEGIN statement indexed by $1 i.e. arrA[$1]=$0, however...in the second example the array is indexed with an incrementer i.e. arr[++lineA]=$0

I know the scripts perform different tasks but I can't understand the logic behind the choice to
index the array - clearly it makes a difference.

=========================================
Subject: Comparing two files

nawk -v fn=fileA -f comm.awk fileB

#------------------- comm.awk
BEGIN {
if (!fn) fn = "fileA"
fileAprime= fn ".prime"
while((getline < fn) > 0)
arrA[$1] = $0
}

FNR == 1 { fileBprime= FILENAME ".prime" }

{
if ( !( $1 in arrA))
print $0 >> fileBprime;
else
delete arrA[$1];
}
END {
for (i in arrA)
print arrA >> fileAprime;
}

======================================

Subject: Can Two input files be processed at awk..?

BEGIN {
if (!fn) fn = "fileA"
lineA=0
while((getline < fn) > 0)
arr[++lineA] = $0
}

========================================

I would appreciate it if you could clear up the confusion.

Thanks

Baraka69 · Apr 24, 2003

I just recently started dealing with arrays within awk and found out that all awk arrays are associative arrays, making it possible to use strings and numerics to index the array.

Advantage is that you can test for membership by using strings as index. In my opinion that is what was used in the 1st example. The reason being that you could test for membership in the array and act accordingly.

Drawback is sorting within the array and especially looping through parts of the array. Thats where you would rather use numeric indices.

e.g.

myWeek_array[Mon]
myWeek_array[Tue]
myWeek_array[Wed]
myWeek_array[Thu]
myWeek_array[Fri]
myWeek_array[Sat]
myWeek_array[Sun]

vs.

myWeek_array[1]
myWeek_array[2]
myWeek_array[3]
myWeek_array[4]
myWeek_array[5]
myWeek_array[6]
myWeek_array[7]

Try running through you array for all weekdays. In the 1st example you will have some difficulty (you will need to explicitly list all occurrences), but in the 2nd example you can write a loop from 1-5.

I hope that helps.

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

Question for vgersh99

ranjit

Technical User

Baraka69

Programmer

Similar threads

Part and Inventory Search

Sponsor