memcmp versus strcmp, why is memcmp faster?

SkiLift · Jan 16, 2007

Why is memcmp faster than strcmp?

cpjust · Jan 16, 2007

Maybe because it isn't checking to see if it hits a NULL? I don't see anything else that it would be doing differently.

Diancecht · Jan 17, 2007

Have you checked that it's faster or you read it somewhere?

Cheers,
Dian

ArkM · Jan 17, 2007

Compare real memcmp/strcmp core loops:

Code:

/* MS VC++ RTL memcmp core loops: */
while ( --count && *(char *)buf1 == *(char *)buf2 ) {
  buf1 = (char *)buf1 + 1;
  buf2 = (char *)buf2 + 1;
  }
...
/* MS VC++ RTL strcmp core loop: */
while( ! (ret = *(unsigned char *)src - *(unsigned char *)dst) && *dst)
     ++src, ++dst;
...

The strcmp has one extra compare (and memory access) op...

SkiLift · Jan 17, 2007

During a code review, I was told to change my strcmp to a memcmp, because strcmp is too expensive. I'm pretty sure he is correct, but I was wondering why. The loop gets moderate use. Here is the old code:

Code:

if (!(lWordIndex == 0 &&			
      strcmp(str1, "Z") == 0 ||
      strcmp(str1, "X") == 0 ) &&
      lPhoneticName == &lWord[lWordIndex][0]))
{

I cant use a char compare, like str1[0] == 'Z', because I dont know how long str1 is. Here is the new if. Notice I moved the fourth condition to the second which is a great time saving idea, but I'm not sure how much the memcmp pays off because I had to add a strlen operation.

Code:

if (!(lWordIndex == 0 &&
      lPhoneticName == &lWord[lWordIndex][0] &&
      strlen(str1) == 1  &&
      (memcmp(str1, "Z", 1) == 0 ||
       memcmp(str1, "X", 1) == 0    )))
{

SkiLift · Jan 17, 2007

Ooops...missing a open paren before the first strcmp in the first code block.

mingis · Jan 17, 2007

No need for strlen():

Code:

if (!(lWordIndex == 0 &&
      lPhoneticName == &lWord[lWordIndex][0] &&
      (memcmp(str1, "Z", [COLOR=red]2[/color]) == 0 ||
       memcmp(str1, "X", [COLOR=red]2[/color]) == 0    )))
{

Diancecht · Jan 17, 2007

If you use strncmp instead of strmcp you get the same loop as memcmp. I don't think there will be noticeable differences.

Cheers,
Dian

SkiLift · Jan 17, 2007

memcmp(str1, "Z", 2) is a great idea to get rid of the strlen, but is harder to read or understand what's going on and why (for maintainability).

I like the strcmp because of the readability. I would use the memcmp if I did not have to add either the strlen or the memcmp(str1, "Z", 2) trick.

memcmp is good for comparing structures and the like.

It does not appear that there is significant time savings. That's my opinion.

Great input by everyone.

Salem · Jan 17, 2007

> During a code review, I was told to change my strcmp to a memcmp, because strcmp is too expensive.
How about looking at what the profiler is telling you is really expensive, rather than on someone's ideas as to what might be expensive.

Sacrificing readability for minor performance gains is rarely worthwhile IMO.

Besides, for such short strings, why not just save a function call and unroll it yourself into
[tt]str[0] == 'Z' && str[1] == '\0'[/tt]

> memcmp is good for comparing structures and the like.
No it isn't.
Whilst it seems like it should, you have to remember that structures can have padding, and that the padding is always(*) undefined. All well and good except that memcmp will try and compare one structs padding with another structs padding.

Unless you initialise each struct with a memset, then I don't think you can be sure of the state of any padding.

Further, using memcmp() to compare structs containing pointers is most definitely the wrong thing to be doing.

--
If you dance barefoot on the broken glass of undefined behaviour, you've got to expect the occasional cut.

SkiLift · Jan 17, 2007

You're right about unrolling it yourself!

My bad about memcmp and structs...I actually do know better than that

xwb · Jan 17, 2007

Unless it is a time critical piece of code, I wouldn't change it. If the code is only going to be executed once, say, on initialization, does it make a lot of difference whether the system starts up 2ms slower?

If, on the other hand, it is executed millions of times, then it is worth changing.

Diancecht · Jan 17, 2007

Assuming there's an actual gain. Did anyone run a test to measure that?

And even if there's one, I bet there are a lot of simpler changes that will give you a bigger performace gain.

Cheers,
Dian

counselour1 · Feb 1, 2007

It all has to do with memory alignment. strcmp does a byte compare if either of the arguments is not aligned to a word boundary. memcmp returns non-zero if either are unaligned; otherwise is does block compares (4 bytes at a time on a 32 bit architecture).

Salem · Feb 1, 2007

> memcmp returns non-zero if either are unaligned
Rubbish - the alignment of the two things being compared does not affect the result. It might affect the internal implementation, but that's a completely different issue.

--
If you dance barefoot on the broken glass of undefined behaviour, you've got to expect the occasional cut.

mingis · Feb 5, 2007

> memcmp() does block compares (4 bytes at a time on a 32 bit architecture)

I thought about this case at first time as well, but it is faster for big endians only. For little endians it is not so simple:

Code:

union
{
char chars[4];
int word;
} first, second;

first.word < second.word

does not mean

first.chars < second.chars

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

memcmp versus strcmp, why is memcmp faster?

SkiLift

Programmer

cpjust

Programmer

Diancecht

Programmer

ArkM

IS-IT--Management

SkiLift

Programmer

SkiLift

Programmer

mingis

Programmer

Diancecht

Programmer

SkiLift

Programmer

Salem

Programmer

SkiLift

Programmer

xwb

Programmer

Diancecht

Programmer

counselour1

Programmer

Salem

Programmer

mingis

Programmer

Similar threads

Part and Inventory Search

Sponsor