Query to delete almost-duplicate records from database?

GWH · May 15, 2000

I've got a table in SQL Server 7 that has almost-duplicate records: they're identical except for a keynumber. I don't know how they got there, but they must go! I've built a query that joins the table to itself and selects out all the records that have duplicates, and their duplicates. (query edited for brevity)     select distinct a.* from thetable a     full outer join thetable b     on ((a.productname=b.productname) and a.Location=b.Location))     where a.KeyNumber <> b.KeyNumber     order by a.productname; If I could get a result set of only the duplicates, I could use it to purge the table.  Using the results of this query would purge ALL of the records that have dupes. Any ideas on how to get this?

carp · May 15, 2000

Well, I'm an Oracle puke but I THINK this ought to work in SQLServer: DELETE FROM thetable where KeyNumber IN   ( select distinct a.KeyNumber from thetable a                               full outer join thetable b                               on ((a.productname=b.productname) and a.Location=b.Location))                               where a.KeyNumber > b.KeyNumber); I BELIEVE this will leave the row with the lowest KeyNumber value.  I would DEFINITELY "trust but verify" on this one.

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

Query to delete almost-duplicate records from database?

GWH

Programmer

carp

MIS

Similar threads

Part and Inventory Search

Sponsor