Remove duplicated row

Dieuz · Jun 11, 2010

Hey there,

I have a code to remove the duplicates in one of my table but I cant seem to be able to ORDER the way mysql select the values.

Here is my initial table:

Code:

mysql_query("CREATE TABLE new_table as SELECT * FROM test GROUP BY url") 
mysql_query("DROP TABLE test") 
mysql_query("RENAME TABLE new_table TO test")

I want to remove the duplicated URL and make sure that it keeps only the value with the higher PR. I tried using the ORDER BY attribute but it doesnt seem to work. The first 3 values with a PR of 0 are always selected and placed in the new table...

The good result should be that the 3 values (a,b,c) with a PR of 6 are found in the final table.

I tried this and it doesnt work:

Code:

mysql_query("CREATE TABLE new_table as SELECT * FROM test GROUP BY url ORDER BY pr DESC")

I am looking for a fast way to achieve this because my table have over 200k entries.

Thanks!

towerbase · Jun 12, 2010

I think you need to use a Correlated Subquery. Try

Code:

SELECT DISTINCT
  url
, pr
FROM
  test as t
WHERE
  pr = (
    SELECT
      MAX(pr)
    FROM
      test
    WHERE
      url=t.url
  )

I didn't really understand Correlated Subqueries until I read Rudy Limeback's excellent book 'Simply SQL'.

Andrew
Hampshire, UK

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

Remove duplicated row

Dieuz

Programmer

towerbase

Programmer

Similar threads

Part and Inventory Search

Sponsor