Find a value from a max count in a sub query

twifosp · Oct 22, 2009

I have some data that looks like this:

Code:

region_code city zip 
AAA     Houston 77076
AAA     Houston 77024
AAB     Houston 77024

The region_code is entered manually by a multitude of different people. It should always be the same thing for each city, but some times it gets messed up.

I am trying to write a query that will let me find the cities that have more than one region code per city, and then update them based on which region code appears more than the other region code.

So far I have:

Code:

select city, count(*) as n
  from (
    select distinct pid, city, region_code
    from  customer_dim
   ) a
  group by city
  having count(*) > 1
  order by count(*) desc

Which finds all the cities that have more than one region code assigned to them per city.

The below code is my attempt to find the max value for each city and return its region code, but of course it doesn't work because I am not doing the sub queries right. Can someone help?

Code:

select  region_code, city
from  (
     select  region_code, city, count(*) as n
  from  customer_dim
  where city = 'HOUSTON'
  group by region_code, city) a
join  (
  select max(n) as maxn
  from  (
  select  region_code, city, count(*) as n
  from  customer_dim
  group by region_code, city) aa
  ) b
on a.city = b.city
and b.maxn = a.n

carp · Oct 22, 2009

I haven't tested this, but I think it ought to give you what you want:

Code:

SELECT cd.region_code, cd.city, v.code_count
  FROM customer_dim cd
 INNER JOIN (SELECT city, max(n) AS code_count
               FROM (SELECT region_code, city, count(*) AS n
                       FROM customer_dim
                      GROUP BY region_code, city) 
             )v
ON cd.city = v.city;

carp · Oct 22, 2009

Correction to the above:

Code:

SELECT cd.region_code, cd.city, v.code_count
  FROM customer_dim cd
 INNER JOIN (SELECT city, max(n) AS code_count
               FROM (SELECT region_code, city, count(*) AS n
                       FROM customer_dim
                      GROUP BY region_code, city) 
               GROUP BY city) v
ON cd.city = v.city;

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

Find a value from a max count in a sub query

twifosp

Programmer

carp

MIS

carp

MIS

Similar threads

Part and Inventory Search

Sponsor