Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Sql to Find every 15th row in each group, sql server

Status
Not open for further replies.
Mar 6, 2008
56
US
Hi guys,

I am Using sql server 2005 and I need to find the values at every 15th date in each group of a table. Below is the Table "Orders". I am grouping on PatientID

PatientID, AdmitDate, treatment
-------------------------------
1, Date1, Treatment1
1, Date2, Treatment2
....
....
1, Date15, Treatment15
1, Date16, Treatment16
...
2, Date19, Treatment19
2, Date1, Treatment1
....
2, Date15, Treatment15
2, Date3, Treatment3
...
-------------------------------
please note that the date column is not always ascending, its ordered randomly like in patientID = 2 in the above.

Thanks


 
And how do you know which is 15th date?
Check ROW_NUMBER() function in BOL, esp OVER(PARTITION BY ) clause.
Then use it as a derived table and get every 15th row number

Borislav Borissov
VFP9 SP2, SQL Server 2000/2005.
 
Which field would indicate the order within the group? You can not rely on the order records are inserted into the table.

Anyway, here is the idea
Code:
;with cte as (select PatientID, AdmitDate, Treatment, row_number() over (partition by PatientID order by ?(Field to order)) as RowNum from Orders)

select * from cte where RowNum = 15
 
Thanks for comments.

@bborissov, @markros I got to order by Admitdate in Asc within each group PatientID and get the row at 15th Admitdate.

We havent used the server specific analytical functions or temp tables yet in our queries, we have always been writing old style sql queries that return data in one final select. Would it be more efficient to use analytical functions instead of standard sql?

It would be helpful if anyone could provide the standard sql style query too.

Thanks!
 
That is standard query for 2005 and above :)
Other way is to use a table variable with identity field, get the min and max values and the calculate 15th row.
But using ROW_NUMBER() is much easier.

Borislav Borissov
VFP9 SP2, SQL Server 2000/2005.
 
Adam Haines solution from the mentioned thread:

Code:
--2000 solution
-- Adam Haines [URL unfurl="true"]http://social.msdn.microsoft.com/Forums/en-US/transactsql/thread/8664e2df-67f1-4e36-b37e-7656b26b3ad8[/URL]
SELECT t.empid,t.[surname],t.[department],[t].[salary],(MAX([dept_cnt].emp_cnt) - COUNT(*))+1 AS seq
FROM @t t
INNER JOIN @t t2 ON t2.[department] =t.[department] AND [t2].[salary] <= [t].[salary]
INNER JOIN(SELECT [department], COUNT(*) AS emp_cnt FROM @t GROUP BY Department) AS dept_cnt
	ON [dept_cnt].[department] = t.[department]
GROUP BY t.empid,t.[surname], t.[department], t.[salary]
HAVING (MAX([dept_cnt].emp_cnt) - COUNT(*))+1 <=3
ORDER BY t.[department], (MAX([dept_cnt].emp_cnt) - COUNT(*))+1 ASC

Easy to apply for 15 instead of 3. Yet row_number() solution is so much easier.
 
Sorry, with the table creation script the above solution:

Code:
DECLARE @t table(empid numeric(10), department VARCHAR(25),surname varchar(10),salary MONEY)

insert into @t values(1,'IS','emp1',10)
insert into @t values(4,'IS','emp2',15)
insert into @t values(5,'IS','emp3',20)
insert into @t values(8,'IS','emp4',40)
insert into @t values(7,'IS','emp5',25)
insert into @t values(2,'IS','emp6',13)

insert into @t values(10,'MKTG','emp7',5)
insert into @t values(11,'MKTG','emp8',24)
insert into @t values(12,'MKTG','emp9',35)
insert into @t values(13,'MKTG','emp10',55)
insert into @t values(14,'MKTG','emp11',67)
insert into @t values(15,'MKTG','emp12',20)

--2000 solution
-- Adam Haines [URL unfurl="true"]http://social.msdn.microsoft.com/Forums/en-US/transactsql/thread/8664e2df-67f1-4e36-b37e-7656b26b3ad8[/URL]
SELECT t.empid,t.[surname],t.[department],[t].[salary],(MAX([dept_cnt].emp_cnt) - COUNT(*))+1 AS seq
FROM @t t
INNER JOIN @t t2 ON t2.[department] =t.[department] AND [t2].[salary] <= [t].[salary]
INNER JOIN(SELECT [department], COUNT(*) AS emp_cnt FROM @t GROUP BY Department) AS dept_cnt
	ON [dept_cnt].[department] = t.[department]
GROUP BY t.empid,t.[surname], t.[department], t.[salary]
HAVING (MAX([dept_cnt].emp_cnt) - COUNT(*))+1 <=3
ORDER BY t.[department], (MAX([dept_cnt].emp_cnt) - COUNT(*))+1 ASC
 
Hi,

Here's a way you can do it in one statement. This example returns the 2nd row in each group.

Code:
declare @temp table (id int, dt datetime, treat char)
insert into @temp values (1, getdate(),   'a')
insert into @temp values (1, getdate()-1, 'b')
insert into @temp values (1, getdate()-2, 'c')
insert into @temp values (2, getdate(),   'd')
insert into @temp values (2, getdate()-1, 'e')
insert into @temp values (2, getdate()-2, 'f')
insert into @temp values (3, getdate(),   'g')
insert into @temp values (3, getdate()-1, 'h')


select id, dt, treat from 
(
  select id, dt, treat,rank() over (partition by id order by dt asc) as my_rank
  from @temp
) as t
where my_rank = 2

Ryan
 
Ryan,

Did you miss the new requirement to do it for SQL Server 2000?
 
Thanks for comments. @bborissov, @markros I got to order by Admitdate in Asc within each group PatientID and get the row at 15th Admitdate. We havent used the server specific analytical functions or temp tables yet in our queries, we have always been writing old style sql queries that return data in one final select. Would it be more efficient to use analytical functions instead of standard sql?It would be helpful if anyone could provide the standard sql style query too.Thanks!

No, it would not be efficient and it would be much more complex code similar to the one I posted.
 
I mean using windowing functions is more efficient than using any other kind of code.
 
Using analytical funtions is much cleaner IMO and the OP is using 2005 so it shouldn't be a problem. But using standard sql you could do the following?

Code:
declare @temp table (id int, dt datetime, treat char)
insert into @temp values (1, getdate(),   'a')
insert into @temp values (1, getdate()-1, 'b')
insert into @temp values (1, getdate()-2, 'c')
insert into @temp values (2, getdate(),   'd')
insert into @temp values (2, getdate()-1, 'e')
insert into @temp values (2, getdate()-2, 'f')
insert into @temp values (3, getdate(),   'g')
insert into @temp values (3, getdate()-1, 'h')

select distinct t1.id, t2.*
from @temp t1
inner join @temp t2 on t2.id = t1.id 
and t2.dt = (
 select max(t3.dt) 
 from (
   select top 2 dt
   from @temp t4
   where t4.id = t1.id
   order by dt asc
 ) as t3 )

Ryan
 
Thanks for the responses!
The solution using rank() and partition looks clean. so will go with that. Though the above solution (@RyanEK) for 2000 works fine too.

Thx
 
I need help on rownum() function. I am trying to create a view (see example below) but I get error:
"The OVER SQL construct or statement is not supported."

I can get the query to work fine but how can I use this query or a simiar one to work as a view?

SELECT CustomerID,
acct_nbr, App_Description,
row_number() over (partition by LEFT(CustomerID,12) order by acct_nbr) as Acct_Count
FROM cfk
 
The problem is.... the view builder wizard thing you are using does not support the OVER statement. What you should do is...

Using SQL Server Management Studio (or Query Analyzer), open a new window.

Copy/paste this....

Code:
Create View dbo.[!]AnyViewName[/!]
As
SELECT CustomerID,
acct_nbr, App_Description,
row_number() over (partition by LEFT(CustomerID,12) order by acct_nbr) as Acct_Count
FROM cfk

Change the part in red to give it a meaning name.

Run this code by pressing F5 on your keyboard.

-George

"The great things about standards is that there are so many to choose from." - Fortune Cookie Wisdom
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top