Quantcast
Channel: SQL – TAR Solutions
Viewing all articles
Browse latest Browse all 10

SQL Calculating the row number

$
0
0

To count the row number in SQL there’s a useful little in built SQL function you need to use, row_number()

This enables you to number all rows and restart the row numbering part way a dataset. Note this is different to the SQL rank function.

This is a very useful thing is you want to select the top X rows by specified criteria. For example the top 10 selling products per country or the top 5 sales people per region.

The syntax of this is ROW_NUMBER ( )     OVER ( [ <partition_by_clause> ] <order_by_clause> ), which for me is not super helpful without some examples. (For more details you can see the row_number function on the Microsoft website here: http://msdn.microsoft.com/en-us/library/ms186734.aspx)

Here is some demo data to run through some common scenarios. You can run this code in SQL Server Management Studio (SSMS) to create the test data.

CREATE TABLE #Sales
    (
    Country varchar(10)
    ,Person varchar(2)
    ,Sales decimal (18,2)
    )

insert into #Sales values (‘UK’,’AW’,8329)
insert into #Sales values (‘UK’,’KS’,542)
insert into #Sales values (‘UK’,’NE’,942)
insert into #Sales values (‘UK’,’LT’,7358)
insert into #Sales values (‘UK’,’AP’,4262)
insert into #Sales values (‘France’,’ME’,952)
insert into #Sales values (‘France’,’DD’,8802)
insert into #Sales values (‘France’,’WQ’,2331)
insert into #Sales values (‘France’,’PD’,544)
insert into #Sales values (‘France’,’ML’,2418)
insert into #Sales values (‘Italy’,’MN’,5426)
insert into #Sales values (‘Italy’,’DN’,7982)
insert into #Sales values (‘Italy’,’IA’,499)
insert into #Sales values (‘Italy’,’EY’,1034)
insert into #Sales values (‘Italy’,’LA’,5429)

Example 1: Number each row with the highest selling person at the top

This is quite simple, you just need to order by sales in the Over part of the clause.

select Person
    ,Country
    ,Sales
    ,ROW_NUMBER() over (order by sales desc) as RowNumber
from #Sales

Example 2: Number each row with the highest selling person at the top doing this for each country

Slightly more complex, this is where we introduce the Partition By into to Over part of the clause

select Country
    ,Person
    ,Sales
    ,ROW_NUMBER() over (partition by country order by sales desc) as RowNumber
from #Sales

Example 3: Only select the top 3 sales people from each country in term of sales amount

Here we also need to use the same code as Example 2 and we just query that result set

select Country
    ,Person
    ,Sales
    ,RowNumber
from
    (
    select Country
        ,Person
        ,Sales
        ,ROW_NUMBER() over (partition by country order by sales desc) as RowNumber
    from #Sales
    )a
where RowNumber < 4
order by Country, RowNumber

Quite straightforward once you understand the syntax. Note the Row_number() function can’t be used in the WHERE clause or HAVING hence I had to query a query in Example 3.

The post SQL Calculating the row number appeared first on Business Analytics.


Viewing all articles
Browse latest Browse all 10

Trending Articles