sql server—显示scd类型2维度表的时间趋势的sql查询

6mw9ycah  于 2021-07-24  发布在  Java
关注(0)|答案(3)|浏览(353)

下表:

pk,EmployeeNo,Building,ValidFrom,ValidTo. 
1, 1, a, 2000-01-01, 2008-06-01
2, 1, b, 2008-06-01, 2010-06-01
3, 1, c, 2010-06-01, 2011-08-01
4, 2, a, 2000-01-01, 2008-06-01
5, 2, b, 2008-06-01, 2010-06-01
6, 2, c, 2010-06-01, 2011-08-01

我是一个相对新手的sql开发人员,我使用的是mssqlserver。我想设计一个sql查询,它可以使用上面使用类型2缓慢变化的维度(validfrom date和validto date)构建的表,并生成一个随时间推移的每日趋势客户计数表。
输出如下:

Date, Employee Count
1/1/2000, 2
1/2/2000, 2
aelbi1ox

aelbi1ox1#

您可以使用聚合和累计和—在取消激活数据之后:

with d as (
      select validfrom as dte, 1 as inc
      from t
      union all
      select validto, -1
      from t
     )
select dte, sum(sum(inc)) over (order by dte)
from d
group by dte
order by dte;

注意:这假设 validto 不包括日期。如果你想把它包括在内,那就加上“1”。
编辑:
如果您只需要这两个日期的结果,请使用:

select v.dte,
       (select count(*) 
        from t
        where t.validfrom <= v.dte and t.validto >= t.validto
       ) as cnt
from (values ('2020-01-01'), ('2020-01-02')) v(dte);
gg58donl

gg58donl2#

下面的sql似乎可以工作。本人假设该员工的有效期为有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期至(但不包括)有效期;如果此假设不正确,则需要修改内部连接逻辑。

--Get the min and max dates in the employee table
DECLARE @todate date, @fromdate date
SELECT @fromdate= min(ValidFrom) from [POC].[dbo].[EmployeeSCD2];
SELECT @todate= max(validto) from [POC].[dbo].[EmployeeSCD2];

-- Generate a list of all the dates between the two dates
WITH calendar (FromDate) AS (
    SELECT @fromdate AS FromDate
    UNION ALL
    SELECT DATEADD(day, 1, FromDate)
    FROM Calendar
    WHERE FromDate < @todate
)
-- Join the list of dates to the employee table - gives a record per date per employee active on that date
SELECT CAL.FromDate, COUNT(EMP.EMPLOYEENO) 'Employee Count' 
FROM CALENDAR CAL
INNER JOIN [POC].[dbo].[EmployeeSCD2] EMP ON CAL.FromDate >= EMP.ValidFrom AND CAL.FromDate < EMP.ValidTo
group by cal.FromDate
OPTION (MAXRECURSION 0) -- Without this parameter the recursion stops after 100 loops
;
qxsslcnc

qxsslcnc3#

一个简单的方法是建立一个日期表,然后计算在特定日期有多少员工(或客户)。

;with t as (   -- this is your table/Dimension
    select pk,EmployeeNo,Building,ValidFrom,ValidTo 
    from (values (1, 1, 'a', '2000-01-01', '2008-06-01'),
                 (2, 1, 'b', '2008-06-01', '2010-06-01'),
                 (3, 1, 'c', '2010-06-01', '2011-08-01'),
                 (4, 2, 'a', '2000-01-01', '2008-06-01'),
                 (5, 2, 'b', '2008-06-01', '2010-06-01'),
                 (6, 2, 'c', '2010-06-01', '2011-08-01')
        )t (pk,EmployeeNo,Building,ValidFrom,ValidTo)
)
, dates as ( -- this is a recursive query, building a date table from 2000-01-01 up to today
    select convert(date, '2000-01-01') as d
    union all
    select DATEADD(d,1,d) 
    from dates 
    where d < getdate()-1
)
select d.d, count(1) as [Employe count]  
from dates d
inner join t on d.d between t.ValidFrom and t.ValidTo -- will join only valid employee
group by d.d 
option (maxrecursion 0)  -- since our dates table recurse over more than 100 times and the dataset is fairly small, you need this option

相关问题