Sql-server – Find time between status changes for each unit on each incident

sql serversql-server-2008

I have a table called History that holds unit status changes when they are assigned to incidents. Each unit can change status from DP to AK to ER to AR. Statuses can be skipped.

The table is called history. The fields are Unit, Datetimestamp, Status, Incident.

I want to find the average time between each of the statuses for every unit for an incident. e.g. for the UNIT E04 on INCIDENT F141000001 I want to know the time between DP to AK, AK to ER, ER to AR. Then use this to derive the average time between each of these status changes for all the units in the table.

Best Answer

OK, try this. I made some assumptions.

All statuses start as DP
Status changes are only one way as specified above and don't revert the other direction.
If a status change is listed twice, use the first earliest instance
If a status change is skipped, use the previous status time stamp to calculate the change
INCIDENT is not desired as part of the AVG status change report for UNIT
Averages in the final query are based off the actual number of status changes that occurred and is variable by UNIT/INCIDENT

With those assumptions in mind I present the following solution. LAG() isn't an option in 2008, so you are left with making a bunch of datasets to realize the output you want. I'm using CTEs to generate the lists of UNITS by INCIDENT and then creating temp tables to work with the timestamps.

I could have done a bunch of nested CTEs but then they'd be executed multiple times; same with doing this with sub queries. Finally, when reading the output for the status changes, if a change is preceeded by a NULL value, it is because the status was skipped. In the case of E14, based off available sample data, there were no status changes from DP.

Hopefully this is what you need or at least enough to get you going.

GENERATE SAMPLE DATA

create table history 
(
      UNIT CHAR(3)
    , DATETIMESTAMP CHAR(16)
    , STATUS CHAR(2)
    , INCIDENT CHAR(10)
)

INSERT history
values ('E04','20140617101703ED','DP','F141000001')
     , ('L24','20140617101703ED','DP','F141000001')
     , ('E04','20140617101845ED','ER','F141000001')
     , ('L24','20140617101848ED','ER','F141000001')
     , ('E07','20140617101955ED','DP','F141000002')
     , ('L17','20140617101955ED','DP','F141000002')
     , ('E04','20140617102029ED','AR','F141000001')
     , ('L24','20140617102038ED','AR','F141000001')
     , ('E07','20140617102235ED','ER','F141000002')
     , ('L17','20140617102238ED','ER','F141000002')
     , ('E14','20140617102501ED','DP','F141000003')
     , ('D03','20140617102626ED','DP','F141000002')
     , ('E07','20140617102712ED','ER','F141000002')
     , ('L17','20140617102717ED','ER','F141000002')
     , ('D03','20140617102740ED','ER','F141000002')
     , ('D03','20140617102744ED','aR','F141000002')

--DROP TABLE history

CODE

--GATHER DP AND AK STATUSES AND COMPARE
with 
cte_dp as
(
    select UNIT, min(convert(datetime,left(DATETIMESTAMP,4)+'-'+SUBSTRING(DATETIMESTAMP,5,2)+'-'+SUBSTRING(DATETIMESTAMP,7,2)+' '+SUBSTRING(DATETIMESTAMP,9,2)+':'+SUBSTRING(DATETIMESTAMP,11,2)+':'+SUBSTRING(DATETIMESTAMP,13,2))) DATETIMESTAMP, STATUS, INCIDENT
    from history
    where STATUS = 'DP'
    group by UNIT, STATUS, INCIDENT
),
cte_ak as 
(
    select UNIT, min(convert(datetime,left(DATETIMESTAMP,4)+'-'+SUBSTRING(DATETIMESTAMP,5,2)+'-'+SUBSTRING(DATETIMESTAMP,7,2)+' '+SUBSTRING(DATETIMESTAMP,9,2)+':'+SUBSTRING(DATETIMESTAMP,11,2)+':'+SUBSTRING(DATETIMESTAMP,13,2))) DATETIMESTAMP, STATUS, INCIDENT
    from history
    where STATUS = 'AK'
    group by UNIT, STATUS, INCIDENT
)
select dp.UNIT, coalesce(ak.DATETIMESTAMP, dp.DATETIMESTAMP) PreviousTimeStamp, DATEDIFF(second,dp.DATETIMESTAMP,ak.DATETIMESTAMP) DiffInSecs, dp.INCIDENT, case when ak.UNIT is null then 0 else 1 end as StatusCount
into #dk
from cte_dp dp
    left join cte_ak ak on ak.UNIT = dp.UNIT
        and ak.INCIDENT = dp.INCIDENT
        and ak.DATETIMESTAMP > dp.DATETIMESTAMP;

--GATHER ER STATUS AND COMPARE TO PREVIOUS STATUS CHANGE
with
cte_er as 
(
    select UNIT, min(convert(datetime,left(DATETIMESTAMP,4)+'-'+SUBSTRING(DATETIMESTAMP,5,2)+'-'+SUBSTRING(DATETIMESTAMP,7,2)+' '+SUBSTRING(DATETIMESTAMP,9,2)+':'+SUBSTRING(DATETIMESTAMP,11,2)+':'+SUBSTRING(DATETIMESTAMP,13,2))) DATETIMESTAMP, STATUS, INCIDENT
    from history
    where STATUS = 'ER'
    group by UNIT, STATUS, INCIDENT
)
select dk.UNIT, coalesce(er.DATETIMESTAMP, dk.PreviousTimeStamp) PreviousTimeStamp, DATEDIFF(second,dk.PreviousTimeStamp,er.DATETIMESTAMP) DiffInSecs, dk.INCIDENT, case when er.UNIT is null then 0 else 1 end as StatusCount
into #kr
from #dk dk
    left join cte_er er on er.UNIT = dk.UNIT
            and er.INCIDENT = dk.INCIDENT
            and er.DATETIMESTAMP > dk.PreviousTimeStamp;

--GATHER AR STATUS AND COMPARE WITH PREVIOUS STATUS CHANGE
with
cte_ar as
(
    select UNIT, min(convert(datetime,left(DATETIMESTAMP,4)+'-'+SUBSTRING(DATETIMESTAMP,5,2)+'-'+SUBSTRING(DATETIMESTAMP,7,2)+' '+SUBSTRING(DATETIMESTAMP,9,2)+':'+SUBSTRING(DATETIMESTAMP,11,2)+':'+SUBSTRING(DATETIMESTAMP,13,2))) DATETIMESTAMP, STATUS, INCIDENT
    from history
    where STATUS = 'AR'
    group by UNIT, STATUS, INCIDENT
)
select kr.UNIT, DATEDIFF(second,kr.PreviousTimeStamp,ar.DATETIMESTAMP) DiffInSecs, kr.INCIDENT, case when ar.UNIT is null then 0 else 1 end as StatusCount
into #rr
from #kr kr
    left join cte_ar ar on ar.UNIT = kr.UNIT
            and ar.INCIDENT = kr.INCIDENT
            and ar.DATETIMESTAMP > kr.PreviousTimeStamp;

--REPORT THE STATUS CHANGE TIMES IN SECONDS FOR EACH STATUS CHANGE, PER UNIT, PER INCIDENT
select dk.UNIT, dk.DiffInSecs 'DP-AK', kr.DiffInSecs 'AK-ER', rr.DiffInSecs 'ER-AR', dk.INCIDENT
from #dk dk
    left join #kr kr on kr.UNIT = dk.UNIT and kr.INCIDENT = dk.INCIDENT
    left join #rr rr on rr.UNIT = dk.UNIT and rr.INCIDENT = dk.INCIDENT

--REPORT THE AVG TIME THE STATUS CHANGED FOR EACH UNIT
select dk.UNIT, (isnull(dk.DiffInSecs,0)+isnull(kr.DiffInSecs,0)+isnull(rr.DiffInSecs,0))/(case when (dk.StatusCount+kr.StatusCount+rr.StatusCount) = 0 then 1 else (dk.StatusCount+kr.StatusCount+rr.StatusCount) end) AverageSeconds_Between_StatusChange
from #dk dk
    left join #kr kr on kr.UNIT = dk.UNIT
    left join #rr rr on rr.UNIT = dk.UNIT

drop table #dk
drop table #kr
drop table #rr

Related Solutions

Sql-server – Design concerns for using 3x bit or one char(1) or one integer in table for holding status of item

In my experience, trying to encode multiple data points into a single column always ends up being more trouble than it's worth. Sure, it seems cool and clever to use BITWISE operators, but there are many things that go wrong and it won't always be efficient to test those bits without cumbersome and unintuitive workarounds. It's the same reason we stay away from storing comma-separated lists, JSON strings etc. in a single column - eventually you care about viewing or filtering on those distinct bits which you now have to extract, sometimes expensively.

With the information I have, my vote is for three separate BIT columns. They will still collapse to similar storage patterns as a single column with the three bits on/off, and can be made more efficient individually and across the board in several ways, including:

data compression
sparse columns
filtered indexes (e.g. WHERE allow_returns = 1)

Someone else advocated for three CHAR(1) columns. These do not benefit from storage collapse and also require a check constraint, making them less than ideal in my mind.

Now, my answer might change if you say, "well, what if I might add 15 other attributes in the future?" I certainly don't think it's wise to build the columns this way if they're not relatively static - changing the schema (and therefore all of the code and interfaces to it) for every new or changed attribute is going to be a royal pain. So in that case you might want to consider EAV - where the attributes are not part of the metadata but part of the data. There are a lot of objections to EAV, mostly around performance and the difficulty in enforcing constraints (in this case unlikely to be an issue if all of these attributes are either on or off), but it worked quite well for us at my previous job. You might model it like this:

CREATE TABLE dbo.Attributes
(
  AttributeID TINYINT PRIMARY KEY,
  Name VARCHAR(32) NOT NULL UNIQUE
);

CREATE TABLE dbo.ItemAttributes
(
  ItemID INT NOT NULL 
    FOREIGN KEY REFERENCES dbo.Items(ItemID),
  AttributeID TINYINT NOT NULL 
    FOREIGN KEY REFERENCES dbo.Attributes(AttributeID),
  Status BIT NOT NULL,
  PRIMARY KEY(ItemID, AttributeID)
);

And again, you can have filtered indexes here to make certain queries much more efficient, such as (imagine the AttributeID for "allow returns" is 10):

CREATE INDEX optAllowReturns ON dbo.ItemAttributes(ItemID)
  WHERE AttributeID = 10 AND Status = 1;

If you have certain attributes that are not on/off (for example, three states of manufacture or shipping), you can change the Status column to:

Value TINYINT NOT NULL

This can double as an on/off value for attributes that are represented that way, and as tri- or more-state value for attributes that require more than simple on/off. You can also reflect which type is which in the metadata of the dbo.Attributes table.

Sql-server – Getting each status change in a table

This type of requirement comes under the banner of "gaps and islands". A popular approach is

WITH T
     AS (SELECT *,
                DENSE_RANK() OVER (PARTITION BY ItemId ORDER BY DateOfChange) - 
                DENSE_RANK() OVER (PARTITION BY ItemId, Status ORDER BY DateOfChange) AS Grp
         FROM   ItemTable)
SELECT ItemId,
       Status,
       MIN(DateOfChange) AS Start,
       MAX(DateOfChange) AS Finish
FROM   T
GROUP  BY ItemId,
          Status,
          Grp
ORDER  BY Start

Best Answer

Related Solutions

Sql-server – Design concerns for using 3x bit or one char(1) or one integer in table for holding status of item

Sql-server – Getting each status change in a table

Related Question