shlogg · Early preview
Judy @esproc_spl

Finding Missing Records In Time Series Data With EsProc

Missing records found by comparing expected & actual data. Script p1.dfx calculates differences using esProc, integrating with Java for missing record detection.

Problem description & analysis
Below is part of the data in CSV file csv.csv:
2021-02-02 00:00:00
2021-02-02 01:00:00
2021-02-02 02:00:00
2021-02-02 03:00:00
2021-02-02 04:00:00
2021-02-02 05:00:00
2021-02-02 06:00:00
2021-02-02 07:00:00
2021-02-02 08:00:00
2021-02-02 10:00:00
...
The whole data is from 2021-02-02 00:00:00 to 2021-03-03 23:00:00 and has been ordered at an interval of one hour. There are altogether 30 days of data and, in theory, 24 records each day. Actually, there are certain records are left out. Our task is to find those missing records. Below is the desired result:
2021-02...