SPPCheck: Interpolate multiple missing values #116

NielsKorschinsky · 2022-08-23T18:17:05Z

Current state:

Only a maximum of one data point is interpolated when preparing the data.
This has the reasoning that an assumption of multiple datapoints can have a negative effect when summarizing, especially with counting values.
this is because a zero-count is not permitted and internally handled as NA, interpolating would bloat up the summarized count.
However - how realistic are zero-counts in a production system?

The major issue with not interpolating these intermediate values is, that they can greatly affect the summarized prediction.
If a single multiple systems fail, the trend for this period is lower than before and after, showing the prediction function there was an increase/decrease in the trend.
It might be wise to interpolate any point, as long as there is a following data point (or multiple? hard to implement).

The individual prediction is not affected, as they are interpolated as a safety measurement before, resulting in a error spam in the log messages (note #113 )

Except of Masters thesis:

Due to an unknown reason, some of the vSnaps associated with the testing system were unreachable. 
Therefore, no data could be collected by SPPMon and the data points are missing. 
According to the guidelines of the predictor,when preparing data, a maximum of one data point can be interpolated in a row (see Section4.2.3). 
This guideline results in missing data from four of seven vSnaps from 04.05.2022 until 07.06.2022. 
These missing values influence the total trend, though it is negligible in the long-term trend.
The missing values are not directly apparent in Grafana because it automatically connects the remaining data points, though when inspecting the data, the missing values become apparent.

The text was updated successfully, but these errors were encountered:

NielsKorschinsky added python Pull requests that update Python code SPPCheck Issues affecting SPPCheck labels Aug 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPPCheck: Interpolate multiple missing values #116

SPPCheck: Interpolate multiple missing values #116

NielsKorschinsky commented Aug 23, 2022

SPPCheck: Interpolate multiple missing values #116

SPPCheck: Interpolate multiple missing values #116

Comments

NielsKorschinsky commented Aug 23, 2022