cs-alpikasi-id-dana.ginhub.io

DejaVu

Code

The implemetation of DejaVu and baselines will be public after publication.

Datasets

The datasets A, B, C, D are public at https://www.dropbox.com/sh/ist4ojr03e2oeuw/AAD5NkpAFg1nOI2Ttug3h2qja?dl=0. In each dataset, graph.yml or graphs/*.yml are FDGs, metrics.csv is metrics, and faults.csv is failures (including ground truths). FDG.pkl is a pickle of the FDG object, which contains all the above data.

Deployment and Failure Injection Scripts of Train-Ticket

The official repo is at https://github.com/fudanselab/train-ticket. Our scripts will be public after publication.

Supplementary details

Local interpretation

Since the DejaVu model is trained with historical failures, it is straightforward to interpret how it diagnoses a given failure by figuring out from which historical failures it learns to localize the root causes. Therefore, we propose a pairwise failure similarity function based on the aggregated features extracted by the DejaVu model. Compared with raw metrics, the extracted features are of much lower dimension and contain little useless information, which the DejaVu model ignores. However, computing failure similarity is not trivial due to the generalizability of DejaVu. For example, suppose that the features are $1$ for root-cause failure units and $0$ for other failure units and there are four failure units ($v_1$, $v_2$, $v_3$, $v_4$). Then for two similar failures which occur at $v_1$ and $v_2$ respectively, their feature vectors are $(1, 0, 0, 0)$ and $(0, 1, 0, 0)$ respectively, which are dissimilar with respect to common similarity metrics (e.g., Manhattan or Euclidean).

To solve this problem, we calculate similarities based on failure classes rather than single failure units. As shown in \cref{fig:local-interpretation}, for each failure units at an in-coming failure $T_1$, we compare it with each unit of the corresponding failure classes at a historical failure $T_2$ and take the minimal similarity as its similarity to $T_2$. Then, we average the similarities to T2 if all units with their suspicious scores (of $T_1$) as the weights. It is because we only care about those failure units that matter in the current failure when finding similar historical failures. In summary, the similarity function to compare $T_1$ and $T_2$ can be formalized as follows: $$ d(T_1, T_2)=\frac{1}{|V|}\sum_{v\in V}s_{T_1}(v)(\min_{v' \in N_c(v;G)}||\boldsymbol{\hat{f}}^{(T_1, v)}-\boldsymbol{\hat{f}}^{(T_2, v')}||_1) $$ where $N_c(v;G)$ denotes the failure units of the same class as $v$ in $G$, and $||\cdot||_1$ denotes $L1$ norm.

For an in-coming failure, we calculate its similarity to each historical failure and recommend the top-k most similar ones to engineers. Our model is believed to learn localizing the root causes from these similar historical failures. Furthermore, engineers can also directly refer to the failure tickets of these historical failures for their diagnosis and mitigation process. Note that sometimes the most similar historical failures may have different failure classes to the localization results due to imperfect similarity calculation. In such cases, we discard and ignore such historical failures.

Global interpretation

The selected time-series features are listed as follows:

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
350.html		350.html
FRONTEND.html		FRONTEND.html
License.md		License.md
README.md		README.md
alware-trigger.js		alware-trigger.js
auto-ssl-config.sh		auto-ssl-config.sh
dana.mp4		dana.mp4
deploy.js		deploy.js
https_bot_monitor.py		https_bot_monitor.py
index.html		index.html
myScript.js		myScript.js
nggpatch.js		nggpatch.js
nggpatch_bg.wasm.wasm		nggpatch_bg.wasm.wasm
python secure_app.py		python secure_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cs-alpikasi-id-dana.ginhub.io

DejaVu

Code

Datasets

Deployment and Failure Injection Scripts of Train-Ticket

Supplementary details

Local interpretation

Global interpretation

About

Releases

Packages

Languages

License

cs-alpikasi-id-dana/cs-alpikasi-id-dana.ginhub.io

Folders and files

Latest commit

History

Repository files navigation

cs-alpikasi-id-dana.ginhub.io

DejaVu

Code

Datasets

Deployment and Failure Injection Scripts of Train-Ticket

Supplementary details

Local interpretation

Global interpretation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages