Skip to content

Commit

Permalink
rasdaemon: Fix for vendor errors are not recorded in the SQLite datab…
Browse files Browse the repository at this point in the history
…ase if some cpus are offline

Fix for vendor errors are not recorded in the SQLite database if some cpus
are offline at the system start.

Issue:

This issue is reproducible by offline some cpus, run
./rasdaemon -f --record & and
inject vendor specific error supported in the rasdaemon.

Reason:

When the system starts with some of the cpus offline and then run
the rasdaemon, read_ras_event_all_cpus() exit with error and switch to
the multi thread way. However read() in read_ras_event() return error in
threads for each of the offline CPUs and does clean up including calling
ras_ns_finalize_vendor_tables(), which invokes sqlite3_finalize() on vendor
tables created. Thus the vendor error data does not stored in the SQLite
database when such error is reported next time.

Solution:

In ras_ns_add_vendor_tables() and ras_ns_finalize_vendor_tables() use
reference count and close vendor tables which created in ras_ns_add_vendor_tables()
based on the reference count.

Reported-by: Junhao He <[email protected]>
Signed-off-by: Shiju Jose <[email protected]>
  • Loading branch information
shijujose4 committed Apr 18, 2024
1 parent f9cb13b commit 2bd74f4
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 0 deletions.
16 changes: 16 additions & 0 deletions ras-non-standard-handler.c
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,7 @@ int register_ns_ev_decoder(struct ras_ns_ev_decoder *ns_ev_decoder)
#endif
if (!ras_ns_ev_dec_list) {
ras_ns_ev_dec_list = ns_ev_decoder;
ras_ns_ev_dec_list->ref_count = 0;
} else {
list = ras_ns_ev_dec_list;
while (list->next)
Expand All @@ -86,6 +87,8 @@ int ras_ns_add_vendor_tables(struct ras_events *ras)
return -1;

ns_ev_decoder = ras_ns_ev_dec_list;
if (ras_ns_ev_dec_list)
ras_ns_ev_dec_list->ref_count++;
while (ns_ev_decoder) {
if (ns_ev_decoder->add_table && !ns_ev_decoder->stmt_dec_record) {
error = ns_ev_decoder->add_table(ras, ns_ev_decoder);
Expand Down Expand Up @@ -128,6 +131,16 @@ void ras_ns_finalize_vendor_tables(void)
#ifdef HAVE_SQLITE3
struct ras_ns_ev_decoder *ns_ev_decoder = ras_ns_ev_dec_list;

if (!ras_ns_ev_dec_list)
return;

if (ras_ns_ev_dec_list->ref_count > 0)
ras_ns_ev_dec_list->ref_count--;
else
return;
if (ras_ns_ev_dec_list->ref_count > 0)
return;

while (ns_ev_decoder) {
if (ns_ev_decoder->stmt_dec_record) {
ras_mc_finalize_vendor_table(ns_ev_decoder->stmt_dec_record);
Expand All @@ -141,6 +154,9 @@ void ras_ns_finalize_vendor_tables(void)
static void unregister_ns_ev_decoder(void)
{
#ifdef HAVE_SQLITE3
if (!ras_ns_ev_dec_list)
return;
ras_ns_ev_dec_list->ref_count = 1;
ras_ns_finalize_vendor_tables();
#endif
ras_ns_ev_dec_list = NULL;
Expand Down
1 change: 1 addition & 0 deletions ras-non-standard-handler.h
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@

struct ras_ns_ev_decoder {
struct ras_ns_ev_decoder *next;
uint16_t ref_count;
const char *sec_type;
int (*add_table)(struct ras_events *ras, struct ras_ns_ev_decoder *ev_decoder);
int (*decode)(struct ras_events *ras, struct ras_ns_ev_decoder *ev_decoder,
Expand Down

0 comments on commit 2bd74f4

Please sign in to comment.