Partially fix unsound join for `top` with empty environment in `ApronDomain` #1354

jerhard · 2024-02-06T11:44:07Z

The join in the ApronDomain behaves incorrectly when given the "Top" element for the empty environment: In that case, it returns the other element. This is because the join checks for is_bot, which checks for equality with bot, which is defined as the top element for the empty environment.
In the given problematic example, the analysis is context-insensitive and the auto-tuner selected only a subset of the variables to be tracked by the relational analysis. There, such a Top-element in an empty environment would arise, leading to an unsound analysis.

This PR changes the Apron Domain to fix this issue.

is_bot returned true for some elements for which is_top_env returned true, leading to too small values being returned.

jerhard · 2024-02-06T17:42:29Z

When changing the ApronDomain such that bot and top are actually bot and top values of Apron, respectively, another issue appears: To maintain the precision of the multithreaded relational analysis, it seems to be required that the startstate (now RD.top ()) behaves nicely with the join, i.e., is not the absorbing element. Previously, the join: join (bot ()) x yielded x, where bot () was the Apron top of the empty environment, and was used as the startstate.
Using RD.bot () as the startstate in the relational analysis no longer works, as this results in Deadcode to be detected basically everywhere.

Maybe it would be necessary to separate the abstraction for "uninitialized state" and "unreachable state".

michael-schwarz · 2024-02-18T14:33:09Z

Wit the help of @jerhard we looked at it a bit on Friday and were able to narrow it down a bit.

I have dug a bit further now, the problem seems to be the following:

The unknown [V.mutex_inits] has a value of T (which is then incorporated everywhere which causes the problem)
The side-effect is triggered during a sync transfer function with reason Join
- At this point, a relation value only containing global variables from st is created
- In the examples, the list of these variables is empty, and a top T an empty environment ([]) is side-effected

This problem does not surface for other privatizations such mutex-meet-tid-*, as they do not cause side-effects on Join as they require a path-sensitive multithreading analysis and thus never need to side-effect to mutex_inits because one branch may be multi-threaded while the other is not.

michael-schwarz · 2024-02-18T14:38:12Z

The remaining tests that fail now are

46/12 (to be investigated)
All affeq tests: The modified analysis seems to call RD.top () which was not to be called according to the documentation, as there is not environment attached. This immediately raises an excpetion.

michael-schwarz · 2024-02-18T15:24:12Z

The problem for 46/12 was similar:

A local variable x that has escaped is never added to mutex_init (as this happens as a side-effect when going multi-threaded) and the escape is later
Therefore, when looking up the value for this x in [mutex_inits], we get a T in an empty environment ([])

The fix is to not consult mutex_init for variables not in the environment, and do a blind write in escape to avoid the state becoming bot from reading bot from [x].

michael-schwarz · 2024-02-18T15:30:21Z

If one adds a top () = {d = Some (Matrix.empty ()); env = Environment.make [||] [||]} to affeq, all tests pass. Not sure how much sense that makes though...

michael-schwarz · 2024-02-18T15:31:46Z

@jerhard It seems like I cannot request your review as you opened the PR, but it would be great if you can also have a look!

michael-schwarz · 2024-02-18T16:02:38Z

The gobview failure is unrelated, the relational analyses are not even used with gobview.

CC @stilscher

tests/regression/46-apron2/79-context-insens.c

src/cdomains/apron/apronDomain.apron.ml

src/cdomains/apron/affineEqualityDomain.apron.ml

src/analyses/apron/relationAnalysis.apron.ml

sim642 · 2024-02-22T08:51:43Z

src/analyses/apron/relationPriv.apron.ml

+        (* If no globals are contained here, none need to be published *)
+        (* https://github.com/goblint/analyzer/pull/1354 *)
+        if g_vars <> [] then sideg V.mutex_inits rel_side;


This sounds like some bottom-top confusion again: if there are no globals (i.e. the environment is empty), then don't side-effect (i.e. side-effect bottom) instead of side-effecting top.
Conditionally flipping a top to bottom seems inconsistent with all other values.

I still think it's conceptually the right thing to do and not just a hack.
The thing is that we abuse the presence of globals in the relation as a no-overhead way to track whether one of the threads being joined possibly was in single-threaded mode.
If none of these threads was single-threaded (g_vars = []), all initial values are guaranteed to have been published and no side -effect is needed.

Does this make sense? If yes, I can write a comment detailing this reasoning.

The thing is that we abuse the presence of globals in the relation as a no-overhead way to track whether one of the threads being joined possibly was in single-threaded mode.

How so? We explicitly check the MustBeSingleThreaded query right before this. Prior to this, nothing is conditional on g_vars = [].

This is the case of branched thread creation.

Case 0: Both ST (single-threaded), nothing to do.

Case 1: One thread is ST, and the other one is not. After the join, Q.MustBeSingleThreaded is false, and the globals from the single-threaded side of the join need need to be published. In this case, potentially, g_vars <> []. If g_vars = [] there are no globals, and not causing a side-effect is ok.

Case 2: Both MT, nothing should be done. In this case g_vars = []. Earlier $$\bot$$ was side-effected, now $$\top$$ would be side-effected which we need to protect against.

Being multi-threaded doesn't imply g_vars = [], though.

Special-casing [] to flip the lattice is not a natural extension of the behavior on all other environments. I'm afraid we're setting ourselves up for a similar issue about top-bottom confusion that we're trying to fix here.

Suppose the global unknown has a non-bottom relation $R_{\{x\}}$ (on the environment $\{x\}$) and we're side-effecting $\top_\emptyset$ (on the empty environment). What is the join of these two relations?

$R_{\{x\}} \sqsubseteq R_{\{x\}}$ by reflexivity.

$\top_\emptyset \sqsubseteq R_{\{x\}}$ because $\emptyset \subseteq \{x\}$ and $\top_\emptyset \sqsubseteq R_{\{x\}}\vert_\emptyset = \top_\emptyset$ by the definition of heterogeneous $\sqsubseteq$ (e.g. see the unassume paper) and reflexivity.

It therefore follows that $R_{\{x\}}$ itself is an upper bound of $R_{\{x\}}$ and $\top_\emptyset$. Since it's equal to one of the arguments, it's obviously also the least upper bound of the two.

So side-effecting $\top_\emptyset$ should be a no-op (and this special case is not needed) if we have the right lattice. If it isn't, then we still don't have the lattice fully right. And that is completely independent from whatever thread-modular analysis we might be doing: it's just a relational domain property.

Principled fix is delayed to #1380.

sim642 · 2024-02-22T08:55:15Z

src/analyses/apron/relationPriv.apron.ml

@@ -489,7 +489,11 @@ struct
    in
    let get_mutex_inits = getg V.mutex_inits in
    let get_mutex_inits' = RD.keep_vars get_mutex_inits [g_var] in
-    RD.join get_mutex_global_g get_mutex_inits'
+    if not (RD.mem_var get_mutex_inits' g_var) then
+      (* This is an escaped variable whose value was never side-effected to get_mutex_inits' *)


Is this not just a workaround for the actual bug? Seems like escapes should also contribute to mutex_inits.

BasePriv.PerMutexPrivBase and BasePriv.PerMutexMeetTIDPriv side-effect escapes to mutex_inits at least.

Yes, in the base priv this is all not problematic, as it is non-relational. Here, this all becomes fragile, and I think for now we are wise to avoid the set of variables at global unknowns not only shrinking (due to decreasing protection) but also increasing in non-obvious matters due to escapes also being tracked associated with mutex_inits which is supposed to abstract the state at program start. Maybe this is also something for #1380.

stilscher · 2024-02-26T10:38:21Z

The gobview failure is unrelated, the relational analyses are not even used with gobview.

CC @stilscher

This is because with goblint/gobview#41 changes were made to GobView that rely on the respective changes in Goblint (#1360). Both PRs were already merged, but this PR is based on an older version that does not include the Goblint changes yet. Merging with the master branch should be sufficient to fix it.

michael-schwarz · 2024-03-03T11:20:17Z

Merging with the master branch should be sufficient to fix it.

It does not.

michael-schwarz · 2024-03-03T12:37:37Z

@jerhard @sim642: Anyone opposed to merging this as a stop-gap to at least be sound for the problematic cases reported here and enable experiments for my thesis until we have time to do #1380?

@stilscher: With the gobview things, I have no idea how to debug but it should be completely orthogonal: Nothing is touched here that is even used in JS builds.

sim642 · 2024-03-04T12:19:57Z

With the gobview things, I have no idea how to debug but it should be completely orthogonal: Nothing is touched here that is even used in JS builds.

I haven't looked, but it might just be crashing during some initialization. For example let is_top = equal (top ()) would evaluate top () at initialization time, even if is_top is never called, but there's no Apron JS stubs. I think we don't explicitly disable Apron in JSOO builds, it works as long no calls to Apron C/JS stubs are made.

michael-schwarz · 2024-03-04T13:07:04Z

For example let is_top = equal (top ())

Right on the money! Together with @stilscher we were able to fix it and it was exactly this. I didn't think that the eta expansion was needed, as it was also not done for bot () but was somehow unproblematic there.

sim642 · 2024-03-04T14:09:10Z

I didn't think that the eta expansion was needed, as it was also not done for bot () but was somehow unproblematic there.

That's odd indeed. By that logic it should've already been failing before. So maybe there's something else going on, like one of them actually getting called from elsewhere during initialization and the other one not.

michael-schwarz · 2024-03-04T15:09:05Z

I think it could also be an artifact of compilation order or something the like. Given that it works now, I'd suggest to go ahead and merge it.

sim642 · 2024-03-04T16:06:40Z

Since this isn't the principled fix but still leaves us with a domain that clearly violates basic lattice properties, I think this should go through an sv-benchmarks run. It's not exactly obvious that this cannot expose some issue elsewhere by fixing this one particular sv-benchmark with some special conf.
Independently, this branch can still be used for some other benchmarks that it's needed for.

michael-schwarz · 2024-03-04T16:16:28Z

I think this should go through an sv-benchmarks run.

Do you currently have capacity in Tartu? Our machines are quite loaded with SOAP benchmarks for the past few weeks.

sim642 · 2024-03-04T19:25:12Z

Do you currently have capacity in Tartu? Our machines are quite loaded with SOAP benchmarks for the past few weeks.

I don't have immediately because I'm running SV-COMP 2024 tools on Concrat with 2h time limit, which is going to take a while...
SOAP deadline is in two days though, so it's not a big delay.

michael-schwarz · 2024-03-07T13:07:16Z

I started a run on server02 now.

michael-schwarz · 2024-03-10T13:34:24Z

The results do not differ much from the last run we did on sv-comp in November (d629d14) which is our official sv-comp release.

The new numbers are:

+2 tasks on no-data-race
+204 tasks on unreach-call
+12 tasks on no-overflow
-26 tasks on mem-safety
+/- 0 tasks on mem-cleanup
- 73 tasks on termination

There are four new unsoundnesses (#1385) that are however also present on master. I will also start a diff with the current master to see which of these changes are actually due to this PR, but I think it at least doesn't like this is a huge regression for now.

sim642 · 2024-03-11T10:18:01Z

#1385 lists two of the unsound cases being from list_search, which is precisely what this PR is supposed to fix (#1302), but looks like doesn't.

michael-schwarz · 2024-03-11T10:26:30Z

Good catch! It looks like it fixes only the issue where both branches are dead that the regression test also addresses.

michael-schwarz · 2024-03-11T12:39:18Z

W.r.t current master, the only changes with this PR are for unreach_call:

unreach_call.zip where 5 tasks fluctuate between TIMEOUT 900s / StackOverflow and Unknown after 880s

Runtimes remain consistent.

I would suggest to merge this now, as it fixes known soundness bugs (the one added to the regression tests) and does not deteriorate SV-COMP results.

jerhard added 2 commits February 6, 2024 12:21

Add failing test case.

978b5e6

apronDomain.join: Check for is_bot_env instead of is_bot.

554a5f0

is_bot returned true for some elements for which is_top_env returned true, leading to too small values being returned.

jerhard added bug unsound labels Feb 6, 2024

jerhard added 2 commits February 6, 2024 18:09

Define bot and top in apron domain correctly.

2311795

Adapt relationPriv for bot being proper bot.

8848276

SchiJoha mentioned this pull request Feb 7, 2024

master thesis: Taming Recursion with Three Context-Sensitive Analyses (Callstring, LoopfreeCallstring, Context Gas) #1340

Merged

Do not side-effect empty relationship

d2ffcf0

Fix escape handling

b57fd0f

Add a top to affeq?

8787f59

michael-schwarz requested a review from sim642 February 18, 2024 15:30

michael-schwarz marked this pull request as ready for review February 18, 2024 15:30

Merge branch 'master' into issue_1302

05bd808

sim642 reviewed Feb 22, 2024

View reviewed changes

sim642 linked an issue Feb 27, 2024 that may be closed by this pull request

Unsoundness with context-insensitive analyses: sv-comp tests "list_search" #1302

Closed

sim642 added the relational Relational analyses (Apron, affeq, lin2var) label Feb 27, 2024

michael-schwarz mentioned this pull request Mar 3, 2024

Principled fix for Apron-Domain Top #1380

Open

michael-schwarz added 2 commits March 3, 2024 11:55

Merge branch 'master' into issue_1302

c008b83

Comment about top and bot over empty environment being different

5f7ce7f

michael-schwarz added 2 commits March 3, 2024 12:26

AffEq: Make top and is_top consistent

b0465a2

Minimize example

171443b

michael-schwarz requested a review from sim642 March 3, 2024 12:37

make function argument explicit to fix gobview test

6c4742c

sim642 added 2 commits March 4, 2024 17:39

Add RelationPriv TODO from PR #1354

db019b5

Fix RelationPriv indentation (PR #1354)

cdca6b7

michael-schwarz mentioned this pull request Mar 10, 2024

Unsoundness on SV-COMP for No-Overflow & Memory Properties #1385

Closed

4 tasks

michael-schwarz added this to the v2.4.0 milestone Mar 11, 2024

sim642 approved these changes Mar 12, 2024

View reviewed changes

michael-schwarz merged commit afd4631 into master Mar 12, 2024
21 checks passed

michael-schwarz deleted the issue_1302 branch March 12, 2024 08:49

michael-schwarz mentioned this pull request Mar 12, 2024

Linear Two-Variable Equalities Analysis #1297

Merged

sim642 changed the title ~~Fix unsound join for top with empty environment in ApronDomain~~ Partially fix unsound join for top with empty environment in ApronDomain Mar 12, 2024

sim642 added a commit that referenced this pull request Mar 12, 2024

Add SKIP to apron test (PR #1354)

11a7f31

sim642 mentioned this pull request May 3, 2024

Fix unsoundness from relation analysis reading special mutexes as integer variables #1441

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partially fix unsound join for `top` with empty environment in `ApronDomain` #1354

Partially fix unsound join for `top` with empty environment in `ApronDomain` #1354

jerhard commented Feb 6, 2024 •

edited

Loading

jerhard commented Feb 6, 2024 •

edited

Loading

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

sim642 Feb 22, 2024

michael-schwarz Feb 23, 2024

sim642 Feb 26, 2024

michael-schwarz Feb 26, 2024 •

edited

Loading

sim642 Feb 27, 2024 •

edited

Loading

michael-schwarz Mar 3, 2024

sim642 Feb 22, 2024

michael-schwarz Mar 3, 2024

stilscher commented Feb 26, 2024

michael-schwarz commented Mar 3, 2024

michael-schwarz commented Mar 3, 2024 •

edited

Loading

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 7, 2024

michael-schwarz commented Mar 10, 2024 •

edited

Loading

sim642 commented Mar 11, 2024

michael-schwarz commented Mar 11, 2024

michael-schwarz commented Mar 11, 2024

Partially fix unsound join for top with empty environment in ApronDomain #1354

Partially fix unsound join for top with empty environment in ApronDomain #1354

Conversation

jerhard commented Feb 6, 2024 • edited Loading

jerhard commented Feb 6, 2024 • edited Loading

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

michael-schwarz commented Feb 18, 2024

sim642 Feb 22, 2024

Choose a reason for hiding this comment

michael-schwarz Feb 23, 2024

Choose a reason for hiding this comment

sim642 Feb 26, 2024

Choose a reason for hiding this comment

michael-schwarz Feb 26, 2024 • edited Loading

Choose a reason for hiding this comment

sim642 Feb 27, 2024 • edited Loading

Choose a reason for hiding this comment

michael-schwarz Mar 3, 2024

Choose a reason for hiding this comment

sim642 Feb 22, 2024

Choose a reason for hiding this comment

michael-schwarz Mar 3, 2024

Choose a reason for hiding this comment

stilscher commented Feb 26, 2024

michael-schwarz commented Mar 3, 2024

michael-schwarz commented Mar 3, 2024 • edited Loading

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 4, 2024

sim642 commented Mar 4, 2024

michael-schwarz commented Mar 7, 2024

michael-schwarz commented Mar 10, 2024 • edited Loading

sim642 commented Mar 11, 2024

michael-schwarz commented Mar 11, 2024

michael-schwarz commented Mar 11, 2024

Partially fix unsound join for `top` with empty environment in `ApronDomain` #1354

Partially fix unsound join for `top` with empty environment in `ApronDomain` #1354

jerhard commented Feb 6, 2024 •

edited

Loading

jerhard commented Feb 6, 2024 •

edited

Loading

michael-schwarz Feb 26, 2024 •

edited

Loading

sim642 Feb 27, 2024 •

edited

Loading

michael-schwarz commented Mar 3, 2024 •

edited

Loading

michael-schwarz commented Mar 10, 2024 •

edited

Loading