-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Yices2 SMTCOMP 2024 Submission #66
Conversation
Summary of modified submissionsYices2
|
@bobot The summary is showing logics which we didn't submit to. e.g. we didn't submit Yices2 to QF_UFBVDT in the unsatcore track, but it is shown in the summary. |
Indeed the markdown removed the |
#84: Create cvc5-cloud #74: Draft STP submission #70: draft yicesQS submission #68: Create STP-CNFLS #66: Yices2 SMTCOMP 2024 Submission #65: Z3-alpha draft PR #64: Solver submission: cvc5 #63: submission iProver #61: OSTRICH 1.4 #60: SMT-RAT submission #57: Amaya's submission for SMT-COMP 2024 #55: plat-smt submission #54: Add 2024 Bitwuzla submission. #53: 2024 solver participant submission: OpenSMT #52: Z3-Noodler submission #51: Submission Colibri #45: Submission for smtinterpol #42: Adding Algaroba to SMTCOMP 2024
@ahmed-irfan We have executed the latest version of Yices2 on a randomly chosen subset of 20 single-query benchmarks from each logic where it participates. The benchmarks are also scrambled by the competition scrambler (with seed 1). You can find the results here: https://www.fi.muni.cz/~xjonas/smtcomp/yices2.table.html#/table Quick explanation:
Please check whether there are some discrepancies, such as missing/extra logics, unexpected aborts or |
@ahmed-irfan Here are also results of the test run on a small subset of incremental benchmarks: https://www.fi.muni.cz/~xjonas/smtcomp/yices2_inc.table.html#/table The status column has the following meaning:
IMPORTANT: It seems that the current version of Yices2 produces some wrong results on one of the benchmarks. Note that this occurs only after scrambling, e.g.,
If there are any other discrepancies (missing/extra logics, unexpected results), please let me know. Note that for the execution on our cluster, I had to create a new script
and this is the script executed in the runs. This is unrelated to the wrong results, as they occur even without this wrapper. |
@martinjonas I have updated the solver. Can you please run the tests again? |
Sure, done! You can find the new results using the same links as before. And I am glad to see that you managed to fix that wrong result. |
@ahmed-irfan We have finished test runs of model-validation and unsat-core generation tracks. You can find the results here:
The run already uses your updated version of Yices2. As before, you can click on the status of the benchmark to see the output of your solver. If you find any discrepancies or extra/missing logics, please let me know. Note that we selected only SAT benchmarks for model validation and only UNSAT for unsat-core generation. As a result of that, some logics do not contain any benchmarks. So do not be surprised if you have subscribed to one of these logics and you do not have any result for it. In particular, the logics are: For model-generation: QF_UFFP, QF_UFBVDT, QF_UFDTNIA, QF_NIRA. For unsat-core: UFBVDT, FPLRA, QF_AUFBVFP, QF_ABVFPLRA, QF_UFBVDT, QF_UFDTNIA, QF_UFDT, QF_NIA, QF_SNIA. |
The verification of the model using dolmen found one wrong model
(We can give more time for fixing the participation in model track) |
@bobot I have updated the Yices2 executable. Thanks! |
No description provided.