Tool Submission Discussion #3

ttj · 2024-06-24T01:49:04Z

We will update this soon after finishing the AWS set up on the submission system, but posting now to help ensure people getting started with tool preparation (copied from last iteration) @ChristopherBrix

At this point, you should be updating your tool in order to support quickly verifying as many benchmarks as you can. Note that the benchmarks instances will change based on a new random seed for the final evaluation. We will follow a similar workflow to last year, where tool authors provide shell scripts to install their tool, prepare instances (convert models to a different format, for example), and then finally verify an instance. The detailed instructions for this are available at 2021's git repo.

You will be able to run and debug your toolkit on the submitted benchmarks online at this link. There, you first need to register. Your registration will be manually activated by the organizers, you'll receive a notification once that's done. Afterwards, you can login and start with your first submission.

The process is similar to the submission of benchmarks, with a small change compared to last year: You need to specify a public git URL and commit hash, as well as the location of a .yaml config file. There, you can specify parameters for your toolkit evaluation. By making those settings part of the repository, those will be preserved for future reference.
You can define a post installation script to set up any licenses.

Once submitted, you're placed in a queue until the chosen AWS instance can be created, at which point your installation and evaluation scripts will be run. You'll see the output of each step and can abort the evaluation early in case there are any issues. Once a submission has terminated, you can use it to populate the submission form for the next iteration, so you don't have to retype everything.

Important: We currently have no limitation on how often you can submit your tool for testing purposes, but will monitor the usage closely and may impose limits if necessary. Please be mindful of the costs (approx. 3$ per hour) each submission incurs. To save costs, you should debug your code locally and then use the website to confirm the results match your expectations.

We strongly encourage tool participants to at least register and have some test submissions on the toolkit website well ahead of the deadline.

mldiego · 2024-07-01T22:25:22Z

Is the submission system open yet?

mldiego · 2024-07-02T14:34:02Z

Do our accounts from last year still work? If not, can I get my account activated?

email: [email protected]

aliabigdeli · 2024-07-03T18:50:00Z

@ttj @ChristopherBrix
My last year account is not activated any more. Could you activate my account?

email: [email protected]

hodangkhoinguyen · 2024-07-04T00:49:01Z

Please activate account [email protected] when you have a chance. I'm affiliated with NeuralSAT - GMU. Thanks so much.

shizhouxing · 2024-07-04T00:57:57Z

Same here. zshi at cs.ucla.edu

toladnertum · 2024-07-04T05:45:00Z

Same here: [email protected]

kollerlukas · 2024-07-04T07:12:35Z

Same here: [email protected]

mldiego · 2024-07-04T11:47:30Z

Will the submission deadline be pushed back?

Can we get this AMI added to the submission system? @ChristopherBrix

R2024a matlab_linux
ami-080fce32dc8c15ced
[Copied ami-0b5d9d19591951eb7 from us-east-1]
OwnerAlias: –
Platform: Other Linux
Architecture: x86_64
Owner: 704708324783
Publish date: 2024-04-25
Root device type: ebs
Virtualization: hvm
ENA enabled: Yes

ttj · 2024-07-05T00:17:03Z

As we were a bit behind getting the execution system going (some AWS issues needed to be sorted out), we will extend the deadline to July 12. Barring any further AWS issues that hopefully are about resolved, I don't anticipate further extensions, as we need some time to execute and prepare prior to the conference presentation on July 23.

phK3 · 2024-07-05T09:18:59Z

Can you please activate [email protected] ?

aliabigdeli · 2024-07-05T17:05:14Z

Can you please activate [email protected] ?

ChristopherBrix · 2024-07-05T21:37:50Z

Tool submission is open!
Please try to submit your tools. If you encounter any issues, feel free to ping me here, or write me an email ([email protected])
Currently, AWS has only granted us limited resources, so if you pick the p3 or g5 instance, you might have to wait a bit in the queue before your submission is evaluated. If you just want to make sure that your installation works as intended, and don't care about the runtime performance yet, please use m5 instances right now, we have more capacity there. I'm working with AWS to resolve this, and will post an update once it's done.
I've enabled all accounts.

ChristopherBrix · 2024-07-05T21:41:24Z

@mldiego I've added ami-080fce32dc8c15ced to the list of supported amis.

shizhouxing · 2024-07-05T21:43:12Z

Hi @ChristopherBrix , could you please also add the standard ubuntu 24.04? It's ami-04a81a99f5ec58529. Thanks!

ChristopherBrix · 2024-07-05T22:10:34Z

@shizhouxing I've added ami-0aff18ec83b712f05, that should be the us-west-2 equivalent. Let me know if you need a different one.

Jubengo · 2024-07-08T16:19:21Z

I ran into an error earlier which I think is a bug on the server side (submission 71):
I tried to test single networks on acas, the first and third network path were ok (e.g. vnncomp2024_benchmarks/benchmarks/acasxu_2023/onnx/ACASXU_run2a_1_1_batch_2000.onnx ) but the second one was not ok:
vnncomp2024_benchmarks/benchmarks/acasxu_2023/nx/ACASXU_run2a_5_6_batch_2000.onnx
I think my prepare_instance script is ok and I also think that the path in the instances.csv is ok

ChristopherBrix · 2024-07-09T15:48:24Z

You can now have a fixed MAC address! This might help with licensing issues.

In your new "Profile" page, you can find an "eni" code. This is used to assign a static MAC address to your AWS instance.
If you share your eni code with collaborators, they can use the same MAC address for their submissions, too.

Please let me know if you encounter any issues!

@Jubengo I'll look into that.

mldiego · 2024-07-09T18:33:39Z

Setting a fixed MAC address did not work for me. I got the following error during the initialization phase:
Error loading logs, connection timed out.

Submission IDs: 101 and 102

@ChristopherBrix

ChristopherBrix · 2024-07-09T20:21:48Z

@mldiego This is fixed, ENIs should now no longer cause the submission to get stuck.

ChristopherBrix · 2024-07-09T22:40:54Z

@Jubengo I'm afraid this seems to be a bug in your code, or a very weird interaction between your scripts and mine. If I remove the call to pyrat.pyc from your run_instance.sh script, the model paths are all correct (but of course, your tool is not executed). If I include the call, the onnx paths are all messed up.
I'm not really sure how to debug this. Do you set any specific environment variables inside your tool?
Feel free to reach out to me via email to discuss this further: [email protected]

ChristopherBrix · 2024-07-10T14:26:03Z

If you encounter issues with your tool that you need to debug, you can now choose to pause the pipeline after the post-installation script was run. This can be selected in the submission form.
Use the post-installation script to append your public ssh key to the ~/.ssh/authorized_keys file. Do not replace its content, otherwise the server won't be able to connect any longer, and instance will be terminated. After the post-installation script was run, the pause will prevent the pipeline from continuing. You can then ssh into the machine using ssh ubuntu@IP and the IP you see at the top of your submission details page.
Please note that the instance will be terminated automatically after 4 hours.

ChristopherBrix · 2024-07-10T17:46:41Z

I just switched over to a different AWS account. This caused all ENIs (and the associated MAC addresses) to change, so if you used them for your licenses, you'll have to update them one more time. From now on, I don't anticipate further changes to this.
On the bright side, this solved our quota issues, and we now have enough instances of all three types available to keep the queue short.

shizhouxing · 2024-07-10T17:49:29Z

@ChristopherBrix It looks like "Yaml Config File, relative to repository root" cannot be loaded from a private repository with a personal access token? The submission site is trying to open it using a https://raw.githubusercontent.com/ URL instead of cloning the repository first.

ChristopherBrix · 2024-07-10T18:12:45Z

We were able to debug this. For anyone else encountering issues: Please make sure the URL you specify doesn't end in .git.

Jubengo · 2024-07-11T11:46:36Z

Hi, @ChristopherBrix I can't get past the initialisation phase with theami-0280f3286512b1b99: Deep Learning AMI and it does not seem to be from my side (or to work if I retry). Could you look at this ?

ChristopherBrix · 2024-07-11T15:49:47Z

@Jubengo Please try again. The problem is that this instance type spends a lot of time doing some background updates. I've tried to detect that and wait until they're done, but it's not foolproof. I've updated the scripts, hopefully this reduces this problem enough to make it usable.

ttj · 2024-10-11T17:06:13Z

We have shared a draft overleaf report and are also fixing some execution issues identified by some participants. Please let us know if you didn't get it and we will share it, we emailed the listservs so far

ttj · 2024-12-20T03:03:39Z

All: the report is about finalized pending some final checks, and we plan to submit to arxiv before the end of 2024 to ensure posting with a 2024 date, so please make any final changes before 12/27 at the very latest. We can post a new version to arxiv if further changes are needed after that. Thank you for your participation again and we wish you a happy holiday season and new year!

ttj mentioned this issue Jun 24, 2024

Benchmark Proposals and Discussion #2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool Submission Discussion #3

Tool Submission Discussion #3

ttj commented Jun 24, 2024

mldiego commented Jul 1, 2024

mldiego commented Jul 2, 2024

aliabigdeli commented Jul 3, 2024

hodangkhoinguyen commented Jul 4, 2024 •

edited

Loading

shizhouxing commented Jul 4, 2024

toladnertum commented Jul 4, 2024

kollerlukas commented Jul 4, 2024

mldiego commented Jul 4, 2024

ttj commented Jul 5, 2024

phK3 commented Jul 5, 2024

aliabigdeli commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

shizhouxing commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

Jubengo commented Jul 8, 2024

ChristopherBrix commented Jul 9, 2024

mldiego commented Jul 9, 2024

ChristopherBrix commented Jul 9, 2024

ChristopherBrix commented Jul 9, 2024

ChristopherBrix commented Jul 10, 2024

ChristopherBrix commented Jul 10, 2024

shizhouxing commented Jul 10, 2024

ChristopherBrix commented Jul 10, 2024

Jubengo commented Jul 11, 2024

ChristopherBrix commented Jul 11, 2024

ttj commented Oct 11, 2024

ttj commented Dec 20, 2024

Tool Submission Discussion #3

Tool Submission Discussion #3

Comments

ttj commented Jun 24, 2024

mldiego commented Jul 1, 2024

mldiego commented Jul 2, 2024

aliabigdeli commented Jul 3, 2024

hodangkhoinguyen commented Jul 4, 2024 • edited Loading

shizhouxing commented Jul 4, 2024

toladnertum commented Jul 4, 2024

kollerlukas commented Jul 4, 2024

mldiego commented Jul 4, 2024

ttj commented Jul 5, 2024

phK3 commented Jul 5, 2024

aliabigdeli commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

shizhouxing commented Jul 5, 2024

ChristopherBrix commented Jul 5, 2024

Jubengo commented Jul 8, 2024

ChristopherBrix commented Jul 9, 2024

mldiego commented Jul 9, 2024

ChristopherBrix commented Jul 9, 2024

ChristopherBrix commented Jul 9, 2024

ChristopherBrix commented Jul 10, 2024

ChristopherBrix commented Jul 10, 2024

shizhouxing commented Jul 10, 2024

ChristopherBrix commented Jul 10, 2024

Jubengo commented Jul 11, 2024

ChristopherBrix commented Jul 11, 2024

ttj commented Oct 11, 2024

ttj commented Dec 20, 2024

hodangkhoinguyen commented Jul 4, 2024 •

edited

Loading