-
Notifications
You must be signed in to change notification settings - Fork 204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory fault when running Spark on Gramine #1227
Comments
The following code snippet raises a memory fault in jvm.
In my case, the memory fault was triggered within 10 trials. I suspect that something goes wrong when gramine You can run the attached testcase as follow:
|
@JaewonHur This looks like a duplicate of this issue: #1156 ? Please write back if it looks like you're hitting the above issue -- then I'll mark this one as a "duplicate". Currently there is noone (to my knowledge) who is working on solving this bug in Gramine, but we'll try to find some resources on this. |
@TejaswineeL - please check if this is similar to what you are debugging. |
@dimakuv I'm not sure, but it seems it is not related to #1156. The bug was not triggered after modifying It can be the issue of the different semantics between |
@JaewonHur So, there does not seem anything wrong with checkpointing. |
Sorry for the confusion, it seems not related to the checkpointing issue. |
I didn't reproduce this fault using |
@llly Thanks for checking! So the test case just worked, without any issues? |
When the java was built without debug enabled, it took longer to trigger the bug (about 50 trials). |
@llly Could it be that you didn't run the test sufficiently many times to trigger this bug? |
Closing this issue, as it is 1.5 years old, and no follow ups happened. |
Description of the problem
Hi,
Currently I'm trying to run Spark on Gramine, but whenever I run the application, it raises memory fault.
The memory fault occurs non-deterministic (but quite often at a specific pc), and I suspects it is related to the futex?
I tried debugging to find the root cause, but I could not find the clue.
In detail, the memory fault occurs frequently in the
java compiler thread
which synchronizes the accesses usingCodeCacheLock
.I assume the bug should be related to the synchronization.
Could anyone help me getting the clue?
By the way, the first few memory faults on address
0x00000000
are not the bugs, which are also raised when running jvm on the native environment.gramine-spark-stdout.txt
gramine-spark-trace.txt
Steps to reproduce
gramine version:
v1.4.0
java version:
openjdk-11
spark version:
v3.3.2
If you want the minimized testcase, please inform me.
Expected results
no memory fault
Actual results
memory fault
Gramine commit hash
v1.4.0
The text was updated successfully, but these errors were encountered: