-
-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EMR bootstrap script fails #122
Comments
Thank you for the error report @ResidentMario . My apologies in the delayed response. The folks who maintain this repository have been busy lately. Do you have any interest in submitting a patch to resolve this issue? |
I might be able to look into it, but no promises. |
I am trying to debug some other issues with this and found that by using the EMR release |
Apparently emr-5.30 onwards they only support systemd and no longer support upstart. |
I think it has to do with Amazon Linux 2:
|
I tried to make it work with systemd and updated it with the following:
Note: I added a port for the notebook. This runs on bootstrap but there is nothing on port 8889. When I run the ExecStart command manually, via ssh, the notebook opens. So not sure what I'm doing wrong. I also get the following problem: #124 Sources for the new script:
EMR version 5.31.0 |
Hi @hiqbal2, Your script for systemd was super helpful. I got it to work by doing a few changes to this script.
I hope this helps |
@hegde-anish thanks for the help. EMR seems to bootstrap properly now. However, not sure if you got this error when trying to start a dask cluster:
Not sure why this is happening. I am also not sure if there is a difference in behaviour in just calling I don't have any experience with hadoop or dask, so am a little lost on debugging this. |
This modified bootstrap script worked for me, with a few additional fixes:
To fix the latter two issues, I added
and I specified a
EMR version 5.32.0 |
The above worked for me. However, the jupyter notebook now just does not output any values. I tried to start the notebook via ssh and got the following error when trying to do a simple
|
@kqshan this is great, thanks. I didn't find I needed to to unalias, after the bootstrap I had proper pointers to miniconda python/pip. I'm running a newer EMR (emr-6.2.0) so this may be a factor. I removed the version pin for tornado as well. The conda pack issue appears to be from this conda issue. I added Also the version spec for dask-yarn causes a file to be written to the home folder called ''=0.7.0". Some escaping or quoting likely necessary to fix but I just removed the version specification because conda installed 0.8.1 on its own. |
I also ran into this issue when trying the bootstrap script. @davegravy did you test your script? Do you have a version that works with emr-6.2.0? |
What version of |
My script works with emr-6.2.0, yes. I've had no issues with dask & EMR, at least not that I can attribute to the bootstrap. |
Can you share it? |
Sure: https://gist.github.com/davegravy/61e3abb81176f4490032554b70d28c31 |
Hello, i tried install dask with many versions of bootstrap and EMR versions but anything doesnt work. If it's possible share with me what EMR version and dask bootstrap you used. Thanks |
Hi I was using EMR 6.2.0.
This is a private python library my bootstrap script installs. It shouldn't have any bearing on the bootstrap's ability to succeed. |
The EMR bootstrap script currently fails with the following error (found via
stderr
logs):The text was updated successfully, but these errors were encountered: