Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hadoop/spark #108

Open
skiptoniam opened this issue Sep 15, 2022 · 3 comments
Open

hadoop/spark #108

skiptoniam opened this issue Sep 15, 2022 · 3 comments

Comments

@skiptoniam
Copy link

skiptoniam commented Sep 15, 2022

This tutorial seems to be well out of date. I managed to get it running by folking the elasticluster repo and making a lot of changes. I seem to have a few issues actually running the hadoop cluster, which I'm still working through, I expect some changes to the ansible playbooks might have cause issue with the hadoop/spark cluster.

@ramson33
Copy link
Contributor

Hi there,

Thanks for letting us know, and sorry there were issues with the tutorial!

I will have a look through and make fixes where necessary.

Kind regards,
Sonia

@skiptoniam
Copy link
Author

All good. I also had to run elasticluster from a python3.7 virtual environment. It didn't work from 3.8.

@andybotting
Copy link
Member

hey @skiptoniam We've had a couple of tickets come through recently about this. I've been trying to make this work with at least Python 3.10 and I've fixed up a few things.

We're trying to work out if we can continue to support this tutorial, as it does feel like upstream aren't really maintaining it.

Maybe with the work you've done here and my work fixing some of the newer Python issues, we can keep it going. Are you going to be using Elasticluster longish term?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants