You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What happened: Connecting to a remote EMR cluster from a Jupyter notebook (using YarnCluster for Dask Cluster creation) causes notebook cell to hang. The YarnCluster client is able to successfully submit the job to Yarn on EMR and the application is listed under the running applications tab, however on the notebook client side the cell just hangs. The application on Yarn seemingly continue to run as well and has to be manually killed (nothing in the Yarn application logs seems to be indicating an error)
What you expected to happen: After the job is submitted, the notebook cell should not hang and allow user to submit further Dask transformation code to the Dask cluster created on EMR (Yarn app)
Minimal Complete Verifiable Example:
Hangs after submitting the following code in the notebook cell, no errors are reported (and there is a little asterisk beside the cell)
please note that Dask-yarn is installed on all EMR nodes already
name: test-daskqueue: defaultservices:
dask.scheduler:
# Restrict scheduler to 2 GiB and 1 coreresources:
memory: 2 GiBvcores: 1script: | dask-yarn services schedulerdask.worker:
# Don't start any workers initiallyinstances: 0# Workers can infinite number of timesmax_restarts: -1depends:
- dask.scheduler# Restrict workers to 4 GiB and 2 cores eachresources:
memory: 4 GiBvcores: 2# Distribute this python environment to every worker nodefiles:
environment: /notebooks_deps_pkg.tar.gz# The bash script to start the worker# Here we activate the environment, then start the workerscript: | virtualenv env source env/bin/activate dask-yarn services worker
Anything else we need to know?: In the logs after adding print statement to base skein core.py file (added a print(req) before the return) I see the following in the logs
22/03/04 21:08:19 INFO conf.Configuration: resource-types.xml not found
22/03/04 21:08:19 INFO resource.ResourceUtils: Unable to find 'resource-types.xml'.
22/03/04 21:08:19 INFO skein.Driver: Uploading application resources to hdfs://cluster.ip:8020/user/hadoop/.skein/application_1646182918041_0074
22/03/04 21:08:43 INFO skein.Driver: Submitting application...
22/03/04 21:08:43 INFO impl.YarnClientImpl: Submitted application application_1646182918041_0074
id: "application_1646182918041_0074"<generator object KeyValueStore._input_iter at 0x7f20908370a0>
Then it just hangs in the notebook cell
Environment:
Dask version: 0.8.1
Python version: 3.6.9
Operating System: Ubuntu
Install method (conda, pip, source): Pip
The text was updated successfully, but these errors were encountered:
What happened: Connecting to a remote EMR cluster from a Jupyter notebook (using YarnCluster for Dask Cluster creation) causes notebook cell to hang. The YarnCluster client is able to successfully submit the job to Yarn on EMR and the application is listed under the running applications tab, however on the notebook client side the cell just hangs. The application on Yarn seemingly continue to run as well and has to be manually killed (nothing in the Yarn application logs seems to be indicating an error)
What you expected to happen: After the job is submitted, the notebook cell should not hang and allow user to submit further Dask transformation code to the Dask cluster created on EMR (Yarn app)
Minimal Complete Verifiable Example:
Hangs after submitting the following code in the notebook cell, no errors are reported (and there is a little asterisk beside the cell)
spec.yaml
Anything else we need to know?: In the logs after adding print statement to base skein core.py file (added a
print(req)
before the return) I see the following in the logsThen it just hangs in the notebook cell
Environment:
The text was updated successfully, but these errors were encountered: