You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Before tagging this year's version of the repo at the conclusion of the workshop, we should fix some of these problems and clarify any ambiguities that led to user issues during the walkthrough.
(mostly copied from Slack workspace)
Some pitfalls of the Dask-RAPIDS scikit-learn tutorial, specifically with the ./open_jupyterlab_polaris.sh script:
If you get the username@x3008c0s19b0n0: Permission denied (publickey,keyboard-interactive,hostbased). error, it is likely because you do not have any SSH keypair created on Polaris and installed in the ~/.ssh/authorized_keys list; the compute node is therefore rejecting your direct SSH jump from the login node. From a Polaris login node, run ssh-keygen -t ed25519 and ssh-copy-id polaris
if you get past that, then get no control path specified for "-O" command, it could be an issue with the multiplexed connection socket set up in the earlier step ssh -M -S ~/.ssh/multiplex:polaris.rapids [email protected]. The DNS resolution and other default settings can be very OS dependent and error prone. Suggested alternative below
If you finally get through all of that, and then get an error like port already in use , then try modifying PORTD=8787 to some other number in the script
For that second pitfall, I prefer to use a different global SSH multiplexing setup for all Polaris connections. First make: mkdir ~/.ssh/cm_socket/
Then edit ~/.ssh/config:
Host polaris
HostName polaris.alcf.anl.gov
User <INSERT username>
ControlMaster auto
ControlPath ~/.ssh/cm_socket/%r@%h:%p
ControlPersist 10m
Then the second step turns from ssh -M -S ~/.ssh/multiplex:polaris.rapids [email protected] to simply ssh polaris
Before tagging this year's version of the repo at the conclusion of the workshop, we should fix some of these problems and clarify any ambiguities that led to user issues during the walkthrough.
(mostly copied from Slack workspace)
Some pitfalls of the Dask-RAPIDS scikit-learn tutorial, specifically with the
./open_jupyterlab_polaris.sh
script:username@x3008c0s19b0n0: Permission denied (publickey,keyboard-interactive,hostbased).
error, it is likely because you do not have any SSH keypair created on Polaris and installed in the~/.ssh/authorized_keys
list; the compute node is therefore rejecting your direct SSH jump from the login node. From a Polaris login node, runssh-keygen -t ed25519
andssh-copy-id polaris
ssh -M -S ~/.ssh/multiplex:polaris.rapids [email protected]
. The DNS resolution and other default settings can be very OS dependent and error prone. Suggested alternative belowPORTD=8787
to some other number in the scriptFor that second pitfall, I prefer to use a different global SSH multiplexing setup for all Polaris connections. First make:
mkdir ~/.ssh/cm_socket/
Then edit
~/.ssh/config
:Then the second step turns from
ssh -M -S ~/.ssh/multiplex:polaris.rapids [email protected]
to simplyssh polaris
May need to fix
ALCF_Hands_on_HPC_Workshop/Scikit-learn/open_jupyterlab_polaris.sh
Line 6 in 0b791a0
Also:
git rm
temporary file https://github.com/argonne-lcf/ALCF_Hands_on_HPC_Workshop/blob/master/Scikit-learn/~%24Intro%20to%20Scikit-learn_dask_intelex.pptxREADME.md
toScikit-learn/
. SeeInferenceOptimizations/README.md
for exampleThe text was updated successfully, but these errors were encountered: