You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
$ sky jobs launch test.yaml --cloud aws --cpus 2 -n test-mount-bucketTask from YAML spec: test.yamlManaged job 'test-mount-bucket' will be launched on (estimated):Considered resources (1 node):---------------------------------------------------------------------------------------- CLOUD INSTANCE vCPUs Mem(GB) ACCELERATORS REGION/ZONE COST ($) CHOSEN ---------------------------------------------------------------------------------------- AWS m6i.large 2 8 - us-east-1 0.10 ✔ ----------------------------------------------------------------------------------------Launching a managed job 'test-mount-bucket'. Proceed? [Y/n]: ⚙︎ Translating workdir and file_mounts with local source paths to SkyPilot Storage... Workdir: 'examples' -> storage: 'skypilot-filemounts-vscode-904d206c'. Folder : 'examples' -> storage: 'skypilot-filemounts-vscode-904d206c'. Created S3 bucket 'skypilot-filemounts-vscode-904d206c' in us-east-1 Excluded files to sync to cluster based on .gitignore.✓ Storage synced: examples -> s3://skypilot-filemounts-vscode-904d206c/ View logs at: ~/sky_logs/sky-2025-01-30-23-19-02-003572/storage_sync.log Excluded files to sync to cluster based on .gitignore.✓ Storage synced: examples -> s3://skypilot-filemounts-vscode-904d206c/ View logs at: ~/sky_logs/sky-2025-01-30-23-19-09-895566/storage_sync.log✓ Uploaded local files/folders.Launching managed job 'test-mount-bucket' from jobs controller...Warning: Credentials used for [GCP, AWS] may expire. Clusters may be leaked if the credentials expire while jobs are running. It is recommended to use credentials that never expire or a service account.⚙︎ Launching managed jobs controller on Kubernetes.W 01-30 23:19:33 instance.py:863] run_instances: Error occurred when creating pods: sky.provision.kubernetes.config.KubernetesError: Insufficient memory capacity on the cluster. Required resources (cpu=4, memory=34359738368) were not found in a single node. Other SkyPilot tasks or pods may be using resources. Check resource usage by running `kubectl describe nodes`. Full error: 0/1 nodes are available: 1 Insufficient memory. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod.sky.provision.kubernetes.config.KubernetesError: Insufficient memory capacity on the cluster. Required resources (cpu=4, memory=34359738368) were not found in a single node. Other SkyPilot tasks or pods may be using resources. Check resource usage by running `kubectl describe nodes`.Full error: 0/1 nodes are available: 1 Insufficient memory. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod.During handling of the above exception, another exception occurred:NotImplementedErrorThe above exception was the direct cause of the following exception:sky.provision.common.StopFailoverError: During provisioner's failover, stopping 'sky-jobs-controller-11d9a692' failed. We cannot stop the resources launched, as it is not supported by Kubernetes. Please try launching the cluster again, or terminate it with: sky down sky-jobs-controller-11d9a692
Launch again:
$ sky jobs launch test.yaml --cloud aws --cpus 2 -n test-mount-bucketTask from YAML spec: test.yamlManaged job 'test-mount-bucket' will be launched on (estimated):Considered resources (1 node):---------------------------------------------------------------------------------------- CLOUD INSTANCE vCPUs Mem(GB) ACCELERATORS REGION/ZONE COST ($) CHOSEN ---------------------------------------------------------------------------------------- AWS m6i.large 2 8 - us-east-1 0.10 ✔ ----------------------------------------------------------------------------------------Launching a managed job 'test-mount-bucket'. Proceed? [Y/n]: ⚙︎ Translating workdir and file_mounts with local source paths to SkyPilot Storage... Workdir: 'examples' -> storage: 'skypilot-filemounts-vscode-b7ba6a41'. Folder : 'examples' -> storage: 'skypilot-filemounts-vscode-b7ba6a41'. Created S3 bucket 'skypilot-filemounts-vscode-b7ba6a41' in us-east-1 Excluded files to sync to cluster based on .gitignore.✓ Storage synced: examples -> s3://skypilot-filemounts-vscode-b7ba6a41/ View logs at: ~/sky_logs/sky-2025-01-30-23-20-51-067815/storage_sync.log Excluded files to sync to cluster based on .gitignore.✓ Storage synced: examples -> s3://skypilot-filemounts-vscode-b7ba6a41/ View logs at: ~/sky_logs/sky-2025-01-30-23-20-58-164407/storage_sync.log✓ Uploaded local files/folders.Launching managed job 'test-mount-bucket' from jobs controller...Warning: Credentials used for [AWS, GCP] may expire. Clusters may be leaked if the credentials expire while jobs are running. It is recommended to use credentials that never expire or a service account.Cluster 'sky-jobs-controller-11d9a692' (status: INIT) was previously in Kubernetes (gke_sky-dev-465_us-central1-c_skypilotalpha). Restarting.⚙︎ Launching managed jobs controller on Kubernetes.⨯ Failed to set up SkyPilot runtime on cluster. View logs at: ~/sky_logs/sky-2025-01-30-23-21-05-243052/provision.logAssertionError: cpu_request should not be None
The text was updated successfully, but these errors were encountered:
To reproduce:
~/.sky/config.yaml
The text was updated successfully, but these errors were encountered: