Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

/test jump-ci icelake skeleton #634

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

MML-coder
Copy link
Collaborator

No description provided.

Copy link

openshift-ci bot commented Jan 14, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign ashishkamra for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@MML-coder
Copy link
Collaborator Author

/test jump-ci icelake skeleton

@openshift-ci openshift-ci bot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jan 14, 2025
Copy link

openshift-ci bot commented Jan 14, 2025

Hi @MML-coder. Thanks for your PR.

I'm waiting for a openshift-psap member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Copy link

openshift-ci bot commented Jan 14, 2025

@MML-coder: Cannot trigger testing until a trusted user reviews the PR and leaves an /ok-to-test message.

In response to this:

/test jump-ci icelake skeleton

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@kpouget
Copy link
Contributor

kpouget commented Jan 15, 2025

/ok-to-test
/test jump-ci icelake skeleton

@openshift-ci openshift-ci bot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jan 15, 2025
Copy link

topsail-bot bot commented Jan 15, 2025

🟢 Test of 'skeleton test test_ci' succeeded after 00 hours 09 minutes 14 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

@MML-coder
Copy link
Collaborator Author

/test jump-ci icelake skeleton

Copy link

topsail-bot bot commented Jan 15, 2025

🟢 Test of 'skeleton test test_ci' succeeded after 00 hours 09 minutes 28 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

@kpouget
Copy link
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning

Copy link

topsail-bot bot commented Jan 16, 2025

🔴 Test of 'fine_tuning test prepare_ci' failed after 00 hours 10 minutes 53 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci

Failure indicator:

/logs/artifacts/000__prepare1/000__install_rhoai/000__cluster__deploy_operator_service-mesh/FAILURE | [000__cluster__deploy_operator_service-mesh] ./run_toolbox.py cluster deploy_operator --catalog=redhat-operators --manifest_name=servicemeshoperator --namespace=all --> 2
/logs/artifacts/000__prepare1/000__install_rhoai/FAILURE | CalledProcessError: Command 'set -o errexit;set -o pipefail;set -o nounset;set -o errtrace;ARTIFACT_DIR="/logs/artifacts/000__prepare1/000__install_rhoai" ARTIFACT_TOOLBOX_NAME_SUFFIX="_service-mesh" ./run_toolbox.py cluster deploy_operator --catalog='redhat-operators' --manifest_name='servicemeshoperator' --namespace='all'' returned non-zero exit status 2.
Traceback (most recent call last):
  File "/opt/topsail/src/projects/rhods/library/prepare_rhoai.py", line 57, in install
    install_servicemesh()
  File "/opt/topsail/src/projects/rhods/library/prepare_rhoai.py", line 32, in install_servicemesh
    run.run_toolbox("cluster", "deploy_operator",
  File "/opt/topsail/src/projects/core/library/run.py", line 82, in run_toolbox
    return run(f'{cmd_env} ./run_toolbox.py {group} {command} {_dict_to_run_toolbox_args(kwargs)}', **run_kwargs)
  File "/opt/topsail/src/projects/core/library/run.py", line 105, in run

[...]

@MML-coder
Copy link
Collaborator Author

/test rhoai-light skeleton

Copy link

topsail-bot bot commented Jan 16, 2025

🟢 Test of 'rhoai test test_ci' succeeded after 00 hours 05 minutes 45 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

# RHOAI: run skeleton test test_ci
PR_POSITIONAL_ARGS: ''
PR_POSITIONAL_ARG_0: skeleton-rhoai-light

@kpouget
Copy link
Contributor

kpouget commented Jan 16, 2025

/test rhoai-light fine_tuning fms gating use_smoke_os

Copy link

topsail-bot bot commented Jan 16, 2025

🟢 Test of 'rhoai test test_ci' succeeded after 00 hours 37 minutes 41 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

# RHOAI: run fine_tuning test test_ci
PR_POSITIONAL_ARGS: fms gating use_smoke_os
PR_POSITIONAL_ARG_0: fine_tuning-rhoai-light
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
PR_POSITIONAL_ARG_3: use_smoke_os

@kpouget
Copy link
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: true

Copy link

topsail-bot bot commented Jan 16, 2025

🔴 Test of 'fine_tuning test test_ci' failed after 00 hours 05 minutes 35 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
matbench.lts.opensearch.export.enabled: True

Failure indicator:

/logs/artifacts/004__fms_fine_tuning/000__fine_tuning__run_fine_tuning_job/FAILURE | [000__fine_tuning__run_fine_tuning_job] ./run_toolbox.py from_config fine_tuning run_fine_tuning_job --extra={'name': 'fine-tuning', 'pod_count': 1, 'model_name': 'bloom-560m', 'dataset_name': 'twitter_complaints_small.json', 'gpu': 1, 'dataset_replication': 1, 'dataset_response_template': '\n### Label:'} --> 2
/logs/artifacts/004__fms_fine_tuning/FAILURE | CalledProcessError: Command 'set -o errexit;set -o pipefail;set -o nounset;set -o errtrace;ARTIFACT_DIR="/logs/artifacts/004__fms_fine_tuning" ./run_toolbox.py from_config fine_tuning run_fine_tuning_job --extra="{'name': 'fine-tuning', 'pod_count': 1, 'model_name': 'bloom-560m', 'dataset_name': 'twitter_complaints_small.json', 'gpu': 1, 'dataset_replication': 1, 'dataset_response_template': '\n### Label:'}"' returned non-zero exit status 2.
Traceback (most recent call last):
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 156, in _run_test
    run.run_toolbox_from_config("fine_tuning", "run_fine_tuning_job",
  File "/opt/topsail/src/projects/core/library/run.py", line 49, in run_toolbox_from_config
    return run(f'{cmd_env} ./run_toolbox.py from_config {group} {command} {_dict_to_run_toolbox_args(kwargs)}', **run_kwargs)
  File "/opt/topsail/src/projects/core/library/run.py", line 105, in run
    proc = subprocess.run(command, **args)
  File "/usr/lib64/python3.9/subprocess.py", line 528, in run

[...]

@kpouget
Copy link
Contributor

kpouget commented Jan 16, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: false
/var tests.fine_tuning.test_settings.gpu: 0
/only test_ci

Copy link

topsail-bot bot commented Jan 16, 2025

🔴 Test of 'fine_tuning test test_ci' failed after 00 hours 05 minutes 31 seconds. 🔴

• Link to the test results.

• Link to the reports index.

Test configuration:

PR_POSITIONAL_ARGS: jump-ci
PR_POSITIONAL_ARG_1: fms
PR_POSITIONAL_ARG_2: gating
matbench.lts.opensearch.export.enabled: False
tests.fine_tuning.test_settings.gpu: 0

Failure indicator:

/logs/artifacts/005__plots/FAILURE | An error happened during the visualization post-processing ... (opensearch secret cannot be generated)
/logs/artifacts/006__prom_plots/FAILURE | An error happened during the visualization post-processing ... (opensearch secret cannot be generated)
/logs/artifacts/FAILURE | Traceback (most recent call last):
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 410, in test
    failed = _run_test_and_visualize()
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 338, in _run_test_and_visualize
    generate_visualization(do_matbenchmarking, test_artifact_dir_p[0])
  File "/opt/topsail/src/projects/fine_tuning/testing/test_finetuning.py", line 389, in generate_visualization
    raise exc
  File "/opt/topsail/src/projects/core/library/run.py", line 178, in run_and_catch

[...]

@kpouget
Copy link
Contributor

kpouget commented Jan 20, 2025

/test jump-ci icelake fine_tuning fms gating
/var matbench.lts.opensearch.export.enabled: false
/var tests.fine_tuning.test_settings.gpu: 0
/only test_ci

Copy link

topsail-bot bot commented Jan 20, 2025

🟢 Test of 'fine_tuning test test_ci' succeeded after 00 hours 05 minutes 47 seconds. 🟢

• Link to the test results.

• Link to the reports index.

Test configuration:

{PR_POSITIONAL_ARGS: jump-ci, PR_POSITIONAL_ARG_1: fms, PR_POSITIONAL_ARG_2: gating,
  matbench.lts.opensearch.export.enabled: false, tests.fine_tuning.test_settings.gpu: 0}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ok-to-test Indicates a non-member PR verified by an org member that is safe to test.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants