-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor JobsCrawler
to add include_job_ids
#3658
base: main
Are you sure you want to change the base?
Conversation
JobsCrawler
to add include_job_ids
✅ 29/29 passed, 5 skipped, 38m22s total Running from acceptance #8227 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@mohanab-db : Thank you for reporting this issue and resolving it!
I have added some minor comments. A more fundamental implementation question would be: what is the bottle neck when crawling all the jobs?
Is the bottleneck:
- Listing all the jobs
- Listing all clusters
- Combining jobs with clusters
- Assessing the jobs (actually we only assess the clusters related to the jobs)
@@ -94,9 +94,10 @@ def _job_clusters(job: BaseJob) -> Iterable[tuple[BaseJob, ClusterSpec]]: | |||
|
|||
|
|||
class JobsCrawler(CrawlerBase[JobInfo], JobsMixin, CheckClusterMixin): | |||
def __init__(self, ws: WorkspaceClient, sql_backend: SqlBackend, schema): | |||
def __init__(self, ws: WorkspaceClient, sql_backend: SqlBackend, schema, include_job_ids: list[int] | None = None): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
def __init__(self, ws: WorkspaceClient, sql_backend: SqlBackend, schema, include_job_ids: list[int] | None = None): | |
def __init__(self, ws: WorkspaceClient, sql_backend: SqlBackend, schema, *, include_job_ids: list[int] | None = None): |
Changes
Check for include_job_ids in assess_jobs task
Linked issues
#3656
Resolves #..
Functionality
databricks labs ucx ...
...
...
Tests