[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka #341

dolfinus · 2025-02-18T11:20:17Z

Change Summary

SparkS3.get_packages() and Kafka.get_packages() return list of packages with compile dependency on hadoop client packages which are already a part of Spark/PySpark bundle. Also, SparkS3 uses spark-cloud package depending on lots of cloud clients, like GCP, Azure and so on, which are not required here.

Added .get_exclude_packages() method to both of these classes. Using spark.jar.excludes mechanism to tell Ivy2 that some transitive packages should be excluded.

Related issue number

Checklist

Commit message and PR title is comprehensive
Keep the change as small as possible
Unit and integration tests for the changes exist
Tests pass on CI and coverage does not decrease
Documentation reflects the changes where applicable
docs/changelog/next_release/<pull request or issue id>.<change type>.rst file added describing change
(see CONTRIBUTING.rst for details.)
My PR is ready to review.

codecov · 2025-02-18T11:28:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.00%. Comparing base (60aea9e) to head (7e9941a).
Report is 1 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #341      +/-   ##
===========================================
- Coverage    92.02%   92.00%   -0.03%     
===========================================
  Files          228      228              
  Lines         9867     9875       +8     
  Branches      1013     1013              
===========================================
+ Hits          9080     9085       +5     
- Misses         599      601       +2     
- Partials       188      189       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dolfinus self-assigned this Feb 18, 2025

dolfinus force-pushed the feature/DOP-23968 branch from 28f8309 to 47d9c2f Compare February 18, 2025 12:43

dolfinus temporarily deployed to test-pypi February 18, 2025 12:43 — with GitHub Actions Inactive

dolfinus requested review from TiGrib and IlyasDevelopment February 18, 2025 12:58

dolfinus marked this pull request as ready for review February 18, 2025 12:58

IlyasDevelopment approved these changes Feb 18, 2025

View reviewed changes

[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka

7e9941a

dolfinus force-pushed the feature/DOP-23968 branch from 47d9c2f to 7e9941a Compare February 18, 2025 15:02

dolfinus temporarily deployed to test-pypi February 18, 2025 15:02 — with GitHub Actions Inactive

dolfinus enabled auto-merge (rebase) February 18, 2025 15:07

dolfinus merged commit b9a19af into develop Feb 18, 2025
38 checks passed

dolfinus deleted the feature/DOP-23968 branch February 18, 2025 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka #341

[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka #341

dolfinus commented Feb 18, 2025 •

edited

Loading

codecov bot commented Feb 18, 2025 •

edited

Loading

[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka #341

[DOP-23968] Add .get_exlude_packages() to SparkS3 and Kafka #341

Conversation

dolfinus commented Feb 18, 2025 • edited Loading

Change Summary

Related issue number

Checklist

codecov bot commented Feb 18, 2025 • edited Loading

Codecov Report

dolfinus commented Feb 18, 2025 •

edited

Loading

codecov bot commented Feb 18, 2025 •

edited

Loading