The container image of spark task should be immutable #2956

pingsutw · 2024-11-25T18:37:09Z

Tracking issue

NA

Why are the changes needed?

flytekit overwrites the default_executor_path when the base_image of ImageSpec is None.

When running two Spark tasks in a workflow, flytekit overwrites the default_executor_path only for the first task. This happens because flytekit modifies the base_image of the imageSpec during the compilation of the first task. As a result, when compiling the second task, the base_image is no longer None, preventing Flytekit from overwriting the default_executor_path for the second task.

What changes were proposed in this pull request?

Deep copy the image spec and modify it.

How was this patch tested?

import datetime
import random
import time
from operator import add
from flytekit import ImageSpec, Resources, task, workflow
import flytekit

from flytekitplugins.spark import Spark
custom_image = ImageSpec(registry="ghcr.io/flyteorg", packages=["flytekitplugins-spark"])


@task(
    task_config=Spark(
        # This configuration is applied to the Spark cluster
        spark_conf={
            "spark.driver.memory": "1000M",
            "spark.executor.memory": "1000M",
            "spark.executor.cores": "1",
            "spark.executor.instances": "2",
            "spark.driver.cores": "1",
            "spark.ui.proxyRedirectUri": "https://dogfood.cloud-staging.union.ai",
            "spark.jars": "https://storage.googleapis.com/hadoop-lib/gcs/gcs-connector-hadoop3-latest.jar",
        }
    ),
    limits=Resources(mem="2000M"),
    container_image=custom_image,
)
def hello_spark1(partitions: int) -> float:
    print("Starting Spark with Partitions: {}".format(partitions))

    n = 1 * partitions
    sess = flytekit.current_context().spark_session
    count = sess.sparkContext.parallelize(range(1, n + 1), partitions).map(f).reduce(add)

    pi_val = 4.0 * count / n
    time.sleep(360)
    return pi_val


def f(_):
    x = random.random() * 2 - 1
    y = random.random() * 2 - 1
    return 1 if x**2 + y**2 <= 1 else 0


@task(
    task_config=Spark(
        # This configuration is applied to the Spark cluster
        spark_conf={
            "spark.driver.memory": "1000M",
            "spark.executor.memory": "1000M",
            "spark.executor.cores": "1",
            "spark.executor.instances": "2",
            "spark.driver.cores": "1",
            "spark.ui.proxyRedirectUri": "https://dogfood.cloud-staging.union.ai",
            "spark.jars": "https://storage.googleapis.com/hadoop-lib/gcs/gcs-connector-hadoop3-latest.jar",
        }
    ),
    limits=Resources(mem="2000M"),
    container_image=custom_image,
)
def hello_spark2(partitions: int) -> float:
    print("Starting Spark with Partitions: {}".format(partitions))

    n = 1 * partitions
    sess = flytekit.current_context().spark_session
    count = sess.sparkContext.parallelize(range(1, n + 1), partitions).map(f).reduce(add)

    pi_val = 4.0 * count / n
    time.sleep(360)
    return pi_val


@workflow
def my_spark2(triggered_date: datetime.datetime = datetime.datetime.now()) -> float:
    """
    Using the workflow is still as any other workflow. As image is a property of the task, the workflow does not care
    about how the image is configured.
    """
    pi1 = hello_spark1(partitions=1)
    pi2 = hello_spark2(partitions=1)
    return pi1

Setup process

Screenshots

Check all the applicable boxes

I updated the documentation accordingly.
All new and existing tests passed.
All commits are signed-off.

Related PRs

NA

Docs link

NA

Signed-off-by: Kevin Su <[email protected]>

…e-spec

eapolinario · 2024-11-26T18:03:04Z

the failures in the different plugins tests are real. Can you take a look?

Signed-off-by: Kevin Su <[email protected]>

pingsutw · 2024-11-26T19:32:30Z

the failures in the different plugins tests are real. Can you take a look?

fixed it

Signed-off-by: Kevin Su <[email protected]>

codecov · 2024-11-26T20:56:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 46.62%. Comparing base (fdb7676) to head (b03fa42).
Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2956      +/-   ##
==========================================
- Coverage   51.25%   46.62%   -4.63%     
==========================================
  Files         200      200              
  Lines       20835    20851      +16     
  Branches     2688     2691       +3     
==========================================
- Hits        10678     9722     -956     
- Misses       9559    10652    +1093     
+ Partials      598      477     -121

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Kevin Su <[email protected]>

add a test

8b23f86

Signed-off-by: Kevin Su <[email protected]>

pingsutw requested review from wild-endeavor, kumare3, eapolinario, cosmicBboy, samhita-alla, thomasjpfan and Future-Outlier as code owners November 25, 2024 18:37

pingsutw added 3 commits November 25, 2024 10:42

lint

d17194f

Signed-off-by: Kevin Su <[email protected]>

Merge branch 'master' of github.com:flyteorg/flytekit into spark-imag…

a5759fc

…e-spec

Merge branch 'master' of github.com:flyteorg/flytekit into spark-imag…

e2cf64b

…e-spec

fix tests

704e3f5

Signed-off-by: Kevin Su <[email protected]>

fix tests

b03fa42

Signed-off-by: Kevin Su <[email protected]>

eapolinario previously approved these changes Nov 26, 2024

View reviewed changes

fix tests

f0d2e13

Signed-off-by: Kevin Su <[email protected]>

pingsutw dismissed eapolinario’s stale review via f0d2e13 November 26, 2024 21:03

fix tests

b743adc

Signed-off-by: Kevin Su <[email protected]>

Future-Outlier approved these changes Nov 27, 2024

View reviewed changes

pingsutw merged commit d5ea440 into master Nov 27, 2024
104 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The container image of spark task should be immutable #2956

The container image of spark task should be immutable #2956

pingsutw commented Nov 25, 2024

eapolinario commented Nov 26, 2024

pingsutw commented Nov 26, 2024

codecov bot commented Nov 26, 2024

The container image of spark task should be immutable #2956

The container image of spark task should be immutable #2956

Conversation

pingsutw commented Nov 25, 2024

Tracking issue

Why are the changes needed?

What changes were proposed in this pull request?

How was this patch tested?

Setup process

Screenshots

Check all the applicable boxes

Related PRs

Docs link

eapolinario commented Nov 26, 2024

pingsutw commented Nov 26, 2024

codecov bot commented Nov 26, 2024

Codecov Report