Monte carlo simulations (non-ML use) and looping pipelines #2958
quantumtrope
started this conversation in
Idea
Replies: 2 comments 3 replies
-
Thanks for the questions! This is very interesting use of Kedro. Any chance you can jump on our slack.kedro.org to ask the question to see if the community have an answer for this? |
Beta Was this translation helpful? Give feedback.
3 replies
-
Hi @quantumtrope, this is a good question and one for which there's no "Kedro canonical" answer. It's a frequently requested feature, you have more context here #1606 In summary, either of the two options you proposed (modular pipelines or custom |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have a question about using Kedro in a non-ML setting. Specifically, I am trying to use Kedro in a data analysis (no learning) and statistical modeling/simulation pipeline. A simplified view of the use case is something like:
So far so good: Kedro defines these operations really nicely and keeps things tidy, along with visualizations and saving data for experiments. (keep in mind that in reality, steps 2 and 3 are probably 6-8 nodes long split across two or three pipelines in Kedro).
Now, the problem is that I need to explore a large space of input parameters. Like sweep an input parameter in 100 steps of log space from 1e-6 to 1e-4. So the (simplified) workflow now becomes:
I know Kedro wasn't built for this, but I want to highlight that the Kedro way is very amenable to general statistical modeling and simulation efforts that don't include ML.
My question is: what's the "Kedro canonical" way to do that? From initial attempts I can see one of two options:
(keep in mind this is also a simplified example, I probably have two or three variables that I want to loop over in similar ways, upping the amount of total pipelines to run to something like 10000+).
Beta Was this translation helpful? Give feedback.
All reactions