Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement request: Apply Custom Time Series Functions #196

Open
BenLBurke opened this issue Apr 29, 2022 · 1 comment
Open

Enhancement request: Apply Custom Time Series Functions #196

BenLBurke opened this issue Apr 29, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@BenLBurke
Copy link

I've heard it is in development but wanted to add to the backlog.

I commonly find I am running multiple time-series related functions either with Numpy or utilizing tsfresh's library.

While these are useful, tsfresh's ability to run on spark is available, via their tsfresh.convenience.bindings.spark_feature_extraction_on_chunk feature, it is limited in it's capacity to truly run in SPARK as it performs an applyInPandas transformation.

Would love to see the ability to either utilize these functions or have custom created functions as well.

@rportilla-databricks
Copy link
Contributor

@BenLBurke , this is definitely an interesting request. Can you explain what functions you are looking for in particular? The reason we ask is because applyInPandas is the go-to method if the function you want to apply is completely custom. However, if there is something specific (1 or 2 features from tsfresh) that you like, we can optimize this by using PySpark directly under the hood. Let me know what you're looking for and we can prioritize it.

@tnixon tnixon added the enhancement New feature or request label May 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants