Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Oct 28, 2024: This week in DataFusion #13167

Closed
1 of 3 tasks
alamb opened this issue Oct 29, 2024 · 9 comments
Closed
1 of 3 tasks

Oct 28, 2024: This week in DataFusion #13167

alamb opened this issue Oct 29, 2024 · 9 comments

Comments

@alamb
Copy link
Contributor

alamb commented Oct 29, 2024

Introduction

This ticket is a weekly summary of interesting things happening in DataFusion. Note this is not a complete list (it is what I remember / can find). Please feel free to leave comments on this ticket about things that I may have missed or you think should get wider attention by the community

Loosely inspired by https://this-week-in-rust.org/

DataFusion Related Blogs

Upcoming Releases

Major Projects / Discussions under way

Highlights from last week(s):

(I am sorry if I missed you -- please add a note to this ticket with anything you would like to highlight)

Looking to get more involved? Try code review!

DataFusion has a long history of community members contributing in all aspects of the project. Reviewing PRs is an especially great way to get introduced to the project, help the community and grow your own knowledge -- researching and understanding the code enough to review PRs also often inspires additional ideas for improvements.

We have docs about reviews. TLDR is: look for test coverage, if the change is understandable and well documented, and if the code can be improved. When you think the PR looks good to merge, try @ mentioning one of the committers.

Help wanted

Please feel leave your own comments on the ticket if you are looking for help

Community

Upcoming meetups:

Background:

Previous update: #13035

Andrew's Focus Areas:

We are preparing for the 43.0.0 release and I am personally pretty excited about (and thus actively help / put to the top of my review list)

@alamb alamb pinned this issue Oct 29, 2024
@Omega359
Copy link
Contributor

FYI Caching in DataFusion: Don't read twice is blocked by my corporate DNS likely because the cert is not from a common issuer.

@XiangpengHao
Copy link
Contributor

blocked by my corporate DNS likely because the cert is not from a common issuer.

hmmm interesting, the blog is hosted on GitHub pages

@alamb
Copy link
Contributor Author

alamb commented Oct 29, 2024

Maybe a good reason to repost them on the DataFusion blog 🤔 https://datafusion.apache.org/blog/

@adriangb
Copy link
Contributor

I have some questions / suggestions about the blob post so looking forward to a public forum where we can discuss 😄

@alamb
Copy link
Contributor Author

alamb commented Oct 29, 2024

@XiangpengHao would it be ok to port your blog's content to the DataFusion blog? And we could work together as a community to augment / extend it as needed?

@XiangpengHao
Copy link
Contributor

@XiangpengHao would it be ok to port your blog's content to the DataFusion blog? And we could work together as a community to augment / extend it as needed?

Of course! I'm more than happy to have it ported over, and I'm all ears for any feedback!

@XiangpengHao
Copy link
Contributor

I have some questions / suggestions about the blob post so looking forward to a public forum where we can discuss 😄

I've added a comment section to the blog (by giscus), feel free to share your questions and suggestions there! Looking forward to the discussion 😄

@alamb
Copy link
Contributor Author

alamb commented Oct 31, 2024

Wow -- @Rachelint @jayzhan211 and @Dandandan have another non trivial improvement in grouping performance: #12996

👏

It is really impressive to see what can be done with such careful engineering

@alamb
Copy link
Contributor Author

alamb commented Nov 5, 2024

Next week: #13265

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants