-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Oct 28, 2024: This week in DataFusion #13167
Comments
FYI Caching in DataFusion: Don't read twice is blocked by my corporate DNS likely because the cert is not from a common issuer. |
hmmm interesting, the blog is hosted on GitHub pages |
Maybe a good reason to repost them on the DataFusion blog 🤔 https://datafusion.apache.org/blog/ |
I have some questions / suggestions about the blob post so looking forward to a public forum where we can discuss 😄 |
@XiangpengHao would it be ok to port your blog's content to the DataFusion blog? And we could work together as a community to augment / extend it as needed? |
Of course! I'm more than happy to have it ported over, and I'm all ears for any feedback! |
I've added a comment section to the blog (by giscus), feel free to share your questions and suggestions there! Looking forward to the discussion 😄 |
Wow -- @Rachelint @jayzhan211 and @Dandandan have another non trivial improvement in grouping performance: #12996 👏 It is really impressive to see what can be done with such careful engineering |
Next week: #13265 |
Introduction
This ticket is a weekly summary of interesting things happening in DataFusion. Note this is not a complete list (it is what I remember / can find). Please feel free to leave comments on this ticket about things that I may have missed or you think should get wider attention by the community
Loosely inspired by https://this-week-in-rust.org/
DataFusion Related Blogs
Upcoming Releases
0.52.0
datafusion-sqlparser-rs#1423 (huge kudos to @iffyio for all the reviews)Major Projects / Discussions under way
Highlights from last week(s):
(I am sorry if I missed you -- please add a note to this ticket with anything you would like to highlight)
datafusion-contriub
: discuss: Move into the Apache ORC PMC and develop asapache/orc-rust
datafusion-contrib/datafusion-orc#120. Thanks @waynexia and @Xuanwolike
expressions (prefix matching) #12978 from @adirangbjoin_partial
phase #13111)Date32
/Date64
in aggregate fuzz testing #13041ntile
builtIn function to UDWF #13040 from @jatin510LogicalPlan::CrossJoin
as it is unused #13076 / extended log.rs tests for unary/binary and f32/f64 casting #13034Looking to get more involved? Try code review!
DataFusion has a long history of community members contributing in all aspects of the project. Reviewing PRs is an especially great way to get introduced to the project, help the community and grow your own knowledge -- researching and understanding the code enough to review PRs also often inspires additional ideas for improvements.
We have docs about reviews. TLDR is: look for test coverage, if the change is understandable and well documented, and if the code can be improved. When you think the PR looks good to merge, try
@
mentioning one of the committers.Help wanted
Please feel leave your own comments on the ticket if you are looking for help
Community
Upcoming meetups:
Background:
Previous update: #13035
Andrew's Focus Areas:
We are preparing for the 43.0.0 release and I am personally pretty excited about (and thus actively help / put to the top of my review list)
WindowFunction
Interface (remove built in list ofBuiltInWindowFunction
s) #8709 (very close to finishing thanks @jcsherin @jatin510)The text was updated successfully, but these errors were encountered: