-
Notifications
You must be signed in to change notification settings - Fork 992
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use arrow format as the memory model #5670
Comments
I doubt. Algos in DT are designed for particular memory layout that R uses for data.frame. Switching this to arrow means that we cannot easily switch between DF and DT without doing a copy. And of course a lot of code rewriting. That of course could be considered, but what benefits do you expect? Without presenting a good reason for that it is rather unlikely. |
Duplicate of #5656 A couple of months ago I looked into arrow -> data.table exporting. I couldn't find anything obvious as the two memory models are simply very different there is no way I can see around a one-time copying / materialization cost. |
It should also be noted that the Arrow libraries (Arrow C++ or arrow-rs or the arrow2 crate) can be much more expensive to build than the current data.table. The polars R package has been removed from CRAN due to build time issue (In other words, building polars takes longer than any R package on CRAN.). |
AFAIR @eddelbuettel can be good person to comment about building R's arrow package |
As the Python packages pandas and polars use arrow format as the memory model, any plan of having it in data.table?
The text was updated successfully, but these errors were encountered: