-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text to Columns: allow more than 1000 records as input #274
Comments
It works for me - and I don't see a reason why it would fail. But then, I only tried with a file with 1200 lines of |
Sure. The dataset is included in the attached workflow as an URL on Google Drive. It's a list of 5499 delayed train rides, where each ride has a list of comma-separated station abbreviations that we want to run through Text to Column. I have to correct myself though: the maximum number of rows that can be handled appears to be 5000, not 1000. In the attached workflow, if you set Data Sampler to sample 5001 or more rows, or if you bypass Data Sampler, the following error message pops up:
|
Thank you, this one was great. If you need a quick fix (God knows when we'll release the next version of Prototypes), find the file
to
That is, add It took me 10 minutes to fix this, and more than an hour to understand why is this sometimes needed. In short, the table is constructed in batches of 5000 rows (I didn't know this!) and the widget failed if some station code never appeared in some batch. Interestingly, just yesterday I encountered the very same problem in a completely different place. |
5000 seems a strange number in the context of computing - 4096 would have surprised me less ... |
True. :) But in this particular context there is no advantage in using round numbers like 4096, so any arbitrary number, including 5000, is OK. :) |
Prototypes version
0.21.1
Orange version
3.36.2
Expected behavior
Text to Columns works with an arbitrary number of rows
Actual behavior
Text to Columns crashes when it receives more than 1000 rows at its input
Steps to reproduce the behavior
Connect a suitable data file to the input of Text to Columns that has more than 1000 rows
Additional info (worksheets, data, screenshots, ...)
The text was updated successfully, but these errors were encountered: