Pure python solution #3

abhishekkrthakur · 2021-02-08T16:46:33Z

Here is a solution that I thought of. Runs in 5.5 seconds on my machine:

import glob
import time

start = time.time()

path = "../data/"
all_files = glob.glob(path + "/*.csv")

cols = ["f_0", "f_1", "f_2", "f_3", "f_4", "f_5", "f_6", "f_7", "f_8", "f_9", "target"]
with open("out.csv", "w") as out:
    out.write("%s\n" % ",".join(cols))
    for filename in all_files:
        with open(filename) as f:
            for idx, line in enumerate(f):
                if idx > 0:
                    out.write(line)
end = time.time()
print(end - start)

The text was updated successfully, but these errors were encountered:

sbarthwal · 2021-02-08T20:00:58Z

Could you (@abhishekkrthakur ) please provide your machine configuration?
Because it is taking 124.09613084793091 s on my machine.
My machine configuration:
MacBook Pro (15-inch, 2019)
2.3 GHz Intel Core i9
16 GB 2400 MHz DDR4
Radeon Pro 560X 4 GB
Intel UHD Graphics 630 1536 MB

Thank you

abhishekkrthakur · 2021-02-08T20:19:32Z

core i7, 32gb ram but that shouldnt matter. 124s and 5s is huge!!! something else is wrong.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pure python solution #3

Pure python solution #3

abhishekkrthakur commented Feb 8, 2021

sbarthwal commented Feb 8, 2021

abhishekkrthakur commented Feb 8, 2021

Pure python solution #3

Pure python solution #3

Comments

abhishekkrthakur commented Feb 8, 2021

sbarthwal commented Feb 8, 2021

abhishekkrthakur commented Feb 8, 2021