Shifu 0.2.5 Eval Step Scalability Improvement

In Shifu 0.2.4, 20 MM records in eval step, the running time is about 20 minutes. We found in the eval step, eval model scores are stored into HDFS and then read to get eval report. For some cases, we don't need to store such big data into HDFS.

One PR https://github.com/ShifuML/shifu/pull/74 from zhang7575 fix this issue. By default score dating is not persisted to HDFS in Shifu 0.2.5. This drops running time from 20 minutes to only 12 minutes with the same eval data set.

Shifu: A Distributed Model Training Framework on Hadoop

DOWNLOAD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shifu 0.2.5 Eval Step Scalability Improvement

Clone this wiki locally