Need smarter way to manage experiments #112

nicholas-leonard · 2015-01-26T16:08:48Z

I am thinking we should launch multiple experiments in parallel from a single controller interface. The controller should allow the user to configure and launch experiments. Then the controller could be used to monitor and compare the different experiments.

For visualization we could use either :

qt, but it's a pain to install
gnuplot, but it doesn't do much
gfx.js. nice visuals, uses javascript and browser to render. Control via command line.
iTorch. Relatively new, nice visuals. Uses iPython. By Facebook. Control in-line. Generates nice doc.

What I like about iTorch is its potential use of notebooks for writing, viewing and interacting with tutorials and experimental reports.

The controller interface could also potentially be made out of an iTorch notebook. Which would allow users to more easily query data for analysis. This could be done with a simple Lua API. The user can call functions to render diagrams, tables, lists, structures, etc.

To minimize the impact of this change, we could provide functions accepting arguments specifying scripts and command-line arguments to run on available resources. We could use the parallel package (which uses ssh) to execute the commands on different servers. All experiments share a common storage space which is managed by the controller, but experimental data remains partitioned. Data for each experiment is saved in a directory on the file system such that it can be accessed by the controller. This can be accomplished using something simple like rsync, or can be handled by something more complicated like a server running on the different machines. Or all experiments could listen for incoming requests. For faster response, we could use threads-ffi to have a pool of aync fibers waiting for requests from the controller.

I tried to implement something like this through the now closed async PR. But it was too complicated and imploded on me. Maybe now we have the tools to make this happen. But I fear I do not have the time.

Kaixhin · 2015-08-16T13:27:19Z

I've come across the same thought several times, and even for an environment where only one experiment can be run at a time it would make exploration more systematic.

The design patterns within dp, along with the libraries you mentioned, sound like a good fit. After looking around for similar solutions, I came across some notes on Google's Sibyl system, which may be useful.

I'm going to give something a shot, but I'm going to base the web server on Node.js since that's where my experience lies. This still allows computation nodes and clients in Lua. Any basic functionality requests worth considering in a first iteration?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need smarter way to manage experiments #112

Need smarter way to manage experiments #112

nicholas-leonard commented Jan 26, 2015

Kaixhin commented Aug 16, 2015

Need smarter way to manage experiments #112

Need smarter way to manage experiments #112

Comments

nicholas-leonard commented Jan 26, 2015

Kaixhin commented Aug 16, 2015