[WIP] Add training loops #599

Joshua-Chin · 2016-02-03T22:19:53Z

This adds a convenience class the helps train and evaluate networks, using a scikit-learn style interface.

benanne · 2016-02-04T00:19:55Z

This is great, but I'm not sure if we would want something like this in the main library at this point. I'm open to discussing it though! Note that there is also nolearn which provides the same functionality.

I still plan to come up with some training loop utilities that don't involve encapsulating the model into a class, which tends to fit my own workflow better -- but I guess that's mostly a matter of preference. Hopefully I can draft a proposal for that sometime soon (I've been postponing it for a while now...)

Hopefully I'll have some time to take a closer look at this this weekend. At a glance, it seems like it might actually be fairly limited in scope (e.g. only supporting (X, y)-style problems, i.e. supervised learning) and we might want something more broadly applicable. There is a fairly lengthy discussion at #12 which is probably worth (re-)reading as it touches some of these concerns.

f0k · 2016-02-04T18:12:02Z

Same here... we'd love to have some generic training loop code in Lasagne to make it easier to get started, but we need to get it right! This means the following restrictions of your proposal would need to be lifted:

restriction to "single input, single output" network
restriction to "X -> y" problems
restriction to datasets that fit into host memory at once

The toughest part probably is that the training loop should cooperate with any kind of data iterator. We can provide one for the simple case of iterating over a numpy array, but other than that, it should support whatever people have cooked up already. This will still need some discussion.

Thank you for bringing this back to our attention, though, and feel free to add your two cents to #12! Meanwhile, nolearn provides what you proposed and then some more.

Joshua-Chin · 2016-02-07T23:15:23Z

Generic training loop code would be great for Lasagne, but common use cases should be easy. Many of the applications for neural networks can be phrased as "X -> y" problems.

I have also been working on a more generic batch iterator, that handles both numpy arrays and generators over individual examples. I plan to create a pull request for it sometime this coming week.

benanne · 2016-02-09T10:06:48Z

I agree that (X, y) should be the default, but if we add this to the main library it should definitely be more flexible than that.

Joshua-Chin added 4 commits February 2, 2016 21:25

added model

25cbe52

initial commit

068a319

renamed to network and added autodocs

77cee89

added network to __init__.py

b5e3333

f0k mentioned this pull request Oct 21, 2016

Is there interest for implementing flexible training loops, batch iteration schemes, etc. [offer of code] #756

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add training loops #599

[WIP] Add training loops #599

Joshua-Chin commented Feb 3, 2016

benanne commented Feb 4, 2016

f0k commented Feb 4, 2016

Joshua-Chin commented Feb 7, 2016

benanne commented Feb 9, 2016

[WIP] Add training loops #599

Are you sure you want to change the base?

[WIP] Add training loops #599

Conversation

Joshua-Chin commented Feb 3, 2016

benanne commented Feb 4, 2016

f0k commented Feb 4, 2016

Joshua-Chin commented Feb 7, 2016

benanne commented Feb 9, 2016