Revamping the Learn API #102

eaplatanios · 2018-04-28T03:12:30Z

This is a place to discuss issues with the current design of the learn API and ways to improve it, so that we can have a new implementation ready (hopefully) by the end of May.

The main issues I see currently are:

The Model class feels quite awkward to newcomers. Is there a nicer way to define models, while being flexible?
There are some issues with Layers. Implementing a new layer adds an overhead and some boilerplate even for simple models. We should probably allow functions to be usable as layers. Also, variable scopes and the rest of the op creation context does not work really well with layers at this point.
How should we manage variables that layers create and use? How can we allow for regularization and constraints for those variables? @mandar2812, let's move the discussion of Regularization in Tensorflow Scala models #88 here.

@sbrunk Do you have any thoughts based on use cases you might have tried?

The text was updated successfully, but these errors were encountered:

sbrunk · 2018-04-29T12:40:02Z

I have only created a few relatively simple sequential CNNs for image classification so far, so I think I can only comment on the first point about the Model class and even there I've only used one variant.

When I tried it the first time, I was a bit confused by the Model constructor parameters, especially trainInput and trainInputLayer. It took a bit until I had realized that they are just the dual of the feature input but for the labels.

Looking at it now I actually like how it's done because it really is quite flexible. I think the main issue is
missing documentation (I'd be happy to help where I can). Without it, all the overloaded model constructor variants and type parameter lists like [IT, IO, IDA, ID, IS, I, TT, TO, TDA, TD, TS, T] can be a bit scary. :)

It might also make sense to create convenience functions for things that are probably often used in this way like different loss functions. So instead of

val loss = tf.learn.SparseSoftmaxCrossEntropy("Loss/CrossEntropy") >>
      tf.learn.Mean("Loss/Mean") >> tf.learn.ScalarSummary("Loss/Summary", "Loss")

you could just do someting like val loss = SoftmaxCrossEntropyLoss()

You could still use the first variant to compose it yourself if you need it, the second would just be a wrapper for common cases.

eaplatanios · 2018-09-15T19:17:43Z

I will close this because plans changed significantly and I will soon publish a proposal of a lot for new features I've been working on over the summer that involve compile-time support for TensorFlow, similar to what Swift for TensorFlow does, but (hopefully :)) with much stronger type guarantees.

eaplatanios · 2018-10-14T16:13:05Z

@sbrunk The parameter lists like [IT, IO, IDA, ID, IS, I, TT, TO, TDA, TD, TS, T] have now been simplified dramatically with #131. This is a good example. :)

eaplatanios added the enhancement label Apr 28, 2018

eaplatanios closed this as completed Sep 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revamping the Learn API #102

Revamping the Learn API #102

eaplatanios commented Apr 28, 2018

sbrunk commented Apr 29, 2018 •

edited

Loading

eaplatanios commented Sep 15, 2018

eaplatanios commented Oct 14, 2018

Revamping the Learn API #102

Revamping the Learn API #102

Comments

eaplatanios commented Apr 28, 2018

sbrunk commented Apr 29, 2018 • edited Loading

eaplatanios commented Sep 15, 2018

eaplatanios commented Oct 14, 2018

sbrunk commented Apr 29, 2018 •

edited

Loading