WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #562

daydreamt · 2020-04-06T20:24:52Z

Hi all, since it's been a while I thought I should maybe give a sign of life and continue from here. This tries to implement #559.

There are still some things I haven't figured out myself yet, so I was planning to only request the review when I'm more ready, but of course feel free to take a look if you want already.

…more reading on relaxed_categorical and transformations of distributions instead

…w temperatures.

…adient work

…pansion where needed

…ing the shape tests;

fehiepsi · 2020-04-07T17:55:38Z

Hi @daydreamt , thanks for the PR! I think the main blocker of your work would be to define custom derivative rules for some of your operators. I'll update the repo to the latest JAX version today to unblock your work.

fehiepsi · 2020-04-08T02:11:08Z

@daydreamt FYI, I think @tbsexton only needs RelaxedOneHotCategorical (or GumbelSoftmax) in his feature request because he wanted to use MCMC (instead of SVI) to draw samples from the relaxed distribution. @tbsexton could you confirm that StraightThrough is not required?

rtbs-dev · 2020-04-08T13:12:26Z

@fehiepsi I was originally only using HMC, though planning to test out SVI as well. As long as not having access to a backward pass doesn't preclude using NUTS for inference of latent variables, should work!

This is in practice a work-around for not having discrete latent variables; see my original example problem here.

fehiepsi · 2020-04-12T17:01:37Z

Thanks, @tbsexton! In your model, you want to infer each ϕ for each cascade, so I guess you can replace

ϕ = ny.sample("ϕ", dist.Dirichlet(np.ones(n_nodes)))  
x0 = ny.sample("x0", dist.Categorical(ϕ))
infectious, hist = spread_jax(s_ij, x0, 5)

by

ϕ = ny.sample("ϕ", dist.Dirichlet(np.ones(n_nodes)))  
infectious, hist = spread_jax(s_ij, ϕ, 5)

Or if you want the prior for ϕ to be more like discrete, you can choose (or define a prior) a suitable temperature variable and use RelaxedOneHotCategorical

ϕ = ny.sample("ϕ", dist.RelaxedOneHotCategorical(temporature, logits=np.ones(n_nodes))))
infectious, hist = spread_jax(s_ij, ϕ, 5)

The reason is with RelaxedOneHotCategorical, the support is "simplex", and there is a transform which transforms a simplex to an "unconstrained" value, which is required for HMC/NUTS. The support of RelaxedOneHotCategoricalStraightThrough is discrete, hence there is no such transform.

If you want something like straight through, you can simply use

ϕ = ny.sample("ϕ", dist.RelaxedOneHotCategorical(temporature, logits=np.ones(n_nodes))))
ϕ_quantize = quantize(ϕ)

by defining "straight-through" quantize operator as in Pyro

def quantize(x):
    return x + jax.lax.stop_gradient((x == np.max(x, -1, keepdims=True)) - x)

. You can use numpyro.deterministic(...) to record those quantized values. I am happy to add new helpers to NumPyro for your convenience when you start using SVI.

rtbs-dev · 2020-04-13T14:16:24Z

@fehiepsi much appreciated! I think I should update the model there to reflect som local changes, but primarily I think it makes more sense to pull the dirichlet out of the plates:

def diff_kg(infections):
    n_cascades, n_nodes  = infections.shape
    n_edges = n_nodes*(n_nodes-1)//2 # complete graph
        
    # node initial infection, relative probability
    ϕ = ny.sample("ϕ", dist.Dirichlet(np.ones(n_nodes))) 
    
    # beta hyperpriors
    u = ny.sample("u", dist.Uniform(np.zeros(n_edges), 
                                         np.ones(n_edges)))
    v = ny.sample("v", dist.Gamma(np.ones(n_edges),
                                       20*np.ones(n_edges)))
    Λ = ny.sample("Λ", dist.Beta(u*v, (1-u)*v))
    s_ij = jax_squareform(Λ)  # adjacency matrix to recover via inference
    
    with ny.plate("n_cascades", n_cascades):
        # infer infection source node
        x0 = ny.sample("x0", dist.Categorical(ϕ))
        # simulate ode and realize
        infectious, hist = spread_jax(s_ij, x0, 5)
        numpyro.sample("obs", dist.Bernoulli(probs=infectious), 
                       obs=infections)

The main idea being that certain nodes in general have a tendency to be "sources", represented by the dirichlet prior, and those manifest as conditional probabilities that each node was the source (given any individual observed infection cascade). That should be realized as one node for the spread_jax sim, or at least, very close to one node (therefore the [relaxed]categorical).

Maybe that dirichlet prior is unnecessary partial pooling? I will definitely give the new relaxed categorical a try. @daydreamt would it be helpful if I tested things out before the PR gets merged?

fehiepsi · 2020-04-13T15:02:48Z

pull the dirichlet out of the plates

Agree that this makes more sense. With this model, you can define RelaxedOneHotCategorical for x0. (FYI, in PyTorch, Categorical samples are 0, 1, 2, 3. If you want OneHot version, you can use RelaxedOneHotCat... or OneHotCat...)

dirmeier · 2022-06-15T09:07:01Z

Hey @daydreamt , any progress on this?

daydreamt · 2022-06-15T11:30:03Z

Hi @dirmeier, not really, please feel free to take over or supersede with another MR.

daydreamt added 9 commits March 29, 2020 21:38

WIP: Add initial experiments based on pytorch's gumbel_softmax: need …

d95aaf8

…more reading on relaxed_categorical and transformations of distributions instead

Move GumbelSoftmaxProbs to continuous, because it needs access to Gumbel

3a847af

rewrite proposal by using gumbel_soft_max_logits in utils

e648ceb

Add first test of correct sampling from GumbelSoftmax for high and lo…

810adba

…w temperatures.

Fix constructor of GumbelSoftmaxProbs; many tests still failing

249112a

add working but not tested version of log_prob for GumbelSoftmaxProbs

e20fc99

Fix gradient for most cases

c4f3ec8

Make most tests, except d_log_prob broadcasting, and test_log_prob_gr…

8790e5a

…adient work

Write out the calculation of the log_prob to prepare for dimension ex…

c67abad

…pansion where needed

daydreamt changed the title ~~WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #559~~ WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough https://github.com/pyro-ppl/numpyro/issues/559 Apr 6, 2020

daydreamt changed the title ~~WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough https://github.com/pyro-ppl/numpyro/issues/559~~ WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough Apr 6, 2020

daydreamt added 3 commits April 7, 2020 08:59

Fix lint problems

78451d6

Make gumbel_soft_max_probs behavior consistent in preparation of pass…

8ee6ed0

…ing the shape tests;

Fix high/low temperature non-one hot tests

7c6837b

fehiepsi mentioned this pull request Apr 7, 2020

update jax to master 0.1.63 #565

Merged

2 tasks

Merge master into origin/RelaxedOneHotCategoricalStraightThrough

fad229e

fehiepsi added the WIP label Jul 17, 2020

Merge master

33b77a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #562

WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #562

daydreamt commented Apr 6, 2020 •

edited

Loading

fehiepsi commented Apr 7, 2020

fehiepsi commented Apr 8, 2020 •

edited

Loading

rtbs-dev commented Apr 8, 2020

fehiepsi commented Apr 12, 2020

rtbs-dev commented Apr 13, 2020

fehiepsi commented Apr 13, 2020

dirmeier commented Jun 15, 2022 •

edited

Loading

daydreamt commented Jun 15, 2022

WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #562

Are you sure you want to change the base?

WIP: GumbelSoftmax / RelaxedOneHotCategoricalStraightThrough #562

Conversation

daydreamt commented Apr 6, 2020 • edited Loading

fehiepsi commented Apr 7, 2020

fehiepsi commented Apr 8, 2020 • edited Loading

rtbs-dev commented Apr 8, 2020

fehiepsi commented Apr 12, 2020

rtbs-dev commented Apr 13, 2020

fehiepsi commented Apr 13, 2020

dirmeier commented Jun 15, 2022 • edited Loading

daydreamt commented Jun 15, 2022

daydreamt commented Apr 6, 2020 •

edited

Loading

fehiepsi commented Apr 8, 2020 •

edited

Loading

dirmeier commented Jun 15, 2022 •

edited

Loading