About resampling #4

ahustr · 2024-03-15T15:31:44Z

Hello! While studying your code, I found that when resampling is used here, it seems to be calculated directly using the variance. In my understanding, resampling seems to be using the standard deviation, so the equation logsigma.unsqueeze(1) should be divided by 2. I wonder if I missed something.

SanghyukChun · 2024-03-16T06:39:09Z

Hi, thanks for your question.

First, I would like to clarify that this function is not used for PCME++, but only for sampling-based one, PCME. Hence it does not change the results of PCME++.

Second, your question looks correct, and that means I made a mistake when I implemented PCME (https://github.com/naver-ai/pcme). In general, I think it is not a very critical issue because, during training, it only affects the Monte-Carlo sampling-based pairwise distance and VIB loss (=KL divergence). It would not predict the correct PCME loss as the original design, but as the standard deviation value is usually much smaller than the mean distances, I think the corrected version and the wrong implementation will show almost similar empirical performances.

ahustr · 2024-03-16T06:59:18Z

Yes, it is more like a weighting parameter, thanks for your wonderful paper!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About resampling #4

About resampling #4

ahustr commented Mar 15, 2024

SanghyukChun commented Mar 16, 2024

ahustr commented Mar 16, 2024

About resampling #4

About resampling #4

Comments

ahustr commented Mar 15, 2024

SanghyukChun commented Mar 16, 2024

ahustr commented Mar 16, 2024