Questions about disentangled representation learning #8

Ray-Zhen · 2024-08-26T16:38:41Z

Hello,
Thank you for the nice work.
I have a question on the representation projection. In your paper, the text and video representation are independently project into K components with trainable parameter W, but in your implemention, you just split the representation with dimension transformation like:

        t_feat = cls.view(a, self.config.center, -1)
        v_feat = video_feat.view(a, b, self.config.center, -1)

It would be greatly appreciated if you could provide a explanation.
Thanks in advance for your time and assistance.
Best regards

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about disentangled representation learning #8

Questions about disentangled representation learning #8

Ray-Zhen commented Aug 26, 2024

Questions about disentangled representation learning #8

Questions about disentangled representation learning #8

Comments

Ray-Zhen commented Aug 26, 2024