question regarding datasets #31

pianoman4873 · 2017-03-05T09:43:33Z

Hello,
This is not an issue but rather a question -
Where could I get all the datasets you reported to in the paper ?
Do you think that training on ALL datasets together would improve the results ?
What about training for various languages - do you think a model containing text for mixed languages would behave better or worse than models handling each language separately ?

And another question regarding phrases - the google's word2vec pretrained vectors include also phrases - were they taken into account as well ?

yoonkim · 2017-03-08T16:47:44Z

Hi, you can obtain all the datasets here:

https://github.com/harvardnlp/sent-conv-torch

Phrases were not taken into account from word2vec.

pianoman4873 · 2017-03-08T21:05:38Z

thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question regarding datasets #31

question regarding datasets #31

pianoman4873 commented Mar 5, 2017 •

edited

Loading

yoonkim commented Mar 8, 2017

pianoman4873 commented Mar 8, 2017

question regarding datasets #31

question regarding datasets #31

Comments

pianoman4873 commented Mar 5, 2017 • edited Loading

yoonkim commented Mar 8, 2017

pianoman4873 commented Mar 8, 2017

pianoman4873 commented Mar 5, 2017 •

edited

Loading