Skip to content

Latest commit

 

History

History

data

Data

Training vs test data

We train on all data, without labels. We use the labels in order to evaluate the resulting clusters.

Twitter Airlines Customer Support

The data is available in two version:

We sampled 500 examples, and annotated them. 8 examples were rejected because not English, leaving 492 labeled examples. The remaining examples were labeled UNK.

AskUbuntu