rowanz · mayankjobanputra · Aug 14, 2020
diff --git a/data/README.md b/data/README.md
@@ -8,7 +8,7 @@ In `train_full.csv` or `val_full.csv`: we have both the texts of the endings/con
 
 ## regular (shuffled)
 
-This could be more interesting for modeling, and it's the way the test data is formatted. For each `startphrase` (also, split into `sent1`,`sent2`) we have 4 endings, and a label which says the correct one. You can use `test.csv` for submission on the leaderboard here: [https://leaderboard.dev.allenai.org/swag/submission/create](https://leaderboard.dev.allenai.org/swag/submission/create). The fields are exactly the same as `val.csv` and `train.csv` except for the label.
+This could be more interesting for modeling, and it's the way the test data is formatted. For each `startphrase` (also, split into `sent1`,`sent2`) we have 4 endings, and a label which says the correct one. You can use `test.csv` for submission on the leaderboard here: [https://leaderboard.allenai.org/swag/submission/create](https://leaderboard.allenai.org/swag/submission/create). The fields are exactly the same as `val.csv` and `train.csv` except for the label.
 
 
 
@@ -22,4 +22,4 @@ If the source starts with `gold`, it comes from the found data (from an actual v
 
 For training, we also have questions marked as `gen-orig`: these are generated answers that are selected as the *best* answer, while the real answer was selected as the second best (`gold1-orig`)
 
-tl;dr you probably don't have to worry about this one. However, during training, some models work better if also shown the `gen-orig` examples, and some work better if not.
+tl;dr you probably don't have to worry about this one. However, during training, some models work better if also shown the `gen-orig` examples, and some work better if not.