This is a simple collection of linguistic data sets. For now it only contains those data sets included in the NDL package: http://cran.r-project.org/web/packages/ndl/index.html, and those included in the languageR package: http://cran.r-project.org/web/packages/languageR/index.html. I will include more data sets in the future. If you're the owner of any of these data sets and you feel it shouldn't be included here, please let me know and I'll remove it.
To use simply do:
Pkg.clone("LinguisticData")
using LinguisticData
dative = dataset("dative")
- affixProductivity.rda
- alice.rda
- auxiliaries.rda
- beginningReaders.rda
- clean.R
- danish.rda
- danks.rda (version found in the "ndl" package)
- dativeLR.rda (version found in the "languageR" package)
- dative.rda
- dativeSimplified.rda
- durationsGe.rda
- durationsOnt.rda
- dutchSpeakersDistMeta.rda
- dutchSpeakersDist.rda
- english.rda
- etymology.rda
- faz.rda
- finalDevoicing.rda
- havelaar.rda
- heid.rda
- imaging.rda
- latinsquare.rda
- lexample.rda
- lexdec.rda
- lexicalMeasuresClasses.rda
- lexicalMeasures.rda
- moby.rda
- nesscg.rda
- nessdemog.rda
- nessw.rda
- numbers.rda
- oldFrenchMeta.rda
- oldFrench.rda
- oz.rda
- periphrasticDo.rda
- phylogeny.rda
- plurals.rda
- primingHeidPrevRT.rda
- primingHeid.rda
- quasif.rda
- ratings.rda
- regularity.rda
- selfPacedReadingHeid.rda
- serbianLex.rda
- serbian.rda
- serbianUniCyr.rda
- serbianUniLat.rda
- shrinkage.rda
- sizeRatings.rda
- spanishFunctionWords.rda
- spanishMeta.rda
- spanish.rda
- splitplot.rda
- think.rda
- through.rda
- twente.rda
- variationLijk.rda
- verbs.rda
- ver.rda
- warlpiri.rda
- weightRatings.rda
- writtenVariationLijk.rda
Add descriptions of the datasets.