Let the Data Flow: Pipelines in R with dplyr and magrittr

Abstract

Pipelines were the best thing to happen in R in 2014. They let us transform messy, inside-out code like sort(unique(round(xs, 2))) into a clear chain of transformations like xs %>% round(2) %>% unique %>% sort. In this talk I lead a tutorial on how to use pipelines for data-cleaning, transformation and presentation with the packages magrittr and dplyr. For beginners, I also review some of the essential R functions to make the most of pipelines.

Tristan is a PhD student in Communication Sciences and Disorders. He uses in R in the Learning To Talk lab to model eye-tracking and speech perception data. @tjmahr, github.com/tjmahr.

Slides

I prepared three sets of slides:

Resources

magrittr vignette
RStudio Data-Wrangling Cheatsheet
Core R vocabulary
Awesome R
Pipelines for Data Analysis (dpylr/magrittr talk by Hadley Wickham)
Best Practices for Scientific Computing
Data Science on the Command Line
Unix Commands for Data Science

Packages

magrittr for pipelines
dplyr for data-frame functions
broom
stringr for string manipulation functions
pipeR an alternative pipeline package (that I haven't tried yet).

License

Obviously, the GPL-2 license applies only to the code and words I wrote, which are in the .Rpres and .md files and are reproduced with markup in the .html files.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
.gitignore		.gitignore
00_pipelines-rpubs.html		00_pipelines-rpubs.html
00_pipelines.Rpres		00_pipelines.Rpres
00_pipelines.md		00_pipelines.md
01_dplyr-rpubs.html		01_dplyr-rpubs.html
01_dplyr.Rpres		01_dplyr.Rpres
01_dplyr.md		01_dplyr.md
02_tables-rpubs.html		02_tables-rpubs.html
02_tables.Rpres		02_tables.Rpres
02_tables.md		02_tables.md
LICENSE		LICENSE
MadR_Pipelines.Rproj		MadR_Pipelines.Rproj
README.md		README.md
data-wrangling-cheatsheet.pdf		data-wrangling-cheatsheet.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Let the Data Flow: Pipelines in R with dplyr and magrittr

Abstract

Slides

Resources

Packages

License

About

Releases

Packages

Languages

License

anouel/MadR_Pipelines

Folders and files

Latest commit

History

Repository files navigation

Let the Data Flow: Pipelines in R with dplyr and magrittr

Abstract

Slides

Resources

Packages

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages