Skip to content

Instantly share code, notes, and snippets.

@hepplerj
Created February 20, 2020 20:58
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save hepplerj/8769597368e8b15c1f0791861cd3c55b to your computer and use it in GitHub Desktop.
Save hepplerj/8769597368e8b15c1f0791861cd3c55b to your computer and use it in GitHub Desktop.
Messy data in R, for teaching the tidyverse
library(charlatan)
library(salty)
library(magrittr)
library(readr)
messydata <- ch_generate('name','job','phone_number', n = 200)
messydata <- messydata %>%
mutate(job = salt_capitalization(job)) %>%
mutate(phone_number = salt_na(phone_number)) %>%
mutate(name = salt_na(name)) %>%
mutate(job = salt_insert(job, shaker$whitespace, p = 0.2))
readr::write_csv(messydata, "messydata.csv")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment