class: title-slide, middle, center # BUS 320 # Topic 4 # Reading and writing data ## Elizabeth Stanny --- layout: true <div class="my-footer"><span>http://estannydotcom.netlify.com</span></div> --- # Learning objectives for the course - Ask the right questions -- - Extract, transform and load relevant data (ETL process) - Extract (`tidyverse`) - Transform (`tidyverse`) - Load data (`tidyverse`) -- - Apply appropriate data analytic techniques - Descriptive statistics - `skimr` - `tidyverse` -- - Interpret and share the results - `rmarkdown` files end with .Rmd --- # Importing and exporting data ### File extension indicates file type .pull-left[ ### [Rectangular well formatted data](https://www.tidyverse.org/packages/#import) - Comma-separated data `,` (.csv) - Pipe-separated data `|` (.psv) - Tab-separated data `\t` (.tsv) - Excel (.xls and .xlsx) - Databases (.sql) - Googlesheets ] -- .pull-right[ ### Other * Portable document format (.pdf) - [pdftools](https://docs.ropensci.org/pdftools/) * Word (.doc) - [antiword](https://docs.ropensci.org/antiword/) * Webpages (.html) - [rvest part of tidyverse](https://rvest.tidyverse.org) * Text (.rtf) - [unrtf](https://docs.ropensci.org/unrtf/) - Images (.png, .jpeg etc) - [magick](https://docs.ropensci.org/magick/) and [tesseract](https://docs.ropensci.org/tesseract/) - Audio and video - [av](https://docs.ropensci.org/av/) - NoSQL databases - [nodbi](https://docs.ropensci.org/nodbi/) ]