Skip to content

melindahiggins2000/Emory_ACBE_March2023_TextParsing

Repository files navigation

Emory ACBE Meeting March 2023 - R Exercise on Text Parsing

Add a short description here...


Workshop Instructor:


Preparation for Workshop

  1. Install R https://cran.r-project.org/
  2. Install RStudio Desktop https://posit.co/download/rstudio-desktop/
  3. Install the R packages listed below.
    • see instruction on how to install packages at Chapter 1.3.1 in the Modern Dive Book.
  4. Download datasets and mapfiles:

OPTIONAL PREP

  1. OPTIONAL if you are still actively enrolled in RSPH, you can also run R and RStudio in the Emory RSPH virtual desktop using Apporto, https://rsphemory.apporto.com/.
  2. Online BOOKS to get started:
    1. Statistical Inference via Data Science: A ModernDive into R and the Tidyverse! by Chester Ismay and Albert Y. Kim https://moderndive.netlify.app/.
    2. R in Action, Third Edition: Data analysis and graphics with R and Tidyverse by Robert I. Kabacoff https://www.manning.com/books/r-in-action-third-edition
    3. The Epidemiologist R Handbook https://epirhandbook.com/en/

R packages to be installed for workshop:

See packages listed on CRAN - click on "Packages". All of the packages on CRAN can be installed inside RStudio by clicking on "Tools/Install Packages" top menu.

devtools::install_github('Mikata-Project/ggthemr')

Overview of Workshop

This workshop will cover (final list in development):

  • Module 01
    • An introduction to R and RStudio
    • Learning about R Objects and data structures
  • Module 02
    • Introduction to R packages
    • Loading data, Saving data (read/write datasets)
  • Module 03
    • Data Visualization with ggplot2 package
  • Module 04
    • Brief Intro Data Wrangling with dplyr package
  • Module 05
    • Introduction to Rmarkdown
    • Exercise - creating Rmarkdown Report
  • Module 06
    • Using dplyr to join datasets
  • Module 07
    • Putting it all together into a "COVID Case Report"
  • Module 08
    • Open Dialog - audience questions

Schedule - links to slides and R scripts

🗓️ November 11, 2022
⏰ 09:00 - 16:00

Time Activity
09:00 - 09:25 Module 1 [Slides and R script]
09:25 - 09:50 Module 2 [Slides and R script]
------------- ----------------
09:50 - 10:00 BIO Break
------------- ----------------
10:00 - 10:30 Module 3 [Slides and R script]
R Scripts w/Answers
10:30 - 11:00 Module 4 [Slides and R script]
R Scripts w/Answers
------------- ----------------
11:00 - 12:00 Breakout Session 1 [working on exercises from above modules]
12:00 - 13:00 LUNCH BREAK
------------- ----------------
13:00 - 13:25 Module 5 Abalone Rmarkdown Report and Rmarkdown file
Abalone RMarkdown w/Answers Rmarkdown file
13:25 - 13:50 Module 6 Merging Datasets Exercise and Rmarkdown file
data1.csv
data2.csv
data.table exercise R script
------------- ----------------
13:50 - 14:00 BIO Break
------------- ----------------
14:00 - 14:30 Module 7 COVID Cases Report and Rmarkdown file
14:30 - 15:00 Module 8 Open Q&A
------------- ----------------
15:00 - 16:00 Breakout Session 2 [working on exercises from modules and more editing on COVID case report]
------------- ----------------

Workshop Recording

  • To be added later...

Learn more about Rmarkdown at:

More R Resources - get help and inspiration