Join for free and connect with our local tech scene
Stay on top of the latest companies and upcoming events with our weekly newsletter, and be counted among the people building the future of your local tech community.
R is the best open-source programming language in data science. The burgeoning community of R developers has contributed to tools such as the tidyverse (an opinionated collection of packages for data import, manipulation, modeling, and visualization), shiny (a package for building interactive web applications from R) and RMarkdown (a format for documenting and communicating analysis to stakeholders).
This talk will focus on exploring the powerful capabilities of R by walking through a raw dataset on a Harvard/MIT online course. We will go over all stages of data science in R: We will be importing data, cleaning, transforming, modeling, and communicating our results with an interactive web application. By the end of this talk, you will have been introduced to the abilities of R. The focus of this talk is not going to be to teach you the fundamentals of R, but rather to give you the tools that make R so amazing.
Presenter: Jason Baik
If you haven't already done so, please install R / RStudio beforehand.
Once you have these installed, I'd recommend installing the tidyverse, an opinionated collection of R packages designed for data science (https://www.tidyverse.org/).
In your console, type
The dataset we will be working with is located here: http://bit.ly/harvard-data
Code & Supply is a community of software professionals supported by members contributing $10 to $60 per month. Become a supporter today at https://codeandsupply.co/join
All attendees are expected to abide by the Code & Supply conduct policies available at https://codeandsupply.co/policies/conduct
Instructions on finding your way around our event space are available at https://codeandsupply.co/hq