Description In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data…
Description Linear regression is commonly used to quantify the relationship between two or more variables. It is also used to adjust for confounding. This course, part ofourProfessional Certificate Program in Data Science, covers how to implement linear regression and adjust for confounding in practice using R. In data science applications, it is very common to…
Description Perhaps the most popular data science methodologies come from machine learning. What distinguishes machine learning from other computer guided decision processes is that it builds prediction algorithms using data. Some of the most popular products that use machine learning include the handwriting readers implemented by the postal service, speech recognition, movie recommendation systems, and…
Description To become an expert data scientist you need practice and experience. By completing this capstone project you will get an opportunity to apply the knowledge and skills in R data analysis that you have gained throughout the series. This final project will test your skills in data visualization, probability, inference and modeling, data wrangling,…
Description As part of our Professional Certificate Program in Data Science, this course covers the basics of data visualization and exploratory data analysis. We will use three motivating examples and ggplot2, a data visualization package for the statistical programming language R. We will start with simple datasets and then graduate to case studies about world…
Description While the United States is one of the world’s wealthiest nations, it is far from the healthiest. Our nation’s burden of disease affects businesses every day, from sick employees and families reducing productivity and increasing costs, to product recalls and failures, to environmental scandals such as toxic chemical emissions harming communities and reputations.Named Runner…
Description Our world’s climate is changing. Of the top twenty hottest years ever recorded, sixteen have occurred in the last two decades. This warming has already had a profound effect. Many feel powerless in the face of this challenge, but you can make a difference. By looking at air quality, nutrition, infectious diseases, and human…
Description How can we eradicate malaria? Explore the scientific and technological underpinnings of malaria, as well as the historical, political, social, and economic contexts in which control, elimination, and eradication efforts unfold. Through foundational lectures and supplemental interviews, this course provides participants with a toolbox of knowledge and skills. Learners will be guided through the…
Description In this course, part of our Professional Certificate Program in Data Science,you will learn valuable concepts in probability theory. The motivation for this course is the circumstances surrounding the financial crisis of 2007-2008. Part of what caused this financial crisis was that the risk of some securities sold by financial institutions was underestimated. To…
Description In the first act of William Shakespeare’s Hamlet, the Ghost of the dead King of Denmark appears to his son, setting off a chain of events that culminates in the play’s notoriously bloody finale. But how would this mysterious figure have been understood in Shakespeare’s time?Harvard professor Stephen Greenblatt guides learners through an exploration…