Description Matrix Algebra underlies many of the current tools for experimental design and the analysis of high-dimensional data. In this introductory data analysis course, we will use matrix algebra to represent the linear models that commonly used to model differences between experimental units. We perform statistical inference on these differences. Throughout the course we will…
Description In this course you’ll learn various statistics topics including multiple testing problem, error rates, error rate controlling procedures, false discovery rates, q-values and exploratory data analysis. We then introduce statistical modeling and how it is applied to high-throughput data. In particular, we will discuss parametric distributions, including binomial, exponential, and gamma, and describe maximum…
Description If you’re interested in data analysis and interpretation, then this is the data science course for you. We start by learning the mathematical definition of distance and use this to motivate the use of the singular value decomposition (SVD) for dimension reduction and multi-dimensional scaling and its connection to principle component analysis. We will…
Description This course bridges the gap between introductory and advanced courses in Python. While there are many excellent introductory Python courses available, most typically do not go deep enough for you to apply your Python skills to research projects. In this course, after first reviewing the basics of Python 3, we learn about tools commonly…
Description Improving access to healthcare is only as useful as the quality of care provided. Many agree that quality is important – but what is it? How do we define it? How do we measure it? And most importantly, how might we make it better?The course is designed for those who care about health and…
Description From the Syrian refugee crisis to the West Africa Ebola outbreak, humanitarian emergencies have reached unprecedented dimensions and proportions. As need for humanitarian aid grows, how can efforts to alleviate human suffering evolve with it? This course from the Harvard Humanitarian Initiative and HarvardX seeks to prepare learners to recognize and analyze emerging challenges…
Description Causal diagrams have revolutionized the way in which researchers ask: What is the causal effect of X on Y? They have become a key tool for researchers who study the effects of treatments, exposures, and policies. By summarizing and communicating assumptions about the causal structure of a problem, causal diagrams have helped clarify apparent…
Description The weather forecasts we see every day are based on an army of meteorological sensing networks and intensive computer modeling. Before the rise of these technologies, predictions were made by methods like discerning cloud formations and wind directions. This course will explore the science behind weather systems by teaching the observational skills needed to…
Description A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging. Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and…
Description In this course, part of our Professional Certificate Program in Data Science,we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data…