David M. Diez, Christopher D. Barr, Mine Çetinkaya-Rundel

Digital versions Two PDF versions: full screen or tablet
LaTeX source available     Yes
Exercises Yes
Solutions Odd numbered problems
Solution Manual No
License Creative Commons
  • Third edition (July 2015)
  • Full color hardcover version from Amazon for under $25
  • Black and white paperback version from Amazon or CreateSpace for $10 or less
  • Text has been used at Duke and Princeton and is the text for the Coursera course taught by the third author
  • Companion data sets available on website
  • Labs based on freely available R and RStudio
  • Short videos for each section of some chapters
  • For more information and to download

As the authors write in the preface, “Data is messy, and statistical tools are imperfect. But, when you understand the strengths and weaknesses of these tools, you can use them to learn about the real world.” This book is full of examples and exercises on topics of current interest pulled from the popular media and published research.

In addition to the exercises at the end of each chapter, a novel feature is the incorporation of in-chapter exercises, meant to be done immediately, with answers below in the footnotes.

Chapter Summaries

  1. Introduction to data. Data structures, variables, summaries, graphics, and basic data collection techniques.
  2. Probability. The basic principles of probability. An understanding of this chapter is not required for the main content in Chapters 3-8.
  3. Distributions of random variables. Introduction to the normal model and other key distributions.
  4. Foundations for inference. General ideas for statistical inference in the context of estimating the population mean.
  5. Inference for numerical data. Inference for one or two sample means using the normal model and t distribution, and also comparisons of many means using ANOVA.
  6. Inference for categorical data. Inference for proportions using the normal and chi-square distributions, as well as simulation and randomization techniques.
  7. Introduction to linear regression. An introduction to regression with two variables. Most of this chapter could be covered after Chapter 1.
  8. Multiple and logistic regression. An introduction to multiple regression and logistic regression for an accelerated course.