# Week 5 β Filling In (9/19β23)

## Contents

# Week 5 β Filling In (9/19β23)#

This week introduces one new statistical concept β the hypothesis test β and is otherwise about **practice** and **solidifying concepts**.
Iβm also going to take a step back and give some more context to some of the things weβre talking about.

Our learning outcomes are:

Compute and interpret hypothesis test

Understand how to read and interpret Python errors

Understand how the quantitative techniques we are learning in this class fit in a broader landscape of epistemologies

## π§ Content Overview#

Element |
Length |
---|---|

5m6s | |

14m51s | |

12m24s | |

2000 words | |

25m44s | |

653 words | |

7m28s | |

3m43s | |

5m10s |

This week has **1h14m** of video and **2653 words** of assigned readings. This weekβs videos are available in a Panopto folder.

## π Deadlines#

Week 5 Quiz is due on

**Thursday**at 8AM.Assignment 2 is due on

**Sunday, September 25, 2022**at 11:59 PM.Midterm A is next week, on

**September 28**.

## π Assignment 1 Solution#

The Assignment 1 solution is on Piazza.

## π Course Glossary#

If you havenβt yet, I highly recommend consulting the course glossary. Please post on Piazza if you have suggested additions!

The glossary is also likely to be useful in studying for the exam next week.

## π Writing Functions#

Iβve used Python *functions* in a few of my example notebooks.
The function notebook talks more about them, how to write them, and how to use them.

## π₯ Comparing Distributions#

This video describes how to use Q-Q plots to compare data against a distribution.

### Resources#

## π₯ Testing Hypotheses#

### Resources#

## π₯ Cartoon#

Read XKCD #882: Significant.

This is called ** p-hacking**: running tests until we find one that is significant.

## π₯ T-tests#

This video discusses the *t*-test in more detail, and the different kinds of *t*-tests that we can run.
It also introduces degrees of freedom.

## π Tying It Together#

I will be adding a notebook reading here to tie together some Week 4 and 5 material.

## π₯ Epistemology#

In this video, I talk about how the quantitative data science methods we are learning fit into a broader picture of source of knowledge.

## π© Week 5 Quiz#

The Week 5 quiz is about material **through this point**.
The subsequent videos are to help you better understand and contextualize material.

## π One Sample Notebook#

The One Sample notebook demonstrates how to compute a one-sample *t*-test, and draw a Q-Q plot to compare a distribution with normal.

### Resources#

NIST Handbook on quantitative meaures (has info on 1-sample and 2-sample

*t*-tests)

## π₯ Python Errors#

This video discusses common Python errors and how to read errors.

## π₯ Python Libraries#

## π₯ Learning More#

In this video I talk about how I go about expanding my own data science knowledge and techniques, with the goal of giving you ideas for how you can continue learning beyond this class.

## β Practice#

There are a few things you can do to keep practicing the material:

The HETREC data contains two data sets besides the movie data: Delicious bookmarks and Last.FM listening records. Download this data set and apply some of our exploratory techniques to it.

Download the SBA data from Week 4βs activity and describe the distributions of more of the variables.

Apply the inference techniques from Week 4 to statistically test the differences you observed in Assignment 1.

## π More Examples#

Some more examples from my own work (these are *not* all cleaned up to our checklist standards):

Data summary from book gender paper - shows a number of descriptive things, including a stacked area chart; it also uses Plotnine.

Linkage statistics from book data - shows some matploblib things, and computing data linking statistics.

## π Tutorials#

The tutorial notebooks include many useful things, and have a couple of additions moved over from π Β Week 4 β Inference (9/12β16).

## π© Assignment 2#

Assignment 2 is due on **Sunday, September 25, 2022**.