Tutorial Notebooks
This is a collection of notebooks with tips and consolidated references for the various Python and Pandas topics that we are discussing.
- Fun with Numbers
- Writing and Using Functions
- Selecting Data
- Reshaping Data
- Building Data — building up arrays and data series
- Indexing
- Missing Data
- Drawing Charts
- Chart Finishing Touches
- Penguin Inference
- One Sample T-test and Distribution Comparision
- Correlation
- Regressions (goes with Week 8)
- Example: Sessionization
- Logistic Regression
- SciKit-Learn Logistic Regression
- SciKit-Learn Pipelines and Regularization — also includes a significance test
- Advanced SciKit-Learn pipeline and logistic regression example (on Towards Data Science)
- Tricks with Boolean Series
- Movie Decomposition
- K-Means Example (uses the chi-papers data from Week 13)
- Fetching CHI Papers creates the
chi-papers.csv
file from Internet sources - Tuning Hyperparameters
- MovieLens Time Series
- Git repo & workflow example