Skip to content

Commit 437e7f0

Browse files
committed
add data quality checks to lab 5
1 parent 45b72ca commit 437e7f0

File tree

2 files changed

+8
-1
lines changed

2 files changed

+8
-1
lines changed

labs/lab_05.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Lab 5 - IN PROGRESS
1+
# Lab 5
22

33
## Objective
44

@@ -22,6 +22,12 @@ You'll [pair](../docs/pairing.md) in your group, repository, and dataset from [L
2222

2323
---
2424

25+
### [Data quality checks](../lectures/lecture_05.md#data-quality-checks)
26+
27+
Set up at least three different data quality checks with pytest.
28+
29+
---
30+
2531
## Submit
2632

2733
[Submit the links to the pull requests via CourseWorks.](https://courseworks2.columbia.edu/courses/210480/assignments)

lectures/lecture_05.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -106,6 +106,7 @@ Seeing projects with `<name>2.py`. Splitting code up into smaller files will hel
106106
## Data quality checks
107107

108108
- Can be flexible, like checking for standard deviation or X% matching certain criteria
109+
- [Things to check for when cleaning data](https://computing-in-context.afeld.me/lecture_17.html#data-cleaning)
109110
- There are tools that help with this
110111

111112
---

0 commit comments

Comments
 (0)