Checking Your Work

email twitter instagram facebook

Lessons

Standards in this Lesson

CSTA Standards

1B-AP-15: Test and debug (identify and fix errors) a program or algorithm to ensure it runs as intended.
2-AP-17: Systematically test and refine programs using a range of test cases
3B-AP-21: Develop and use a series of test cases to verify that a program performs according to its design specifications.

K-12CS Standards

6-8.Computing Systems.Troubleshooting: Comprehensive troubleshooting requires knowledge of how computing devices and components work and interact. A systematic process will identify the source of a problem, whether within a device or in a larger system of connected devices.
9-12.Computing Systems.Troubleshooting: Troubleshooting complex problems involves the use of multiple sources when researching, evaluating, and implementing potential solutions. Troubleshooting also relies on experience, such as when people recognize that a problem is similar to one they have seen before or adapt solutions that have worked in the past.

Oklahoma Standards

OK.L1.IC.C.02: Test and refine computational artifacts to reduce bias and equity deficits.

Textbook Alignment

IM Algebra 1

IM.Alg1.1.15: Comparing Data Sets

Practices in this Lesson

K12CS

P6: Testing and Refining Computational Artifacts

Math

MP.3: Construct viable arguments and critique the reasoning of others

Science and Engineering

SEP.3: Planning and Carrying Out Investigations

Social Justice

SJ.12: Students will recognize unfairness on the individual level (e.g., biased speech) and injustice at the institutional or systemic level (e.g., discrimination).
SJ.13: Students will analyze the harmful impact of bias and injustice on the world, historically and today
SJ.14: Students will recognize that power and privilege influence relationships on interpersonal, intergroup and institutional levels and consider how they have been affected by those dynamics.
SJ.15: Students will identify figures, groups, events and a variety of strategies and philosophies relevant to the history of social justice around the world.

Students consider the concept of trust and testing — how do we know if a particular analysis is trustworthy?

Lesson Goals

Students will be able to… - Create a subset of data to verify that a given transformation works as-advertised, using attributes of the transformation and the dataset.

Student-facing Lesson Goals

Let’s learn how to test the trustworthiness of a data analysis.

Materials

🔗Confirming Analysis 30 minutes

Overview

Students learn how to create a Testing Table, which is small enough to reason about and can be used to test whether code does the right thing.

Launch

Samples are taken in Data Science and Computer Programming for different reasons. One of the main purposes of Data Science is to take a representative sample from a larger population, and use information from the sample to infer what’s true about the whole population.

Uber and Google are making self-driving cars, which use artificial intelligence to interpret sensor data and make decisions about whether a car should speed up, slow down, or slam on the brakes. This AI is trained on a lot of sample data, which it learns from. What might be the problem if the sample data only included roads in California?
Why might it be a bad thing to only test medicines on men (or only on women), before prescribing them to the general public?

Testing Matters!

A good Testing Table should be representative of the population, and relevant to what’s being analyzed. A good Testing Table should have…

At least the columns that matter — whether we’ll be ordering or filtering by those columns.
Enough rows to include different circumstances that are relevant to the task at hand. For instance, if our code is supposed to extract certain cats from the animals table, our Testing Table should include at least one animal that’s not a cat.
Rows that aren’t already sorted, if our analysis is supposed to sort for us.

Data scientists usually think in terms of samples that best serve the purpose of performing inference: Samples should be representative of the entire population, and large enough to get us fairly close to the truth about that population.

How can we trust that our code is correct?

You’ve already written lots of code to analyze data. Millions of lines of code just like yours are run on datasets every day. The results are used to tell is where a drug is safe or not, whether someone should be put on the "no-fly" list, how much someone needs to pay for health insurance, and more. But programmers are only human, and everyone makes mistakes! And with so-called "A.I. Code Generators" out there writing more and more code for us, we need better and better ways of verifying that code does exactly what it claims to do!

Programmers need to think in terms of Testing Tables that best serve the purpose of verifying that their code does what it’s supposed to: The Tables should be designed to call attention to any imperfections in the code’s instructions.

Investigate

Testing Tables can also be used to verify that a certain analysis is correct.

An AI writes code that claims to filter out any shelter data to show only the cats.

Could we test it using a Table that already contains only cats?
- No! The AI’s code might do nothing at all and just return the Table it was given. It would give the right answer for the wrong reason! We need to find out if it actually removes non-cat Rows.
Could we test it using a Table that has no cats at all?
- No! The AI’s code might always return a table with no rows (regardless of species!), so giving it a Table without cats will give the right answer for the wrong reason! We need to find out if it actually keeps rows for cats.
Could we test it using a Table that has only cats and dogs?
- No! Maybe the AI’s code just removes dogs. We need to see if it removes other species as well.

Verifying that code does what it does is an important part of checking our work! That’s why writing examples is so valuable: it’s a chance to think about how the program should work, without worrying as much about how it works.

The AI writes a function called fixed-cats and claims that, given a table of animals, it produces a table with only fixed cats.

Do you trust it? How could you test it?
Which animals would you use in a Testing Table?
Complete “Trust, but verify …”.
Open the Trust but Verify Starter File. There are 3 versions of fixed-cats. Are they all correct? If not, which ones are broken?

An AI writes a function called fixed-cats and claims that, given a table of animals, it produces a table with only dogs five years or older.

Do you trust it? How could you test it?
Which animals would you use in a Testing Table?
Turn to “Trust, but verify…” (2). Using the same Starter File, construct a Testing Table and figure out which (if any) of the functions are correct!

Synthesize

Complex analysis has more room for mistakes, so it’s critical to think about a Testing Table that allows us to trust that our code really does what it’s supposed to!

How would you check whether or not a facial recognition system was equally accurate for everyone?

🔗When AI isn’t Intelligent… 20 minutes

Launch

Law enforcement in many towns has started using facial-recognition software to automatically detect whether someone has a warrant out for their arrest. A lot of facial-recognition software, however, has been trained on sample data containing mostly white faces. Why might this be a problem?

Investigate

Read "Summarizing US Congress Testimony on Artificial Intelligence", or watch this 10-minute video The Coded Gaze: Bias in Artificial Intelligence.
Complete Can Software be Biased?

Synthesize

Discuss the article and/or video, revisiting the following questions:

What are some concerns that experts and activists have raised about Artifical Intelligence?
What are some solutions that would address these concerns?
How would you test whether or not a facial recognition system was equally accurate for everyone?

These materials were developed partly through support of the National Science Foundation, (awards 1042210, 1535276, 1648684, and 1738598). Bootstrap by the Bootstrap Community is licensed under a Creative Commons 4.0 Unported License. This license does not grant permission to run training or professional development. Offering training or professional development with materials substantially derived from Bootstrap must be approved in writing by a Bootstrap Director. Permissions beyond the scope of this license, such as to run training, may be available by contacting contact@BootstrapWorld.org.

Checking Your Work

🔗Confirming Analysis 30 minutes

Overview

Launch

Investigate

Synthesize

🔗When AI isn’t Intelligent…​ 20 minutes

Launch

Investigate

Synthesize

🔗When AI isn’t Intelligent… 20 minutes