Programming in Python for Data Science – Unit tests, corner cases

How can we be so sure that the code we wrote is doing what we want it to?

Does our code work 100% of the time?

These questions can be answered by using something called units tests.

Assert Statements

assert 1 == 2 , "1 is not equal to 2."

AssertionError: 1 is not equal to 2.

Detailed traceback: 
  File "<string>", line 1, in <module>

assert 1 == 1 , "1 is not equal to 1."
print('Will this line execute?')

Will this line execute?

assert 1 == 2 , "1 is not equal to 2."
print('Will this line execute?')

AssertionError: 1 is not equal to 2.

Detailed traceback: 
  File "<string>", line 1, in <module>

assert 1 == 2

AssertionError: 

Detailed traceback: 
  File "<string>", line 1, in <module>

Why?

Where do assert statements come in handy?

Up to this point, we have been creating functions, and only after we have written them, we’ve tested if they work.

Some programmers use a different approach: writing tests before the actual function. This is called Test-Driven Development.

This may seem a little counter-intuitive, but we’re creating the expectations of our function before the actual function code.

Often we have an idea of what our function should be able to do and what output is expected.

If we write our tests before the function, it helps understand exactly what code we need to write and it avoids encountering large time-consuming bugs down the line.

Once we have a serious of tests for the function, we can put them into assert statements as an easy way of checking that all the tests pass.

What to test?

def exponent_a_list(numerical_list, exponent=2):
    new_exponent_list = list()
    
    for number in numerical_list:
        new_exponent_list.append(number ** exponent)
    
    return new_exponent_list

assert exponent_a_list([1, 2, 4, 7], 2) == [1, 4, 16, 49], "incorrect output for exponent = 2"

assert exponent_a_list([1, 2, 3], 3) == [1, 8, 27], "incorrect output for exponent = 3"

assert type(exponent_a_list([1,2,4], 2)) == list, "output type not a list"

False Positives

def bad_function(numerical_list, exponent=2):
    new_exponent_list = [numerical_list[0] ** exponent] # seed list with first element
    for number in numerical_list[1:]:
        new_exponent_list.append(number ** exponent)
    return new_exponent_list

assert bad_function([1, 2, 4, 7], 2) == [1, 4, 16, 49], "incorrect output for exponent = 2"
assert bad_function([2, 1, 3], 3) == [8, 1, 27], "incorrect output for exponent = 3"

bad_function([], 2)

IndexError: list index out of range

Detailed traceback: 
  File "<string>", line 1, in <module>
  File "<string>", line 2, in bad_function

Just because all our tests pass, this does not mean our program is necessarily correct.

It’s common that our tests can pass, but our code contains errors.

Let’s take a look at the function bad_function(). It’s very similar to exponent_a_list except that it separately computes the first entry before doing the rest in the loop.

This function looks like it would work perfectly fine, but what happens if we get an input argument for numerical_list that cannot be sliced?

Let’s write some unit tests using assert statements and see what happens.

Here, it looks like our tests pass at first.

But what happens if we try our function with an empty list?

We get an unexpected error! How do we avoid this?

Write a lot of tests and don’t be overconfident, even after writing a lot of tests!

Checking an empty list in our bad_function() function is an example of checking a corner case.

A corner case is an input that is reasonable but a bit unusual and may trip up our code.

Testing Functions that Work with Data

def column_stats(df, column):
   stats_dict = {'max': df[column].max(),
                 'min': df[column].min(),
                 'mean': round(df[column].mean()),
                 'range': df[column].max() - df[column].min()}
   return stats_dict

data = {'name': ['Cherry', 'Oak', 'Willow', 'Fir', 'Oak'], 
        'height': [15, 20, 10, 5, 10], 
        'diameter': [2, 5, 3, 10, 5], 
        'age': [0, 0, 0, 0, 0], 
        'flowering': [True, False, True, False, False]}
         
forest = pd.DataFrame.from_dict(data)
forest

	name	height	diameter	flowering
0	Cherry	15	2	True
1	Oak	20	5	False
2	Willow	10	3	True
3	Fir	5	10	False
4	Oak	10	5	False

assert column_stats(forest, 'height') == {'max': 20, 'min': 5, 'mean': 12.0, 'range': 15}
assert column_stats(forest, 'diameter') == {'max': 10, 'min': 2, 'mean': 5.0, 'range': 8}
assert column_stats(forest, 'age') == {'max': 0, 'min': 0, 'mean': 0, 'range': 0}

Systematic Approach

We use a systematic approach to design our function using a general set of steps to follow when writing programs.

1. Write the function stub: a function that does nothing but accepts all input parameters and returns the correct datatype.

def exponent_a_list(numerical_list, exponent=2):
    return list()

2. Write tests to satisfy the design specifications.

def exponent_a_list(numerical_list, exponent=2):
    return list()
   
assert type(exponent_a_list([1,2,4], 2)) == list, "output type not a list"
assert exponent_a_list([1, 2, 4, 7], 2) == [1, 4, 16, 49], "incorrect output for exponent = 2"
assert exponent_a_list([1, 2, 3], 3) == [1, 8, 27], "incorrect output for exponent = 3"

AssertionError: incorrect output for exponent = 2

Detailed traceback: 
  File "<string>", line 1, in <module>

3. Outline the program with pseudo-code.

def exponent_a_list(numerical_list, exponent=2):

    # create a new empty list
    # loop through all the elements in numerical_list
    # for each element calculate element ** exponent
    # append it to the new list 
    
    return list()
    
assert type(exponent_a_list([1,2,4], 2)) == list, "output type not a list"
assert exponent_a_list([1, 2, 4, 7], 2) == [1, 4, 16, 49], "incorrect output for exponent = 2"
assert exponent_a_list([1, 2, 3], 3) == [1, 8, 27], "incorrect output for exponent = 3"

AssertionError: incorrect output for exponent = 2

Detailed traceback: 
  File "<string>", line 1, in <module>

3. Outline the program with pseudo-code.

Pseudo-code is an informal but high-level description of the code and operations that we wish to implement.

In this step, we are essentially writing the steps that we anticipate needing to complete our function as comments within the function.

So for our function pseudo-code includes:

# create a new empty list
# loop through all the elements in numerical_list
# for each element calculate element ** exponent
# append it to the new list

4. Write code and test frequently.

def exponent_a_list(numerical_list, exponent=2):
    new_exponent_list = list()
    
    for number in numerical_list:
        new_exponent_list.append(number ** exponent)
    
    return new_exponent_list
    
assert type(exponent_a_list([1,2,4], 2)) == list, "output type not a list"
assert exponent_a_list([1, 2, 4, 7], 2) == [1, 4, 16, 49], "incorrect output for exponent = 2"
assert exponent_a_list([1, 2, 3], 3) == [1, 8, 27], "incorrect output for exponent = 3"

5. Write documentation.

def exponent_a_list(numerical_list, exponent=2):
    """ Creates a new list containing specified exponential values of the input list. 
    
    Parameters
    ----------
    numerical_list : list
        The list from which to calculate exponential values from
    exponent : int or float, optional
        The exponent value (the default is 2, which implies the square).
    
    Returns
    -------
    new_exponent_list : list
        A new list containing the exponential value specified of each of
        the elements from the input list 
        
    Examples
    --------
    >>> exponent_a_list([1, 2, 3, 4])
    [1, 4, 9, 16]
    """
    new_exponent_list = list()
    for number in numerical_list:
        new_exponent_list.append(number ** exponent)
    return new_exponent_list