Year

2020

Scope

9 cookie recipes
scored 1-100 by 11 tasters

Variables

4 ingredients
3 levels each
2 age groups

Observations

7 adults
4 kids

Script

R

Method

Location Dispersion Model
Response Surface Model
Linear Model
Quadratic Model

Chocolate Chip Cookie Tasting Experiment

To find out the best recipe with a limited sample size (as one would imagine how difficult it is to get access to human tasters in the Los Angeles county during covid-19 quarantine), my project partner Prakul and I developed and executed an effective experiment design involving 11 tasters, to gather the critical data points that allowed us to map out the main effects of key cookie ingredients. Every taster only needed to take 9 bites and score it 1 to 100 to give us sufficient information about 4 ingredients, each with 3 different levels of ingredient mix.

Findings

Chewiness is the main determining factor for adults. Bread flour to all-purpose flour ratio peaked at half/half mix. Sweetness is the biggest likeness factor for kids: the sweeter the better for the 11 to 16 age group. White sugar 100% is the best for kids. And when we have adults and kids together, we see a “tug of war” between the two factors.

Design

This is a classical 3x3 matrix graeco-latin square design, which is a 4 dimensional orthogonal array.

graeco-latin-square

⦿  Column is the 1st dimension: sugar, in three levels of
    100% white, 50% white and 50% brown, 100% brown sugar.
⦿  Row is the 2nd dimension: flour, in three levels of mixing
    all-purpose and bread flour, i.e. 100%, 50% 50%, 100%.
⦿  Latin letter is the 3rd dimension: fat, in three levels of
    mixing butter and coconut oil.
⦿  Greek letter is the 4th dimension: chocolate, in three levels
    of mixing chips and chunks.

Design execution is fundamental to the success of the experiment involving individuals tasting cookies. A million things can go wrong and get us unreliable data. Prakul and I worked out all the details to block the two operations with the same ingredients from the same brand, the same baking procedure and equipment, the same tasting activity with a baseline cookie first as score 70 and tasters eyes closed. And we randomized the baking, labeling and tasting procedure so that what cookie tasting sequence won’t skew our results.

Analysis

99 data points came from 11 tasters, each giving 9 scores over the 9 recipes.

observation data points

Here’s how the data grouped per recipe look and color coded into adult and kid.

boxplot

Here’s the Main Effects of the four ingredients, showing how tasters score on each level of the four ingredients.

main-effect

After running on response surface model on full-factorial with two dominant factors: flour and sugar, we look at how the two factors interact for all age groups.

interaction-plot

We built several models and came to below bivariate quadratic model which can be visualized into contour plot, which tells the story of the sugar x flour “tug war”. Depending on the adult-kid ratio in our sampling, we see a “sweet spot” where the highest score at 100% white sugar and 50/50 all-purpose and bread flour ratio.

contour-plot quadratic-model

Our perfect chocolate chip cookie recipe is tested to satisfy everyone.