Oh, the temptation to "fix" the data
Posted: Wed Mar 09, 2011 12:54 pm
My daughter (5th grade) is doing a science fair project on sizing paper. One of the tests that she wanted to do was compare the texture of paper with sizing added to paper with no sizing. Since we couldn't figure out a way to measure this quantitatively, we figured we would do it qualitatively (by touching it, maybe drawing on it, then comparing).
Her big sister is doing the "Take a musical step back in time" project from this site, and she developed a survey form to have listeners rate the sound from different home made record players. Little sister copied this idea and made a survey form to have users test the paper. She decided to just have them touch the paper, rather than write on it, so she could use the same paper for all the volunteers. (Each piece of paper is about 20 minutes of work, so 6 samples times a survey size of 10 volunteers would be 20 hours of preparation time to make paper!)
Here's the problem: Little sister just didn't have the best survey design skills. Her first two volunteers (mom and dad) gently tried to tell her that her test was really hard to do, but she didn't really want to hear it. The form was really hard to fill out--keeping track of samples numbered one to six, and scores from one to five and put the right number by the right number. And, rating texture of paper is not something that comes natural to people who (unlike dear daughter) are not papermaking fanatics. From her point of view, the "fuzzy" paper has "bad texture" -- the fibers are falling out, loose fibers will scratch off as you try to write on it. From some users point of view (despite instructions to the contrary), the "fuzzy" paper feels like a soft cotton sweater, fun to the fingers, and thus has a "good texture."
Now that the data is in, we can see that the fuzziest paper (with no sizing added to keep the fibers from flying away) has been rated "best" by about a third of the users (contrary to instructions), and "worst" by the rest. It's clear that they filled in the survey scores backwards to what she was trying to ask.
I think the only solution is to either throw out this part of the experiment -- either redesign the survey, make it easier for volunteers, and re-do it, or to throw out the survey just use the quantitative data that she already has (she did strength tests and absorbency tests as well as another qualitative experiment where she folded origami with the different papers) and be done. It's a breach of scientific ethics to just "reverse the scores" on the bad surveys, even if we are sure that the data would make the most sense that way. This survey can't be saved? If we throw it out, do we put any mention of it on the science fair board?
Her big sister is doing the "Take a musical step back in time" project from this site, and she developed a survey form to have listeners rate the sound from different home made record players. Little sister copied this idea and made a survey form to have users test the paper. She decided to just have them touch the paper, rather than write on it, so she could use the same paper for all the volunteers. (Each piece of paper is about 20 minutes of work, so 6 samples times a survey size of 10 volunteers would be 20 hours of preparation time to make paper!)
Here's the problem: Little sister just didn't have the best survey design skills. Her first two volunteers (mom and dad) gently tried to tell her that her test was really hard to do, but she didn't really want to hear it. The form was really hard to fill out--keeping track of samples numbered one to six, and scores from one to five and put the right number by the right number. And, rating texture of paper is not something that comes natural to people who (unlike dear daughter) are not papermaking fanatics. From her point of view, the "fuzzy" paper has "bad texture" -- the fibers are falling out, loose fibers will scratch off as you try to write on it. From some users point of view (despite instructions to the contrary), the "fuzzy" paper feels like a soft cotton sweater, fun to the fingers, and thus has a "good texture."
Now that the data is in, we can see that the fuzziest paper (with no sizing added to keep the fibers from flying away) has been rated "best" by about a third of the users (contrary to instructions), and "worst" by the rest. It's clear that they filled in the survey scores backwards to what she was trying to ask.
I think the only solution is to either throw out this part of the experiment -- either redesign the survey, make it easier for volunteers, and re-do it, or to throw out the survey just use the quantitative data that she already has (she did strength tests and absorbency tests as well as another qualitative experiment where she folded origami with the different papers) and be done. It's a breach of scientific ethics to just "reverse the scores" on the bad surveys, even if we are sure that the data would make the most sense that way. This survey can't be saved? If we throw it out, do we put any mention of it on the science fair board?