Gadfly on the Wall: Classroom Grades Show Learning Better Than Standardized Test Scores
“Not everything that counts can be counted, and not everything that can be counted counts.”
This summer my family suffered a tremendous blow.
My grandmother, Ce Ce, died.
She was in her 90s and had been unwell since before COVID. But she was also our matriarch, the point around which so much of our interrelations orbited and met.
After the funeral, I found myself at my uncle’s house somehow tasked with watching over several young cousins who had had just about enough of sitting around quietly in itchy suits and dresses.
To get a moment to myself, I set them a task: go downstairs among the assorted relatives and ask them to tell you a story about Ce Ce. Best story wins.
They went off like an explosion. And when they came back, they each had a touching tale about Ce Ce.
One was about how she defended a niece who wanted to marry someone of another faith. Another story was a fond recollection of the sweet and sour spaghetti sauce she used to make, the recipe of which is lost forever.
I was even surprised to hear some stories I had never known like that after my grandfather died, a semi-famous painter had asked Ce Ce on a date!
When my little cousins’ recitations were done, they were united in one thing – wanting to know who won.
I stumbled. I stammered.
I really had no way of judging such a thing.
They had all brought back such wonderful stories. Who won? We were ALL enriched by hearing them.
And that’s kind of how I feel about learning.
It is a fool’s errand to try and compare one person’s acquisition of knowledge with another. But that’s exactly what our current education system is built on.
Unless opted out by a parent or guardian, every public school child in America is required to take standardized tests in grades 3-8 and once in high school.
And the results of these tests are used to make high stakes decisions about which classes the students can enroll in, which enrichments, field trips or remediation they require, and even how much funding will be given or withheld from the schools and districts where they attend.
As a result, the social effects of poverty and racial discrimination end up being transformed into numbers. Thus, instead of being seen as indictments of the economic and racist status quo, they are viewed as the problem of schools and the individual students, themselves.
Standardized tests purport to show that poor children and/or children of color aren’t learning at the same rate as other children. So by the end of 12th grade they have learned less. When they are discriminated against in the job market then, that discrimination is justified – because it is not based on economics or race; it is based on numbers.
However, to perform this alchemy, we have to ignore the fact that standardized assessments are not the only way to determine whether students have learned anything. In fact, for the majority of students’ school experience that learning is assessed by something else entirely – classroom grades.
What if we took classroom grades as seriously as we take standardized test scores?
What if we valued them MORE?
The world would be a very different place.
The entire narrative of failing students and failing schools would turn on its head. After all, graduation rates have steadily increased over the last decade.
Students are completing more courses and more difficult courses. And students are even getting higher grades in these classes!
How is that possible?
The new analysis comes from the U.S. Department of Education, and tracks transcripts of a representative sample of high school graduates in 1990, 2000, 2009, and 2019.
It does not include scores from 2020 and 2022 when both classroom grades and national test scores fell. But that’s clearly because of the pandemic and the fact that most students educations and testing schedules were disrupted.
Before COVID, students increasingly were taking higher-level courses, and their Grade Point Averages (GPAs) were steadily rising — from an average of 2.68 in 1990 to 2.94 in 2000, 3.0 in 2009, and 3.11 in 2019.
This is true of students from all backgrounds, but disparities still existed. On average, white and Asian students had higher GPAs than Black and Hispanic students. Though girls, overall, had higher GPAs than boys.
However, on the National Assessment of Educational Progress (NAEP), given to a sample of students across the country, test scores during the same period did not show a similar increase. Math and reading scores in 2019 were slightly lower than in 2009 and unchanged from 2005. Science scores haven’t budged since 2009.
Why the disparity?
Scholars, teachers, parents and students have been complaining about the validity of standardized testing for more than a century. But business interests make billions of dollars off the industry it creates. Guess which group policymakers continue to heed over the other.
It doesn’t take much to show why classroom grades are better at assessing student learning. Compare them with standardized test scores.
Students earn grades based on a wide range of assessments, activities, and behaviors – quizzes, class participation, oral and written reports, group assignments, homework, and in-class work.
Standardized tests, on the other hand, are not assigned on such a multifaceted range of factors. Instead, they are designed to obtain a measure of student proficiency on a specified set of knowledge and skills within limited academic areas, such as mathematics or reading.
Classroom grades are tapestries sown from many patches showing a year’s worth of progress. Standardized tests are at best snapshots of a moment in time.
In class, students can speak with teachers about grades to get a better sense of how and why they earned the marks they did. They can then use this explanation to guide them in the future thus tailoring the classroom experience to individuals.
The value seen in standardized test is its apparent comparability. Scores are supposed to reflect student performance under roughly the same conditions, so the results can be equated and analyzed.
So the biggest difference isn’t a matter of validity, it is pragmatism. Test scores can be used to rate students from all over the country or the world. They can be used to sort kids into a hierarchy of best to worst. Though why anyone would want to do that is beyond me. The purpose of education is not like the National Football League (NFL). It’s to encourage learning, not competition based on a simulation of learning.
And there is evidence that classroom grades are more valid than standardized test scores.
After all, high stakes assessments like the Scholastic Aptitude Test (SAT) do NOT accurately predict future academic success as classroom grades, in fact, do.
Kids with perfect scores on the SAT or American College Testing (ACT) tests don’t achieve more than kids who received lower scores or never took the tests in the first place.
Numerous studies have shown this to be true. The most recent one I’ve seen was from 2014.
Researchers followed more than 123,000 students who attended universities that don’t require applicants to take these tests as a prerequisite for admission. They concluded that SAT and ACT test scores do not correlate with how well a student does in college.
However, classroom grades do have predictive value – especially when compared to standardized tests. Students with high grades in high school but middling test scores do better in college than students with higher test scores and lower grades.
Why? Because grades are based on something other than the ability to take one test. They demonstrate a daily commitment to work hard. They are based on 180 days (in Pennsylvania) of classroom endeavors, whereas standardized tests are based on the labor of an afternoon or a few days.
Classroom grades would not have such consistent predictive value if they were nothing but the result of grade inflation or lenient teachers.
In fact, of the two assessments – classroom grades and standardized tests – one is far more essential to the daily learning of students than the other.
We could abolish all standardized testing without any damage to student learning. In fact, the vacuum created by the loss of these high stakes tests would probably result in much less teaching to the test. Days, weeks, months of additional class time would suddenly appear and much more learning would probably take place.
Academic decisions about which classes students can enroll in or what remediation is necessary could just as easily be made based on classroom grades and teacher observations. And funding decisions for schools and districts could be made based on need and equity – not the political football of standardized testing.
However, getting rid of classroom grades would be much more disruptive. Parents and students would have few measures by which to determine if students had learned the material. Teachers would have fewer tools to encourage children to complete assignments. And if only test scores remained, the curriculum would narrow to a degree unheard of – constant, daily test prep with no engagement to ones life, critical thinking or creativity.
To be fair, there are mastery-based learning programs that try to do without grades, but they are much more experimental and require a complete shift in how we view learning. This is a more holistic system that requires students to demonstrate learning at one level before moving ahead to the next. However, it is incredibly labor intensive for teachers and often relies heavily on edtech solutions to make it viable.
I’m not saying this is an impossible system or even taking a stance on its value. But a large scale shift away from classroom grades would be chaotic, confusing and probably a failure without serious support, scaffolding and parental, teacher and student buy-in.
At the end of the day, classroom grades are the best tool we have to determine whether learning has taken place and to what degree. We should do everything we can to change the way policymakers prefer the standardized approach to the personalized one.
To return to a fuller quote by sociology professor Cameron with which I began this article:
‘It would be nice if all of the data which sociologists require could be enumerated because then we could run them through IBM machines and draw charts as the economists do. However, not everything that can be counted counts, and not everything that counts can be counted.”
Thus, the urge to quantify student learning seems predicated on the popular maxim: If you can’t measure it, you can’t manage it.
Standardized testing is about managing students – sorting them into valuable and disposable for the workforce.
Classroom grades are actually concerned with the project at hand – assessment of learning.
Which brings me back to my little cousins.
When I told them I couldn’t possibly pick a winner between them based on their stories, there were lots of groans of annoyance.
They viewed the whole project as a competition and they wanted to win.
I hope on reflection they’ll see that we all won.
Everything isn’t a contest. We are not all opponents.
If they can grasp that, it would be the greatest lesson I could teach.
This blog post has been shared by permission from the author.
Readers wishing to comment on the content are encouraged to do so via the link to the original post.
Find the original post here:
The views expressed by the blogger are not necessarily those of NEPC.