High-Stakes Testing = Negative Effects on Student Achievement

Written by Jack Hassard

On November 9, 2011

In earlier posts, I have advocated banning high-stakes testing as a means of making significant decisions about student performance (achievement in a course, passing a course—end-of-year-tests, being promoted, and graduating from high school).

I suggested this because the research evidence does not support continuing the practice in American schools.

The research reported here sheds light on high-stakes testing, and shows why they should not be used to make decisions about students’ achievement, teachers’ performance, or to make sanctions or offer rewards to schools.

High-Stakes Tests: No Effect on Student Achievement

The Board on Testing and Assessment of the National Research Council issued a report entitled Incentives and Test-Based Accountability in Education.

The report concludes that using test-based (high-stakes testing) incentives has not created positive effects on student achievement.  It says that school-incentives such as those of the No Child Left Behind Act produce some of the highest effects in the programs studied, but only in elementary mathematics, and the improvements were miniscule.  Exit exams, which are used in 25 states, typically given in each of the major content areas at the end-of-the-year have actually decreased graduation rates.

What do tests measure?

We rely on tests to inform us about academic learning, but we fail to consider not only what tests don’t measure, but the limitations on what they do measure.

We get ourselves in real trouble when we think that a score on a NCLB test, or a CRCT type of test is actually a good measure of student academic learning.  We get ourselves in further trouble when we believe that the score represents what students know. The hole gets deeper when we think that changes in student test scores (positive or negative) are caused by the performance of teachers.

The authors of the National Research Council report on Incentives and Test-Based Accountability in Education had this to say about tests:

The tests that are typically used to measure performance in education fall short of providing a complete measure of desired educational outcomes in many ways. This is important because the use of incentives for performance on tests is likely to reduce emphasis on the outcomes that are not measured by the test.

Collateral Effects

The first is teaching to the test.  The curriculum becomes narrow as we teach to the test.  This often causes us to stray from interesting activities.  We give less time to project-based work, and hand-on-collaborative activities.

Using projects and hands-on activities takes away time needed to drill students on the content of the test, or in the case of elementary schools, these take away time to teach math and reading/language arts.

One of the constraints of test-based incentives is that there are many goals of teaching that are not measured by bubble tests such as curiosity, persistence, ability to solve problems, or to collaborate.  Yet, these might be as important as the content that is tested.

But as the Board of Testing report reveals, the tests that we use do not do a great job in measuring the performance in the tested areas such as science, mathematics, English, or social studies.  Since the tests in these areas are based on the outline of content as represented in the content standards of each subject, there simply is not enough time to test students in each content standard.

Constructing a Test is not So Simple

There are seven major areas of standards:

  1. Science as Inquiry
  2. Physical Science
  3. Life Science
  4. Earth Space Science
  5. Science and Technology
  6. Science in Personal and Social Perspectives
  7. History and Nature of Science

In these seven areas there are 64 content standards just for grades K-8.

If you then look at the details of the Science Standards for any one of the 64 content standards, one finds at least three fundamental concepts and principles that underlie the standards.

So at the least, we have 192 concepts to measure on a test.

What is a test maker to do?

Look at this example.

If you were to develop a test for Grade 5, you would need to develop a domain chart that included about 96 concepts.  If you wrote one test item for each concept, then the test would be 96 items long.

But, that’s too long a test, so the test must be reduced in number, to say 30 or 40 items, meaning that not all of the content standards have been measured.

And what is worse, we are only using one test item to “measure” performance on each standard. Wouldn’t it be more valid if we used two or more test items to “measure” each standard? If we do, then we end up testing fewer standards.

So high-stakes tests fall short in measuring the standards in most content areas, yet we continue to use them to make decisions about student, teacher and school performance.

As the National Research Council report suggested

…tests also fall short in measuring performance in the tested  subjects and grades in important ways.   Some aspects of performance in many tested subjects are difficult or even impossible to assess with current tests.  As a result, tests can measure only a subset of the content of a tested subject.

We can define what a test measures, but in the current era of high-stakes testing, the tests that are being used to measure performance in any subject (math, science, English) do not represent the full scope of the curriculum, and have been shown to be ineffective in increasing student achievement.  End-of-year tests, such as those given in Georgia, are high-stakes tests, and should not be used to determine if a student should graduate.  The evidence is that end-of-year tests actually result in decreasing graduation rates.

Suggestions

The authors of the Incentives and Test-Based Accountability in Education report recommend that since we do not yet know how to use test-based incentives consistently to make positive effects, policy makers should support and look at alternative evaluation models.  Furthermore, policy makers should make use of basic research and make choices from a number of options.  They go on to say that:

We call on researchers, policy makers, and educators to examine the evidence in detail and not to reduce it to a simple thumbs-up or thumbs-down verdict. The school reform effort will move forward to the extent that everyone, from policy makers to parents, learns from a thorough and balanced analysis of each success and each failure.

We would wish that policy makers would use the report to put a moratorium on using high-stakes tests to make decisions about students, teachers and schools.

Do you think this will happen? Comment and tell us what you think.

You May Also Like…

Beyond the Scientific Method by Charles R. Ault

Beyond the Scientific Method by Charles R. Ault

The guest post by Dr. Charles Ault, an Emeritus Professor of science education, challenges the current scientific educational approach. He critiques the traditional “unity” view that suggests science processes are universally applicable across all scientific disciplines. Dr. Ault advocates for recognizing the diversity and uniqueness of methods within each scientific discipline. He demonstrates this using examples of how geology and physics operate uniquely. He also introduces his concept of “scientific diversity,” which emphasizes that the interpretation and implementation of core scientific ideas should be adaptable to the specific discipline they are applied to. Furthermore, he suggests rethinking the traditional scientific method and adapting it to the disciplinary contexts.

Beyond Science Standards

Beyond Science Standards

I want to tell you about a book that was just published in the field of science education. It’s a book that I was asked to write the forward. I’ve never been asked to write a forward for a book. But I was honored, and I want to encourage you to examine this book, whether you are a science educator, a scientist or a citizen interested in public education.

A Letter from A Teen Living in 2051 about Education and the Climate Crisis

A Letter from A Teen Living in 2051 about Education and the Climate Crisis

This post focuses on education and climate as seen by a teen living in Atlanta in the year 2051.  I originally published it on April 21,  2012.  Although a work of fiction, it is presented here as a reminder of the consequences of making decisions based on faulty reasoning and ignorance.  I am re-publishing it today ahead of the 2021 United Nations Climate Change Conference being held in Glasgow, Scotland

0 Comments

We would enjoy reading your comments

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Discover more from Citizen Jack

Subscribe now to keep reading and get access to the full archive.

Continue reading