State Standards Rise in Reading, Fall in Math

The data used to determine the grades in Figure 1 are available here.

Much ado has been made about setting high standards over the past year. In his first major address on education policy, given just two months after he took the oath of office, President Barack Obama put the issue on the national agenda. They ought “to stop lowballing expectations for our kids,” he said, adding that “the solution to low test scores is not lowering standards—it’s tougher, clearer standards.” In March 2010, Secretary of Education Arne Duncan accused educators of having “lowered the bar” so they could meet the requirements set by the federal education law, No Child Left Behind (NCLB), which requires that all students be proficient in reading and math by the year 2014.

Current conversations about creating a common national standard largely focus on the substantive curriculum to be taught at various grade levels. Even more important, we submit, is each state’s expectations for student performance with respect to the curriculum, as expressed through its proficiency standard. Curricula can be perfectly designed, but if the proficiency bar is set very low, little is accomplished by setting the content standards in the first place.

To see whether states are setting proficiency bars in such a way that they are “lowballing expectations” and have “lowered the bar” for students in 4th- and 8th-grade reading and math, Education Next has used information from the recently released 2009 National Assessment of Educational Progress (NAEP) to evaluate empirically the proficiency standards each state has established. This report is the fourth in a series in which we periodically assess the rigor of these standards (see “Johnny Can Read…in Some States,” features, Summer 2005; “Keeping an Eye on State Standards,” features, Summer 2006; and “Few States Set World-Class Standards,” check the facts, Summer 2008).

The 2009 NAEP tests in reading and math were given to a representative sample of students in 4th- and 8th-grade in each state. NAEP, called “the nation’s report card,” is managed by the Department of Education’s National Center for Education Statistics and is currently the “gold standard” of assessments. Its proficiency standard is roughly equivalent to the international standard established by those industrialized nations that are members of the Organisation for Economic Co-operation and Development (OECD). If a state identifies no higher a percentage of students as being proficient on its own tests than NAEP does, then the state can be said to have set its standards at a world-class level. To ascertain objectively whether state standards are high or low, and whether they are rising or falling, we compare the percentage of students deemed proficient by each state with the percentage proficient as measured by NAEP. The state assessment data used in this report consist of those compiled in 2009 by the 50 states and the District of Columbia.

States have strong incentives not to set world-class standards. If they do, more of their schools will be identified as failing under NCLB rules, and states will then be required to take corrective actions to bring students’ performance up to the higher standard. As a result, the temptation for states to “lowball expectations” is substantial. Perhaps for this reason, a sharp disparity between NAEP standards and the standards in most states has been identified in all of our previous reports. In 2009, the situation improved in reading, but deteriorated further in math.

Every state, for both reading and math (with the exception of Massachusetts for math), deems more students “proficient” on its own assessments than NAEP does. The average difference is a startling 37 percentage points. In Figure 1, we provide a uniform ranking of the rigor of state standards using the same A to F scale used to grade students (see sidebar for the specifics on the methodology we used).

The Grading

In 2003, 2005, 2007, and 2009, 4th- and 8th-grade students took both state and NAEP tests in math and reading. The grades reported here are based on the comparison of state and NAEP proficiency scores in 2009, and changes for each are calculated relative to 2003 (Figure 2). For each available test, we computed the difference between the percentage of students who were proficient on the NAEP and the percentage reported to be proficient on the state’s own tests for the same year. We also computed the standard deviation for this difference. We then determined how many standard deviations each state’s difference was above or below the average difference of all observations in 2009, 2007, and 2005 on each test. The scale for the grades was set so that if grades had been randomly assigned and so were in a normal distribution, 10 percent of the states would earn As, 20 percent Bs, 40 percent Cs, 20 percent Ds, and 10 percent Fs. The grade given to each state is based on how much easier it was to be labeled proficient on the state assessment than on the NAEP. For example, on the 4th-grade math test in 2009, West Virginia reported that 60.8 percent of its students had achieved proficiency, but 28.1 percent were proficient on the NAEP. The overall grade for each state was determined by comparing the difference with the standard deviation from the average for all states for all four years on the tests for which the state reported proficiency percentages. In the case of West Virginia for 4th-grade math, the difference (60.8 percent – 28.1 percent = 32.7 percentage points) is about 0.02 standard deviations worse than the average difference between the state test and the NAEP over the three years, which is 32.4 percent. This earned West Virginia a C for its standards in 4th-grade math. We are therefore generous in that we do not require the meeting of any stipulated cutoff in the differences with NAEP to award a specific grade: no single state would be ranked A, say, if we required for this a difference with NAEP smaller than 5 percentage points. Instead, we rank states against each other in accordance to their current position in the distribution of differences over all the years for which we have observations (2003, 2005, 2007, and 2009).

Racing to the Top?

Ironically, Tennessee received an F and had the lowest standards of all states, despite the fact that it is one of the two winners in the first phase of the bitterly contested Race to the Top (RttT) competition sponsored by the Obama administration’s Department of Education. Indeed, Tennessee has had the lowest standards of all states since 2003. Based on its own tests and standards, the state claimed in 2009 that over 90 percent of its 4th-grade students were proficient in math, whereas NAEP tests revealed that only 28 percent were performing at a proficient level. Results in 4th-grade reading and at the 8th-grade level are much the same. With such divergence, the concept of “standard” has lost all meaning. It’s as if a yardstick can be 36 inches long in most of the world, but 3 inches long in Tennessee.

Delaware, the other RttT First Phase winner, also had below-average standards, for which we awarded a grade of C- and ranked it 36th of the 50 states and the District of Columbia. Delaware claimed that 77 percent of its 4th-grade students were proficient in math, when NAEP shows that only 36 percent were. In 8th-grade reading, Delaware said 81 percent of its students were proficient, but NAEP put the figure at 31 percent.

From these findings one might conclude that the Obama administration is having a huge policy impact by getting states like Tennessee and Delaware to set standards they have been unwilling to establish in the past. But Tennessee earned almost full marks (98 percent) on the section of the competition (weighted a substantial 14 percent of all possible points) devoted to “adopting standards and assessments,” even though its standards have remained extremely low ever since the federal accountability law took hold. The proof will be in the pudding. If Tennessee and Delaware and other states now shift their standards dramatically upward, RttT will win over those who think it is performance, rather than promises, that should be rewarded.

Disparities in State Standards

Despite the incentive to lowball expectations, five states—Hawaii, Massachusetts, Missouri, New Mexico, and Washington—have set their standards at or close to the world-class level, earning them an A. Notice that we award grades purely for the expected standard for performance, not actual proficiency. New Mexico earned the same mark as Massachusetts, even though only about one-quarter of its students are proficient, while half of Massachusetts students score at that level. The two deserve equal grades, however, because both are rigorous in their expectations. Another eight states—Colorado, Maine, Minnesota, Montana, New Hampshire, New Jersey, Rhode Island, and Vermont—earned a B for their standards.

President Obama is undoubtedly correct, however, in suggesting that many states are “lowballing expectations.” Of the remaining 38 states, 27 earned a C, and 8—Arizona, Idaho, Illinois, Maryland, Michigan, New York, Texas, and Virginia—a D. Three states—Alabama, Nebraska, and Tennessee—had such low standards that we awarded them an F. All of the states that earned grades of F have been ranked D or below in all three of our previous reports. This suggests that once a standard, however low, has been set, it tends to persist—another reason to be concerned about promises from Delaware and Tennessee.

Changes in Standards

Secretary of Education Duncan is not altogether correct in suggesting that educators are lowering the bar, however. Figure 2 shows that in 2009 the differences between state and NAEP standards shrank by 0.08 standard deviations as compared to the average for the three prior surveys. This is a reversal of the trend of declining standards we observed between 2003 and 2007. Eight states improved the overall rigor of their assessments by a full letter grade or more since 2007: Georgia, Mississippi, Montana, New Jersey, New Mexico, Oklahoma, Utah, and West Virginia. By contrast, we gave just four states—Alaska, California, New York, and South Carolina—grades that were at least a full letter grade worse than they received in 2007.

The reversal in the overall trend is, however, driven wholly by an improvement in the rigor of reading assessments, which set expectations that are higher by 0.49 standard deviations in 4th grade and by 0.26 standard deviations in 8th grade. As a matter of fact, 17 states increased the rigor of their 4th-grade reading assessments by a whole letter grade since 2007, and 17 states did the same for 8th grade. But math standards have slipped by 0.12 standard deviations in 4th grade and by 0.31 in 8th grade. This means that at least some of the state-reported improvements in mathematics proficiency are misleading.

Converging on a De Facto National Standard?

Most changes to standards, as we noted, have been fairly small: only 12 states have made changes to their standards that alter their standing by a whole letter grade. But since our last report two states, Hawaii and South Carolina, have made major alterations to state assessments. The results of these moves have been at odds: while Hawaii’s increased alignment with NAEP raised its grade from a B+ in 2007 to an A, South Carolina dropped from an A to a C-.

States nonetheless seem to be continuing their trajectory of convergence toward standards of similar rigor in math (which, given the slipping standards noted above, constitutes a downward convergence), but are more divergent in reading since 2007, particularly in 4th grade. If the convergence of math standards were to continue, we could gradually attain something like a national standard. But it would take a great deal of national patience to achieve a national standard by convergence creep.

In this report, as in previous ones, we assess the rigor of standards that states set. This is an important task, as it reminds states that whether students have or have not learned cannot be a matter of how the test is designed and where the “proficiency line” is drawn. Rather, setting high standards for proficiency is the first step in the journey toward actually improving the learning of a high percentage of students. According to NAEP, less than one-third of students are proficient in reading and a similar proportion in math nationwide. For the sake of the children of this country, we should be doing much better than that.

Paul E. Peterson is professor of government at Harvard University, senior fellow at the Hoover Institution, and editor-in-chief at Education Next. Carlos Xabel Lastra-Anadón is a research fellow at the Program on Education Policy and Governance at Harvard University.

State Standards Rise in Reading, Fall in Math

Latest Issue

Spring 2025

Get a Sample copy of The Journal

NEWSLETTER

Business + Editorial Office

Discover

More Information