In addition to his active role on the committee, Mr. Ewert also assisted in the early stages of the project by agreeing to modify the third grade assessment instruments so that class size information could be collected in May 1997 throughout the state. He regularly answered questions about the assessment process and resulting data. It was through his assistance that the Center obtained the data files from Advanced Systems in Measurement and Evaluation, Inc., the state's contractor for the assessment program. Ms. Cherie Dudley of Advanced Systems was helpful in providing information regarding the structure of the computer data files, codes used, scoring methods, and other factors.
Finally, we would like to thank the sixteen thousand third graders in
New Hampshire's public schools who answered an additional question so that
this study would be possible.
At the beginning of the project it became clear that the necessary class size information had not been collected during the 1996 testing. The Center and the NH Department of Education staff discussed how such an analysis could be performed. It was agreed to collect the necessary class size information during the tests that were to be administered in May 1997. While the intent at that time was to collect necessary class size data from both third and sixth grades, the actual data collection occurred only at the third grade level.
There were 16,419 third grade public school students at the time the tests were administered. Of these 15,799 took the Language Arts test and 15,944 took the mathematics test.
The tests were scored and analyzed by Advanced Systems in Measurement and Evaluation, Inc., a Dover, NH, firm under contract with the New Hampshire Department of Education. The Department released its annual report on the 1997 tests in October 1997. Shortly thereafter a database of results was provided to the Center by Advanced Systems.
The results contained in these files have been analyzed in conjunction with other readily available data and reports to form the basis for the findings and conclusions contained in this report.
As stated in RSA 193-C, the state law that established the NHEIAP, the
purpose of the state tests and curriculum frameworks are "to establish
what New Hampshire students should know and be able to do and to develop
and implement effective methods for assessing that learning and its application
so that local decisions about curriculum development and delivery can be
made." We hope that this report contributes to that purpose.
Results of the study demonstrated that:
Each student was asked on the student questionnaire that was part of the assessment process to answer the following question:
| Class Size |
Number of Students
|
| 1-14 students |
455
|
| 15-20 students |
4,204
|
| 21-24 students |
6,293
|
| 25-27 students |
3,312
|
| 28-30 students |
1,580
|
| 31+ students |
90
|
| not tested/spoiled |
485
|
| Total |
16,419
|
The number of students in very small classes (1-14) is also small, only 2.8% of all third graders. The reasons for excluding the results for students in this smallest class size grouping are discussed in the Appendix: The Smallest Class Size Group.
Figure 1 displays the relative percentage of the students that fall
into each category.

Proficiency levels in both language arts and mathematics are Advanced, Proficient, Basic, and Novice.
We aggregated the students by class size and examined the results.
|
|
||||||
| Class
Size |
Advanced
|
Proficient
|
Basic
|
Novice
|
Not Tested
|
Total
|
| 15-20 |
4.4%
|
24.4%
|
43.6%
|
25.6%
|
2.5%
|
100.0%
|
| 21-24 |
4.8%
|
25.8%
|
43.2%
|
24.2%
|
2.0%
|
100.0%
|
| 25-27 |
5.2%
|
27.0%
|
40.1%
|
24.9%
|
2.8%
|
100.0%
|
| 28-30 |
4.9%
|
26.1%
|
43.7%
|
22.0%
|
3.4%
|
100.0%
|
| All Students |
4.5%
|
24.8%
|
41.6%
|
24.4%
|
4.6%
|
100.0%
|

There is only a very slight and inconsistent variation in results among class sizes of 15-30.
Proficiency Level in Mathematics
Table 3 and Figure 3 show the distribution of proficiency levels in
mathematics among each of the class size groupings.
|
|
|||||||
| Class
Size |
Advanced
|
Proficient
|
Basic
|
Novice
|
Not Tested
|
Total
|
|
| 15-20 |
11.8%
|
25.6%
|
44.5%
|
17.5%
|
0.6%
|
100.0%
|
|
| 21-24 |
13.1%
|
26.7%
|
42.7%
|
17.2%
|
0.3%
|
100.0%
|
|
| 25-27 |
15.3%
|
27.3%
|
39.9%
|
16.7%
|
0.8%
|
100.0%
|
|
| 28-30 |
17.3%
|
27.6%
|
40.8%
|
13.9%
|
0.4%
|
100.0%
|
|
| All Students |
13.1%
|
25.8%
|
41.5%
|
16.9%
|
2.7%
|
100.0%
|
|

The percentage of students at the Advanced and Proficient levels is
somewhat greater in larger class sizes. This relationship is consistent,
though small, across the groupings studied.
Writing Score
As part of the language arts test, each student is asked to write a
short essay of a few sentences. Two independent reviewers use the Annotated
Holistic Scoring Method to score a student's writing sample. Each reviewer
can assign an integer score from 1 to 6 with higher scores indicating better
writing. Each student, therefore, receives a combined score of an integer
in the range 2-12.
The average writing score of all third grade students in 1997 was 6.49.
We investigated whether the average writing score differed by class size
and what difference might exist between boys and girls. The results are
shown in Table 4 and Figure 4.
| Class Size |
Count of Students
|
|
||
|
Boys
|
Girls
|
Both Genders
|
||
| 15-20 |
4,204
|
5.96
|
7.02
|
6.48
|
| 21-24 |
6,293
|
6.06
|
7.02
|
6.53
|
| 25-27 |
3,312
|
6.13
|
6.93
|
6.53
|
| 28-30 |
1,580
|
6.10
|
6.95
|
6.53
|
| All Students |
16,419
|
6.02
|
6.98
|
6.49
|

Table 5 and Figure 5 display the percentage of correct answers by gender
and by class size.
|
|
|||
| Class Size |
|
|
|
| 15-20 |
75.2%
|
72.2%
|
73.7%
|
| 21-24 |
75.5%
|
73.7%
|
74.6%
|
| 25-27 |
74.6%
|
72.6%
|
73.6%
|
| 28-30 |
75.9%
|
74.2%
|
75.1%
|
| All Students |
75.1%
|
72.8%
|
73.9%
|

Table 6 and Figure 6 display the percentage of correct answers by gender
and by class size.
|
|
|||
| Class Size |
|
|
|
| 15-20 |
71.5%
|
72.3%
|
71.9%
|
| 21-24 |
72.0%
|
73.8%
|
72.9%
|
| 25-27 |
72.3%
|
74.3%
|
73.3%
|
| 28-30 |
73.8%
|
75.8%
|
74.8%
|
| All Students |
72.0%
|
73.5%
|
72.7%
|
On average, boys had 1.5% more correct answers than girls. This is a smaller difference than the opposite gender difference in language arts identified above.

We investigated whether the average score of students in different size
classes differed in any consistent or significant manner. Table 7 and Figure
7 display the results.
|
|
||||
| Class Size |
Boys
|
Girls
|
Both Genders
|
|
| 15-20 |
6.45
|
6.72
|
6.58
|
|
| 21-24 |
6.62
|
6.75
|
6.68
|
|
| 25-27 |
6.65
|
6.77
|
6.71
|
|
| 28-30 |
6.65
|
6.77
|
6.71
|
|
| All Students |
6.57
|
6.73
|
6.65
|
|

Table 8 and Figure 8 display the differences we found in the average
score of students in different size classes and gender.
|
|
||||
| Class Size |
Boys
|
Girls
|
Both Genders
|
|
| 15-20 |
4.83
|
5.00
|
4.91
|
|
| 21-24 |
4.96
|
4.96
|
4.96
|
|
| 25-27 |
5.12
|
5.26
|
5.19
|
|
| 28-30 |
5.37
|
5.52
|
5.44
|
|
| All Students |
4.97
|
5.07
|
5.02
|
|

Girls had slightly higher scores (0.1 point) than boys on this part of the test. This is in contrast to the multiple choice mathematics questions where boys' achievement surpassed that of the girls.
We decided to explore other factors that may be related to achievement test results. Because little information is collected on each student by the NHEIAP, there was no way to investigate what individual background factors might be related to achievement. However, previous reports on New Hampshire's public schools have found factors such as family income and parental education to be strongly related to student achievement. We proceeded to collect school- and community-wide data related to these factors and to aggregate student results accordingly. The result is that, in contrast to class size, these demographic factors show strong relationships to achievement.
|
|
|||||
|
% of Students in Schools Receiving Free and Reduced Price
Lunches
|
Number of Students
|
Number of Schools
|
Writing
|
Open-Ended Language Arts
|
Open Ended Mathematics
|
|
0.00-6.88%
|
3,249
|
35
|
7.03
|
7.07
|
5.70
|
|
6.89-14.01%
|
3,248
|
42
|
6.73
|
6.70
|
5.27
|
|
14.02-22.34%
|
3,272
|
57
|
6.57
|
6.67
|
5.04
|
|
22.35-34.05%
|
3,256
|
58
|
6.20
|
6.47
|
4.88
|
|
34.05-77.21%
|
3,246
|
58
|
5.85
|
6.13
|
4.08
|
|
16,271
|
|||||



The schools were ordered according to these figures and then aggregated into five groups (each with approximately the same number of students) and average scores were calculated for all students in each of the five groups of schools.
Table 10 and Figures 10a-c display the results. These results are not
as striking or consistent as those above. Nevertheless, they do confirm
in a general way that communities with higher family incomes are more likely
to exhibit higher average achievement test results.
|
|
|
||||||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
||



A similar analysis was done for the percentage of each community's population with incomes below 185% of the official poverty level in 1989. The results are very similar, showing that in communities with poorer populations, achievement of third grade students is generally lower.
Both factors show a strong and consistent relationship with the third grade test results. The higher the percentage of adults in the community with a college education, the higher the average achievement results of third graders (Figures 11 a-c). The higher the percentage of adults in the community with a high school education or less, the lower the test results (Figures 12a-c).






The 1990 US Census was used to obtain for each school or school district the percentage of all households that were renters. Test scores were sorted by schools based on this factor and aggregated into five groupings, each with as close to 20% of the students as possible.
As shown in Figures 13a-c, this factor shows only a muted relationship to achievement. There appears to be little or inconsistent difference in average test results in the 60% of schools where the percentage of renters is 23% (homeowners are 77% and above). For schools where the renters are an even lower percentage of all households, average test results are higher in those communities with very low percentages of renters. Because of these results and the weakness of the proxy factor itself, it would be inappropriate to conclude that schools where larger percentages of students are relatively transient have lower average achievement levels. The data in this study is not adequate to draw a conclusion either way in this regard.






A close review of these graphs indicates how small a variation is seen in average achievement across class size groupings in comparison to that seen across groupings based on other factors.
In addition to average achievement, it is possible to also look at the complete distribution of scores. This can most easily be done with the multiple-choice components of the language arts and mathematics tests.
Each student taking the language arts test answered 35 multiple-choice
questions and obtained a score in the range from 0 - 35 correct. Figure
15a displays the percentage of students who achieved each score. The curve
is an expected normal curve with negative skew and a peak (mode) of 30.

Figure 15c displays another set of distribution curves. One is for students in class sizes of 15-20 students while the other is for students in classes of 28-30 students. In this case there is little apparent difference. The two curves are nearly identical.


The figures for the multiple-choice mathematics results are very similar to those for language arts presented above. They are not reproduced here because they do not provide any new insights but simply lead to the same conclusion.
Statistically, there is little relationship between the class size in which third grade students spend most of their classroom time and their average achievement or the distribution of their achievement test scores. In contrast, factors associated with the economic situation of the families sending children to the school (as measured by the percentage of students at the school eligible for free and reduced-price lunches) is related to measured achievement levels. So too is the degree to which adults in the community are themselves educated.
E. A Surprise: Schools with Higher
Income Families Have Larger Class Sizes
As described above, students in larger classes obtained higher average
scores in the open-ended mathematics test items than did those in smaller
classes. Although the strength of this relationship is much smaller than
those related to socioeconomic variables such as family income and adult
education, it is the largest and most consistent of those related to class
size. It is also contrary to intuition.
Because we have shown that achievement is most strongly (and inversely) related to the percentage of students receiving free and reduced school lunches, we asked whether it was possible that the larger class sizes had a disproportionate share of students from schools where few students received any lunch subsidy.
Such a possibility seemed contrary to intuition. A common assumption is that schools in communities of higher socioeconomic status are most likely to have higher spending per pupil and, therefore, likely to have smaller average class sizes, not larger.
We determined the percentage of students in each class size that come
from each of the 5 school lunch groupings. The results are shown in Table
11.
|
|
||||||
|
|
||||||
| % of Students in Schools Receiving Free or Reduced Price School Lunch |
15-20
|
21-24
|
25-27
|
28-30
|
All Class Sizes
|
|
| 0-6.88% |
9.9%
|
22.7%
|
22.3%
|
36.1%
|
19.8%
|
|
| 6.89-14.01% |
16.2%
|
19.9%
|
22.9%
|
29.8%
|
19.8%
|
|
| 14.02-22.34% |
28.4%
|
16.3%
|
18.5%
|
11.3%
|
19.9%
|
|
| 22.35-34.05% |
29.8%
|
18.2%
|
11.1%
|
15.8%
|
19.8%
|
|
| 34.05-77.21% |
14.3%
|
22.9%
|
25.2%
|
7.0%
|
19.8%
|
|
| Unknown |
1.2%
|
0.0%
|
0.0%
|
0.0%
|
0.9%
|
|
|
100.0%
|
100.0%
|
100.0%
|
100.0%
|
100.0%
|
||
We then asked whether the open-ended math scores of the class sizes could be predicted based simply on the skewed distribution of students from the different school lunch groupings.
The average score of all students on the open-ended math questions was 5.02 while for those in class sizes of 28-30 students it was 5.44, 0.42 higher than the average. The predicted result based solely on the distribution of school lunch groupings is 5.26. In other words, after accounting for the unexpected preponderance of students from the lower lunch subsidy schools, the difference from the average score is only 0.18. This remaining unexplained difference is not large and our conclusion is that there is little or no relation between class size and the results in this portion of the NHEIAP.
The surprise that schools from "wealthier" communities tend to have larger class sizes, however, deserved further investigation.
Dividing all students into 4 class size groupings and simultaneously into 5 school lunch subsidy groupings, however, results in 20 different subgroups. Some of these have too few students from too few schools to be able to draw useful statistical conclusions. The schools were therefore divided into only two school lunch groupings, each containing approximately half the students. 8,131 students were from schools where 18.75% or fewer students receive school lunch subsidy and 8,140 were from schools with higher school lunch subsidy rates. For ease of reference, the former group is termed the "wealthier" schools and the latter the "poorer" schools. The latter group has more students from low-income families than the former group. (148 students are from schools with 9 or fewer 3rd graders. These schools are unidentified and thus listed as "unknown" in regard to school lunch subsidy.)
Table 12 and Figure 16 display the percentage of students in each class
size grouping that come from the wealthier and poorer schools.
|
|
|||||||
|
|
|||||||
|
15-20
|
21-24
|
25-27
|
28-30
|
All Class Sizes
|
|||
| Wealthier
0-18.75% |
37.6%
|
48.3%
|
60.3%
|
77.2%
|
49.5%
|
||
| Poorer
18.76-77.21% |
61.2%
|
51.6%
|
39.7%
|
22.8%
|
49.6%
|
||
| Unknown |
1.2%
|
0.0%
|
0.0%
|
0.0%
|
0.9%
|
||
| Total |
100.0%
|
100.0%
|
100.0%
|
100.0%
|
100.0%
|
||

This finding raises the following question: "Is it possible that there is a measurable and consistent advantage of smaller class size that is masked by the increasing percentage of students from wealthier schools in large class sizes?"
To determine whether this might be true, we re-analyzed the five components
of the NHEIAP this time dividing all students into the two groups based
on school lunch subsidy rates: the wealthier and poorer schools. The results
are displayed in Figures 17a (open-ended language arts), Figure 17b (open-ended
mathematics), Figure 17c (writing score), Figure 17d (multiple choice language
arts), and Figure 17e (multiple choice mathematics).





There is considerable variation but we found no strong and consistent relationship between class size and resulting scores for either group. At the same time, students from the wealthier schools consistently score higher than the students from the poorer schools.
The only exception to this occurs in open-ended mathematics scores in
the largest class size grouping. In this case, the students from the "poorer"
schools did somewhat better than students in "wealthier" schools. They
scored much better than the students from poorer schools in other class
size groupings. This exception is so striking that it deserves further
investigation.
The presence of a teacher's aide in the classroom could affect student achievement. It is possible that larger classes are more likely to have such an aide than smaller classes. As we lack information about the presence of teacher’s aides in each classroom, we cannot determine to what extent this factor could explain or confound some of our results.
Our research does not conclude that smaller classes are without benefit. There may well be benefits to students and teachers in categories such as fewer discipline problems, better morale, more individual attention, or greater class cohesiveness. However, those benefits do not result in significantly different achievement levels as measured by the NHEIAP.
The NHEIAP 3rd grade test attempts to measure achievement of a limited set of specific skills and knowledge related to mathematics and language arts. It does not attempt to measure knowledge in subject areas such as science, an understanding of community or nation, achievement in art or music, social skills such as cooperation, respect for others, or traits such as dependability, self-motivation, and imagination. Accordingly we cannot come to any conclusion about the effect that class size might or might not have on these important topics.
Much of this analysis relies on average scores. Each is the average
of hundreds or thousands of individual scores. Within each grouping, some
students score higher and others lower than the average. There is no attempt
to imply that all students in a particular school or classroom environment
will have test scores of a certain level. It is certain that some students
in poorer schools will fall at the extreme upper end of the achievement
spectrum while some students from the wealthier schools will score very
poorly and vice versa. These averages do not predict the results for any
one student nor do they imply that all students in a particular group will
have similar achievement.
There are many rationales for smaller class sizes in the elementary grades and those should be discussed and debated. But the data do not lend support to the notion that reducing class sizes in a school will result in higher average test scores.
Decisions about appropriate elementary school class sizes are continually made by each school board and school administration in the state. The decision about when to split a class that is "too large" or to redistribute students by adding another section are important decisions for each school and district. There are many factors that should be taken into consideration, including physical crowding, level of discipline problems, teacher and student morale, operating costs, available classroom space, etc. Decisions about appropriate class size, however should not be based on an expectation that reducing class sizes will necessarily result in higher levels of student achievement, at least as measured by the third grade NHEIAP tests.
However, this study shows that underlying community demographics (whether measured by income, education, or other factors) are strongly related to average NHEIAP scores of students. Factors that are most predictive of higher achievement test scores are ones over which teachers, schools, and school boards have no control.
Policy makers should proceed with extreme caution in using NHEIAP test scores as a measure of the overall accomplishments of a school or district. Using the results without accounting for differences in the underlying community demographics will be misleading and potentially damaging.
Recently enacted New Hampshire legislation requires the Department to report "comparisons with state averages" and "statewide rankings of each district and school". While the Department must comply with this law, it should go further and provide additional information that would be more informative and useful.
The state's annual NHEIAP results should be tabulated, analyzed, and presented so that districts or schools are shown how they fare in comparison to those with which they share common population characteristics. The Department should divide schools and districts into 4-6 groups based upon demographic factors that are known to be related to achievement. Test results from a school should first be compared to the average and distribution of all schools in the same demographic group.
A school should be considered to be performing well or poorly to the extent that it falls below or above average for its group.
Such a system is not novel. Georgia and other states already provide this kind of analysis when reporting the results of their statewide assessment tests.
A common working assumption has been that some (as yet undetermined) uniform dollar amount of state aid per pupil distributed to each school district will suffice to accomplish this end. Such an assumption is contrary to the findings that student body and community demographics have great impact on performance as measured by the NHEIAP tests.
If all schools are to be held to the same fixed standard of performance measured by assessment test results, regardless of the differences in the makeup of their student bodies and communities, then it is important to recognize that different resource mixes will be needed.
An example: one measure of an adequate education may be that fewer than 15% of students in a school score at the novice level in 3rd grade NHEIAP tests. If so, then it must be recognized that different resources will be needed to accomplish this in a district where only 10% of the adults have completed a college education than in one where 80% have done so.
It will be important for any new formula for state aid to take into account such differences. The formula should be constructed with a weighting mechanism that modifies the amount provided per pupil to each district based on one or more demographic factors. Exactly what factors should be used and how much they should be weighted can be worked out and altered over time. However, from the beginning it should be clear that a uniform resource distribution per pupil will not result in comparable levels of student performance in schools that do not have comparable student populations.
This will be especially important if, in the next few years, the state
legislature or Board of Education should enact standards of performance
and distribute rewards or penalties to schools or districts that are based,
in part, on outcome measures such as the NHEIAP results.
What does this research show? A 1997 review of the research by the Education Commission of the States concluded, "The research about whether small classes improve student achievement has produced mixed results." Some studies - like ours - have shown little or no relationship between class size and achievement. Other studies, most recently from Tennessee’s Project STAR, have shown a positive relationship between smaller classes and achievement. Yet others have found a positive relationship only for certain students (non-English speakers), or only when other conditions are met (changed teaching methods).
Our results speak for themselves. We believe they reflect accurately the situation among New Hampshire 3rd graders in 1997.
One important distinction between our research and that of other studies is the size of the population being studied. Almost all of the other studies have looked at small samples of students or schools and then tried to draw statistically valid conclusions about the untested student population. Our research, however, looked at all 3rd graders in all public schools in the entire state. We can describe the entire population, not just a sample. It also has the advantage of being a study of the very population New Hampshire policy makers are concerned about – New Hampshire students. Studies done in locales outside our borders are too often dismissed by policy-makers as irrelevant to their concerns because "of course, our students/schools/teachers/systems are different."
This bibliography has been selected to place an emphasis on the reports and studies of the past decade, but also to include some of the more influential reports and publications of the late 1970s and 1980s.
-----
Achilles, C. (no date). Summary of Recent Class-size Research with an Emphasis on Tennessee's Project STAR and its Derivative Research Studies. Center of Excellence for Research and Policy on Basic Skills, Tennessee State University, Nashville, TN.
Achilles, C. M. (1997, October). "Small Classes, Big Possibilities." The School Administrator, 6-15.
Achilles, C., Harman, P., & Egelson, P. (1995). "Using Research Results on Class Size to Improve Pupil Achievement Outcomes." Research in the Schools, 2(2), 23-30.
Achilles, C., Kiser-Kling, K., Owen, J., & Aust, A. (1994). Success Starts Small: Life in a Small Class. Final Report. University of North Carolina, Greensboro, NC.
Achilles, C., Nye, B., & Bain H. (1994 1995). "Test-score 'Value' of Kindergarten for Pupils in Three Class Conditions at grades 1, 2, 3." National Forum of Educational Administration and Supervision Journal (NFEAS), 12(l), 3-15.
Achilles, C., Nye, B., & Zaharias, J. (1995, April). Policy Use of Research Results: Tennessee’s Project Challenge. Paper presented at the annual convention of the American Educational Research Association, San Francisco, CA.
Achilles, C., Nye, B., & Zaharias, J. (1995, April). Policy Use of Research Results: Tennessee's Project Challenge. Paper presented at the annual convention of the American Educational Research Association, San Francisco, CA.
Achilles, C., Nye, B., Zaharias, J., & Fulton, B. (1993). The Lasting Benefits Study in Grades 4 and 5 (1990-1991): A Legacy from Tennessee’s Four Year (K-3) Class Size Study (1985-1989), Project STAR. Paper presented at North Carolina Association for Research in Education, Greensboro, NC.
Achilles, et. al. (1994). The Multiple Benefits of Class-Size Research: A Review of STAR’s Legacy, Subsidiary and Ancillary Studies. (ERICdocument number ED 390913).
Bain, H., & Achilles, C. (1986). "Interesting Developments on Class Size." Phi Delta Kappan, 67 (9), 662-5.
Bain, H., & Jacobs, R. (1990, September). "The Case for Smaller Classes and Better Teachers." Streamlined Seminar, Volume 9, No. 1. (ERIC document number ED 322632).
Bain, H., Achilles, C., Zaharias, J., & McKenna, B. (1992). "Class Size Does Make a Difference." Phi Delta Kappan, 74, 253-256.
Beattie-Smith, G. (1994). Class Size Regulation: A Dossier of International Comparisons. Research and Information on State Education Trust, England.
Berger, M. (1982, April). Class Size Is Not the Issue. Paper presented at the annual meeting of the National School Boards Association, Atlanta, Georgia.
Blatchford, P., & Mortimore, P. (1994). "The Issue of Class Size for Young Children in Schools: What Can We Learn From Research?" Oxford Review of Education, 20 (4), 411-28.
Boozer, M., & Rouse, C. (1995, June). Intraschool Variation in Class Size: Patterns and Implication. (Working Paper No. 344). National Bureau of Economic Research, Industrial Relations Section, Washington, DC. (ERIC document number ED 385935).
Bourke, S. (1986). "How Smaller is Better: Some Relationships Between Class Size, Teaching Practices and Student Achievement." American Educational Research Journal, 23, 558-571.
Bracey, G. (1995). "Debunking the Myths About Money for Schools." Educational Leadership, 53 (3), 65-9.
Brophy, J., & Evertson, C. (1981). Student Characteristics and Teaching. Longman, New York.
Burstall, C. (1979). "Time to Mend the Nets: A Commentary on the Outcomes of Class-Size Research." Trends in Education, Autumn 1979, 27-33.
Cacha, F. (1982). "The Class Size and Achievement Controversy." Contemporary Education, 54, 13-17.
Center for School Assessment. (1986, July). PRIME TIME: The Relationship Between Class Size and Achievement for First Grade Students in Indiana 1984-85. Indiana Department of Education, Indianapolis, IN.
Chase, C., Mueller, D., & Walden, J. (1986, December). PRIME TIME: Its Impact on Instruction and Achievement. Final Report. Indiana Department of Education, Indianapolis, IN.
Cockman, D., Bryson, J., & Achilles, C. (1989). High School Size and Student Participation in School Activities: North Carolina, 1988-89. Paper presented to the North Carolina Association of Research in Education Annual Conference, Research Triangle Park, NC.
Cohen, L. (1983). Class Size and Instruction. Longman, New York.
Cooley, W., & Bickel, W. (1986). Decision-oriented Education Research. Kluwer-Nijhoff, Boston.
Cooper, H. (1989). "Does Reducing Student-to-Instructor Ratios Affect Achievement?" Educational Psychologist, 24 (1), 79-98.
Correa, H. (1993). "An Economic Analysis of Class Size and Achievement in Education." Education Economics, 1 (2), 129 135.
Cullen, B. (1979). "Lessons from Class Size Research – An Economist’s Perspective." Trends in Education, Winter 1979, 29-32.
Day, C., Tolley, H., Hadfield, M., Parkins, E., & Watling R. (1997, February). Class Size Research and the Quality of Education. National Association of Headteachers, Haywards Heath, West Sussex, England. (http://acorn.educ.nottingham.ac.uk/SchEd/res/exec.html)
Dewhurst, J. (1993). "Class Size and Pupil Achievement in Primary Schools: A Review of the Research Evidence." Education 3-13, March 1993, 15-18.
Educational Research Service (1980). Class Size Research: A Critique of Recent Meta-Analyses. Arlington, Virginia.
Egelson, P., Harman, P., & Achilles, C. (1996). Does Class Size Make a Difference? Recent Findings from State and District Initiatives. Southeastern Regional Vision for Education (SERVE), Greensboro, NC.
Ellis, T. (1984). Class Size. ERIC Digest #11, ERIC Clearinghouse on Educational Management, Eugene, Oregon. (ERIC document number ED 259454 84).
Evertson, C., & Folger, J. (1989, March). Small Class, Large Class: What Do Teachers Do Differently? Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
Fagley, W. (1993). "More Spending Not the Solution to School Woes." Viewpoint on Public Issues, Macinac Center for Public Policy. (http://www.macinac.org/viewpoin/1993cv/v9329.htm).
Ferguson, R., & Ladd, H. (1996). "How and Why Money Matters: An Analysis of Alabama Schools" in H.F. Ladd (Ed.), Holding Schools Accountable. Performance-based Reform in Education (pp 265-298). The Brookings Institution. Washington, DC.
Filby, N., et. at. (1980). What Happens in Smaller Classes? A Summary Report of a Field Study. Far West Laboratory for Educational Research and Development, San Francisco, California.
Finn, J. (1998, April). Class Size and Students at Risk: What is Known? What is Next? U. S. Department of Education. (http://www.ed.gov/pubs/ClassSize/intro.html)
Finn, J., & Achilles, C. (1990). "Answers about Questions about Class Size: A Statewide Experiment." American Educational Research Journal, 27, 557-577.
Finn, J., & Voelkl, K. (1994). "Class size" in T. Husen & T. Postlethwaite (Eds.), International Encyclopedia of Education (2nd edition). Pergamon Press, Oxford, England.
Finn, J., Achilles, C., Bain, H., Folger, J., Johnston, J., Lintz, M., & Word, E. (1990). "Three Years in a Small Class." Teaching and Teacher Education, 6(2), 127-136.
Finn, J., Fulton, D., Zaharias, J., & Nye, B. (1989). "Carry-over Effects of Small Classes." Peabody Journal of Education, 67(l), 75-84.
Folger, J. (1989). "Lessons for Class Size Policy and Research." Peabody Journal of Education, 67 (1), 123-132.
Folger, J. (1989). "Project STAR and Class Size Policy." Peabody Journal of Education, 67 (1), 1-16.
Folger, J., & Breda, C. (1989). "Evidence from Project STAR About Class Size and Student Achievement." Peabody Journal of Education, 67 (1), 17-33.
Fowler, W., Jr. (1992, April). What Do We Know about School Size? What Should We Know? Paper presented at the American Educational Research Association, San Francisco, CA.
Galton, M. (1996). "The Class-size Dilemma: Why Research Findings Do Not Confirm the Obvious." Education Review, 10 (1), 28-35.
Gilman, D., & Tillitski, C. (1990). The Longitudinal Effects of Smaller Classes: Four Studies. (ERIC document number ED 326313).
Glass G., & Smith, M. (1979). "Meta-Analysis of Research on Class Size and Achievement." Educational Evaluation and Policy Analysis, 1, 2-16.
Glass, G., & Smith, M. (1978). Meta-analysis of Research on the Relationship of Class Size and Achievement. Far West Laboratory for Educational Research & Development, San Francisco, CA.
Glass, G., Cahen, L., Smith, M., & Filby, N. (1982). School Class Size: Research and Policy. Sage Publications, Beverly Hills, London, New Delhi.
Goettler-Sopko, S. (1992). The Effect of Class Size on Reading Achievement. (ERIC document number ED 325826).
Gross, C. (1968). An Investigation into the Effectiveness of New Hampshire Public Schools, Office of Legislative Services, New Hampshire General Court, Concord, NH.
Hanushek, E. (1986). "The Economics of Schooling: Production and Efficiency in Public Schools." Journal of Economic Literature, 24, 1141-1177.
Harder, H. (1990). "A Critical Look at Reduced Class Size." Contemporary Education, 62 (1), 28-30.
Harvey, B. (1993, December). An Analysis of Grade Retention for Pupils in K-3. Unpublished doctoral dissertation. University of North Carolina, Greensboro, NC.
Harvey, B. (1994). The Effect of Class Size on Achievement and Retention in the Primary Grades: Implications for Policy Makers. (ERIC document number ED 369172).
Hazzard, J. (1989). "Reducing Class Size – Affordable? Efficient?" Education Week, November 22, 1989.
Hedges, L., & Stock, W. (1983). "The Effects of Class Size: An Examination of Rival Hypotheses." American Educational Research Journal, 17, 141-52.
Hedges, L., Laine, R., & Greenwald, R. (1994, April). "Does Money Matter? A Meta-analysis of Studies of the Effects of Differential School Inputs on Student Outcomes." Educational Researcher, 23, 5-14.
Holliday, W. (1992). "Should We Reduce Class Size?" Science Teacher, 59 (1), 14-17.
Hopkins, D. (1995). "Class Size, Teaching, and OFSTED." Education, 17 November 1995, 16.
Kazal-Thresher, D. (1993). "Educational Expenditures and School Achievement: When and How Money Can Make a Difference." Educational Researcher, 22 (2), 30-32.
Kemp, J. (1990). "Who Will Speak for the Children?" Contemporary Education, 62 (1).
King, J. (1994). "Meeting the Educational Needs of At-risk Students: A Cost Analysis of Three Models." Educational Evaluation and Policy Analysis, 16(l), 1-19.
Kiser-Kling, K. (1995). Life in a Small Teacher-pupil Ratio Class. Unpublished Ed.D. dissertation. University of North Carolina, Greensboro, NC.
Klein, K. (1985). "The Research on Class Size." Phi Delta Kappan, 66, 578-80.
Laine, R., Greenwald, R., & Hedges, L. (1995). "Money Does Matter: A Research Synthesis of a New Universe of Education Production Function Studies" in L. Picus & J. Wattenbarger (Eds.), Where Does the Money Go? Resource Allocation in Elementary and Secondary Schools (pp. 44-70). Corwin Press, Thousand Oaks, CA.
Levin, H. (1988). "Cost-effectiveness and Educational Policy." Educational Evaluation and Policy Analysis, 10(l), 51-69.
Lindsay, P. (1982). "The Effect of High School Size on Student Participation, Satisfaction, and Attendance." Educational Evaluation and Policy Analysis, 4(l), 57-65.
Lindsay, P. (1984). "High School Size, Participation in Activities, and Young Adult Social Participation: Some Enduring Effect of Schooling." Educational Evaluation and Policy Analysis, 6(l), 73-83.
Malloy, L., & Gilman, D. (1989). "The Cumulative Effects on Basic Skills Achievement of Indiana's PRIME TIME, A State Sponsored Program of Reduced Class Size." Contemporary Education, 60, 169-172.
McGiverin, J., Gilman, D., & Tillitski, C. (1989). "A Meta-analysis of the Relation between Class Size and Achievement." Elementary School Journal, 90, 47-56.
McIntyre, W., & Marion, S. (1989). The Relationship of Class Size to Student Achievement: What the Research Says. College of Education, University of Maine at Orono. (ERIC document number ED 323643).
Miles, K. (1995). "Freeing Resources for Improving Schools: A Case Study of Teacher Allocation in Boston Public Schools." Educational Evaluation and Policy Analysis, 17, 476-493.
Mitchell D., & Beach S. (1990). "How Changing Class Size Affects Classrooms and Students." Policy Briefs, number 12, Far West Laboratory for Educational Research and Development, San Francisco, California.
Mitchell, D., et al. (1989). "Modeling the Relationship between Achievement and Class Size: A Re-Analysis of the Tennessee Project STAR Data." Peabody Journal of Education, 67 (1), 34-74.
Mortimore, P., & Blatchford, P. (1993). The Issue of Class Size. A briefing paper prepared for the Paul Hamlyn Foundation.
Mosteller, F. (1995). "The Tennessee Study of Class Size in the Early School Grades." The Future of Children, 5(2), 113-27.
Mosteller, F. (1995). "The Tennessee Study of Class Size in the Early School Grades." The Future of Children, 5 (2). (http://www.futureofchildren.org/cri/08cri.htm).
Mueller, D., Chase, C., & Walden, J. (1988). "Effects of Reduced Class Size in Primary Classes." Educational Leadership, 45(7), 48 53.
Mulder, J. (1990). "Class Size Revisited: The Glass/ERS Debate." Contemporary Education, 62 (1).
National Confederation of Parent Teacher Associations (1991). The State of Schools in England and Wales. NCPTA, London, England.
National Confederation of Parent Teacher Associations (1996). The State of Schools Report 1996. NCPTA, London, England.
Nye, B., Achilles, C., Boyd-Zaharias, J., & Fulton, B. (1993). Project Challenge Third-Year Summary Report::An Initial Evaluation of the Tennessee Department of Education "At Risk" Student/Teacher Ratio Reduction Project in Seventeen Counties 1989-90 Through 1991-92. Center of Excellence for Research in Basic Skills, Tennessee State University, Nashville, Tennessee.
Nye, B., Boyd-Zaharias, J., Fulton, B., Achilles, C., Cain, V., & Tollett, D. (1994). The Lasting Benefits Study: A Continuing Analysis of the Effect of Small Class Size in Kindergarten through Third Grade on Student Achievement Test Scores in Subsequent Grade Levels: Seventh Grade Technical Report. Center of Excellence for Research in Basic Skills, Tennessee State University, Nashville, Tennessee.
Nye, B., et. al. (1991). The Lasting Benefits Study: A Continuing Analysis of the Effect of Small Class Size in Kindergarten through Third Grade on Student Achievement Test Scores in Subsequent Grade Levels: Fourth Grade Technical Report. Center of Excellence for Research in Basic Skills, Tennessee State University, Nashville, Tennessee. (ERIC document number ED 338440).
Nye, B., et. al. (1992). Small is Far Better: A Report on Three Class-Size Initiatives: Tennessee Student Teacher Achievement Ratio (STAR) Project, Lasting Benefits Study, and Project Challenge as a Policy Application (Preliminary Results). (ERIC document number ED 354091).
Nye, B., et. al. (1992, May). "Smaller Classes Really Are Better." American School Board Journal, 179 (5), 31-33.
Nye, B., et. al. (1993). Class Size Research from Experiment to Field Study to Policy Application. (ERIC document number ED 356558).
Nye, K. (1995). The Effect of School Size and the Interaction of School Size and Class Type on Selective Student Achievement Measures in Tennessee Elementary Schools. Unpublished doctoral dissertation. University of Tennessee: Knoxville, TN.
Odden, A. (1990). "Class Size and Student Achievement: Research-based Policy Alternatives." Educational Evaluation and Policy Analysis, 12, 213-227.
OFSTED (1995). Class Size and the Quality of Education. OFSTED, London, England.
Robinson, G. & Wittebols, J. (1986). Class Size Research: A Related Cluster Analysis for Decision Making. Educational Research Service, Arlington, VA.
Robinson, G. (1986). Class-Size Research: A Related Cluster Analysis for Decision Making. Educational Research Service.
Robinson, G. (1990). "Synthesis of Research on Effects of Class Size." Educational Leadership, 47(7), 80-90.
Scheck, C. (1994). "The Effect of Class Size on Student Performance." Journal of Education for Business, 70, 104-11.
Shapson, S. (1980). "An Experimental Study of the Effects of Class Size". American Educational Research Journal, 17, 141-152.
Shapson, S., Wright, E., Eason, G., & Fitzgerald, J. (1980). "An Experimental Study of Effects of Class Size." American Educational Research Journal, 17, 141-152.
Slavin, R. (1989). (Ed.). School and Classroom Organization. Lawrence Erlbaum Associates, Hillside, NJ.
Slavin, R. (1989). "Class Size and Student Achievement: Small Effects of Small Classes." Educational Psychologist, 24 (1), 99-110.
Slavin, R. (1990). "Class Size and Student Achievement:: Is Smaller Better?" Contemporary Education, 62 (1), 6-12. (ERIC document number EJ 431933).
Slavin, R., & Madden, N. (1989). "What Works for Students at Risk: A Research Synthesis." Educational Leadership, 46(5), 4-13.
Slavin, R., & Madden, N. (1995). "Success For All: Creating Schools and Classrooms Where All Children Can Read" in Oakes & Quarts (Eds.), Creating New Educational Communities. NSSE Yearbook, Part 1. University of Chicago Press, Chicago, IL, 70-86.
Slavin, R., Madden, N., Dolan, L., Wasik, B., Ross, S., & Smith, L. (1994). Success for All: Longitudinal Effects of Systemic School Reform in Seven Districts. A presentation at the annual meeting of the American Educational Research Association, New Orleans.
Slavin, R., Madden, N., Karweit, N., Livermon, B., & Dolan, L. (1990). "Success for All: First-year Outcomes of a Comprehensive Plan for Reforming Urban Education." American Educational Research Journal, 27, 255-278.
Smith, G. (1995). "Living with Oregon’s Measure 5." Phi Delta Kappan, 76 (6), 452-61.
Smith, M., & Glass, G. (1979). Relationship of Class-Size to Classroom Processes, Teacher Satisfaction and Pupil Affect: A Meta-Analysis. Far Western Laboratory for Educational Research and Development, San Francisco, California. (ERIC document number ED 190698).
Smith, M., & Glass, G. (1980). "Meta-Analysis of Research on Class Size and Its Relationship to Attitudes and Instruction." American Educational Research Journal, 17, 419-33.
Swift, M., & Spivack, G. (1968). "The Assessment of Achievement-related Classroom Behavior." Journal of Special Education, 2(2) 137-149.
Tabberer, R. (1994). School and Teacher Effectiveness. National Foundation for Education Research, London, England.
Thompson, S. (1978). "Class Size". School Management Digest, ERIC Clearinghouse on Educational Management, University of Oregon, Eugene, Oregon.
Tillitski, C. (1990). "The Longitudinal Effect of Prime Time, Indiana’s State Sponsored Reduced Class Size Program." Contemporary Education, 62 (1).
Tomlinson, T. (1988). Class size and public policy: Politics and panaceas. U.S. Department of Education, Office of Educational Research and Improvement. Washington, DC.
Tomlinson, T. (1989). "Class Size and Public Policy: Politics and Panaceas." Educational Policy, 3, 261-273.
Tomlinson, T. (1990). "Class Size and Public Policy: The Plot Thickens." Contemporary Education, LXII(1), 17-23.
Turner, M. (1990). "Prime Time: A Reflection." Contemporary Education, 62 (1).
U. S. Department of Education (1998, May). Reducing Class Size: What Do We Know? (http://www.ed.gov/pubs/ReducingClass/intro.html)
Underwood, S., & Lumsden, L. (1994). "Class Size." Research Roundup, 11 (1). National Association of Elementary School Principals, Alexandria, Virginia.
Varble, M. (1990). "Smaller Class Size = Higher Achievement Scores?" Contemporary Education, 62 (1).
Watling R., (1996, October). Annotated Bibliography: Class Size Research and the Quality of Education. School of Education, University of Nottingham, England. (http://acorn.educ.nottingham.ac.uk/SchEd/res/bib.html)
Wenglinsky, H. (1997). When money matters. How Educational Expenditures Improve Student Performance and When They Don't. Educational Testing Service, Policy Information Center, Princeton, NJ.
Wisconsin Department of Public Instruction (1994, July). Effect of Class Size Reduction on Student Achievement, Safety, and Attendance. Madison, Wisconsin.
Word, E., Achilles, C., Bain, H., Folger, J., Johnston, J., & Lintz N. (1990). "Project STAR Final Executive Summary: Kindergarten through Third Grade Results (1985-1989)." Contemporary Education, 62 (1).
Word, E., Johnston, J., Bain H., et. al. (1990). Student/Teacher Achievement Ratio (STAR) Tennessee’s Class Size Study, Final Summary Report. Tennessee Department of Ecuation, Nashville.
Word, E., Johnston, J., Bain H., Fulton, B., Zaharias, J., Achilles, C., Linz, M., Folger, J., & Breda, C. (1994). The State of Tennessee’s Student/Teacher Achievement Ratio (STAR) Project: Technical Report 1985-1990. Tennessee Department of Ecuation.
Word, E., Johnston, J., Bain, H., Fulton, D., Boyd-Zaharias, J., Lintz, M., Achilles, C., Folger, J., & Breda, C. (1990). Student/Teacher Achievement Ratio (STAR): Tennessee's K-3 Class-size Study. Tennessee State Department of Education, Nashville, TN.
Wright, E., Shapson, S., Beason, G., & Fitzgerald, J. (1977). Effects
of Class Size in the Junior Grades. Ontario Ministry of Education.
Intuition holds that smaller class sizes in public elementary schools should result in better educational results, whether measured by standardized tests or other means. Yet our analysis showed that the results among students in this grouping were considerably lower on every measure than those of the four groupings covering class sizes from 15-30. This result, so contrary to intuition, required us to investigate the makeup of this group further.
Differences in achievement results cannot be attributed to class size if there is reason to believe that the student populations in different class sizes were not reasonably equivalent.
We determined the number and percentage of students in each class size
grouping who were coded for special education. Table A1 and Figure A1 display
the results. Special education students seem to be relatively equally distributed
in class size groupings of 15-30 students, constituting between 11.5% and
12.3% of students in each group. However, they constituted 19.1% of the
students in classes of 1-14 students. Special education students are disproportionately
represented in the smallest classes.
| Class Size |
Special Education
|
Non Special Education
|
Total Students
|
% Special Education
|
| 1-14 |
87
|
368
|
455
|
19.1%
|
| 15-20 |
510
|
3,694
|
4,204
|
12.1%
|
| 21-24 |
773
|
5,520
|
6,293
|
12.3%
|
| 25-27 |
398
|
2,914
|
3,312
|
12.0%
|
| 28-30 |
182
|
1,398
|
1,580
|
11.5%
|
| All Students |
2,231
|
14,188
|
16,419
|
13.6%
|

|
|
|||
| Class Size |
Special Education Students
|
Non-Special Education
|
All Students
|
| 1-14 |
4.16
|
6.43
|
6.02
|
| 15-20 |
4.69
|
6.72
|
6.48
|
| 21-24 |
4.76
|
6.76
|
6.53
|
| 25-27 |
4.69
|
6.76
|
6.53
|
| 28-30 |
4.83
|
6.72
|
6.53
|
| All Students |
4.70
|
6.72
|
6.49
|

Similar results were found for other components of the language arts assessment program as well as the mathematics tests.
If all students were randomly assigned to classes there would be no disproportionate share of special education students in the smallest classes.
We concluded that this group, in addition to being very small in number of students, differed greatly in makeup from the four larger class groupings. This grouping contains an unusually high concentration of students from very small schools and has a disproportionately high representation of special education students.
We have therefore not included the results from this smallest class size grouping in the body of our report.