 |
 |






Quotes on this page originally
posted at FairTest.
About Promotion and Retention of
Students "Another
misuse of standardized achievement test scores
is making promotion
and retention decisions for individual students solely on the
basis of these scores. This is an undesirable practice for
a number of reasons. Perhaps the most important reason is
that national standardized achievement tests are not built to
serve this purpose...they cannot provide complete coverage of
any local curriculum. Consequently, it would be
inappropriate to base a promotion or retention policy strictly
on a students achievement test scores. "Achievement
test scores may certainly enter into a promotion or retention
decision. However, they should be just one of the many
factors considered and probably should receive less weight than
factors such as teacher observation, day-to-day classroom
performance, maturity level, and attitude." --
Excerpted from Stanford Achievement Test Series, Ninth
Edition: Guide for Organizational Planning (Harcourt Brace
Educational Measurement. 1997, Pp 43-44.)
|
|
|
What's Wrong with Standardized Tests?
-- borrowed and slightly adapted from
FairTest
- What is a "High-Stakes" Standardized Test?
Many people are calling standardized tests that are
being used for punitive measures such as retaining a
student, or denying a student graduation a
"High-Stakes" test. Because of state and
federal mandates, more tests are being used for this
purpose. (See sidebars on this page for statements from testing
companies about this practice. See here for a list of
organizations opposed to such testing practices.)
- Aren't Standardized Tests a fair evaluation tool if
every child has to take it?
By their nature, standardized tests are created to be
easily graded, usually in a multiple-choice format. Each question has only one answer, and
these questions do
not measure the ability to think or create. Also by their
nature, these tests reward the ability to quickly answer
superficial questions that do not require critical
thought. Their use encourages a narrowed curriculum,
outdated methods of instruction, and harmful practices
such as retention in grade and tracking. Some
students just do not take these types of tests well. They also
assume all test-takers have been exposed to a white,
middle-class background. (See here
for a former teacher's experience with this type of bias
on an Ohio Proficiency Test.)
- Aren't Standardized Tests the most objective way to
assess children?
Only the scoring of multiple-choice answers is
"objective" because it is done by machine. But
correct answers that are only partially marked correctly
(the "bubble" not filled all the way in, or
stray marks outside the bubble) are marked wrong.
Human beings, who are quite subjective, choose what items
to include on the test, the wording, content, the
determination of the "correct" answer, and the
type of testing situation. Short answer questions
and essay questions are evaluated by people who may not
have an education degree, who are taught in a day or two
how to "score" a test, and who have a high
caseload of tests to score each day (see here).
And now, companies have developed machines to assess
writing. As we know, machines cannot understand
intent. They do not recognize creative use of
language. Yet, in the future, machines could be grading
your child's test which is used to determine promotion to
another grade or even graduation.
-
Aren't test scores the most reliable
form of assessment?
A test is completely reliable if you get exactly the
same results the second time you administered it.
All existing tests have "measurement
error." This means that an individual's score
may vary from day to day because of testing conditions, or
the test-taker's emotional state. As a result, many
individual score are frequently not the most accurate of a
child's capability. Test scores of young children
and scores on sub-sections of tests (reading one day, math
another day, science another day, social science another
day, etc.) are much less reliable than test scores on
adults or whole tests.
[top]
- Do test scores reflect real differences among people?
Not necessarily. To create a norm-referenced
test (a test where half of the test-takers score above
average, and the other half below average), test makers
must make small differences among people appear
large. Content differs from one test to another, so
even tests that claim to measure the same thing often
produce very different results.
- Don't test makers remove bias from tests?
Most test-makers review items for biases, such as
offensive words, etc. But this is inadequate, since many
forms of bias are not superficial (such as a child being
asked to write about something that has never happened to
him/her). Some test makers also use statistical
bias-reduction techniques, but these techniques cannot
detect underlying bias in the test's form or
content. As a result, biased cultural assumptions
built into the test as a whole are not exposed or removed
by test-makers.
- Do tests reflect what we know about how students
learn?
No. Standardized tests are based in behaviorist
psychological theories from the 19th century. While
our understanding of the brain and how people learn and
think has progressed enormously, tests have largely
remained the same. Behaviorism assumed that
knowledge could be broken into separate bits and that
people learned by passively absorbing these bits.
Today, cognitive and developmental psychologists
understand that knowledge is not separable bits and that
people (including children) learn by connecting what they
already know with what they are trying to learn. If
they cannot actively make meaning out of what they are
doing, they do not learn or remember. Most
standardized tests do not incorporate the modern theories
and are still based on recall of isolated facts and narrow
skills.
- Do multiple-choice tests measure important student
achievement?
Multiple-choice tests are a very poor yardstick of
student performance. They do not adequately measure the ability
to write, to use math, to make meaning from text when
reading, to understand scientific methods or reasoning, or
to grasp social science concepts. These tests also
do not adequately measure thinking skills or assess what
people can do on real-world tasks.
- Aren't the test scores helpful to teachers?
Standardized, multiple-choice tests were not originally
designed to provide help to teachers. Classroom
surveys show teachers do not find scores from standardized
tests very helpful. The tests do not provide
specific information that can help a teacher understand
what to do next in working with a student because they do
not indicate how the student learns or thinks. Good
evaluation would provide helpful information to teachers.
[top]
- Are readiness or screening tests helpful?
Readiness tests, used to determine if a child is ready
for school, are very inaccurate and unsound. They
encourage overly academic, developmentally inappropriate
primary schooling. Screening tests for disabilities
are often not adequately validated; they also promote a
view of children as having deficits to be corrected,
rather than having individual differences and strengths on
which to build. We know that not everyone learns in
the same way, yet we continue to insist that instruction
and testing are "standardized."
- So are there more effective ways to evaluate student achievement
or ability?
Yes. Good teacher observation, documentation of
student work (portfolios to chart student progress
throughout the year), and performance-based assessment,
all of which involve the direct evaluation of student
effort on real learning tasks, provide useful material for
teachers, parents, the community and the government.
- For further resources see:
- FairTest
- "The Case Against Standardized Testing" by
Alfie Kohn
- Authentic Assessment
- Additional Resources
-- 2004
[top]
|
|
|
|


Community
 Come
back to visit
our message
board. Your opinion is important to us. (It is currently
off-line, but will be back soon.) We
want to hear from anyone who wants to speak for, against, and
about the requirements for testing and other educational
matters. ******* "...our
expectations for student performance, which are most positive
and effective when they are realistic, need to take ability
level and socioeconomic status into account when interpreting
scores and making conclusions about school effectiveness." --
Dr. James H. McMillan, professor, Virginia Commonwealth
University ******* About
Inappropriate
Purposes "The popular
press and professional literature have furnished countless
examples of how standardized test results have been used in
inappropriate ways. Many of the common misuses stem from
depending on a single test score to make an important decision
about a student or class of students. All who wish to
interpret test scores must be made aware of the intended uses of
the scores, the limitations of the scores, and the most common
misunderstandings about them. Here [is one] inappropriate
[use] of the results from the ITBS batteries. "To
decide to retain students at a grade level. There
is considerable disagreement among educators about the
appropriateness of grade retention. If a retention
decision is to be made, classroom assessment data gathered by
the teacher over a period of months is likely to form a highly
relevant and accurate basis for making such a decision. A
test score can make a valuable contribution to the array of
evidence that should be considered. However, a test score
from an achievement battery should not be used along in making
such a significant decision." --
Excerpted from Iowa Test of Basic Skills Interpretive Guide
for School Administrators Levels 5-14 M Complete and Survey
(Riverside Publishing. 1996. P. 17.) |