III. Confounding Factors
This section focuses on features of the assessment task that could
reduce the likelihood that students’ responses will provide information
about what they know and do not know. Students should not get a
question wrong if they understand the targeted idea and they should not
get it right if they do not. Three criteria are provided:
comprehensibility, appropriateness of task context, and resistance to
test-wiseness. Additionally, a section on cost effectiveness, in
which the reviewer reflects on the amount of time needed to complete
the task, is present.
III. A. Comprehensibility
More Details
This criterion addresses whether students are likely to understand the
task statement, diagrams, and symbols. The reviewer should look
for unfamiliar general vocabulary (Technical scientific or mathematical
terminology is addressed under Sufficiency.), and language that may not
be familiar to poor readers or students whose first language is not
English. The reviewer should check that photographs, diagrams,
graphs, and symbols are accurate, well labeled, and intelligible.
Check all applicable boxes and provide explanations as appropriate.
-
It is not clear what question is being asked. Explain.
-
The task uses unfamiliar vocabulary that is not clearly defined, or words or phrases that have unclear, confusing, or ambiguous meanings. Explain.
Some
words, although correct, may lead students astray. For example,
in the phrase “scientific finding,” the word “finding” has a special
meaning in science that may be unfamiliar to students. Note all
places where words, both general and scientific, do not have clear and
straightforward meanings.
-
The task uses unnecessarily complex sentence structure when plain
language could have been used, or ambiguous punctuation that make the
task difficult to comprehend. Explain.
-
There is inaccurate information in any part of the task (including
diagrams and data tables) that may be confusing to students who have a
correct understanding of the science. Explain.
-
There are comprehensibility issues with the diagrams, graphs, and data tables. Explain.
-
There is more than one correct answer. Explain
-
Other. Provide a brief explanation.
Score
Guidelines
Scoring Comprehensibility for Multiple-Choice (4-Choice) Items
Problems in the stem should be weighted more than problems in the
answer choices. The reviewer is expected to use his/her professional
judgment regarding the overall effect on comprehensibility. For
cases where a part of the item is found to be incomprehensible to
students then the following scoring scheme is suggested:
If box 6 is checked
(i.e., there is more than one correct answer), subtract one point for each answer choice that can be considered correct.
Students will NOT be able to evaluate: |
Score |
Zero answer choices |
5 |
One answer choice |
4 |
Two answer choices |
3 |
Three answer choices |
2 |
The stem |
1 |
All four answer choices |
1 |
Scoring notes:
III. B. Appropriateness of Task Context
More Details
A task context sets up particular situations or stories involving
people, objects, or events, including real-world occurrences or
idealized conditions. The context should be understandable,
interesting, and sensible to most students.
Judgments about the suitability of task contexts are
difficult in the absence of knowledge of students’ particular life
experiences. However, the reviewer should note contexts that they
feel may be inappropriate. In addition, all situations that are
described should include plausible quantities and dimensions and should
be accurate and credible. Students should not have to spend time
wondering if there is a mistake in the question. Task contexts
should be scientifically and mathematically accurate. Idealizations
(such as frictionless world) should be clearly noted.
Steps for reviewers:
- If
the task is not based on a particular situation, story, event, or
object, check the first statement in Box F and move on to the next
section.
- If the task does make use of a
particular situation, story, event, or object (e.g. a cow, a speeding
car, etc.) check the second statement and consider the following
statements. The reviewer should check these statements if they believe
that the problem is significant enough to affect the student’s
response.
Check all applicable boxes and provide explanations as appropriate.
-
The context may be unfamiliar to most students. Explain.
-
The context may advantage or disadvantage students because of their
interest or familiarity with the context. Explain.
-
The context is complicated so that students might have to spend a lot
of time trying to figure out what the context means. Explain.
-
The information and quantities that are used are not reasonable or believable. Explain.
-
The context does not accurately represent scientific or mathematical
realities or make clear when idealizations are involved. Explain.
-
Other. Provide a brief explanation.
score
Guidelines
Scoring
the Appropriateness of Task Context for Multiple-Choice Items: Deduct 1
point (from a total of 5 points) for each statement that is checked, if
the student's response would be affected by it.
Scoring notes:
III. C. Resistance to Test-Wiseness
More Details
This
criterion addresses characteristics of the task that might (1) allow
students to make a satisfactory response using only general test-taking
skills without understanding the idea being tested or (2) mislead
students into choosing an incorrect answer. For example, for
selected-response tasks, one answer (especially the correct answer)
should not be distinctly different from the other answers. It
should not be longer or more elaborate. It should not be more
qualified than the others, using terms such as “always,” “never,” or
“everyone.” It is also important to pay attention to the use of
logical opposites, which may make it easy to eliminate answer
choices. Lists of common student misconceptions and interview
data if available are provided to help identify plausible
distractors. These test-wiseness issues should be noted and
explained in the boxes below.
Check all applicable boxes and provide explanations as appropriate.
-
Some of the distractors are not plausible. Explain.
-
One of the answer choices differs significantly in length or contains a
different amount of detail from the other response options. Explain.
-
One of the answer choices is qualified differently from the other
response options, and/or uses giveaway words such as “usually” or
“never.” Explain.
-
The use of logical opposites leads the student to eliminate the other answer choices. Explain.
-
One of the answer choices contains vocabulary at a different level of
difficulty from the other response options. Explain.
-
The language in one of the answer choices mirrors or is obviously
closely related to the language in the stem. Explain.
-
Other. Provide a brief explanation.
score
Guidelines
Scoring
Resistance to Test-Wiseness for Multiple-Choice (4-Choice) Items:
Deduct 0-1 points (from a total of 5 points) for each distractor, and
1-4 points for the correct answer choice, if they can be evaluated
(successfully or unsuccessfully) through test-wiseness. The table below
gives scoring ranges for some possible situations. The ranges are
intended to allow the reviewer to use his/her professional judgment
regarding the overall effect on test-wiseness. In particular, the large
range associated with the correct answer choice allows the reviewer to
assess a severe penalty, if the person can select the correct answer
without any content knowledge.
Note: Successfully evaluating the correct answer choice means
identifying it as the correct answer. Successfully evaluating a
distractor means identifying it as a wrong answer. Unsuccessfully
evaluating the correct answer choice means thinking that it is a wrong
answer, and unsuccessfully evaluating a distractor means thinking that
it is the correct answer.
|
Score |
No test-wiseness issues |
5 |
One distractor can be eliminated using test-wiseness methods |
4-5 |
Two distractors can be eliminated using test-wiseness methods |
3-5 |
Three distractors can be eliminated using test-wiseness methods |
1-4 |
The correct answer can be determined using test-wiseness methods |
1-4 |
Scoring notes:
III. D. Cost Effectiveness
More Details
This section is provided to determine whether the task seems to require
a great deal of time and effort to complete, relative to what is
gained. The focus here is primarily on the amount of time and
effort required, and the goal is simply to flag the most costly items,
not to locate items on a scale of cost-effectiveness. Do not
consider comments made under necessity, sufficiency, comprehensibility,
and test-wiseness here. Suggestions for improving the
cost-effectiveness of an item should be noted in Box H.
Check the applicable box and provide explanations as appropriate.
Does it take a long time for students to:
Read the question
Decipher the answer choices
Interpret the graph, read the data table
Make calculations
Other
Score
Scoring notes: