Escolar Documentos
Profissional Documentos
Cultura Documentos
_1592151_1
Criterion-referenced assessment
The alternative approaches to assessment we explore here are all criterion
referenced rather than norm-referenced. That is, we will be concerned
with identifying changes in performance by individual students, not
comparison of one student's performance with the performance of other students
in the group, or with national scales.
Criterion-referenced performances are compared to a pre-specified
criterion/criteria such as a learning outcome(s) or a predetermined benchmark
performance. The performance is then assessed as being competent/having
achieved mastery or not? Or some other similar term.
Norm-referenced performances on the other hand are compared
against the performance of the group or a similar group of students.
The individual student's performance is then given a rating or rank based on the
norm (i.e., the mean or average) of the group. For example, it might be decided
that the top 5% in the class will get HD, the next 10% - D, the next 20% - C, the
next 35% - D, and the bottom 30% will fail.
This norm-referenced method of assigning grades allows administrators to vary
the numbers 'passing' a course without seeming to do it in a subjective manner
or ranking students along a continuum of 'performance'.
In contrast to norm-referenced assessment, one can set up criteria for awarding
grades and then check which of the categories the observed performance most
closely fits. This criterion-referenced assessment is particularly suited to
assessing performance skills such as 'Correctly refitting the cylinder head' of an
engine, or the voice procedure of an office receptionist, or an oral presentation of
a topic the students selected, or the development of an argument in a piece of
extended writing.
GRADE CRITERIA
3 HC - Demonstrates mastery over the strategy or skill specific to the task or
situation; Can perform the strategy or skill without error and with little or no
conscious effort. 3 C - Carries out the strategy specific to the task or situation
without significant error. 2 AC - Makes a number of errors when performing the
strategy or skill specific to the task or situation but can complete a rough
approximation of it. 1 NYC - Makes many critical errors when performing the
strategy or skill specific to the task or situation
(This rating scale is adapted from Marzano et al. (1993) Assessing Student
Outcomes, p. 66)
While criterion-referenced assessment is particularly suited to performance
tasks, it can also be applied to the products of assessment tasks such as an
essay or other extended forms of writing or to models or work samples in art,
carpentry, dressmaking, mathematics, and so on.
Benchmarking
Where more than one person is involved in assessing a particular assessment
task, inter-marker reliability can be improved by establishing benchmarks. For
example, when a number of markers have to rate a large number of responses to
the same writing task, inter-marker reliability can be improved by having
samples of student responses which are deemed to be excellent (HD),
satisfactory (C) and unsatisfactory (F).
These samples are frequently regarded as benchmarks and they can be used to
develop consensus amongst the team of markers of what to expect and look for
in the products they are about to assess.
Ideally one would select examples for each category of the marking scale being
used, but in practice it is less time consuming to only illustrate what is expected
at the 'top', 'middle' and 'bottom' of the scale. Ambiguous or overlapping
responses are then fitted in between two of the clearly-defined categories.
Benchmarking of performance assessment can be achieved through the use of a
panel in the early stages of assessment through which consensus is established,
or by the use of moderators. Assessments of sporting performance where
judgement is required, such as in diving or gymnastics, typically use a panel
whose scores are averaged.
classroom + home activity; under test conditions; in a work setting, etc. The
tasks must be very closely related to the pre-specified learning outcomes. The
tasks must reflect the previously determined level of competency or expected
outcomes. The tasks must be a valid sample of the possible domain or range of
tasks that could be set to assess these outcomes. The rating criteria should be
illustrated by typical samples of students' work for, at least, the top, middle and
bottom range of performance.
Authentic assessment
To be able to assess students' behaviour we must get information about the
desired behaviour. This information may be gathered by making direct
observations of the student in a number of different situations. The information is
then assigned to predetermined categories which are frequently represented by
numerical or literal symbols. The conditions under which the information is
gathered can be natural or normal (e.g., in the classroom, workplace, workshop),
or formal (e.g., examinations, structured interviews). Performances are usually
assessed under natural or normal conditions or simulations of natural or normal
conditions. Such assessments are frequently referred to as authentic
assessments. Diane Hart (1994, p. 9) defines authentic assessment as those
information gathering activities which involve: students in tasks that
are worthwhile, significant and meaningful. Such assessments look and feel
like learning activities ... They involve higher order thinking skills and the
coordination of a broad range of knowledge.
They communicate to students what it means to do their work well by making
explicit the standards by which that work will be judged. In this sense authentic
assessments are standard-setting, rather than standardized, assessment tools.
Brady and Kennedy (2005, p. 3) define authentic assessment on page 3 and go
on to provide four major interpretations of authentic assessment (p. 43) and list
of advantages (p. 44). They state that authentic assessment should satisfy at
least three criteria:
1. Authentic assessment involves a variety of assessment strategies
that capture the quality of a student's work.
2. These assessments explore a student's normal daily performance
rather than focusing exclusively on tests.
3. Such assessments reflect the actual learning and teaching of the
classroom and beyond.
When one examines Griffin and Nixs
(1991, p. 91) list of authentic
assessment strategies shown below
[with some additions] it is clear that
most of the methods for gathering
assessment information can be
classified as authentic assessment:
1. Performances
a. In practical or manipulative tasks or
tests (or physical performances)
b. In reading tasks or writing tasks such
as essays
c. In oral tasks such as debates and
speeches
d. In teacher made paper and pencil
tests
e. In commercial (standardised) tests
2. Products
a. Projects on themes or topical issues
b. Homework assignments or tasks
c. Finished practical samples
3. Processes
a. Participation in social activities
b. Activities during individual or group
work
c. Contributions to class or other group
activities
d. Interactions in interviews
e. Casual conversations and events
4. Records
a. Student histories (and portfolios)
b. Examination results published in the
media
c. Surveys, evaluation reports or
research reports
d. Response patterns on questionnaires
5. Personal qualities
a. Perseverance and commitment to
attainment of goals
b. Entrepreneurship and risk-taking
tendencies?
Are there any surprises? What results are unexpected? What anomalies exist?
Is there evidence of improvement or decline? If so, what might have caused the changes?
What questions do these data raise?
Are these results consistent with other achievement data?
Are there alternative explanations for these results?
By what criteria are we evaluating student work?
What is the performance standard? How good is "good enough"?
How do our results compare to those of similar schools?
but also began report writing about Insects and Minibeasts. Incidental
writing has also
occurred including writing stories, journal writing, lists and labels.
Year 9 Example:
Comment to Parents:
SOSE Teacher:
YOUR CHILD is an engaged student who displayed a good understanding of
concepts covered in Studies of Society and Environment throughout this
semester. S/he is well organised and takes pride in presenting he/r work in a neat
and tidy manner. YOUR CHILD is being encouraged to offer he/r opinion during
class discussions more often as they are frequently insightful. This will help he/r
to gain a better understanding of the concepts being covered. YOUR CHILD is to
be commended on he/r excellent results achieved in the Bombing of Darwin Test.
Maths Teacher:
YOUR CHILD is a polite student who is successfully completing the Year 9
Mathematics course at a high level of achievement. S/he is commended for he/r
topic test results where s/he has demonstrated a high level of mathematical
understanding, problem solving and communication skills. YOUR CHILD is being
encouraged to closely monitor the due dates for homework tasks so s/he is able
to consistently complete these within the set time frames. Both assignments
have been neatly completed.
Description of Course:
SOSE
Students are required to provide evidence of their learning and achievement
from the state/territory curriculum. Students have addressed a number of key
historical periods of time including the Bombing of Darin, Indigenous Australians,
European settlement, the Gold Rush and the Development of Australia as a
nation separate from England. Students developed their knowledge of a range of
historical research skills and methodology.
Mathematics
Students were required to provide evidence of their learning and achievement in
the state/territory and Australian curricula in the areas of Measurement, Number
and Algebra. Students reflected on themselves as learners of Mathematics and
developed a range of skills and concepts. They applied these to solve problems.
There was a focus on the analysis and deconstruction of problems which were
required to be effectively communicated. Students had many opportunities to
provide solutions in a range of forms including mental, written and electronic
methods