Escolar Documentos
Profissional Documentos
Cultura Documentos
Lingayen Campus
Lingayen, Pangasinan
S:Y 2016-2017
(Assessment 1)
Submitted by:
Lyka G. Sison
III_BSEd Filipino
Submitted to:
Instructor
6.4 Exercises
D.
1.
An expectancy table tries to predict how well a person will perform on one exam
from the results of a previous exam. You may use an expectancy table to
determine if a person will graduate from his or her SAT scores.
3. Enumerate the three types of validity evidences. Which of these types of validity is the
most difficult to measure? Why?
There are essentially three main types of evidence that may be collected: content-
related evidence of validity, criterion-related evidence of validity and construct-
related evidences of validity.
Regardless of how construct validity is defined, there is no single best way to study
it. In most cases, construct validity should be demonstrated from a number of
perspectives. Hence, the more strategies used to demonstrate the validity of a
test, the more confidence test users have in the construct validity of that test, but
only if the evidence provided by those strategies is convincing.
In short, the construct validity of a test should be demonstrated by an
accumulation of evidence. For example, taking the unified definition of construct
validity, we could demonstrate it using content analysis, correlation coefficients,
factor analysis, ANOVA studies demonstrating differences between differential
groups or pretest-posttest intervention studies, factor analysis, multi-trait/multi-
method studies, etc. Naturally, doing all of the above would be a tremendous
amount of work, so the amount of work a group of test developers is willing to put
into demonstrating the construct validity of their test is directly related to the
number of such demonstrations they can provide. Smart test developers will stop
when they feel they have provided a convincing set of validity arguments.
4. What is the relationship between validity and reliability? Can a test be reliable and yet
not valid? Illustrate.
If data are valid, they must be reliable. If people receive very different scores on a
test every time they take it, the test is not likely to predict anything. However, if a
test is reliable, that does not mean that it is valid. For example, we can measure
strength of grip very reliably, but that does not make it a valid measure of
intelligence or even of mechanical ability. Reliability is a necessary, but not
sufficient, condition for validity.
5. Discuss the different measures of reliability. Justify the use of each measure in the
context of measuring reliability.
7.8 Exercises
A.
B.
1.
5. Solving an algebraic verbal problem involving two linear equations in two unknowns
8. Writing an autobiography
8.9 Exercises
NORM-REFERENCED GRADING
A norm-referenced grading system assumes that classroom achievement will
naturally vary among a group of heterogeneous students because of differences in
such characteristics as prior knowledge, learning skills, motivation, and aptitude.
Under ideal circumstances (hundreds of scores from a diverse group of students),
this variation produces a bell-shaped, or "normal," distribution of scores that
ranges from low to high, has few tied scores, and has only a very few low scores
and only a very few high scores. For this reason, norm-referenced grading
procedures are also referred to as "grading on the curve."
In norm-referenced systems students are evaluated in relationship to one
another (e.g., the top 10% of students receive an A, the next 30% a B, etc.). This
grading system rests on the assumption that the level of student performance
will not vary much from class to class. In this system the instructor usually
determines the percentage of students assigned each grade, although this
percentage may be determined (or at least influenced) by departmental
expectations and policy.
The performance of the student is evaluated relative to the performance of other
students within a group.
The performance of the student depends on the performance of the other student
in the class.
Advantages:
Disadvantages:
One objection to norm-referenced systems is that an individual's grade is
determined not only by his/her achievements, but also by the achievements of
others. This may be true in a large non-selective lecture class, where we can be
fairly confident that the class is representative of the student population; but in
small classes (under 40) the group may not be a representative sample. One
student may get an A in a low-achieving section while a fellow student with the
same score in a higher-achieving section gets a B.
A second objection to norm-referenced grading is that it promotes competition
rather than cooperation. When students are pitted against each other for the few
As to be given out, they're less likely to be helpful to each other.
The performance of a student is not only determined by his achievement, but also
the achievement of the other students. It promotes competition among the
students rather than cooperation. It cannot be used when the class size is smaller
than 40. Not all the student can pass the given subject or course.
Possible modification :
When using a norm-referenced system in a small class, the allocation of grades can
be modified according to the caliber of students in the class. One method of
modifying a norm-referenced system is anchoring. Jacobs and Chase in Developing
and Using Tests Effectively: A Guide for Faculty, 1992, describe the following
ways to use an anchor:
"If instructors have taught a class several times and have used the same or an
equivalent exam, then the distribution of test scores accumulated over many
classes can serve as the anchor. The present class is compared with this
cumulative distribution to judge the ability level of the group and the
appropriate allocation of grades. Anchoring also works well in multi-section
courses where the same text, same syllabus, and same examinations are used.
The common examination can be used to reveal whether and how the class
groups differ in achievement and the grade in the individual sections can be
adjusted accordingly... If an instructor is teaching a class for the first time and
has no other scores for comparison, a relevant and well-constructed teacher-
made pretest may be used as an anchor."
Advantages
The performance of the students will not be affected by the performance of the whole
class. It promotes cooperation among the students. All students may pass the subject or
course when they meet the standard set by the teacher.
Disadvantages
It is difficult to set a reasonable standard if it is not stated in the grading policies of the
institution. All students may not pass the subject or course when they do not meet the
standard set by the teacher or the institution.
Advantages:
Students are not competing with each other and are thus more likely to
actively help each other learn. A student's grade is not influenced by the caliber
of the class.
Disadvantages:
Possible modifications: