Escolar Documentos
Profissional Documentos
Cultura Documentos
Cognitive psychology
interested in understanding how people solve problems
discover and understand the details of
History
developed during World War II
develop procedures for the selection and classification of
aircrew
This method also has parts, involving several different
people in different roles
collecting the data and
analyzing it
questionnaires.
original critical incident technique
Concurrent recording of these observations
is recommended in the original papers, but
they were acknowledged to be impractical
in many of the situations the critical
incident technique was used
e.g., during combat missions.
The observations gathered are then
categorized and interpreted by analysts and
put into a final report that summarizes the
findings.
the general procedure for the critical incident
technique
Someone performs a task in the real world
e.g., combat missions.
An observer (who may or may not be the
"someone" performing the task) reports critical
incidents after the fact in interviews or
questionnaires administered by an analyst.
The analyst categorizes and interprets the
observations.
The analyst writes a summarizing report of the
data and interpretations.
modified procedure of Usability studies in UI design have
A user thinks aloud as he or she performs a task using a
prototype of the computer system being evaluated,
usually in a laboratory setting, usually videotaped or
using screen-and-voice capture software.
An analyst (who is not the user performing the task)
looks at the recording of the think-aloud protocol
session and reports critical incidents using the UAR
format.
The analyst categorizes and interprets the observations.
The analyst writes a summarizing report of the data
and interpretations.
Differences between the two procedure
users take the role of the "someone
performing the task,"
the usability analyst takes both the roles of
the observer and the analyst in the original
critical incident technique.
Watching the videotape and writing up
UARs takes the place of after-the-fact
interviews or questionnaires.
Definition of the term “critical incident” (关键事
件)
The definitions used in the 1954
Psychological Bulletin paper are as follows:
“By incident is meant any observable
human activity that is sufficiently complete
in itself (本身) to permit inferences
and predictions to be made about the
person performing the act."
Definition of the term “critical incident”
“To be critical, an incident must occur in a
situation where the purpose or intent ( 目
的 ) of the act seems fairly clear to the
observer and where its consequences are
sufficiently definite (明确) to leave
little doubt concerning its effects."
"Such incidents are defined as extreme
behavior, either outstandingly effective or
ineffective with respect to attaining the
general aims of the activity."
The three important concepts mapped into UAR format
3.1.2 Ethics For Empirical Studies
Testing the Interface,not the participant
Voluntary Participation
Maintain Anonymity
Informed Consent
Laws
You are testing the Interface, Not the Participant
Your attitude
you are testing the interface, not the participant
including
procedures
data analysis
You are testing the Interface, Not the Participant
What we do?
No matter what the participants do,
always ask yourself what about your system
indicate them in that direction,
rather than blaming the participant for not
knowing enough or not paying attention or not
reading.
You are testing the interface, not the participant.
Voluntary Participation
Laws
laws that govern the use of humans in empirical
studies.
consist of federal regulations
tell you
when they apply and
what you have to do to comply with them
what types of observations are exempt from the
regulations
Institutional Review Board( 评审 委员会 )
Be dictated when an organization must form
"Institutional Review Board"
to oversee all observations of people
all universities that receive federal funding must have
IRBs
Laws
Most recent version is available in the website
the regulations for the "Protection of Human Research
Subjects,“
http://www.access.gpo.gov/nara/cfr/cfr-
retrieve.html#page1
In USA,it is your and your organization's
responsibility to understand how these laws
apply to your work
In otherconutry, it is your responsibility to
determine what the relevant laws are and apply
them
All the steps like maintaining anonymity and
informed consent has to be followed even if your
are exempted from government regulations
3.2 How to Conduct a Think-Aloud Usability Test
documentation
online help
other resources
tasks
participants
E.g.,
Choosing task to test the date/time control panel
The Content of the Tasks
pick tasks that reflect actual or expected
use of the system
If asimilar system is already in place (even a
paper-based system), actual tasks can be
determined by observing what people already do
with the existing system.
The Content of the Tasks
pick several tasks from the many you
identify and include them in the test suite.
When You identify many real tasks through
observation, interview, or sometimes data
collected for other purposes
include the most frequent tasks in the test
getdata on how your system supports the tasks
users will do on the most regular basis
The Content of the Tasks
include tasks that cover the range of
functionality of the system
exercise every part of it.
If you do not do this, then the entire system is not
tested, only the subset represented in the tasks
you chose.
Make sure you include not only tasks that create
things from scratch but also delete or modify
existing things.
The Content of the Tasks
include error-recovery tasks
e.g., put the
person in a situation where they
would be had they made an error and ask them to
recover from it.
include in the test suite very important
tasks, though possibly infrequent
E.g., emergency procedures are used
infrequently, but should be tested because they
are safety-critical.
The Need for Training to Do the Tasks
the training must be counted as part of the
test suite of tasks
If require training to accomplish those tasks.
The task set becomes
Should be Rather than
1."Learn to do 1."Task A."
Task A“
2."Task A,"
The Need for Training to Do the Tasks
This allows you to test training materials as
well as the system
test training materials :
lectures
online tutorials.
The Duration of the Tasks
users cannot do tasks for a relatively long
time, E.g.,
more than an hour at a time, or
more than two hours on a single day.
but
modify them
we see that
the usability target in this case will not require user
training and
that the tasks will last no more than a few minutes .
These facts make it reasonable to require participants to
carry out all tasks in one single test session.
the Date/Time control panel no tasks would integrate
with other applications.
Example: Choosing Tasks to Test the Date/Time Control
Panel
The suite of tasks to use in testing the usability of
the Date/Time control panel would include the
following:
Set the time
Set the date
Set the time zone
3.2.3 Preparing For Observation
Note :
Before running any actual think-aloud sessions, there are
several preparations need to make.
Contents
Setting Up a Realistic Situation for Data Collection
Writing Up the Task Scenarios
Recruiting Users
Setting Up a Realistic Situation for Data Collection
This situation will differ depending on the
system you are testing.
If system is a desktop system for a PC, an office-like set-
up is fine.
traditionally set-up for usability labs in large
corporations, Microsoft and Sun.
Setting Up a Realistic Situation for Data Collection
For many more types of computer systems,
realistic set-ups take on new characteristics.
Testing the interface to a microwave oven is best done in
a kitchen-like setting.
Testing a PDA can be done almost anywhere, but you
need to pay attention to lighting conditions that will be
found in offices, in cars, outdoors, and on public
transportation.
Navigation devices to be placed in cars need to be tested
while driving (or in driving simulators).
Setting Up a Realistic Situation for Data Collection
The variety is endless today with the ubiquity of
computer systems.
In general, think carefully about how your
system will eventually be used and set up the
situations to match those uses as closely as
possible.
Setting Up a Realistic Situation for Data Collection
Difference situations , difference the data-collection
process.
traditionally capture the actions and voice of the users :
fixed video cameras
microphones.
In the more mobile situations, more dynamic data-
capturing techniques must be employed.
wireless microphones
operate a camcorder and follow the user around as
they work.
Some new software capture and playback the user's
actions and voice with a portable computer.
Setting Up a Realistic Situation for Data Collection
each situation has its own requirements, the solutions
will vary. capture at least :
what users do with your system (including observations about
what is being displayed to your users)
what users say as they think aloud, synchronizing what they
say with what they do.
When record the user's actions, you will need record the
time during the session.
some video recorders can put a timestamp on the signal.
screen-capture software have a system clock within the area of the
screen when the software is capturing.
Writing Up the Task Scenarios
Write a statements of tasks for the
participants to perform. How to do?
each task is written on a separate sheet of paper
given to the user one task at a time.
After explaining
ask the participant if any questions exist about
the think-aloud procedure or anything else so far.
Example: Introducing Participants to a Think-Aloud
Usability Test of the Date/Time Control Panel
For preparation the test of the Date/Time control panel,
incorporated all the information into a script.
The analyst follow the script when testing the software.
Following the script are several recordings that capture
our delivery of this introductory material to a typical
user.
As with all informal observations, the analyst may not
have adhered perfectly to the script.
Deviations or omissions from the script in the recordings can be
found
Typically experienced analysts are able to follow a script closely
without appearing to be unnatural or to lack spontaneity.
Recordings of Participant Briefings
1. The first recording is of the analyst describing the purpose of the
study in general terms. Refer to that subsection of this module,
the script above, and the consent from, as you view it. Think
about what the analyst is doing according to the procedures
presented in this module and what she is doing that deviates from
those procedures.
Click Introduction_&_Consent to view the recording.
After you have viewed the recording, click Intro_Critique for our critique of
the recording.
2. The second recording is of the analyst introducing the think-
aloud technique. Refer to the relevant previous subsection, and to
the script above, as you view it. Think about how the analyst is
following the procedures presented in this section and what she is
doing that deviates from those procedures.
1. Click How-to-TA to view the recording.
2. After you have viewed the recording, click TA_Critique for our critique of the
recording.
Recordings of Participant Briefings
1. The third recording is of the analyst demonstrating the think-
aloud technique and giving the participant a chance to practice.
Refer to that subsection of this section and the script above as
you view it. Think about where the analyst follows and deviates
from the procedures presented in this section.
1. Click Practice to view the recording.
2. After you have viewed the recording, click Practice_Critique for our critique
of the recording.
2. The last recording is of the analyst explaining the rules of the
observation and giving the participant the task. Refer to that
subsection of this section and the script above, as you view the
recording. Think about where the analyst follows and deviates
from the procedures presented in this section.
1. Click Rules_& Tasks to view the recording.
2. After you have viewed the recording, click Rules-&-Tasks_Critique for our
critique of the recording.
3.2.5 Conducting the Observation
Contents
Introduce the observation phase
Begin the observation
Introduce yourself
contents
Establish criteria for Critical Incidents
Paper UARs
continue to be needed for many reasons
easier to carry and hand out copies at a meeting
Understanding
guessing
TA UAR 2
TA UAR 3
Example : Analyze The Observation Of Date/Time
Summarizing Reports
Example : A summarizing Report for the
Date/Time
Summarizing Report
Communicate the Results of the usability
analysis
If the problem is severe it should not exceed three
pages
If the problem is small then ranked them
In the report usability aspect must be fixed along
with usability problem
If needed produce “highlight” videotape to
support your report that summarizes all the
problems mentioned in the UAR’s
Example: A Summarizing Report for the Date/Time Control
Panel
If the think-aloud data were all you had to
report on, a summarizing report would not
be necessary, because
thethree UARs themselves would be sufficiently
short for everyone to read.
For finding possible redesigns include the
28 UARs of Heuristic Evaluation
SampleOfSummarizingReport.doc
Take assessment
Exercise 7
3.3Think-Aloud Testing vs. Heuristic Evaluation
3.3.1 Comparing Think-Aloud Usability
Testing With Heuristic Evaluation
3.3.1 Comparing Think-Aloud Usability Testing With Heuristic
Evaluation
In the section, we will discuss
Many usability aspects identified in
HE are
confirmed in Think-Aloud Usability Tests
When HE predictions are not confirmed by
Think-Aloud.
“False Alarms” Vs True Problems
HE21:
Is inefficient, since a visual search through such a long
list takes too long .
TA:
...of course, it isn't in any kind of order, so we just have
to sit here and scan through lots and lots and lots of
cities (in an exasperated voice)…and then when we
realize we went in the wrong direction, we have to go
alllll the way back up (the user sighs).
evidence of the user's perception of inefficiency
And …
HE27:
thevery incompleteness of the list of cities would
cause users confusion.
TA:
he fails to find either Manila or the Philippines
and finally gives up on the task (time 16:30:00)
All this evidence supports the problem
identified in HE28:
theListBox is a bad control for setting the time
zone.
Example for supporting good feature
HE5 and HE16
TA
Theuser has no difficulty using the OK button in
several instances (times 16:30:10 and 16:30:50),
HE22
seeing the time zone setting on the Date/Time tab
TA
"of course, we are still in the wrong time zone"
When HE Predictions are not Confirmed by Think-Aloud Usability Tests
In such circumstances
probably decide to fix these problems predicted
by heuristic evaluation,
Based on their relative severity and cost of
applying the fixes.
Think-Aloud Usability Tests Can Show Things HEs Can't
Show
some problems heuristic evaluation cannot show in
principle
Some ohter does not show in practice.