Escolar Documentos
Profissional Documentos
Cultura Documentos
System
Lecture 8: Automatic indexing, Abstracting and Bibliographies
Introduction
3.
Process of indexing
The basic idea behind Luhns theory was that the more
frequent the occurrence of a term in a given document,
the more significant is that term in denoting the subject
content of the document. Therefore, by counting the
frequency of occurrences of all the words in a given
document one can identify the most significant words
that can represent the subject of the document.
4.
Definitions
Types of Abstract
Abstract may differ according to their writer, purpose, and style. Guinchat
and Menou suggest that the various types of abstract can be distinguished
by:
Their length, which normally ranges from a few dozen to several hundred words,
and is occasionally over a thousand
The amount of detail, certain abstracts (known as indicative abstracts) simply
provide a brief summary, whereas others (known as informative abstracts)
include a varying number of points that are likely to interest the user
The inclusion of judgments or critical analysis, which may amount to some form
of evaluation of the document
Whether the indexer deals with the whole document or only with aspects that
are likely to interest the user (known as slanted abstract)
Whether the author of the abstract is the author of the original document or
some other person
The language used, which may be a natural language or a more formalized
(artificial) language.
Kinds of abstract
Abstract by writer
Abstracts
Abstracts by purposes
Abstracts are written with certain purposes in mind, and therefore there may be
different abstracts to serve different purposes. Borko and Bernier have identified
four different types of abstracts: the indicative abstract, informative abstract,
critical abstracts, and special purpose abstract.
Informative abstract to provide readers with quantitative and qualitative
information as presented in the parent document.
Indicative abstract indicates what the parent document is all about
Critical abstract, kind of critical comments or review by the abstractor.
Some abstracts may have been written to serve a special purpose or with a
specific category of users in mind. Such abstracts are called slanted or special
purpose abstracts.
Modular abstract, abstractor is expected to prepare different kinds of abstractsindicative, informative, critical, and so on
Abstracts by form
Qualities of abstracts
Abstract should possess the following qualities:
1.
2.
3.
4.
Dissemination of information
Selection of information by the end-user
Retrieval of information, especially in computerized
information retrieval systems
2.
3.
5.
6.
7.
Guidelines
Guidelines for abstracts
1.
2.
3.
4.
Read the whole document at least once to obtain a clear idea about
its essential contents and special features, such as tables, illustrations,
and list of references.
After reading, examine the original document carefully with
emphasis on the author abstract, if any, the first and the last
paragraphs, and the key sections, such as introduction, purpose,
conclusion, summary and recommendations. Also take note of the
footnotes, if any.
Underline, if necessary, the key phrases and sentences, while reading
and examining the original document.
Write the abstract in the style and the manner in which you feel
most comfortable and efficient of course being consistent with the
principles and rules formulated for the purpose.
Guidelines
5.
6.
7.
8.
9.
10.
Automatic abstracting
Automatic abstracts
Abstracting
Bibliographies
Preparing a bibliography
Bibliographies
Types of bibliographies
Vocabulary Control
Introduction
2.
Thesauri
2.
3.
4.
(b)
To control the term used in indexing, providing a means of translating the natural
language authors, indexers and enquirers into the more constrained language used for
indexing and retrieval.
To ensure, through the provision of a controlled language, consistent practice between
different indexers, employed by the same agency, or between indexers employed by
the different agencies in a cooperative network;
To limit the number of terms that need to be assigned to a document. The terms
assigned to a document should represent, as specifically as possible, the concepts
described by the author, but they need not include terms of broader conference and
can therefore be displayed effectively in a thesaurus
To serve as a search aid in retrieval, including retrieval from free-text systems
Structure: A thesaurus displays, through its structure the synonymous, hierarchical and
other relationships between their terms which together comprise an indexing
language.
Summary
References