Escolar Documentos
Profissional Documentos
Cultura Documentos
discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/216518301
CITATIONS
READS
13
103
1 AUTHOR:
Alexander Halavais
Arizona State University
42 PUBLICATIONS 421 CITATIONS
SEE PROFILE
are printing presses or microcomputers. Central control is more likely when the means of
communication are concentrated, monopolized, and scarce (1985, p.5). From early in
the evolution of computer networks, they appeared to engender the sort of massively
distributed and decentralized communication conducive to democratic deliberation.
This promise has gone largely unfulfilled. In particular, we have not seen the sort
of anticipatory democracy Alvin Toffler predicted would increasingly involve the
citizen in policy creation (1970, 1978). This is not due to a lack of active discussion
supported by computer networks. While these on-line discussions are not
unproblematicand work has been done that demonstrates the degree to which such
discussion can vary from what we might consider the rational ideal at the root of
participatory democracymany such discussions can and do lead to informed consensus
over policy issues. The question is how to link this consensus to those who actively shape
policy (Dahlgren, 2000).
Many see the Internet as a technology encouraging discussion and integration at
the community level, and hope that virtual communities can serve to enhance rather
than distract from traditional, geographically-bound local communities (e.g., Agre &
Schuler, 1997; Doheny-Farina, 1998). James Fishkin (1995, p. 24) notes that democracy
as it is practiced today in America often appears more like the Spartan tradition of
measuring the level of applause to elect a representative (during an event called the
Shout) than the deliberative model found in Athens. The quantity of response is held to
be more important than the content. Electronic media has until this point enhanced rather
than reversed this trend. Legislators and policy-makers are sometimes aware of the
volume of the telephone calls, emails, and faxes they receive, but all detail is lost in the
shout. From the perspective of policy-makers, public deliberation is generally limited
to what is visible in the national daily newspapers. In those cases where public opinion is
in conflict with the general opinions represented in the mainstream newsas in the
nomination of Zo Baird, for examplethe only indication is often this sort of direct
logging of telephone calls, faxes, and mail (Page, 1996, c. 4).
The challenge is to find ways of representing discussion and deliberation, so that the
public discourse finds its way to appropriate representatives. Some have placed the onus
of this communication on the citizen, and recommended ways in which activists can
make use of new communication technologies to make their voice heard (e.g., Kush,
2000; Browning & Weitzer, 1996). Thus far, computer networking has, on the whole,
reinforced existing political processes, rather than causing radical change (Davis, 1999).
The history of using computer networks as catalysts for policy discussions
reaches into the 1970s (Hiltz & Turoff, 1993). Such attempts examine the formal
structure of the software and rules for discussion in the hope that an idealized public
sphere of discussion can be obtained. The Online Deliberative Discourse Project at
Harvards Berkman Center for Internet and Society1 and attempts by the Summit
Education Initiative2, for example, are exploring ways in which computer networking can
be used to establish temporary public fora to discuss policy issues. Such projects are
promising, but they ignore the discussion that already exists informally across the
Internet on email lists and discussion boards, blogs, and homepages.
1
2
http://cyber.law.harvard.edu/projects/deliberation/
http://www.seisummit.org/od-overview.htm
of quality. The resulting ranking system has proven effective, if only by the popularity of
the search engine.
Second, a new feature on the Google site trolls the news sites on the web and
generates a summary. Beyond this basic function it also automatically groups stories with
similar content from different sources. Other systems, like NewsSeer, perform a filtering
function by tracking a users preferred stories and proffering similar ones on subsequent
visits. While PageRank filters sites by popularity, Google News (and similar systems) are
biased in favor of those sites most rapidly or recently updated.
Finally, Googles Zeitgeist report is generated on a weekly basis. It assumes
that the terms searched for on the popular search engine are in some way indicative of the
collective thoughts and aspirations of its millions of users, Google assembles weekly lists
of the largest gaining and losing search terms. It provides both overall lists of these
changes, as well as more topically specialized lists (see fig. 1). Alexa provides a similar
metric for websites, suggesting in their Movers and Shakers listings which sites have
had the greatest increases in traffic. Companies offering corporate intelligence, or web
intelligence offer similar trend-watching to their clients. This study utilizes such an
approach to track and summarize the topics of conversation on a sample of blogs,
emphasizing differences and anomalies, either in topic or vocabulary.
The first of these approaches, the use of automated hyperlink analysis to
determine pages with the highest reputation, has already been applied to the blogging
world. MIT Media Labs Blogdex indexes nearly 10,000 blogs and analyzes the links
among them (Kahney, 2001). In fact, it includes an element of the zeitgeist approach as
well, by ranking sites according to changes in linkages rather than simply linkages.
The second approach finds a parallel in meta blogs and announcement sites.
The original role of blogs was to provide a kind of filtering of the web: a cross between a
site providing bookmarks, new site announcements, and reviews. Many successful
collective blogs remain more or less true to this vision, filtering either web sites or
mainstream news. Some of the most popular group blogs, like Slashdot and Metafilter,
perform this function. Unlike Googles site, however, these filters tend not to detect not
general trends. Others do a better job of summarizing blog content, and as processes for
syndicating content become more widespread, sites that summarize content will likely
gain in popularity. Google News benefits from a set of news sources that tend to be fairly
uniform in terms of topics and approaches, something that certainly cannot be said of the
blogosphere. As with Google News, there are also sites that focus on the most recently
updated blogs, including Weblogs.com and Daypop.com.
The final approach, inspired by Googles Zeitgeist, is the one followed here.
Rather than recording the changes in keyword frequency, the changes in word frequency
were recorded from week to week, in an attempt to gauge what could be considered the
hot topics. This measurement of topicality among blogs can then be considered in
relation to a similar measure of news sites to determine the ways in which they differ.
Finally, once the broad outlines of blog discourse are discovered, questions related to
those topics can be examined and illustrated with specific key phrases from the original
work.
A separate tally was generated of words that appear in only one of the lists.
Overall Blog-Agenda
The final product of the collection was five lists of words that represented the
greatest changes from week to week. From these, the obvious artifacts were removed
(e.g., the days of the month or month of the year). What remained, with some
consistency, was the major news stories of the periods studied, with a few items that
seemed to have an impact more specifically on the blog community.
A list of the top five topics during each period is provided in table 1. In practice,
assembling news-related items was relatively straightforward, and these dominated the
top of the lists. An informal comparison to the top stories reported in the news during the
same period makes clear that these also represented the most commonly shared content of
personal blogs. Unfortunately, given that the measures were not very finely grained,
determining the lag between release and comment, or the degree to which particular
stories were more or less popular, was difficult.
There were some unusual emphases. It seemed, for example, more unusual or
controversial topics were more likely, naturally, to be discussed. There also seemed to be
an enduring interest in the events in the Israeli-Palestinian conflict, as well as possible US
involvement in Iraq. There was a further emphasis on environmental issues, and on issues
broadly related to the Republican party.
Some items seemed to be unique to the blog world and of interest particularly to
the blog community. For example, the Friday Five4, a set of personal questions asked
each Friday and answered on personal blogs, seemed to be popular among a relatively
large number of blogs in the sample. The results tended to present themselves in the
results. For example, when the Friday Five from August 9 asked questions about the
authors cars, this led to a spike in not only the word car but the names of a number of
automobile manufacturers. Similar trends were detectable but remained subsidiary to a
focus on news items.
Finally, some words were impossible to clearly categorize, yet seemed to change
significantly from week to week. The first period, for example, saw a substantial rise in
the words anyway and simple. These were used in a variety of contexts. Two
possibilities present themselves as explanations. First, it may be that with a larger sample
these anomalies will recede. Given the impact a blog dedicated to space exploration had
on the averages (by repeating the word scramjet many times, for example), it is certain
that a larger sample would yield more consistent results. Second, it is possible that those
who are exposed to a particular word are more likely to continue to use it in their own
work. Normally, this possibility would seem remote, but the increase of the use of the
words sign and signs during the period a new film titled Signs was released is only
partially attributable to mention of the name of the movie and its discussion. It seems that
the term is used more frequently even when not in connection with the film. Again, more
finely grained sampleson a daily basis, for examplewould provide evidence if this is
the case.
Crystal Gazing
This represents a very preliminary effort at analyzing changes in Weblog activity.
Even as such, it introduces some potentially interesting insights. The Blogdex project has
demonstrated that mainstream news items are the most popular target of links among
4
http://www.fridayfive.org
bloggers, and this survey not surprisingly supports this by indicating that popular news
stories are the most common topic of discussion across a sample of pages. However,
there does seem to be an emphasis on issues that are particularly contentious. As Hiler
(2002b) has suggested, it may be that blogs largely represent an extension of commentary
found in other news media. A comparison of topics found in blogs and on the opinion
pages of newspapers might yield interesting similarities.
A significant limitation of the approach used here is that it focuses on centrality
measures of textual changes. I cannot think of another corpus that would be as eclectic as
the text collected from these blogs; many are not even internally cohesive, so it is
preposterous to expect the whole to make sense. The identified trends rarely rose beyond
a few dozen blogs within the sample. Therefore, any indication that this represents a
plurality of interest or opinion is probably an overstatement. Nonetheless, it is interesting
to find common threads among such a diverse set of sources.
For an automated blog weather report to be created, the context of these
changing terms must be analyzed and categorized. The largely interpretive categorization
undertaken here is neither reliable nor scalable. A number of options are available for
determining clusters of words within the text, and for summarizing the nature of these
clusters. Future work will concentrate on integrating these approaches.
Blogs, as a popular and accessible form of public discourse, represent a
potentially valuable source of public opinion and deliberation. Unless the ideas presented
in blogs are in some way collected and summarized, they risk being lost to the policymaker and to the larger public. This study represents a first step in the direction of
making that discourse transparent and available.
Bibliography
Agre, Phillip and Schuler, Douglas (eds.). (1997). Reinventing technology, rediscovering
community. Norwood, NJ: Ablex.
Bengston, David N. & Fan, David P. (2001). Trends in attitudes toward the recreation fee
demonstration program on the national forests: A computer content analysis approach.
Journal of Park and Recreation Administration, 19(4), 1-21.
Browning, Graeme & Daniel J. Weitzer (1996). Electronic democracy: Using the Internet
to influence American politics. Medford, NJ: ITI.
Dahlgren, Peter (2000). The Internet and the democratization of civic culture. Political
Communication, 17, 335-340.
Davis, Richard (1999). The web of politics. Oxford: Oxford University Press.
Doheny-Farina, Stephen (1998). Wired neighborhood. New Haven, CT: Yale University
Press.
Fishkin, James S. (1995). The voice of the people. New Haven, CT: Yale University
Press.
Gallagher, David F. (2002, 10 June). A rift among bloggers. New York Times, p. C4.
Hiler, John (2002a). Borg journalism. Microcontent News. April 1. Retrieved from the
web, August, 2002, http:/ /www.microcontentnews.com/ articles/ borgjournalism.htm .
Hiler, John (2002b). Blogosphere, the emerging media ecosystem. Microcontent News.
May 28. Retrieved from the web, August, 2002, http:// www.microcontentnews.com/
articles/ blogosphere.htm .
Hiltz, Starr Roxanne & Turoff, Murray (1993). The network nation, revised ed.
Cambridge, Mass.: MIT Press.
Kahney, Leander (2001, 30 July). Tracking bloggers with blogdex. Wired News.
Retreieved from the web, August, 2002, http:// www.wired.com/ news/ culture/
0,1284,45546,00.html .
Kush, Christopher (2000). Cybercitizen: How to use your computer to fight for all the
issues you care about. New York: St. Martins.
Kweit, Mary Grisez & Kweit, Robert W. (1987). The politics of policy analysis: The role
of citizen participation in analytic decision making. In Jack DeSario & Stuart Langton
(eds.), Citizen Participation in Public Decision Making, Contributions in Political
Science, no. 158, pp. 19-38. New York: Greenwood Press.
Nissan, Ephraim (ed.). (1994). From information to knowledge: Conceptual and content
analysis by computer. London: Intellect.
Page, Benjamin I. (1996). Who deliberates? Mass media in modern democracy. Chicago:
University of Chicago Press.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1998). The PageRank citation ranking:
Bringing order to the web. Stanford Digital Library Technologies Project. Retrieved
August, 2002 from http://stanford.edu/ ~backrub/ pageranksub.ps
Pennings, Paul & Keman, Hans (2002). Towards a new methodology of estimating party
policy positions. Quality & Quantity, 36(1), 55-79.
Pool, Ithiel de Sola (1983). Technologies of freedom. Cambridge, Mass.: Harvard
University Press.
Popping, Roel. (2000). Computer-assisted text analysis. Thousand Oaks, CA: Sage.
Toffler, Alvin (1970). Future shock. New York: Random House.
Toffler, Alvin (1978). Introduction of future-conscious politics. In Clement Bezold (ed.),
Anticipatory Democracy: People in the Politics of the Future. New York: Random
House.
Wadsworth, Deborah (1997). The publics view of public schools. Educational
Leadership, 54(5), 44-48.