Escolar Documentos
Profissional Documentos
Cultura Documentos
Pr
ev
ie
ted
gh
yri
op
-C
Ma
te
ria
l
Ma
te
ria
l
ted
gh
yri
op
Pr
ev
ie
-C
ISBN
Printed in Italy by
L.E.G.O. S.p.A.
ria
l
TIME
Ma
te
Hyperhistory
ted
Pr
ev
ie
-C
op
yri
gh
ore people are alive today than ever before in human history.
And more of us live longer today than ever before. Life expectancy is increasing (Figure ; see also Figure ) and poverty is decreasing (Figure ), even if the degree of global inequality is still scandalous.
As a result, disability is becoming the biggest health-related issue for
humanity.
To a large measure, the lines representing the trends on Figure and
Figure have been drawn by our technologies, at least insofar as we
develop and use them intelligently, peacefully, and sustainably.
Sometimes we forget how much we owe to ints and wheels, to
sparks and ploughs, to engines and computers. We are reminded of our
deep technological debt when we divide human life into prehistory and
history. Such a signicant threshold is there to acknowledge that it was
the invention and development of ICTs (information and communication technologies) that made all the difference between who we were,
who we are, and, as I shall argue in this book, who we could be and
become. It is only when systems to record events and hence accumulate
and transmit information for future consumption became available
that lessons learnt by past generations began to evolve exponentially, in
a soft or Lamarckian1 way, and so humanity entered into history.
80
70
60
50
30
1950
1960
1970
1980
1990
2000
Period
World
Less developed regions
ria
l
40
2010
2020
Ma
te
90
2030
2040
2050
ted
Fig. . Life Expectancy at Birth for the World and Major Development Group,
.
op
yri
gh
Source: Population Division of the Department of Economic and Social Affairs of the
United Nations Secretariat (). World Population Prospects: The Revision Highlights.
New York: United Nations.
Global poverty
-C
70
$2* a day
60
Pr
ev
ie
50
South Asia
$1.25* a day
40
$1.25* a day
(excl. China)
30
20
N
87
90
93
96
China
1981 84
0
East Asia and
Pacific
99 2002 05
08
Sub-Saharan
Africa
Latin America
and Caribbean
Middle East and
north Africa
Eastern Europe and
Central Asia
1981
2005
2008
Fig. . Poverty in the World Dened as the Number and Share of People Living
Below $. a Day (at prices) in .
Source: World Bank. # The Economist Newspaper Limited, London ( February ).
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
No ICTs
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
TIME
gh
ted
Ma
te
ria
l
op
yri
Create
Generate
Collect
information
Pr
ev
ie
-C
Recycle
Erase
Record
Store
Consume
Use
Distribute
Transmit
Process
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
Instructions
Consider the two diagrams in Figure and Figure . Figure is famous,
almost iconic. It is known as Moores Law and suggests that, over the
TIME
16-Core SPAEC T3
Six-Core Core i7
Six-Core Xeon 7400
10-Core Xeon Westmere-EX
2,600,000,000
8-Core POWER7
Dual-Core Itanium 2
Quad-Core z196
AMD K10
Quad-Core Itanium Tukwila
POWER6
8-Core Xeon Nehalem-EX
Itanium 2 with 9MB cache
Six-Core Opteron 2400
AMD K10
Core i7 (Quad)
Core 2 Duo
Itanium 2
Cell
1,000,000,000
100,000,000
AMD K8
10,000,000
68000
8086
Z80
6809
8008
MOS 6502
4004
RCA 1802
1971
ted
2,300
8085
6800
8080
80186
8088
1980
1990
gh
10,000
Ma
te
80386
80286
100,000
Atom
AMD K7
AMD K6-III
AMD K6
Pentium III
Pentium II
AMD K5
Pentium
80486
1,000,000
Barton
ria
l
Transistor count
Pentium 4
2000
2011
Date of introduction
yri
op
Source: Wikipedia.
Pr
ev
ie
-C
$100,000,000,000,000
UNIVAC I
$1,000,000,000,000
$10,000,000,000
IBM 7090
Commodore 64
$100,000,000
Altair 8800
$1,000,000
ria
l
ENIAC
Mac I
$100
$1
1940
1960
1970
ted
1950
Ma
te
$10,000
1980
1990
Gateway G6-200
2000
THE
iPad2
2010
HAMILTON
PROJECT
gh
yri
Pr
ev
ie
-C
op
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
$25.0
ria
l
$15.0
$10.0
Ma
te
Billions of Dollars
$20.0
$5.0
2010
2011
2012F
gh
2009
ted
$0.0
2013F
2014F
2015F
op
yri
Pr
ev
ie
-C
have to involve screens or keyboards. It may be a matter of a neuroprosthetic device implanted in the brain. Of course, in all human
computer interactions, the better the process, the computationally
greedier the ICT in question is likely to be. It takes a lot of MIPS to
make things easy. This is the reason why new operating systems can
hardly run on old computers.
We know that what our eyes can see in the worldthe visible
spectrum of the rainbowis but a very small portion of the electromagnetic spectrum, which includes gamma rays, X-rays, ultraviolet, infrared,
microwaves, and radio waves. Likewise, the data processing spectrum
that we can perceive is almost negligible compared to what is really
going on in machine-to-machine and humancomputer interactions.
An immense number of ICT applications run an incalculable number of
instructions every millisecond of our lives to keep the hyperhistorical
TIME
ria
l
Ma
te
60
ted
50
gh
yri
op
30
-C
BILLIONS
40
10
Pr
ev
ie
20
2003
2010
2015F
2020F
World Population
Connected Devices
90%
80%
ria
l
70%
Connection Devices
World Population
50%
ted
40%
Ma
te
BILLIONS
60%
30%
gh
20%
op
yri
10%
0%
2003
2015F
2020F
-C
2010
Pr
ev
ie
TIME
Data
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
4500
4000
ria
l
Exabytes
5000
3500
3000
Ma
te
2500
2000
1500
1000
0
2006
2007
2008
2009
2011
2012
2013F
2014F
2015F
yri
2010
gh
1970
ted
500
-C
op
Source: based on IDC white paper, The Diverse and Exploding Digital Universe, March
, and IDC white paper, Worldwide Big Data Technology and Service
Forecast, March .
Pr
ev
ie
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
problem of dening big with large (the NSF and NHI seem to be
happy with it), understanding big data in terms of small tools
suggests that data are too big or large only in relation to our current
computational power. This is misleading. Of course, big, as many
other terms, is a relational predicate: a pair of shoes may be too big for
you, but ne for me. It is also trivial to acknowledge that we tend to
evaluate things non-relationally, in this case as absolutely big, whenever the frame of reference is obvious enough to be left implicit.
A horse is a big animal, no matter what whales may think. Yet these
two simple points may give the impression that there is no real
trouble with big data being a loosely dened term referring to the
fact that our current computers cannot handle so many gazillions of
data efciently. And this is where two confusions seem to creep in.
First, that the epistemological (that is, knowledge-related) problem with
big data is that there is too much of it (the ethical problem concerns how
we use them, more on this presently). And, second, that the solution to
the epistemological problem is technological: more and better techniques and technologies, which will shrink big data back to a manageable size. The epistemological problem is different, and it requires
an epistemological solution.
Consider the problem rst. Big data came to be formulated after
other buzz expressions, such as infoglut or information overload,
began to fade away, yet the idea remains the same. It refers to an
overwhelming sense that we have bitten off more than we can chew,
that we are being force-fed like geese, that our intellectual livers are
exploding. This is a mistake. Yes, we have seen that there is an obvious
exponential growth of data on an ever-larger number of topics, but
complaining about such over-abundance would be like complaining
about a banquet that offers more than we can ever eat. Data remain an
asset, a resource to exploit. Nobody is forcing us to digest every
available byte. We are becoming data-richer by the day; this cannot
be the fundamental problem.
Since the problem is not the increasing wealth of data that is
becoming available, clearly the solution needs to be reconsidered: it
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
Memory
Hyperhistory depends on big data, but there are two myths about the
dependability of digital memory that should be exposed in this rst
chapter.
The rst myth concerns the quality of digital memory. ICTs have a
kind of forgetful memory. They become quickly obsolete, they are
volatile, and they are rerecordable. Old digital documents may no
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
vinyl to the CD. Huge quantities of data were left behind, becoming
lost, unavailable, or inaccessible.
According to a Research Report by IBIS World, the data
recovery industry saw its overall revenue over the ve years to
fall at an annualized rate of . per cent to total $ billion, with a
decline of . per cent in .11 This may seem counter-intuitive. Big
data is growing and so are the problems concerning damaged, corrupted, or inaccessible les and storage media. The industry that takes
care of such problems should be ourishing. The explanation is that
cloud or online storage has expanded the options for data recovery
and data loss prevention. If you use Dropbox, Google Docs, Apple
iCloud, or Microsoft Skydrive, for example, and your computer is
damaged, the les are still available online and can be easily recovered,
so you will not need a data recovery service. Yet, this seems to be just a
question of transition and hence time. Cloud computing has put
pressure on an industry specialized in computers at a consumer
level. The more our gadgets become mere terminals, the less we
need to worry ourselves about the data. But the storage of those
data still relies on physical infrastructures, and these will need increasing maintenance. The data recovery industry will disappear, but a new
industry dedicated to cloud computing failures is already emerging. It
is not a matter of relying on the brute force of redundancy (having
more than one copy of the same le). This strategy is not available at a
global level, because of the second myth about the dependability of
digital memory, the one concerning the quantity of digital memory.
Since , the world has been producing many more data than
available storage.12 This despite the fact that, according to Kryders
Law (another generalization), storage density of hard disks is increasing more quickly than Moores Law, so that it is predicted that in
a disk of terabytes will be . inches in size and will cost about $.
Unfortunately, this will not be enough, because even the growth
projected by Kryders Law is slow when compared to the pace at
which we generate new data. Think of your smartphone becoming
too full because you took too many pictures, and make it a global
TIME
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
ria
l
ria
l
Connectivity
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te
TIME
1.2E+09
1E+09
800000000
ria
l
Value
600000000
Ma
te
Thousands
Nodes
400000000
gh
10
15
Number of nodes
20
25
yri
ted
200000000
op
Pr
ev
ie
-C
ria
l
Conclusion
Pr
ev
ie
-C
op
yri
gh
ted
Ma
te