Escolar Documentos
Profissional Documentos
Cultura Documentos
Field?
www.linkedin.com/in/annetoar
2
History of databases
IDMS
Magnetic tape
flat (sequential) files
Precomputer
technologi
es:
Printing
press
Dewey
decimal
system
1940-50
Punched
cards
ADABAS
Magnetic Disk
HBase
Access
System R
Postgres
Oracle V2
MySQL
Dynamo
MongoDB
Redis
VoltDB
Neo4J
1950-60
1960-70
1970-80
19902000
1980-90
20002010
SQL Server
Relational
Model
defined
Sybase
Informix
IMS
Network Model
Hierarchical
model
IndexedSequential Access
Mechanism
(ISAM)
Ingres
DB2
dBase
Aerospike
Hana
Riak
Cassandra
Vertica
Hadoop
2005
2013
Map Reduce
Big Table
Hadoop at Yahoo
Collective
intelligence
Programs that use
inputs from crowds
to simulate
intelligence
Predictive
analytics
Programs that
extrapolate from
past to future
NoSQL
Web
servers
Memcached
sServers
Database
Servers
Shard (A-F)
Shard (G-O)
Shard (P-Z)
No means yes!
New SQL
Name DOB
Salary
Sales
Dick
21/12/60 67,000
78980
Expens
es
3244
100
1
100
2
100
3
100
4
100
5
Jane
12/12/55 55,000
67840
2333
67890
6436
Dan
15/03/75 65,200
98770
2345
43240
3214
Block ID
Name DOB
1001
Dick
21/12/60 67,000
Salary
1002
Jane
12/12/55 55,000
1003
1004
Dan
1005
15/03/75 65,200
Sales Expense
s
7898 3244
0
6784 2333
0
6789 6436
0
9877 2345
0
4324 3214
0
Block
1
2
3
4
5
Dick
Jane
21/12/60 12/12/5
5
67,000
55,000
78980
67840
3244
2333
Robert
17/02/8
0
22,000
67890
6436
Dan
15/03/75
Steven
11/11/81
65,200
98770
2345
76,000
43240
3214
Analytical queries
Row oriented database
SELECT
SUM(salary) FROM
saleperson
Block ID
Name DOB
1001
Dick
21/12/60 67,000
Salary
1002
Jane
12/12/55 55,000
1003
1004
Dan
1005
15/03/75 65,200
Sales Expense
s
7898 3244
0
6784 2333
0
6789 6436
0
9877 2345
0
4324 3214
0
Block
1
2
3
4
5
Dick
Jane
21/12/60 12/12/5
5
67,000
55,000
78980
67840
3244
2333
Robert
17/02/8
0
22,000
67890
6436
Dan
15/03/75
Steven
11/11/81
65,200
98770
2345
76,000
43240
3214
Compression
Row oriented database
Block ID
Name DOB
1001
Dick
21/12/60 67,000
Salary
1002
Jane
12/12/55 55,000
1003
1004
Dan
1005
15/03/75 65,200
Sales Expense
s
7898 3244
0
6784 2333
0
6789 6436
0
9877 2345
0
4324 3214
0
Block
1
2
3
4
5
Dick
Jane
21/12/60 12/12/5
5
67,000
55,000
78980
67840
3244
2333
Robert
17/02/8
0
22,000
67890
6436
Dan
15/03/75
Steven
11/11/81
65,200
98770
2345
76,000
43240
3214
Inserts
Row oriented database
INSERT INTO
salesperson
Block ID
Name DOB
1001
Dick
21/12/60 67,000
Salary
1002
Jane
12/12/55 55,000
1003
1004
Dan
1005
15/03/75 65,200
Sales Expense
s
7898 3244
0
6784 2333
0
6789 6436
0
9877 2345
0
4324 3214
0
Block
1
2
3
4
5
Dick
Jane
21/12/60 12/12/5
5
67,000
55,000
78980
67840
3244
2333
Robert
17/02/8
0
22,000
67890
6436
Dan
15/03/75
Steven
11/11/81
65,200
98770
2345
76,000
43240
3214
SSD and
in-memory databases
In-memory databases
Cost of RAM
falling 50% each
18 months.
Some databases
can fit entirely
within the RAM
of a single
server or cluster
of servers
Data Scientist
Thank you