Escolar Documentos
Profissional Documentos
Cultura Documentos
Projetos |
Cursos Conecte-se
Tutoriais
Boletim de Notícias
Torne-se um membro.
Ascensão do Cientista de Dados Apoie um site independente.
Faça ótimos gráficos.
4 de junho de O que você ganha
2009
Tópico
Desenho ,
Estatística
Foto de majamarko
Como todos nós já lemos até agora, o economista-chefe do Google, Hal Varian , comentou
em janeiro que o próximo trabalho sexy nos próximos 10 anos seria o de estatísticos.
Obviamente, concordo plenamente . Heck, eu iria um passo além e diria que eles são sexy
agora - mentalmente e fisicamente.
No entanto, se você ler o restante da entrevista de Varian, saberá que, por estatísticos , ele
realmente quis dizer um título geral para alguém que é capaz de extrair informações de
grandes conjuntos de dados e, em seguida, apresentar algo útil para não- especialistas em
dados.
Essas habilidades realmente se encaixam perfeitamente com a dissertação de Ben Fry sobre
Design de Informação Computacional (2004). No entanto, Fry dá um passo adiante e defende
um campo totalmente novo que combina as habilidades e talentos de áreas de especialização
muitas vezes desconexas:
Think about all the visualization stuff you’ve been most impressed with or the groups that
always seem to put out the best work. Martin Wattenberg. Stamen Design. Jonathan Harris.
Golan Levin. Sep Kamvar. Why is their work always of such high quality? Because they’re
not just students of computer science, math, statistics, or graphic design.
They have a combination of skills that not just makes independent work easier and quicker; it
makes collaboration more exciting and opens up possibilities in what can be done.
Oftentimes, visualization projects are disjoint processes and involve a lot of waiting. Maybe
a statistician is waiting for data from a computer scientist; or a graphic designer is waiting for
results from an analyst; or an HCI specialist is waiting for layouts from a graphic designer.
Let’s say you have several data scientists working together though. There’s going to be less
waiting and the communication gaps between the fields are tightened.
How often have we seen a visualization tool that held an excellent concept and looked great
on paper but lacked the touch of HCI, which made it hard to use and in turn no one gave it a
chance? How many important (and interesting) analyses have we missed because certain
ideas could not be communicated clearly? The data scientist can solve your troubles.
An Application
This need for data scientists is quite evident in business applications where educated
decisions need to be made swiftly. A delayed decision could mean lost opportunity and profit.
Terabytes of data are coming in whether it be from websites or from sales across the country,
but in an area where Excel is the tool of choice (or force), there are limitations, hence all the
tools, applications, and consultancies to help out. This of course applies to areas outside of
business as well.
Even if you’re not into visualization, you’re going to need at least a subset of the skills that
Fry highlights if you want to seriously mess with data. Statisticians should know APIs,
databases, and how to scrape data; designers should learn to do things programmatically; and
computer scientists should know how to analyze and find meaning in data.
Basically, the more you learn, the more you can do, and the higher in demand you will be as
the amount of data grows and the more people want to make use of it.
Related
47 Comments
jeff — June 4, 2009 at 3:31 am
i’m curious about how you chose the term “data scientist” to describe this role. that’s
precisely the title we used for folks on the data team at facebook, chosen somewhat
arbitrarily as a contraction of “data analyst” and “research scientist”, with the same
skills in mind as you mention above. i also titled my chapter for “beautiful data”
“information platforms and the rise of the data scientist”. quite amazing if you
formulated the phrase separately! something in the air…
@jeff – ha, yes, there must be something in the air. i read a grant proposal two or
three years ago pushing for a new area of study called “data science.” it’s stuck
with me ever since.
Nathan – Nice synthesis and thanks for the shout-out. Ben Fry’s model captures the
various fields that comprise this interdisciplinary ‘data science’ quite elegantly. I’d
even venture to add some bidirectional arrows. Between the four core activities —
Munging, Modeling, Visualizing, and Interacting — there’s a lot of feedback.
And as far as sexiness goes, I’m still holding my breath for People magazine to release
its Sexiest Data Scientist Alive issue. It still may be a decade or more away.
@Michael – re:feedback definitely. fry gets into this as its one of his arguments for an
interdisciplinary field – whereas in a collaboration, a person would have to explain to
another what he wanted, have some misunderstandings along the way, and then wait.
I don’t see that you can’t apply the ‘data scientist’ skill set to the ‘senior
management in a major corporation’ job.
Being able to gather/create, parse/analyze, then present data in a meaningful way
would likely not hinder you.
I like Fry’s approach but in this time and age you’d have to accept that these skills are
not necessarily mastered by the same person.
The best mathematician could not the best graphic artist who is not necessarily the best
interface designer who is not always a subject-matter expert, etc.
Some people have all those skills and then some, but that’s more the exception than the
rule.
What about “storyteller?” Not trying to trivialize the issue at all, but the ability to
effectively communicate the relevance and import of the findings would seem to be the
skill that ties it all together. Completing the analysis isn’t the end of the project, getting
the HiPPO’s sign-off is. If all the effort doesn’t go toward meeting an organizational
goal, it’s wasted.
@Craig: The best answer is to look at senior managers whose work you admire, and
see how they did it. For example, often specialized knowledge is replaced by the
ability to notice, nurture, and exploit technical talent.
And note well: if you can’t find a senior manager you’d like to imitate, that’s a sign
that being an executive will make you unhappy.
@Craig, @Jérôme – i think it’s not so much about learning all there is to know
about all the fields. Instead, you’re learning a collection of skills from the fields with
the primary purpose of visualization (or computational information design). So for
example, you might learn graphic design, but you’re not going to learn logo design;
you’re going to learn how to display data.
From a management standpoint, you’ll know what everyone is talking about and the
work involved which makes it easier to delegate and to keep things moving.
Pingback: Analytics Team » Blog Archive » The role of the data scientist
I would add functional expertise to the list of skills. You need to understand the domain
you are analyzing if you really want to understand your data.
Subhankar Ray,
I am not sure what you are getting at, but these are quite difficult questions.
While such theorems can be assisted with computers (see the four color theorem
and Coq proof assistant), theorems are derived through mathematical induction,
construction, negation, etc., not though empirical data. Quasi-empirical data,
however is used, meaning the results of enumeration (such as the ever-growing
list of prime numbers) or random evaluation of a complex mathematical object
(such as Monte Carlo methods on probability density functions), allow us
approximations or enumerations for further mathematical consideration
It’s interesting that the graphic you represented in this post has similarity to the job of
librarians…
We aquire books (or information), filter these books, or the information, to the right
user or client; or mine the stacks or systems in pursuit of these information; the files
are represented via catalog (cards or systems); and the reference service refines and
interact with our users.
Do you think that to be a Data Scientist is to have a librarian expertise too? And vice-
versa?
having worked with information scientists, there are definitely several parallels
in what we do.
Well, given that it was a sexy maths lady who first pointed me to this article in the first
place… ^^
Pingback: The Rise of the Data Scientist [From Flowing Data] | Computational Legal
Studies
The post here makes data visualization seem so complex (and I’m sure it is..) that I’m
almost afraid to call simple graphs/maps ‘data visualization’.
Or, is this blog (that one of your comment-ers suggested on your 37 visualizations
post) also visualization?
http://thisisindexed.com/
I’m into visualization because reading lots of little words hurts my eyes, because I
believe there’s a more efficient way to convey information, and it really is awesome
looking sometimes. But, does everything count?
Jessica — June 30, 2009 at 7:14 pm
To summarize… Robert Kosara gives a lot of definitions that I think would rule out the
links in my previous comment as “data visualizations”.
And then…the reason data visualization isn’t popular might also be because it’s so
exclusive/scientific/precise that people don’t want to touch/enjoy/appreciate it! What
do you think?
Here is a definition of visualization from (a blog that was also suggested by a reader in
your 37 visualizations post) http://eagereyes.org
http://eagereyes.org/theory/Definition-of-Visualization.html
The definitions (to me at least… casual graph-onlooker) seem pretty intense. And
going back to your post about why data visualization wasn’t popular… it might be
because data visualization is viewed as a science.. very precise/definite/intense that
people are afraid to enjoy/appreciate it at all.
Pingback: Ivan Frantar (ivanico) 's status on Wednesday, 08-Jul-09 08:14:31 UTC -
Identi.ca
Pingback: Flow » Blog Archive » Daily Digest for July 9th - The zeitgeist daily
Pingback: Cientista de Dados > Geek de Dados > Designer « Visualizing Economics
Pingback: Coast to Coast Bio Podcast » Arquivo do blog » Episódio 23: Então, por
que você estava falando sobre iPhones?
Para os passageiros, quanto mais longe Há muitas variáveis a serem Isso é o que você obtém quando soma A nuclear ainda é a mais comum,
você mora do local de trabalho, mais consideradas, mas para pessoas de todos os dias que o adulto americano há milhões de lares nos Estados
cedo precisa sair de casa para chegar renda média, aqui vai uma sugestão, médio passa dormindo, comendo, se Unidos com uma estrutura famili
ao trabalho no horário. Quanto esse baseada em quando você começa a deslocando e fazendo outras diferente.
horário de início muda quanto mais poupar e quando quer se aposentar. atividades.
longe você chega?
Sobre
Contato
Twitter
Boletim de Notícias
RSS