Bem-vindo(a) ao Scribd!

Pular no carrossel

Data Mining - Steps and Functionalities

Enviado por

Raj Endran

0% acharam este documento útil (0 voto)

111 visualizações5 páginas

Data Mining- Steps and Functionalities

Título original

Data Mining- Steps and Functionalities

Direitos autorais

Formatos disponíveis

PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

Data Mining- Steps and Functionalities

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

111 visualizações5 páginas

Data Mining - Steps and Functionalities

Enviado por

Raj Endran

Data Mining- Steps and Functionalities

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 5

Pesquisar no documento

DATA MINING

Data Mining:
Intelligent methods are applied to extract the
useful information or patterns
Data Mining: A KDD Process:
Data mining: the core of knowledge discovery
process.
Steps of a KDD Process
Data Cleaning
Handles Noisy, Inconsistent, Incomplete data
Missing Values
Noisy data
Binning, Clustering etc.
Inconsistencies
Tools, functional dependencies

Data Integration
Schema Integration

Entity Identification problem

Redundancy
Correlation Analysis

Data Selection
Select only the task relevant data

Data Transformation
Transform or consolidate data
Smoothing, Normalization, Feature Construction
Data Reduction Compression

Pattern Evaluation
Interestingness Measures

Knowledge Presentation
Visualization

Data Mining Functionalities:

Descriptive
Characterize general properties of the data

Predictive

Performs inference

Mining

Parallel
Various Granularities

Concept/class description
Association Analysis
Classification and Prediction
Cluster Analysis
Outlier Analysis
Evolution Analysis

Concept/ Class Description:

Data can be associated with Classes / Concepts

Computers, Printers
BigSpenders Vs BudgetSpenders

Class / Concept Description

Classes and Concepts can be summarized in

concise and precise terms

Data Characterization
Data Discrimination

Data Characterization:

Summarization of the general characteristics

Data collected and aggregated
OLAP roll up operation
Attribute Oriented Induction
Results Charts, cubes, rules
Example
Characteristics of Customers

Data Discrimination:

Compare target class and contrasting classes

Maybe user specified
Examples:
Products whose sales increased Vs decreased
Regular Shoppers Vs Occasional Shoppers

Output includes Comparative measures

Association Analysis:

Discovery of association rules

Form: X Y
Multi-dimensional
Age(X, 2029)

buys(X, Laptop)
Single Dimensional

Classification and Prediction:

Classification
Finds models that describe and differentiate

classes or concepts
Predicts class
Training data
Models rules, decision trees, NN, formulae
Preceded by relevance analysis (to eliminate
irrelevant attributes)
Prediction
Derived model is used for prediction
Data value prediction
Class label prediction (Classification)
Trend identification
Cluster Analysis

Unsupervised
Class labels are missing in the training set
Maximize Intra-class similarity
Minimize Inter-class similarity
Hierarchy of classes

Outlier Analysis

Objects that do not comply with the general behavior

Noise Vs Rare events
Fraud detection
Statistical tests
Deviation based methods

Evolution Analysis:

Trend detection
Time series data
Involves other functionalities

Você também pode gostar

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
No Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Nota: 4 de 5 estrelas
4/5 (5794)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
No Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Nota: 4 de 5 estrelas
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
No Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Nota: 4.5 de 5 estrelas
4.5/5 (838)
Principles: Life and Work
No Everand
Principles: Life and Work
Ray Dalio
Nota: 4 de 5 estrelas
4/5 (599)
The Glass Castle: A Memoir
No Everand
The Glass Castle: A Memoir
Jeannette Walls
Nota: 4.5 de 5 estrelas
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
No Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Nota: 4 de 5 estrelas
4/5 (895)
Sing, Unburied, Sing: A Novel
No Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Nota: 4 de 5 estrelas
4/5 (1103)
Grit: The Power of Passion and Perseverance
No Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Nota: 4 de 5 estrelas
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
No Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Nota: 4.5 de 5 estrelas
4.5/5 (537)
The Perks of Being a Wallflower
No Everand
The Perks of Being a Wallflower
Stephen Chbosky
Nota: 4.5 de 5 estrelas
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
No Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Nota: 4.5 de 5 estrelas
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
No Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Nota: 4.5 de 5 estrelas
4.5/5 (474)
Bad Feminist: Essays
No Everand
Bad Feminist: Essays
Roxane Gay
Nota: 4 de 5 estrelas
4/5 (1016)
Her Body and Other Parties: Stories
No Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Nota: 4 de 5 estrelas
4/5 (821)
The Outsider: A Novel
No Everand
The Outsider: A Novel
Stephen King
Nota: 4 de 5 estrelas
4/5 (1839)
The Emperor of All Maladies: A Biography of Cancer
No Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Nota: 4.5 de 5 estrelas
4.5/5 (271)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
No Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Nota: 4.5 de 5 estrelas
4.5/5 (121)
Angela's Ashes: A Memoir
No Everand
Angela's Ashes: A Memoir
Frank McCourt
Nota: 4.5 de 5 estrelas
4.5/5 (440)
Brooklyn: A Novel
No Everand
Brooklyn: A Novel
Colm Tóibín
Nota: 3.5 de 5 estrelas
3.5/5 (1937)
The Little Book of Hygge: Danish Secrets to Happy Living
No Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Nota: 3.5 de 5 estrelas
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
No Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Nota: 3.5 de 5 estrelas
3.5/5 (2259)
A Man Called Ove: A Novel
No Everand
A Man Called Ove: A Novel
Fredrik Backman
Nota: 4.5 de 5 estrelas
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
No Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Nota: 4 de 5 estrelas
4/5 (4200)
A Tree Grows in Brooklyn
No Everand
A Tree Grows in Brooklyn
Betty Smith
Nota: 4.5 de 5 estrelas
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
No Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Nota: 4 de 5 estrelas
4/5 (98)
Steve Jobs
No Everand
Steve Jobs
Walter Isaacson
Nota: 4.5 de 5 estrelas
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
No Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Nota: 4.5 de 5 estrelas
4.5/5 (266)
The Woman in Cabin 10
No Everand
The Woman in Cabin 10
Ruth Ware
Nota: 3.5 de 5 estrelas
3.5/5 (2322)
Yes Please
No Everand
Yes Please
Amy Poehler
Nota: 4 de 5 estrelas
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
No Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Nota: 3.5 de 5 estrelas
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
No Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Nota: 4.5 de 5 estrelas
4.5/5 (234)
Fear: Trump in the White House
No Everand
Fear: Trump in the White House
Bob Woodward
Nota: 3.5 de 5 estrelas
3.5/5 (738)
Wolf Hall: A Novel
No Everand
Wolf Hall: A Novel
Hilary Mantel
Nota: 4 de 5 estrelas
4/5 (3811)
John Adams
No Everand
John Adams
David McCullough
Nota: 4.5 de 5 estrelas
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
No Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Nota: 4 de 5 estrelas
4/5 (74)
Baarda - Research This Is It
Documento33 páginas
Baarda - Research This Is It
Julian Smink
0% (2)
The Light Between Oceans: A Novel
No Everand
The Light Between Oceans: A Novel
M.L. Stedman
Nota: 4.5 de 5 estrelas
4.5/5 (789)
Emma Bell, Alan Bryman & Bill Harley (2018) Business Research Methods
Documento14 páginas
Emma Bell, Alan Bryman & Bill Harley (2018) Business Research Methods
Jesper Buskas
0% (1)
The Unwinding: An Inner History of the New America
No Everand
The Unwinding: An Inner History of the New America
George Packer
Nota: 4 de 5 estrelas
4/5 (45)
Manhattan Beach: A Novel
No Everand
Manhattan Beach: A Novel
Jennifer Egan
Nota: 3.5 de 5 estrelas
3.5/5 (792)
The Constant Gardener: A Novel
No Everand
The Constant Gardener: A Novel
John le Carré
Nota: 3.5 de 5 estrelas
3.5/5 (104)
Role of Statistics in Education
Documento4 páginas
Role of Statistics in Education
Sahir Khan
100% (1)
The Perception of Night Market Shoppers in Binan Modified
Documento46 páginas
The Perception of Night Market Shoppers in Binan Modified
Arveeh Aviles
100% (2)
Rise of ISIS: A Threat We Can't Ignore
No Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Nota: 3.5 de 5 estrelas
3.5/5 (137)
Little Women
No Everand
Little Women
Louisa May Alcott
Nota: 4 de 5 estrelas
4/5 (104)
Data Mining-Mining Sequence Patterns in Biological Data
Documento6 páginas
Data Mining-Mining Sequence Patterns in Biological Data
Raj Endran
Ainda não há avaliações
Data Mining - Mining Sequential Patterns
Documento10 páginas
Data Mining - Mining Sequential Patterns
Raj Endran
Ainda não há avaliações
Data Mining-Graph Mining
Documento9 páginas
Data Mining-Graph Mining
Raj Endran
Ainda não há avaliações
Data Mining-Multimedia Datamining
Documento8 páginas
Data Mining-Multimedia Datamining
Raj Endran
Ainda não há avaliações
Data Mining-Mining Time Series Data
Documento7 páginas
Data Mining-Mining Time Series Data
Raj Endran
Ainda não há avaliações
Data Mining-Outlier Analysis
Documento6 páginas
Data Mining-Outlier Analysis
Raj Endran
Ainda não há avaliações
5.1 Mining Data Streams
Documento16 páginas
5.1 Mining Data Streams
Raj Endran
Ainda não há avaliações
Data Mining-Partitioning Methods
Documento7 páginas
Data Mining-Partitioning Methods
Raj Endran
100% (1)
Data Mining-Spatial Data Mining
Documento8 páginas
Data Mining-Spatial Data Mining
Raj Endran
Ainda não há avaliações
Data Mining-Model Based Clustering
Documento8 páginas
Data Mining-Model Based Clustering
Raj Endran
Ainda não há avaliações
Data Mining-Constraint Based Cluster Analysis
Documento4 páginas
Data Mining-Constraint Based Cluster Analysis
Raj Endran
100% (1)
Data Mining - Outlier Analysis
Documento11 páginas
Data Mining - Outlier Analysis
Raj Endran
100% (2)
Data Mining - Bayesian Classification
Documento6 páginas
Data Mining - Bayesian Classification
Raj Endran
Ainda não há avaliações
Data Mining-Support Vector Machines and Associative Classifiers Revised
Documento4 páginas
Data Mining-Support Vector Machines and Associative Classifiers Revised
Raj Endran
Ainda não há avaliações
Data Mining - Other Classifiers
Documento7 páginas
Data Mining - Other Classifiers
Raj Endran
Ainda não há avaliações
Data Mining-Backpropagation
Documento5 páginas
Data Mining-Backpropagation
Raj Endran
100% (1)
Data Mining - Discretization
Documento5 páginas
Data Mining - Discretization
Raj Endran
Ainda não há avaliações
Data Mining-Rule Based Classification
Documento4 páginas
Data Mining-Rule Based Classification
Raj Endran
Ainda não há avaliações
Data Mining - Data Reduction
Documento6 páginas
Data Mining - Data Reduction
Raj Endran
Ainda não há avaliações
Data Mining-Data Warehouse
Documento7 páginas
Data Mining-Data Warehouse
Raj Endran
Ainda não há avaliações
Data Mining-Applications, Issues
Documento9 páginas
Data Mining-Applications, Issues
Raj Endran
Ainda não há avaliações
Data Mining - Density Based Clustering
Documento8 páginas
Data Mining - Density Based Clustering
Raj Endran
Ainda não há avaliações
08 Data Mining-Other Classifications
Documento4 páginas
08 Data Mining-Other Classifications
Raj Endran
Ainda não há avaliações
02 Data Mining-Partitioning Method
Documento8 páginas
02 Data Mining-Partitioning Method
Raj Endran
Ainda não há avaliações
Barton & Pretty (2010) What Is The Best Dose of Nature
Documento9 páginas
Barton & Pretty (2010) What Is The Best Dose of Nature
Ana Ferreira
Ainda não há avaliações
Applied Econometrics: William Greene Department of Economics Stern School of Business
Documento68 páginas
Applied Econometrics: William Greene Department of Economics Stern School of Business
郭岱瑋
Ainda não há avaliações
Analysis of Education Loan: A Case Study of National Capital Territory of Delhi
Documento14 páginas
Analysis of Education Loan: A Case Study of National Capital Territory of Delhi
bidyuttezu
Ainda não há avaliações
Scheme of Work: Cambridge O Level Statistics 4040
Documento46 páginas
Scheme of Work: Cambridge O Level Statistics 4040
Hadia Usmani
Ainda não há avaliações
IQC - An Introduction
Documento30 páginas
IQC - An Introduction
Bhageshwar Chaudhary
Ainda não há avaliações
Box Muller Method: 1 Motivation
Documento2 páginas
Box Muller Method: 1 Motivation
Nitish Kumar
Ainda não há avaliações
Interpretation and Use of Results From Interlaboratory Testing of Chemical Analysis Methods
Documento13 páginas
Interpretation and Use of Results From Interlaboratory Testing of Chemical Analysis Methods
Eric Gozzer
Ainda não há avaliações
Illiteracy Classification Using K Means-Naïve Bayes Algorithm
Documento6 páginas
Illiteracy Classification Using K Means-Naïve Bayes Algorithm
Anwar Ludfianto
Ainda não há avaliações
QMB 3200 Spring 2024 Project Assignment 2
Documento5 páginas
QMB 3200 Spring 2024 Project Assignment 2
conrradoegm
Ainda não há avaliações
Annex III Sample Course Specifications For BSEE As of Nov. 3 2017 PDF
Documento46 páginas
Annex III Sample Course Specifications For BSEE As of Nov. 3 2017 PDF
Arjhay Gironella
Ainda não há avaliações
Homework 4
Documento4 páginas
Homework 4
Jeremy Ng
Ainda não há avaliações
Gbs10e PPT ch10
Documento48 páginas
Gbs10e PPT ch10
N A
Ainda não há avaliações
Z-Test of One-Sample Mean
Documento16 páginas
Z-Test of One-Sample Mean
Kurt Amihan
Ainda não há avaliações
Write An Essay (At Least 300 Words) About Your Experiences and Learnings in Math 100
Documento2 páginas
Write An Essay (At Least 300 Words) About Your Experiences and Learnings in Math 100
Christian Shane Bejerano
Ainda não há avaliações
Business Statistics
Documento229 páginas
Business Statistics
Ejaz Ahmad
Ainda não há avaliações
Poisson Regression Analysis 136
Documento8 páginas
Poisson Regression Analysis 136
George Adongo
Ainda não há avaliações
Usp 35 en
Documento7 páginas
Usp 35 en
Ismael Tonelli
Ainda não há avaliações
SBS Mathematics 15
Documento28 páginas
SBS Mathematics 15
arcanum78
Ainda não há avaliações
31295013211270
Documento262 páginas
31295013211270
Qaccee Qaaxalee Taarbii
Ainda não há avaliações
IB Mathematics AA SL Internal Assessment
Documento25 páginas
IB Mathematics AA SL Internal Assessment
Shiyou Jin
Ainda não há avaliações
Pengaruh Discount Dan Store Atmosphere Terhadap Perilaku (Studi Kasus Pada Konsumen Lottemart Wholesale Semarang)
Documento13 páginas
Pengaruh Discount Dan Store Atmosphere Terhadap Perilaku (Studi Kasus Pada Konsumen Lottemart Wholesale Semarang)
Ikhsan Maulana
Ainda não há avaliações
BTC Analytics Handbook
Documento30 páginas
BTC Analytics Handbook
Rishabh Pandey
Ainda não há avaliações
Thesis FINAL-EDIT
Documento82 páginas
Thesis FINAL-EDIT
Eugeniano
Ainda não há avaliações
Solutions To Practice Questions Week 1
Documento15 páginas
Solutions To Practice Questions Week 1
YUPING GUO
Ainda não há avaliações
PST1 Solutions For Students
Documento10 páginas
PST1 Solutions For Students
Dirty Rajan
100% (1)
The Hopfield Model - Emin Orhan - 2014 PDF
Documento11 páginas
The Hopfield Model - Emin Orhan - 2014 PDF
Jon Arnold Grey
Ainda não há avaliações