Bem-vindo(a) ao Scribd!

CCC C C: A Kolmogorov - Smirnov Correlation-Based Filter For Microarray Data

Enviado por

0% acharam este documento útil (0 voto)

17 visualizações2 páginas

Filter algorithm using F -measure has been used with feature redundancy removal based on the Kolmogorov -Smirnov (KS) test. Computationally efficient K -S correlation-based selection algorithm has been developed and tested on three high -dimensional microarray datasets. Results are quite encouraging and several improvements are suggested.

Descrição original:

Título original

asm

Direitos autorais

Formatos disponíveis

DOCX, PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato DOCX, PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

17 visualizações2 páginas

CCC C C: A Kolmogorov - Smirnov Correlation-Based Filter For Microarray Data

Enviado por

Nisarg Shah

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato DOCX, PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 2

Pesquisar no documento

c

A Kolmogorov -Smirnov Correlation-Based Filter

for Microarray Data c
c c
c c
c
c
À

c

c
A Kolmogorov-Smirnov Correlation-Based Filter
for Microarray Data

Nisarg Shah
522
MBA (Tech)
Telecom

ABSTRACT.

A filter algorithm using F -measure has been used with feature redundancy removal based on the Kolmogorov -Smirnov (KS)
test for rough equality of statistical distributions. As a result computationally efficient K -S Correlation- Based Selection
algorithm has been developed and tested on three high -dimensional microarray datasets using four types of classifiers.
Results are quite encouraging and several improvements are suggested.

1 Introduction
Feature ranking and feature selection algorithms applicable to large data mining problems with very high
number of features that are potentially irrelevant for a given task are usually of the filter type [1]. Filter
algorithms remove features that have no chance to be useful in further data analysis, independently of particular
predictive system (predictor) that may be used on this data. In the simplest case feature filter is a function
returning a relevance index £(S|D ) that estimates, given the data D, how relevant a given feature subset S is
for the task (usually classification, association or approximation of data). Since the data and the task are
usually fixed and only the subsets S vary, the relevance index will be written as £(S).

This index may result from a simple calculation of a correlation coefficient or entropy -based index, or it may be
computed using more involved algorithmic procedures (for example, requiring creation of partial decision tree,
or finding nearest neighbours of some vectors).

For large problems simpler indices have an obvious advantage of being easier to calculate, requiring an effort on
the order of è(à), while more sophisticated procedures based on distances may require è(à2) operations.

Relevance indices may be computed for individual features ÿ 1 , providing indices that establish a
ranking order £(ÿ1 ) _ £(ÿ2 ) · · · _ £(ÿ ). Those features which have the lowest ranks are subsequently filtered
out.

For independent features this may be sufficient, but if features are correlated many of them may be redundant.
Ranking does not guarantee that a small subset of important features will be found. In pathological situations a
single best feature may not even be a member of the

Você também pode gostar

Solution Chapter8
Documento8 páginas
Solution Chapter8
Bunga Safhira Wirata
50% (2)
Probability & Stochastic Processes (EL-GY 6303) Hw11 Solution
Documento8 páginas
Probability & Stochastic Processes (EL-GY 6303) Hw11 Solution
alamali88
Ainda não há avaliações
Detect Key Genes in Classification of Microarray Data
Documento13 páginas
Detect Key Genes in Classification of Microarray Data
Leo trinh
Ainda não há avaliações
Performance Еvaluation of Тracking Аlgorithm Incorporating Attribute Data Processing via DSmT
Documento5 páginas
Performance Еvaluation of Тracking Аlgorithm Incorporating Attribute Data Processing via DSmT
Mia Amalia
Ainda não há avaliações
Reliability Assessment of Redundant Electrical Power
Documento7 páginas
Reliability Assessment of Redundant Electrical Power
MArcelo
Ainda não há avaliações
Pec-Cs 701e
Documento4 páginas
Pec-Cs 701e
suvamsarma67
Ainda não há avaliações
KNN
Documento11 páginas
KNN
woshidapangmao
Ainda não há avaliações
Performance Еvaluation of Тracking Аlgorithm Incorporating Attribute Data Processing via DSmT
Documento5 páginas
Performance Еvaluation of Тracking Аlgorithm Incorporating Attribute Data Processing via DSmT
Mia Amalia
Ainda não há avaliações
New Approach To Similarity Detection by Combining Technique Three-Patch Local Binary Patterns (TP-LBP) With Support Vector Machine
Documento10 páginas
New Approach To Similarity Detection by Combining Technique Three-Patch Local Binary Patterns (TP-LBP) With Support Vector Machine
IAES IJAI
Ainda não há avaliações
Title: Status: Purpose: Author(s) or Contact(s)
Documento4 páginas
Title: Status: Purpose: Author(s) or Contact(s)
Shafayet Uddin
Ainda não há avaliações
Garkani Nejad2011
Documento10 páginas
Garkani Nejad2011
nabil
Ainda não há avaliações
Scanning The Issue: 4922 Ieee Transactions On Intelligent Transportation Systems, Vol. 21, No. 12, December 2020
Documento6 páginas
Scanning The Issue: 4922 Ieee Transactions On Intelligent Transportation Systems, Vol. 21, No. 12, December 2020
pippo50@
Ainda não há avaliações
Parameter Identification of Induction Motors by Using Genetic Algorithms
Documento8 páginas
Parameter Identification of Induction Motors by Using Genetic Algorithms
Maroof Ahmed
Ainda não há avaliações
Classification Algorithms in Achieving Partitioning Optimization For VLSI Applications
Documento3 páginas
Classification Algorithms in Achieving Partitioning Optimization For VLSI Applications
Sudheer Reddy
Ainda não há avaliações
Automated 3D Scenes Reconstruction For Mobile Robots Using Laser Scanning
Documento6 páginas
Automated 3D Scenes Reconstruction For Mobile Robots Using Laser Scanning
Ian Medeiros
Ainda não há avaliações
Bayesian Networks-Based Approach For Power Systems Fault Diagnosis
Documento12 páginas
Bayesian Networks-Based Approach For Power Systems Fault Diagnosis
ananthsiva
Ainda não há avaliações
Bozeman 2001
Documento22 páginas
Bozeman 2001
turtlelatte2000
Ainda não há avaliações
Markov Chain Monte Carlo Sampling Using A Reservoir Method
Documento11 páginas
Markov Chain Monte Carlo Sampling Using A Reservoir Method
Kevin Rios
Ainda não há avaliações
Linear Congruential Generator For Pseudo-Random Number Generation With R
Documento5 páginas
Linear Congruential Generator For Pseudo-Random Number Generation With R
Marcos Marcos
Ainda não há avaliações
Radar Target Identification Using Spatial Matched Filters
Documento7 páginas
Radar Target Identification Using Spatial Matched Filters
danny escurra
Ainda não há avaliações
Jels Lda
Documento11 páginas
Jels Lda
mehdimatinfar
Ainda não há avaliações
Tencon NTL Detection
Documento6 páginas
Tencon NTL Detection
Ajit Kumar
Ainda não há avaliações
EE602 Assignment
Documento10 páginas
EE602 Assignment
Rajat Soni
Ainda não há avaliações
Dong He, Zhenrong Zhang, Feng Yang: Design of INS Data Acquisition Device Based On STM32
Documento4 páginas
Dong He, Zhenrong Zhang, Feng Yang: Design of INS Data Acquisition Device Based On STM32
kenny Chou
Ainda não há avaliações
Journal of Applied Research and Technology
Documento4 páginas
Journal of Applied Research and Technology
PK
Ainda não há avaliações
Mobile Robot SLAM Methods Improved For Adapting To Search and Rescue Environments
Documento6 páginas
Mobile Robot SLAM Methods Improved For Adapting To Search and Rescue Environments
example
Ainda não há avaliações
Networking Anomaly Detection Using DSNS and Particle Swarm Optimization With Re-Clustering
Documento6 páginas
Networking Anomaly Detection Using DSNS and Particle Swarm Optimization With Re-Clustering
zaig
Ainda não há avaliações
A SOM Based Approach For Visualization of GSM Network Performance Data
Documento10 páginas
A SOM Based Approach For Visualization of GSM Network Performance Data
vatsalshah24
Ainda não há avaliações
Pesgm2022 000071
Documento5 páginas
Pesgm2022 000071
Vedran Peric
Ainda não há avaliações
Data-Driven Learning-Based Classification Model For Mitigating False Data Injection Attacks On Dynamic Line Rating Systems
Documento16 páginas
Data-Driven Learning-Based Classification Model For Mitigating False Data Injection Attacks On Dynamic Line Rating Systems
賴慶明
Ainda não há avaliações
Machine Learning and AI
Documento10 páginas
Machine Learning and AI
smedixon
Ainda não há avaliações
Improved NN-JPDAF For Joint Multiple Target Tracking and Feature Extraction
Documento26 páginas
Improved NN-JPDAF For Joint Multiple Target Tracking and Feature Extraction
VAHID KESHAVARZI
Ainda não há avaliações
Chapter 6
Documento18 páginas
Chapter 6
MD HOSSAIN
Ainda não há avaliações
Ieee 2010 Titles: Data Alcott Systems (0) 9600095047
Documento6 páginas
Ieee 2010 Titles: Data Alcott Systems (0) 9600095047
Sudharsan_Kuma_8469
Ainda não há avaliações
Non-Divergent Estimation Algorithm in The Presence of Unknown Correlations
Documento5 páginas
Non-Divergent Estimation Algorithm in The Presence of Unknown Correlations
Thiago Rocha
Ainda não há avaliações
Analysis and Comparison of Machine Learning Approaches For Transmission Line Fault Prediction in Power Systems
Documento8 páginas
Analysis and Comparison of Machine Learning Approaches For Transmission Line Fault Prediction in Power Systems
ዛላው መና
Ainda não há avaliações
Energy Theft and Defective Meters Detection
Documento6 páginas
Energy Theft and Defective Meters Detection
Andrea Velásquez Encalada
Ainda não há avaliações
Formato Proyecto I+D
Documento3 páginas
Formato Proyecto I+D
Vanessa Fontalvo reniz
Ainda não há avaliações
Exploration of Co-Relation Between Depression and Anaemia in Pregnant Women Using Knowledge Discovery and Data Mining Algorithms and Tools
Documento9 páginas
Exploration of Co-Relation Between Depression and Anaemia in Pregnant Women Using Knowledge Discovery and Data Mining Algorithms and Tools
CyberJournals Multidisciplinary
Ainda não há avaliações
Primera Expo Traduccion
Documento3 páginas
Primera Expo Traduccion
erock94
Ainda não há avaliações
1204-Article Text-154-1-10-20210111
Documento8 páginas
1204-Article Text-154-1-10-20210111
Wildan Attariq
Ainda não há avaliações
System Identification Using Intelligent Algorithms
Documento13 páginas
System Identification Using Intelligent Algorithms
Anonymous gJHAwzE
Ainda não há avaliações
Data Mining Full
Documento19 páginas
Data Mining Full
Tejaswini Reddy
Ainda não há avaliações
Algorithms 17 00111
Documento12 páginas
Algorithms 17 00111
jamel-shams
Ainda não há avaliações
Automatic Bearing Fault Classification Combining S
Documento6 páginas
Automatic Bearing Fault Classification Combining S
Harshad Pawar Patil
Ainda não há avaliações
Sensor Perf
Documento12 páginas
Sensor Perf
enaam1977
Ainda não há avaliações
Investigation of The Problem of Classifying Unbalanced Datasets in Identifying Distributed Denial of Service Attacks
Documento8 páginas
Investigation of The Problem of Classifying Unbalanced Datasets in Identifying Distributed Denial of Service Attacks
ali ali
Ainda não há avaliações
Feature Selection Based On Mutual Information: Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy
Documento13 páginas
Feature Selection Based On Mutual Information: Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy
toufik1986
Ainda não há avaliações
Half-Broken Rotor Bar Detection On IM by Using Sparse Representation Under Different Load Conditions
Documento5 páginas
Half-Broken Rotor Bar Detection On IM by Using Sparse Representation Under Different Load Conditions
kais alvi
Ainda não há avaliações
Analytical Calculations For TRL Calibration - 2019-03-07 - Microwave Journal
Documento9 páginas
Analytical Calculations For TRL Calibration - 2019-03-07 - Microwave Journal
Arun Kumar
Ainda não há avaliações
2005 COST ReviewSchemesFingPrintQualityComputation
Documento6 páginas
2005 COST ReviewSchemesFingPrintQualityComputation
lmtp80
Ainda não há avaliações
Tesis Reliabilitas 1
Documento8 páginas
Tesis Reliabilitas 1
Mochammad Iqbal Velayati Fajrin
Ainda não há avaliações
Spectral Efficiency of Uplink SCMA System With CSI Estimation
Documento7 páginas
Spectral Efficiency of Uplink SCMA System With CSI Estimation
Moayad Mkhlef
Ainda não há avaliações
New Approach For Selecting Multi-Point Relays in The Optimized Link State Routing Protocol Using Self-Organizing Map Artificial Neural Network: OLSR-SOM
Documento8 páginas
New Approach For Selecting Multi-Point Relays in The Optimized Link State Routing Protocol Using Self-Organizing Map Artificial Neural Network: OLSR-SOM
IAES IJAI
Ainda não há avaliações
My Datasetttt
Documento4 páginas
My Datasetttt
Saranya Dhilipkumar
Ainda não há avaliações
Aghamohammadixxx PDF
Documento6 páginas
Aghamohammadixxx PDF
henrydcl
Ainda não há avaliações
2009 Parameter Estimationofa DCMotor
Documento6 páginas
2009 Parameter Estimationofa DCMotor
mohamed
Ainda não há avaliações
Selection Method of Steel Grade With Required Hardenability
Documento4 páginas
Selection Method of Steel Grade With Required Hardenability
hawwwww
Ainda não há avaliações
Applied Energy
Documento1 página
Applied Energy
api-363114237
Ainda não há avaliações
RJournal 2010-2 Sariyar+Borg
Documento7 páginas
RJournal 2010-2 Sariyar+Borg
Asmita Mekalu
Ainda não há avaliações
Ccqre PDF
Documento33 páginas
Ccqre PDF
esr
Ainda não há avaliações
Intelligent Technologies for Automated Electronic Systems
No Everand
Intelligent Technologies for Automated Electronic Systems
S. Kannadhasan
Ainda não há avaliações
BMDE Assignment - Prashant's Group
Documento8 páginas
BMDE Assignment - Prashant's Group
Nisarg Shah
Ainda não há avaliações
Rakesh BC2
Documento11 páginas
Rakesh BC2
Nisarg Shah
Ainda não há avaliações
BPMN
Documento27 páginas
BPMN
lchernandez
100% (2)
MCD
Documento25 páginas
MCD
Nisarg Shah
Ainda não há avaliações
Cognitive Dissonance
Documento4 páginas
Cognitive Dissonance
Harsh Rautela
Ainda não há avaliações
Negotiation
Documento5 páginas
Negotiation
Nisarg Shah
Ainda não há avaliações
Chap 13
Documento59 páginas
Chap 13
api-3763138
Ainda não há avaliações
Sampling Distribution (Assignment 2)
Documento2 páginas
Sampling Distribution (Assignment 2)
Feroz khan
0% (1)
Problem Set Solution QT I I 17 Dec
Documento22 páginas
Problem Set Solution QT I I 17 Dec
Pradnya Nikam bj21158
Ainda não há avaliações
Chapter 3
Documento45 páginas
Chapter 3
Fawaz
Ainda não há avaliações
DIRECT": Modified T Vacation Policy For An M/G/1 Queueing System With An Unreliable Server and Startup
Documento11 páginas
DIRECT": Modified T Vacation Policy For An M/G/1 Queueing System With An Unreliable Server and Startup
Christopher Rose
Ainda não há avaliações
Some Statistical Methods in Anachem
Documento39 páginas
Some Statistical Methods in Anachem
shaine
Ainda não há avaliações
One-Way ANOVA: (Independent Group and Repeated Measures)
Documento36 páginas
One-Way ANOVA: (Independent Group and Repeated Measures)
al7abeeb
Ainda não há avaliações
Business Statistics - Session 9
Documento60 páginas
Business Statistics - Session 9
caprolactamcl4571
Ainda não há avaliações
Worked Example Bayes Minimum
Documento3 páginas
Worked Example Bayes Minimum
Nguyễn Hoàng Phương Trần
Ainda não há avaliações
Brent 1974
Documento3 páginas
Brent 1974
mbmbmbmbmb222
Ainda não há avaliações
G023: Econometrics: J Er Ome Adda
Documento154 páginas
G023: Econometrics: J Er Ome Adda
Lilia Xa
Ainda não há avaliações
Model Paper of Business Statics MBA
Documento5 páginas
Model Paper of Business Statics MBA
Piyush Kumar
Ainda não há avaliações
Twenty Statistical Errors Even YOU Can Find in Biomedical Research Articles - Tom Lang CMJ 2004
Documento10 páginas
Twenty Statistical Errors Even YOU Can Find in Biomedical Research Articles - Tom Lang CMJ 2004
isojukou
Ainda não há avaliações
Domenella Lezma Replication
Documento26 páginas
Domenella Lezma Replication
Gonzalo Lezma Florida
Ainda não há avaliações
Analytical Data. Final
Documento48 páginas
Analytical Data. Final
farjana
Ainda não há avaliações
Forecasting For Economics and Business 1st Edition Gloria Gonzalez Rivera Solutions Manual
Documento36 páginas
Forecasting For Economics and Business 1st Edition Gloria Gonzalez Rivera Solutions Manual
brownismkanacka9oir
100% (27)
Gianola-Fernando-Bayesian Methods-1986-J Anim Sci
Documento28 páginas
Gianola-Fernando-Bayesian Methods-1986-J Anim Sci
jose raul perez
Ainda não há avaliações
Lesson 12 - Parameter and Statistic
Documento16 páginas
Lesson 12 - Parameter and Statistic
jennifer
Ainda não há avaliações
Hasil 3 Frint
Documento2 páginas
Hasil 3 Frint
Afenk
Ainda não há avaliações
Dana Analytics Unitwise Questions
Documento2 páginas
Dana Analytics Unitwise Questions
bhavya.shivani1473
Ainda não há avaliações
Analysis Skewness in GARCH Model
Documento18 páginas
Analysis Skewness in GARCH Model
ALVARO MIGUEL JESUS RODRIGUEZ SAENZ
Ainda não há avaliações
The Normal Distribution
Documento9 páginas
The Normal Distribution
Elfren Bulong
Ainda não há avaliações
06 03 Linear Regression
Documento13 páginas
06 03 Linear Regression
John Bofarull Guix
Ainda não há avaliações
EE 325: Probability and Random Processes
Documento4 páginas
EE 325: Probability and Random Processes
gobinath balamurugan
Ainda não há avaliações
Probability All in Details
Documento129 páginas
Probability All in Details
yousef shaban
Ainda não há avaliações
Areas of Normal Curve
Documento24 páginas
Areas of Normal Curve
Joyzmarv Aquino
100% (1)
CertBus ASQ CSSBB Study Materials Braindumps With Real Exam PDF
Documento18 páginas
CertBus ASQ CSSBB Study Materials Braindumps With Real Exam PDF
afreen timberwalla
100% (1)
Normal Distribution and Z-Score
Documento22 páginas
Normal Distribution and Z-Score
Jafar Nory
Ainda não há avaliações