Bem-vindo(a) ao Scribd!

Quality Stage Rules

Enviado por

0% acharam este documento útil (0 voto)

92 visualizações3 páginas

Investigation Stage is the stage used to perform data investigation. Character Investigation is used to investigation of the characters. Word investigation will investigation entire word. Investigation Stage Supports 1 input and multiple outputs. Standardize Stage is used to make the data into standardize format.

Descrição original:

Direitos autorais

Formatos disponíveis

DOCX, PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato DOCX, PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

92 visualizações3 páginas

Quality Stage Rules

Enviado por

abhivarala

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato DOCX, PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 3

Pesquisar no documento

Investigation Stage : The Investigation stage is the stage used to perform data investigation when the data parses

to this stage while doing the investigation the free form data will be divided in to individual tokens then analyzed . In investigation stage we can perform two types of investigations. 1) Character Investigation 2) Word Investigation Character Investigation:Character Investigation is used to investigation of the characters. When we perform character investigation on the data , it will check for each and every character in that string. There are two types of character investigations again. a)Character Discrete investigation ( Single ) b) Character Concatenate investigation ( Multi) 2) Word Investigation In the word investigation our investigation stage will investigation entire word. Normally this word investigation we will peerform on multi domain fields. ( like addres )

Objectives of the investigation process : a)Values that donot match metadata levels. Column name one data us other one. For example the driver license number. b) Values that overlap adjacent fields and thus required a re-alignment of field. It discovers additional tokens such as name prefixes, name suffixes, street unit type. It verifies the usefulness of data. There are three masks available while doing Data Quality. 1) Mask c: It is used to inspect the value in your column . It shows data character word wise. 2) Mask T: It is used to inspect the type of data in a character position. 3) Mask X: It is used to excludes the character in the frequency count.

And the Investigation Stage Supports 1 input and multiple outputs.

Standardize Stage is one of the stage in quality stage, used to make the data into standardize format. After applying the investigation stage, we will move the data into the standardize stage to make the data into standardize format.In the standardize stage we will be applying the three types of standardization rules. The Standardize rules are as below a) Domain preprocessor rule b) Domain specific rule and c) Validation Rule Domain Preprocessor rule is the rule that rules are set do not perform standardization but parse the columns in each row record and each token into the appropriate domain specific column sets which are name, area, address like that. Ex: Rules------ usprep -------ok Some standard examples will be like Name 1 John doe Name 2 123 clay field, brisbane Address 1 c/o smith james Address 2 West end 4000

Domain Specific Rules: These rules are rules that rules are set, we can check every individual domain level whether that data is valid or invalid. This is mostly domain specific rules set we an apply on three domains. They are name domain, address domain, area domain

Validation Rules: The validation rules are used to standardize, the common business data including data, name, emailid, phone number, social security number, credit card numbers etc We will be validating the data and reporting the error.

Você também pode gostar

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
No Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Nota: 4 de 5 estrelas
4/5 (5795)
Grit: The Power of Passion and Perseverance
No Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Nota: 4 de 5 estrelas
4/5 (588)
The Yellow House: A Memoir (2019 National Book Award Winner)
No Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Nota: 4 de 5 estrelas
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
No Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Nota: 4 de 5 estrelas
4/5 (895)
Shoe Dog: A Memoir by the Creator of Nike
No Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Nota: 4.5 de 5 estrelas
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
No Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Nota: 4.5 de 5 estrelas
4.5/5 (271)
The Little Book of Hygge: Danish Secrets to Happy Living
No Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Nota: 3.5 de 5 estrelas
3.5/5 (400)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
No Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Nota: 4.5 de 5 estrelas
4.5/5 (474)
On Fire: The (Burning) Case for a Green New Deal
No Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Nota: 4 de 5 estrelas
4/5 (74)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
No Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Nota: 4.5 de 5 estrelas
4.5/5 (345)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
No Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Nota: 3.5 de 5 estrelas
3.5/5 (231)
Never Split the Difference: Negotiating As If Your Life Depended On It
No Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Nota: 4.5 de 5 estrelas
4.5/5 (838)
Yes Please
No Everand
Yes Please
Amy Poehler
Nota: 4 de 5 estrelas
4/5 (1891)
Team of Rivals: The Political Genius of Abraham Lincoln
No Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Nota: 4.5 de 5 estrelas
4.5/5 (234)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
No Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Nota: 4.5 de 5 estrelas
4.5/5 (266)
Principles: Life and Work
No Everand
Principles: Life and Work
Ray Dalio
Nota: 4 de 5 estrelas
4/5 (599)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
No Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Nota: 3.5 de 5 estrelas
3.5/5 (2259)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
No Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Nota: 4 de 5 estrelas
4/5 (1091)
Steve Jobs
No Everand
Steve Jobs
Walter Isaacson
Nota: 4.5 de 5 estrelas
4.5/5 (806)
Fear: Trump in the White House
No Everand
Fear: Trump in the White House
Bob Woodward
Nota: 3.5 de 5 estrelas
3.5/5 (738)
The Glass Castle: A Memoir
No Everand
The Glass Castle: A Memoir
Jeannette Walls
Nota: 4.5 de 5 estrelas
4.5/5 (1713)
Rise of ISIS: A Threat We Can't Ignore
No Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Nota: 3.5 de 5 estrelas
3.5/5 (137)
The Unwinding: An Inner History of the New America
No Everand
The Unwinding: An Inner History of the New America
George Packer
Nota: 4 de 5 estrelas
4/5 (45)
Angela's Ashes: A Memoir
No Everand
Angela's Ashes: A Memoir
Frank McCourt
Nota: 4.5 de 5 estrelas
4.5/5 (440)
John Adams
No Everand
John Adams
David McCullough
Nota: 4.5 de 5 estrelas
4.5/5 (2409)
Bad Feminist: Essays
No Everand
Bad Feminist: Essays
Roxane Gay
Nota: 4 de 5 estrelas
4/5 (1016)
The Outsider: A Novel
No Everand
The Outsider: A Novel
Stephen King
Nota: 4 de 5 estrelas
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
No Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Nota: 4.5 de 5 estrelas
4.5/5 (121)
A Man Called Ove: A Novel
No Everand
A Man Called Ove: A Novel
Fredrik Backman
Nota: 4.5 de 5 estrelas
4.5/5 (4610)
The Light Between Oceans: A Novel
No Everand
The Light Between Oceans: A Novel
M.L. Stedman
Nota: 4.5 de 5 estrelas
4.5/5 (789)
The Woman in Cabin 10
No Everand
The Woman in Cabin 10
Ruth Ware
Nota: 3.5 de 5 estrelas
3.5/5 (2322)
Brooklyn: A Novel
No Everand
Brooklyn: A Novel
Colm Toibin
Nota: 3.5 de 5 estrelas
3.5/5 (1938)
The Art of Racing in the Rain: A Novel
No Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Nota: 4 de 5 estrelas
4/5 (4200)
Manhattan Beach: A Novel
No Everand
Manhattan Beach: A Novel
Jennifer Egan
Nota: 3.5 de 5 estrelas
3.5/5 (792)
The Perks of Being a Wallflower
No Everand
The Perks of Being a Wallflower
Stephen Chbosky
Nota: 4.5 de 5 estrelas
4.5/5 (2104)
Wolf Hall: A Novel
No Everand
Wolf Hall: A Novel
Hilary Mantel
Nota: 4 de 5 estrelas
4/5 (3811)
A Tree Grows in Brooklyn
No Everand
A Tree Grows in Brooklyn
Betty Smith
Nota: 4.5 de 5 estrelas
4.5/5 (1929)
Little Women
No Everand
Little Women
Louisa May Alcott
Nota: 4 de 5 estrelas
4/5 (104)
Her Body and Other Parties: Stories
No Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Nota: 4 de 5 estrelas
4/5 (821)
Sing, Unburied, Sing: A Novel
No Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Nota: 4 de 5 estrelas
4/5 (1103)
The Constant Gardener: A Novel
No Everand
The Constant Gardener: A Novel
John Le Carré
Nota: 3.5 de 5 estrelas
3.5/5 (104)
Algorithms - Berman
Documento20 páginas
Algorithms - Berman
niraj_sd
Ainda não há avaliações
Research Planning Assignment 3 Research Overview
Documento18 páginas
Research Planning Assignment 3 Research Overview
mahedre
Ainda não há avaliações
Applied Cryptography 1
Documento5 páginas
Applied Cryptography 1
Sehrish Abbas
Ainda não há avaliações
Sculpting Stylized Chraracters
Documento41 páginas
Sculpting Stylized Chraracters
Nikhilesh Thatipamula
100% (1)
Biometric Security Concerns
Documento27 páginas
Biometric Security Concerns
princeuchendu
Ainda não há avaliações
Multimedia Systems Lab Sheet 01: BSC (Hons) in Information Technology Specialized in Interactive Media Batch 2016
Documento7 páginas
Multimedia Systems Lab Sheet 01: BSC (Hons) in Information Technology Specialized in Interactive Media Batch 2016
api-306703250
Ainda não há avaliações
Employee
Documento21 páginas
Employee
eyob
Ainda não há avaliações
Uml Diagrams
Documento31 páginas
Uml Diagrams
afreenj
Ainda não há avaliações
Practical Paranoia - OS X 10.11 - Marc Mintz PDF
Documento372 páginas
Practical Paranoia - OS X 10.11 - Marc Mintz PDF
keepalive2010
Ainda não há avaliações
DLL Error Detection
Documento20 páginas
DLL Error Detection
pari verma
Ainda não há avaliações
DiSi Flash Memory
Documento8 páginas
DiSi Flash Memory
FERNS
Ainda não há avaliações
Osd Sept2012 Qpms Final
Documento13 páginas
Osd Sept2012 Qpms Final
MarnHtetMyet
Ainda não há avaliações
DRS (Data Recovery System) : 1 Key Features
Documento3 páginas
DRS (Data Recovery System) : 1 Key Features
Francisco Maya
Ainda não há avaliações
Top 50 Business Analyst Interview Questions - Whizlabs Blog PDF
Documento17 páginas
Top 50 Business Analyst Interview Questions - Whizlabs Blog PDF
Amit Kumar
Ainda não há avaliações
Advanced Data Recovery
Documento19 páginas
Advanced Data Recovery
Venu Kinng
Ainda não há avaliações
Holo Magisk
Documento1 página
Holo Magisk
Septarian Dwi Cahyo
Ainda não há avaliações
Promises Cheatsheet PDF
Documento1 página
Promises Cheatsheet PDF
Nenad Trajkovic
Ainda não há avaliações
Rel Notes1
Documento48 páginas
Rel Notes1
Viswal Hagan
Ainda não há avaliações
COMSATS Institute of Information Technology Abbottabad: Goals
Documento2 páginas
COMSATS Institute of Information Technology Abbottabad: Goals
Sohail Mashwani
Ainda não há avaliações
Raghu Institute of Technology: Free Open Source Software (Foss) Lab Manual
Documento51 páginas
Raghu Institute of Technology: Free Open Source Software (Foss) Lab Manual
Sneha Sruthi
Ainda não há avaliações
Embedded Linux Slides
Documento540 páginas
Embedded Linux Slides
Ingeniero Jesus
Ainda não há avaliações
Experiment - 4 Aim: Implementation of Constraints
Documento14 páginas
Experiment - 4 Aim: Implementation of Constraints
aattish
Ainda não há avaliações
Robtest Issue and Robot Inventory Issue
Documento2 páginas
Robtest Issue and Robot Inventory Issue
amsreeku
Ainda não há avaliações
Segmentation Offloading With Wireshark and Ethtool
Documento4 páginas
Segmentation Offloading With Wireshark and Ethtool
Mehmet Demir
Ainda não há avaliações
Embedded System Development Coding Reference Guide
Documento190 páginas
Embedded System Development Coding Reference Guide
DucSyHo
Ainda não há avaliações
Syngo Fastview Vx57E: Release Note
Documento4 páginas
Syngo Fastview Vx57E: Release Note
Alexis Rafael
Ainda não há avaliações
Machine Learning Project Car Price Prediction Algorithm
Documento4 páginas
Machine Learning Project Car Price Prediction Algorithm
Ruqaiya Ali
Ainda não há avaliações
Intelligent Restaurant - Menu Ordering System
Documento7 páginas
Intelligent Restaurant - Menu Ordering System
IOSRjournal
Ainda não há avaliações
A Security Scheme For Wireless Sensor Networks
Documento5 páginas
A Security Scheme For Wireless Sensor Networks
Ronaldo Milfont
Ainda não há avaliações
Week 11: Searching: STIA2024 Data Structures & Algorithm Analysis
Documento28 páginas
Week 11: Searching: STIA2024 Data Structures & Algorithm Analysis
बानि तमिन्
Ainda não há avaliações