Escolar Documentos
Profissional Documentos
Cultura Documentos
com
Reg. No. :
55279
Seventh Semester Computer Science and Engineering
CS 2032 DATA WAREHOUSING AND DATA MINING (Common to Sixth Semester Information Technology) (Regulation 2008) Time : Three hours Answer ALL questions.
List the three important issues that have to be addressed during data
3. 4. 5. 6. 7. 8. 9.
10.
integration.
www.Vidyarthiplus.com
4
Maximum : 100 marks
4 0
www.Vidyarthiplus.com
11.
(a)
What is a data warehouse? With the help of a neat sketch, explain the various components in a data warehousing system. (16)
(b)
What is a multiprocessor architecture? List and discuss the steps involved in mapping a data warehouse to a multiprocessor architecture. (16) (i) (ii)
12.
(a)
Distinguish between Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP). (4)
Giving suitable examples, describe the various multi-dimensional schema. (16) (i) (ii) List and discuss the classification of data mining systems. (8)
(b)
(i) (ii)
Describe the issues and challenges in the implementation of data mining systems.
(ii)
What are the prediction techniques supported by a data mining system? (6) Or
(b)
Apply the a priori algorithm to the following data set. State and discuss each step in the Apriori algorithm. Assume. (16) Solution : Trans ID 101 102 103 104 105 Items Purchased Apple, Orange, Litchi, Grapes Apple, Mango Mango, Grapes, Apple Apple, Orange, Litchi, Grapes Pears, Litchi 2
14.
(a)
(i)
What is classification? With an example explain how support vector machines can be used for classification. (10)
What is the significance of interestingness measures in a data mining system? Give examples.
0
Or
List and discuss the steps for integrating a data mining system with a data warehouse. (8)
What is business analysis? List and discuss the basic features that are provided by reporting and query tools used for business analysis. (12)
www.Vidyarthiplus.com
4 0
55279
Or
www.Vidyarthiplus.com
Apple, Orange, Strawberry, Litchi, Grapes Strawberry, Grapes Apple, Orange, Grapes
15.
(a)
What is grid based clustering? With an example explain an algorithm for grid based clustering. (16) Or
(b)
Consider five points {X 1 , X 2 , X 3 , X 4 , X 5 } with the coordinates as a two dimensional sample for clustering :
The set of items is {Apple, Orange, Strawberry, Litchi, Grapes, Pears, Mango}. Use 0.3 for the minimum support value.
Illustrate the K-means partitioning algorithms using the above data set. (16)
4
3
X 1 = (0 .5 , 2 .5 ); X 2 = (0 , 0 ); X 3 = (1 .5 , 1 ) ; X 4 = (5 , 1 ); X 5 = (6 ,2 )
www.Vidyarthiplus.com
4 0
following
55279