Você está na página 1de 1

1. What is snow-flake schema?

A:-Its a dimensional modelling which is composed of centralized fact and set of dimensions which are normalized into sub-types.Its more normalized compared to star schema, performance wise bad and complex to design. 2. What are the different types of dimension? A:-Confirmed, Junk ,SCD, Fast changing dimension, role playing, garbage, degenerated 3. What is junk dimension? A:-A junk dimension means grouping of low cardinality flags and indicators.or a table consisting of attributes that do not belong in the fact table or in any of the existing dimension tables 4. What is a mini dimension? Where is it used? A:-No idea but if it means fast/rapidly changing dimension table then the table where one attribute or multiple attribute changing rapidly overtime 5. What is fact-less fact and what is coverage fact? A:-The table which does not contain any measure or numeric column. Only it would consists of dimension. 6. What is difference b/w Start schema and snow flack schema ? 1. Design:-Star is easy compared to snow flack 2. Performance:-Star is faster compared to snow due to lot of join required to fetch data from table in snow flack. 3. Normalisation:- snow flack is normalized compared to star. 4. Redundancy:- Snow is redundant free. 7. What is fact less fact table ?In which case you use this one in your project ? A:- The table which does not contain any measure or numeric column. Only it would consists of dimension. In our project we used fact less fact table to identify the product promotion events like to determine the products which did not sell and consumer id who ate opt in for that product. 8. How you implemented SCD in your project ? What SCD type you implemented ? A:-We implemented SCD type 2 in our project. 9. How many types of loading techniques available? A:-There are two type if loading techniques called Initial load/full load and incremental load. 10. How to handle Nulls in Sequential files ? A:-No idea but i think if we chose no in null able option in metadata it will not accept null. 11. What is the difference b/w sequential file, dataset, fileset ? A:-

Seq. File By default it will be processed in sequentially supports .csv,.txt,.xls File create:- only one file Native format used in source and target used upto 2GB

DataSet parallely .ds create 2files descriptor and datafile Binary format Used in Intermediate stage more than that

Fileset parallely .fs create 2files descriptor and datafile ASCII format Used in Intermediate stage more than that

12. How you are going to delete data set in command line ? A:-orchadmin rm datasetname

Você também pode gostar