Você está na página 1de 26

ISSS602 Data Analytics Lab

Lab Exercise 1: First date with JMP Pro

ISSS602 Data Analytics Lab

Lab Exercise 1:
First date with JMP Pro

Learning Outcome

By the end of this session, you will be able to:

 understand the basic components and operations of JMP Pro


 use the table manipulation and tabulation functions of JMP to perform data
import, data cleaning, data recoding, data transformation and
 perform interactive tabulation analysis.

1
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Versions history
Version Description of changes Date Editor
1.0.0 First working draft 20/12/2013 Kam
1.2.0 Final proof-reading 02/01/2013 Kam
1.3.0 Minor editing and replacing the data 23/08/2013 Kam
1.4.0 Revise base on JMP Pro 11 05/01/2014 Kam
1.5.0 Minor editing and replacing the data 20/08/2014 Kam
1.6.0 Replaced the hands-on dataset and minor revision 9/1/2014 Kam
1.7.0 Revised based on JMP Pro 12 2/9/2015 Kam
1.8.0 Replaced the hands-on dataset and minor revision 14/01/2016 Kam
1.9.0 Minor revision 12/08/2016 Kam
2.0.0 Revised based on JMP Pro 13 and change dataset 12/01/2017 Kam

2
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Contents
1.0 Overview ..................................................................................................................................... 4
1.1 Getting Start with JMP ............................................................................................................ 4
1.1.1 Launching JMP and getting started................................................................................. 4
1.1.3 Loading data into JMP ..................................................................................................... 5
1.2 Data Preparation ................................................................................................................... 10
1.2.1 Using Tabulate functions to check data accuracy ......................................................... 10
1.2.2 Correct the error using recode function ....................................................................... 13
1.3 Reformatting data type ......................................................................................................... 15
1.4 Using table functions to derive summary statistics .............................................................. 20
1.5 Working with interactive tabular analysis ............................................................................ 23

3
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

1.0 Overview

1.1 Getting Start with JMP

This exercise aims to familiarise you with the basic components of JMP. You will also learn
how to use the table manipulation and tabulation functions of JMP to perform data import,
data cleaning, data recoding, data transformation and interactive tabulation analysis.

1.1.1 Launching JMP and getting started

 From the Office desktop click on the JMP icon .

When JMP is launched, you will see two windows, namely: the Tip of the Day window and
the JMP Home window.

The Tip of the Day window gives useful hints on using JMP. The tip displayed is randomly
selected and changes every time JMP program is launched.

The JMP Home Window menus, across the top, can also be used to perform JMP functions.
The JMP toolbar, located beneath the Menu bar, provides many shortcuts and useful tools.

4
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

 At the Tip of the Day window, click on the Close button to close the window.

1.1.3 Loading data into JMP

In this section, you will learn how to import an existing data file in Comma-Separated Values
(http://en.wikipedia.org/wiki/Comma-separated_values) format into JMP as JMP’s Data
Table. The file name is REALIS2016.csv (Note: assume that it is found in \ISSS602\lab01
folder). It provides private property transaction records from 1 January 2016 to 31
December 2016.

 At the JMP Home Window, select File -> Open.

The Open Data File dialog appears.

5
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

 At the Files of type:, select Text File (*.txt, *.csv, *.dat, *.tsv).

 Navigate to C:\ISSS602\lab01 folder and click on REALIS2016.csv .

 For Open as, click on the radio button in front of Data with Preview.

 Click on the Open button.

6
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The Import dialog window appears. You will use this interface to specify the characteristics
of the data.

 Click on the Next button.

Your screen should look similar to the figure below. This interface allows you to specify the
data types such as numerical or character.

7
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

 Click on the Import button.

The REALIS2016 data table appears.


Disclosure Button Red Triangle Column Name Data Grid

File Name
Tables Panel

Columns Pane

Row Pane

The Columns Pane shows that there are a total of 20 variables in the data table. The
categorical variables (11) are indicated by the icon and the continuous variables (9) are
indicated by icon.

The Row Pane shows that this data table contains a total of 20,266 records.

DIY: With reference to the lab exercise note, try to understand the
various components such as Table Panel, Column Panel and Row
Panel of JMP data table.

You can now save the Data Table.

 At the JMP menu, click File -> Save.

8
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The Save dialog window appears.

 Navigate to an appropriate output folder, such as C:\ISSS602\lab01.

 For File name:, type REALIS2016.

 For Save as type:, select JMP Data Table (*.jmp) from the drop-down list.

 Click on the Save button.

9
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Notice that the name of the data table located at the upper right hand corner of the data
table has been changed to REALIS2016.

1.2 Data Preparation

Best Practice: Once data is imported into JMP, you should examine the Data Table carefully
for accuracy and completeness. Accuracy refers to the degree that data types in the
imported dataset matches the original dataset. Completeness refers to the degree that the
number of records and fields in the imported dataset matches the original dataset.

1.2.1 Using Tabulate functions to check data accuracy

In this section you will learn how to use the Tabulate Functions, an interactive tabulating
environment of JMP, to detect inconsistent records in a data table.

 At the JMP menu, click Analyze -> Tabulate.

10
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The Tabulate dialog appears.

 At the Control Panel, click on Planning Area.

 Drag and drop it on to the Drop zone for rows dimension.

11
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

A frequency table showing the count of records in each Planning Area appears.

Notice that there are two Yishun planning areas. One is in upper case and the other one is
in lower case. This will affect subsequent analysis if it is not been corrected.

 Click to close Tabulate window.

12
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

1.2.2 Correct the error using recode function

In this section, you will learn how to correct the error identified in section 1.2.1 using the
Recode function of JMP.

 At the REALIS2016 Data Table, click on the Planning Area field name.

 At JMP Menu, select Cols -> Recode.

13
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The Recode dialog appears.

 Click on YISHUN under the New Value column.

 Retype the word as Yishun.

 Click on the Done button.

 Select In Place from the drop-down list.

Notice that JMP replaced all records with the word YISHUN under the Planning Area field to
Yishun.

14
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

DIY: Use the Tabulate Functions to check if the data inconsistency


error has been corrected properly.

1.3 Reformatting data type

JMP is very picky as to the data type of the variables that are going to be analysed. For
certain operations, the data must be continuous and others categorical (i.e. nominal or
ordinal). For example, the Postal District, Postal Sector and Postal Code variables should be
in categorical data format instead of continuous data format. The Sale Date field should be
in month/year format.

In this section, you will learn how to change the above variables into the appropriate data
types.

First, you will learn how to change the data type of the Sale Date field.

 At Column Name, right-click on the Sale Date field.

 Select Column Info from the context menu.

15
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The dialog box containing the Column Info of Sale Date appears.

 At the Format field, select Date, then m/y from the drop-down list.

 Click on the OK button.

16
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Notice that the records under the Sale Date field have changed to month/year format (i.e.
01/2014).

Next, you are going to change the data type of Postal District, Postal Sector and Postal Code
to categorical.

 Press on the Ctrl button of the keyboard then click on Postal District, Postal Sector
and Postal Code fields.

 Right click and select Column Info from the context menu.

17
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

The Column Setting dialog window appears.

18
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

 At the Data Type field of Postal District, select Character from the drop-down list.

Notice that JMP will automatically change the Modeling Type from Continuous to Nominal.

DIY: Repeat the same steps to change the data type of Postal Sector and
Postal Code.

• Click on the OK button.

Notice that the symbols in front of Postal District, Postal Sector and Postal Code in the
Column Pane are now represented by red histogram symbols.

19
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

1.4 Using table functions to derive summary statistics

In this section, you will learn how to construct a summary table using the Tables functions
of JMP.

First, you will use the static Table -> Summary function of JMP.

 At the JMP Menu, select Tables -> Summary.

The Summary dialog appears.

20
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

 Click on No. of Units and select Sum from the Statistics drop-down list.

Your screen should look similar to the figure below.

21
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Next,

 Click on Area (sq m) and select Mean from the Statistics drop-down list.

 Click on Transacted Price and select Mean from the Statistics drop-down list.

 Click on Transacted Price and select Median from the Statistics drop-down list.

 Click on Unit Price (psm) and select Mean from the Statistics drop-down list.

 Click on Unit Price (psm) and select Median from the Statistics drop-down list.

Your screen should look similar to the figure below.

• Click on the OK button.

A summary table looks similar to the table below appears.

 Click to close the Summary table.

22
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

1.5 Working with interactive tabular analysis

Now you will learn how to construct summary table interactively using the Tabulate
function of JMP.

 At the menu bar of REALIS2016 data table window, select Analyze -> Tabulate.

The Tabulate dialog window appears.

First, you will create a summary table of numbers of properties transacted by property type.

 At the Control Panel, click on Property Type.

 Drag-and-drop it on to the Drop zone for rows dimension.

 At the Control Panel, click on No. of Units.

 Drag-and-drop it onto the Resulting Cells dimension.

Your screen should look similar to the figure below.

Next, you will compute the mean and median unit selling price of each property type.

 At the Control Panel, click on Unit Price ($ psf).

 Drag-and-drop it on to the Drop zone for column dimension.

 Select New Column Table with Analysis Columns from the context menu.

23
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Your screen should look similar to the figure below.

Notice that the default summary statistic is Sum.

You will change it to mean.

 At the Control Panel, click on Mean.

 Drag-and-drop it on the Sum directly below Unit Price ($ psf).

Now, your screen should look similar to the figure below.

The beauty of Tabulate is that it is highly interactive. Users can change the table structure
easily by dragging and dropping the variables to the appropriate row or column dimensions.
For example, you can create a cross-table showing the mean transaction price by Property
Type and Planning Region.

 At the Tabulate pane, click on No. of Units.

 Right-click and select Delete from the context menu.

24
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Your screen should look similar to the figure below.

 Next, At the Control Panel window, click on Planning Region.

 Drag-and-drop it on to the Drop zone for columns dimension. Place it on top of the
Unit Price ($ pcf).

25
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro

Your screen should look similar to below.

Congratulation! You had just created a cross-table.

26

Você também pode gostar