Opencv Java Tutorials

OpenCV Java Tutorials Documentation
Release 1.0
Luigi De Russis, Alberto Sacco
Oct 26, 2017

Contents
1 Installing OpenCV for Java 3

1.1 Introduction to OpenCV for Java . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Install the latest Java version . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Install the latest Eclipse version . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Install OpenCV 3.x under Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.5 Install OpenCV 3.x under macOS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.6 Install OpenCV 3.x under Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.7 Set up OpenCV for Java in Eclipse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.8 Set up OpenCV for Java in other IDEs (experimental) . . . . . . . . . . . . . . . . . . . . . . . . . 8
2 Your First Java Application with OpenCV 9

2.1 A Java application with OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 What we will do in this tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.3 Create a New Project . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.4 Add a User Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.5 Create a simple application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3 Your First JavaFX Application with OpenCV 13

3.1 A JavaFX application with OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.3 Your First Application in JavaFX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.4 Installing e(fx)clipse plugin and Scene Builder . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.5 Working with Scene Builder . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.6 Key Concepts in JavaFX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.7 Managing GUI Interactions With the Controller Class . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.8 Video Capturing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4 OpenCV Basics 23
4.2 Getting started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.3 Color channel checkbox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4.4 Load an Image and Add it to the Stream . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.5 Calculate a Histogram . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
4.6 Draw the Histogram . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
5 Fourier Transform 31
5.1 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
i
5.2 What is the Fourier Transform? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
5.4 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.5 Load the file . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.6 Applying the DFT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
5.7 Applying the inverse DFT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
5.8 Analyzing the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
6 Face Detection and Tracking 37

6.1 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
6.2 Cascade Classifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
6.4 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
6.5 Loading the Classifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
6.6 Detection and Tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
7 Image Segmentation 43
7.1 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
7.2 Canny edge detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
7.3 Dilatation and Erosion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
7.5 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
7.6 Using the Canny edge detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
7.7 Canny Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
7.8 Using the Background Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
7.9 Background Removal Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
8 Object Detection 49
8.1 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
8.2 Morphological Image Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
8.4 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
8.5 Image processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
8.6 Morphological Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
8.7 Object tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
9 Camera Calibration 55
9.1 Goal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
9.2 What is the camera calibration? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
9.3 Calibration Pattern . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
9.5 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
9.6 Pattern Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
9.7 Saving Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
9.8 Camera Calibration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
10 Indices and tables 63
ii
OpenCV Java Tutorials Documentation, Release 1.0
Note: We are in the process to update these tutorials to use Java 8, only.
Contents:
Contents 1
2 Contents
CHAPTER 1
Installing OpenCV for Java
Introduction to OpenCV for Java
As of OpenCV 2.4.4, OpenCV supports desktop Java development. This tutorial will help you install OpenCV on your
desktop operating system.
Install the latest Java version
Download the latest Java JDK from the Oracle website. Now you should be able to install the last Java JDK by opening
the file just downloaded.
Install the latest Eclipse version
Download the latest Eclipse version at the Eclipse Download page choosing the Eclipse IDE for Java
Developers version (suggested).
Extract the downloaded compressed file and put the resulting folder wherever you want to. You dont need to install
anything. Alternatively, you can try the Eclipse installer.
Install OpenCV 3.x under Windows
First of all you should download the OpenCV library (version 3.x) from here.
Then, extract the downloaded OpenCV file in a location of your choice. Once you get the folder opencv put in
wherever you prefer.
Now the only two things that you will need are: the opencv-3xx.jar file located at \opencv\build\java
and the opencv_java3xx.dll library located at \opencv\build\java\x64 (for 64-bit systems) or
3
\opencv\build\java\x86 (for 32-bit systems). The 3xx suffix of each file is a shortcut for the current OpenCV
version, e.g., it will be 300 for OpenCV 3.0 and 330 for OpenCV 3.3.
Install OpenCV 3.x under macOS
The quickest way to obtain OpenCV under macOS is to use Homebrew. After installing Homebrew, you have to check
whether the XCode Command Line Tools are already installed on your system.
To do so, open the Terminal and execute: xcode-select --install If macOS asks for installing such tools,
proceed with the download and installation. Otherwise, continue with the OpenCV installation.
To install OpenCV (with Java support) through Homebrew, you need to edit the opencv formula in Home-
brew, to add support for Java: brew edit opencv In the text editor that will open, change the line:
-DBUILD_opencv_java=OFF in -DBUILD_opencv_java=ON then, after saving the file, you can effectively
install OpenCV: brew install --build-from-source opencv
After the installation of OpenCV, the needed jar file and the dylib library will be located at /usr/
local/Cellar/opencv/3.x.x/share/OpenCV/java/, e.g., /usr/local/Cellar/opencv/3.3.
1/share/OpenCV/java/.
Please, notice that this method doesnt work if you update OpenCV from a previous version: you need to uninstall
OpenCV and install it again. Also, Homebrew doesnt produce the needed files in some systems without Apache Ant
(e.g., in /usr/local/bin/ant).
Install OpenCV 3.x under Linux
Please, note: the following instructions are also useful if you want to compile OpenCV under Windows or macOS.
Linux package management systems (apt-get, yum, etc.) may provide the needed version of the OpenCV library.
As first step, download and install CMake and Apache Ant, if you dont have any of these. Download the OpenCV
library from its website. Extract the downloaded OpenCV file in a location of your choice and open CMake ( cmake-
gui ). Put the location of the extracted OpenCV library in the Where is the source code field (e.g.,
/opencv/) and insert the destination directory of your build in the Where to build the binaries field (e.g.,
/opencv/build). At last, check the Grouped and Advanced checkboxes.
4 Chapter 1. Installing OpenCV for Java

Now press Configure and use the default compilers for Unix Makefiles. Please, be sure to have installed
a C/C++ compiler. In the Ungrouped Entries group, insert the path to the Apache Ant executable (e.g., /
apache-ant-1.9.6/bin/ant). In the BUILD group, unselect:
BUILD_PERF_TESTS
BUILD_SHARED_LIBRARY to make the Java bindings dynamic library all-sufficient
BUILD_TESTS
BUILD_opencv_python
In the CMAKE group, set to Debug (or Release) the CMAKE_BUILD_TYPE
In the JAVA group:
insert the Java AWT include path (e.g., /usr/lib/jvm/java-1.8.0/include/)
insert the Java AWT library path (e.g., /usr/lib/jvm/java-1.8.0/include/jawt.h)
insert the Java include path (e.g., /usr/lib/jvm/java-1.8.0/include/)
1.6. Install OpenCV 3.x under Linux 5

insert the alternative Java include path (e.g., /usr/lib/jvm/java-1.8.0/include/linux)

insert the JVM library path (e.g., /usr/lib/jvm/java-1.8.0/include/jni.h)
Press Configure twice, and the CMake window should appear with a white background. If not, fix the red lines and
press Configure again. Now, press Generate and close CMake.
Now open the terminal, go to the build folder of OpenCV and compile everything with the command: make
-j. Notice that the -j flag tells make to run in parallel with the maximum number of allowed job threads, which
makes the build theoretically faster. Wait for the process to be completed... If everything went well you should have
opencv-3xx.jar in the /opencv/build/bin directory and libopencv_java3xx.so in the /opencv/
build/lib directory. The 3xx suffix of each file is a shortcut for the current OpenCV version, e.g., it will be 300 for
OpenCV 3.0 and 330 for OpenCV 3.3. This is everything you need.

Set up OpenCV for Java in Eclipse
Open Eclipse and select a workspace of your choice. Create a User Library, ready to be used on all your next projects:
go to Window > Preferences....
From the menu navigate under Java > Build Path > User Libraries and choose New.... Enter a
name for the library (e.g., opencv) and select the newly created user library. Choose Add External JARs...,
browse to select opencv-3xx.jar from your computer. After adding the jar, extend it, select Native library
location and press Edit....
1.7. Set up OpenCV for Java in Eclipse 7

Select External Folder... and browse to select the folder containing the OpenCV libraries (e.g.,
C:\opencv\build\java\x64 under Windows).
In case of MacOS, if you installed OpenCV without Homebrew, you need to create a soft link with .dylib extension for
the .so file. E.g., from the terminal, type: ln -s libopencv_java300.so libopencv_java300.dylib
Set up OpenCV for Java in other IDEs (experimental)
If you are using IntelliJ, you can specify the location of the library with the VM argument -Djava.library.
path=/opencv/build/lib.

CHAPTER 2
Your First Java Application with OpenCV
Note: We assume that by now you have already read the previous tutorials. If not, please check previous tutorials at
http://opencv-java-tutorials.readthedocs.org/en/latest/index.html. You can also find the source code and resources at
https://github.com/opencv-java/
A Java application with OpenCV
This tutorial will guide you through the creation of a simple Java console application using the OpenCV library in
Eclipse.
What we will do in this tutorial
In this guide, we will:

Create a new Java Project
Add a User Library to the project
Write some OpenCV code
Build and Run the application
Create a New Project
Open Eclipse and create a new Java project; open the File menu, go to New and click on Java Project.
9
In the New Java Project dialog write the name of your project and click on Finish.
Add a User Library
If you followed the previous tutorial (Installing OpenCV for Java), you should already have the OpenCV
library set in your workspaces user libraries; if not please check out the previous tutorial. Now you should be ready
to add the library to your project. Inside Eclipses Package Explorer just right-click on your projects folder and
go to Build Path --> Add Libraries....
Select User Libraries and click on Next, check the checkbox of the OpenCV library and click Finish.
10 Chapter 2. Your First Java Application with OpenCV

Create a simple application
Now add a new Class to your project by right-clicking on your projects folder and go to New --> Class. Write a
name of your choice for both the package and the class then click on Finish. Now we are ready to write the code of
our first application. Lets start by defining the main method:
public class HelloCV {

public static void main(String[] args){
System.loadLibrary(Core.NATIVE_LIBRARY_NAME);
Mat mat = Mat.eye(3, 3, CvType.CV_8UC1);
System.out.println("mat = " + mat.dump());
}
}
First of all we need to load the OpenCV Native Library previously set on our project.
System.loadLibrary(Core.NATIVE_LIBRARY_NAME);
Then we can define a new Mat.
Note: The class Mat represents an n-dimensional dense numerical single-channel or multi-channel array. It can be
used to store real or complex-valued vectors and matrices, grayscale or color images, voxel volumes, vector fields,
point clouds, tensors, histograms. For more details check out the OpenCV page.
Mat mat = Mat.eye(3, 3, CvType.CV_8UC1);
The Mat.eye represents a identity matrix, we set the dimensions of it (3x3) and the type of its elements.
As you can notice, if you leave the code just like this, you will get some error; this is due to the fact that eclipse cant
resolve some variables. You can locate your mouse cursor on the words that seem to be errors and wait for a dialog to
pop up and click on the voice Import.... If you do that for all the variables we have added to the code the following
rows:
import org.opencv.core.Core;
import org.opencv.core.CvType;
import org.opencv.core.Mat;
We can now try to build and run our application by clicking on the Run button. You should have the following output:
2.5. Create a simple application 11

The whole source code is available on GitHub.
12 Chapter 2. Your First Java Application with OpenCV

CHAPTER 3
Your First JavaFX Application with OpenCV
A JavaFX application with OpenCV
This tutorial will guide you through the creation of a simple JavaFX GUI application using the OpenCV library in
Eclipse.

Install the e(fx)clipse plugin and (optionally) Scene Builder.
Work with Scene Builder.
Write and Run our application.
Your First Application in JavaFX
The application you will write by following this tutorial is going to capture a video stream from a webcam and, then,
it will display it on the user interface (GUI). We will create the GUI with Scene Builder: it is will have a button, which
will allow us to start and stop the stream, and a simple image view container where we will put each stream frame.
13
Installing e(fx)clipse plugin and Scene Builder
In Eclipse, install the e(fx)clipse plugin, by following the guide at http://www.eclipse.org/efxclipse/install.

html#fortheambitious. If you choose not to install such a plugin, you have to create a traditional Java
project, only. Download and install JavaFX Scene Builder 2.0 from http://www.oracle.com/technetwork/java/
javafxscenebuilder-1x-archive-2199384.html.
Now you can create a new JavaFX project. Go to File > New > Project... and select JavaFX
project....
Choose a name for your project and click Next.
14 Chapter 3. Your First JavaFX Application with OpenCV

Now add your OpenCV user library to your project and click Next.
3.4. Installing e(fx)clipse plugin and Scene Builder 15

Choose a name for your package, for the FXML file and for the Controller Class. The FXML file will contain the
description of your GUI in FXML language, while the Controller Class will handle all the method and event which
have to be called and managed when the user interacts with the GUIs components.
Working with Scene Builder
If you have installed Scene Builder you can now right click on your FXML file in Eclipse and select Open with
SceneBuilder. Scene Builder can help construct you gui by interacting with a graphic interface; this allows
you to see a real time preview of your window and modify your components and their position just by editing the
graphic preview. Lets take a look at what Im talking about. At fist the FXML file will have just an AnchorPane. An
AnchorPane allows the edges of child nodes to be anchored to an offset from the anchorpanes edges. If the anchorpane
has a border and/or padding set, the offsets will be measured from the inside edge of those insets. The anchorpane
lays out each managed child regardless of the childs visible property value; unmanaged children are ignored for all
layout calculations. You can go ahead and delete the anchorpane and add a BorderPane instead. A BorderPane lays
out children in top, left, right, bottom, and center positions.

You can add a BorderPane by dragging from the Container menu a borderpane and then drop it in the Hierarchy
menu. Now we can add the button that will allow us to start and stop the stream. Take a button component from the
Controls menu and drop it on the BOTTOM field of our BP. As we can see, on the right we will get three menus
(Properties, Layout, Code) which are used to customize our selected component. For example we can change text of
our button in Start Camera in the Text field under the Properties menu and the id of the button (e.g. start_btn)
in the fx:id field under the Code menu.
3.5. Working with Scene Builder 17

We are going to need the id of the button later, in order to edit the button properties from our Controllers methods.
As you can see our button is too close to the edge of the windows, so we should add some bottom margin to it; to do
so we can add this information in the Layout menu. In order to make the button work, we have to set the name of the
method (e.g. startCamera) that will execute the action we want to preform in the field OnAction under the Code
menu.

Now, we shall add an ImageView component from the Controls menu into the CENTER field of our BP. Lets also
edit the id of the image view (e.g. currentFrame), and add some margin to it.
Finally we have to tell which Controller class will mange the GUI, we can do so by adding our controller class name
in the Controller class field under the Controller menu located in the bottom left corner of the window.
We just created our first GUI by using Scene Builder, if you save the file and return to Eclipse you will notice that
some FXML code has been generated automatically.
3.5. Working with Scene Builder 19

Key Concepts in JavaFX
The Stage is where the application will be displayed (e.g., a Windows window). A Scene is one container of Nodes
that compose one page of your application. A Node is an element in the Scene, with a visual appearance and an
interactive behavior. Nodes may be hierarchically nested . In the Main class we have to pass to the start function our
primary stage:
public void start(Stage primaryStage)
and load the fxml file that will populate our stage, the root element of the scene and the controller class:
FXMLLoader loader = new FXMLLoader(getClass().getResource("FXHelloCV.fxml"));

BorderPane root = (BorderPane) loader.load();
FXController controller = loader.getController();
Managing GUI Interactions With the Controller Class
For our application we need to do basically two thing: control the button push and the refreshment of the image view.
To do so we have to create a reference between the gui components and a variable used in our controller class:
@FXML
private Button button;
@FXML
private ImageView currentFrame;
The @FXML tag means that we are linking our variable to an element of the fxml file and the value used to declare the
variable has to equal to the id set for that specific element.
The @FXML tag is used with the same meaning for the Actions set under the Code menu in a specific element.
for:
<Button fx:id="button" mnemonicParsing="false" onAction="#startCamera" text="Start

Camera" BorderPane.alignment="CENTER">
we set:
@FXML
protected void startCamera(ActionEvent event) { ...
Video Capturing
Essentially, all the functionalities required for video manipulation is integrated in the VideoCapture class.
private VideoCapture capture = new VideoCapture();
This on itself builds on the FFmpeg open source library. A video is composed of a succession of images, we re-
fer to these in the literature as frames. In case of a video file there is a frame rate specifying just how long is
between two frames. While for the video cameras usually there is a limit of just how many frames they can dig-
italize per second. In our case we set as frame rate 30 frames per sec. To do so we initialize a timer (i.e., a
`ScheduledExecutorService`) that will open a background task every 33 milliseconds.

Runnable frameGrabber = new Runnable() { ... }

this.timer = Executors.newSingleThreadScheduledExecutor();
this.timer.scheduleAtFixedRate(frameGrabber, 0, 33, TimeUnit.
MILLISECONDS);
To check if the binding of the class to a video source was successful or not use the isOpened function:
if (this.capture.isOpened()) { ... }
Closing the video is automatic when the objects destructor is called. However, if you want to close it before this you
need to call its release function.
this.capture.release();
The frames of the video are just simple images. Therefore, we just need to extract them from the VideoCapture object
and put them inside a Mat one.
Mat frame = new Mat();
The video streams are sequential. You may get the frames one after another by the read or the overloaded >> operator.
this.capture.read(frame);
Now we are going to convert our image from BGR to Grayscale format. OpenCV has a really nice function to do this
kind of transformations:
Imgproc.cvtColor(frame, frame, Imgproc.COLOR_BGR2GRAY);
As you can see, cvtColor takes as arguments:

a source image (frame)
a destination image (frame), in which we will save the converted image.
an additional parameter that indicates what kind of transformation will be performed. In this case we use
COLOR_BGR2GRAY (because of imread has BGR default channel order in case of color images).
Now in order to put the captured frame into the ImageView we need to convert the Mat in a Image. We first create a
buffer to store the Mat.
MatOfByte buffer = new MatOfByte();
Then we can put the frame into the buffer by using the imencode function:
Imgcodecs.imencode(".png", frame, buffer);
This encodes an image into a memory buffer. The function compresses the image and stores it in the memory buffer
that is resized to fit the result.
Note: imencode returns single-row matrix of type CV_8UC1 that contains encoded image as array of bytes.
It takes three parameters:

(.png) File extension that defines the output format.
(frame) Image to be written.
(buffer) Output buffer resized to fit the compressed image.
3.8. Video Capturing 21

Once we filled the buffer we have to stream it into an Image by using ByteArrayInputStream:
new Image(new ByteArrayInputStream(buffer.toArray()));
Now we can put the new image in the ImageView. With Java 1.8 we cannot perform an update of a GUI element in a
thread that differs from the main thread; so we need to get the new frame in a second thread and refresh our ImageView
in the main thread:
Image imageToShow = grabFrame();

Platform.runLater(new Runnable() {
@Override public void run() { currentFrame.setImage(imageToShow); }
});
The source code of the entire tutorial is available on GitHub.

CHAPTER 4
OpenCV Basics

Create a basic checkbox interaction to alter the color of the video stream.
Add a basic checkbox interaction to alpha over a logo to the video stream.
Display the video stream histogram (both one and three channels).
Getting started
For this tutorial we can create a new JavaFX project and build a scene as the one realized in the previous one. So weve
got a window with a border pane in which:
in the BOTTOM we have a button inside a HBox:
<HBox alignment="CENTER" >

<padding>
<Insets top="25" right="25" bottom="25" left="25"/>
</padding>
<Button fx:id="button" alignment="center" text="Start camera" onAction="
#startCamera" />
</HBox>
in the CENTER we have a ImageView:
23
<ImageView fx:id="currentFrame" />
Color channel checkbox
Lets open our fxml file with Scene Builder and add to the RIGHT field of our BorderPane a vertical box VBox. A
VBox lays out its children in a single vertical column. If the VBox has a border and/or padding set, then the contents
will be layed out within those insets. Also it will resize children (if resizable) to their preferred heights and uses its
fillWidth property to determine whether to resize their widths to fill its own width or keep their widths to their
preferred (fillWidth defaults to true). A HBox works just like a VBox but it lays out its children horizontally instead
of vertically.
Now we can put inside the VBox a new checkbox, change its text to Show in gray scale, and set an id (e.g.,
grayscale).
<CheckBox fx:id="grayscale" text="Show in gray scale" />
Lets also add a title to this section by putting a text before our new checkbox, but still inside the VBox. Then, set its
text to Controls (we can find the text element under the Shapes menu).
<Text text="Controls" />
In the Scene Builder we now have:
24 Chapter 4. OpenCV Basics

The graphic interface is complete for the first task, now we need to work on the controller; in the previous tutorial we
could control the number of channels displayed on screen with the line:
In order to control this conversion with the check box, we have to link the check box with a FXML variable:
@FXML
private CheckBox grayscale;
Now we can implement the control by adding a simple if condition which will perform the conversion only if our
check box is checked:
if (grayscale.isSelected())
{
}
Load an Image and Add it to the Stream
The next step is to add another check box which, if checked, will trigger the display of an image over the camera
stream. Lets start by adding the image to the project; create a new folder in the root directory of your project and
put the image in there. In my project I have a resources folder with a Poli.png image. Go back to Eclipse and
refresh your project (you should have the new folder in it). Lets open the FXML file with Scene Builder and add a
new checkbox below the one that controls the stream colors; we have to set the text, the name of the method in the
OnAction field and an id. In the code we will have for example:
<CheckBox fx:id="logoCheckBox" text="Show logo" onAction="#loadLogo" />
In the controller file we have to define a new variable associated with the checkbox, the method set on the OnAction
field and adapt the code so that it will display the logo when the checkbox is checked and the stream is on. Variable:
@FXML
private CheckBox logoCheckBox;
loadLogo method: In this method we are going to load the image whenever the logoCheckBox id selected (checked).
In order to load the image we have to use a basic OpenCV function: imread. It returns a Mat and takes the path of the
image and a flag (> 0 RGB image, =0 grayscale, <0 with the alpha channel).
@FXML
protected void loadLogo()
{
if (logoCheckBox.isSelected())
this.logo = Imgcodecs.imread("resources/Poli.png");
}
Adapt the code.

We are going to add some variants to the code in order to display our logo in a specific region of the stream. This
means that for each frame capture, before the image could be converted into 1 or 3 channels, we have to set a ROI
(region of interest) in which we want to place the logo. Usually a ROI of an image is a portion of it, we can define the
ROI as a Rect object. Rect is a template class for 2D rectangles, described by the following parameters:
Coordinates of the top-left corner. This is a default interpretation of Rect.x and Rect.y in OpenCV. Though, in
your algorithms you may count x and y from the bottom-left corner.
4.4. Load an Image and Add it to the Stream 25

Rectangle width and height.
Rect roi = new Rect(frame.cols()-logo.cols(), frame.rows()-logo.rows(), logo.cols(),

logo.rows());
Then we have to take control of our Mats ROI, by doing so we are able to add our logo in the disired area of the
frame defined by the ROI.
Mat imageROI = frame.submat(roi);
We had to make this operation because we can only add Mats with the same sizes; but how can we add two Mat
together? We have to keep in mind that our logo could have 4 channels (RGB + alpha). So we could use two functions:
addWeighted or copyTo. The addWeighted function calculates the weighted sum of two arrays as follows:
dst(I)= saturate(src1(I) alpha + src2(I)* beta + gamma)*
where I is a multi-dimensional index of array elements. In case of multi-channel arrays, each channel is processed
independently. The function can be replaced with a matrix expression:
dst = src1*alpha + src2*beta + gamma
Note: Saturation is not applied when the output array has the depth CV_32S. You may even get result of an incorrect
sign in the case of overflow.
Parameters:
src1 first input array.
alpha weight of the first array elements.
src2 second input array of the same size and channel number as src1.
beta weight of the second array elements.
gamma scalar added to each sum.
dst output array that has the same size and number of channels as the input arrays.
So well have:
Core.addWeighted(imageROI, 1.0, logo, 0.7, 0.0, imageROI);
The second method (copyTo) simply copies a Mat into the other. Well have:
Mat mask = logo.clone();

logo.copyTo(imageROI, mask);
Everything we have done so far to add the logo to the image has to perform only IF our checkbox is check and the
image loading process has ended successfully. So we have to add an if condition:
if (logoCheckBox.isSelected() && this.logo != null)

{
Rect roi = new Rect(frame.cols() - logo.cols(), frame.rows() - logo.rows(), logo.
cols(),logo.rows());
Mat imageROI = frame.submat(roi);

// add the logo: method #1
Core.addWeighted(imageROI, 1.0, logo, 0.7, 0.0, imageROI);

// add the logo: method #2
// Mat mask = logo.clone();

// logo.copyTo(imageROI, mask);
}
Calculate a Histogram
A histogram is a collected counts of data organized into a set of predefined bins. In our case the data represents the
intensity of the pixel so it will have a range like (0, 256).
Since we know that the range of information value, we can segment our range in subparts (called bins); lets identify some parts
1. dims: The number of parameters you want to collect data of.

2. bins: It is the number of subdivisions in each dim. In our example, bins = 256
3. range: The limits for the values to be measured. In this case: range = [0,255]
Our last goal is to display the histogram of the video stream for either RGB or in grayscale. For this task we are going
to define a method in our controller class that takes a Mat (our current frame) and a boolean that will flag if the frame
is in RGB or in grayscale, for example:
First thing we need to do is to divide the frame into other n frames, where n represents the number of channels of which
our frame is composed. To do so we need to use the Core.split function; it needs a source Mat and a List<Mat>
where to put the different channels. Obviously if the frame is in grayscale the list will have just one element.
Before we could calculate the histogram of each channel we have to prepare all the inputs that the calcHist function
needs. The functions calcHist calculates the histogram of one or more arrays. The elements of a tuple used to increment
a histogram bin are taken from the corresponding input arrays at the same location. Parameters:
images Source arrays. They all should have the same depth, CV_8U or CV_32F, and the same size. Each of
them can have an arbitrary number of channels.
channels List of the dims channels used to compute the histogram. The first array channels are numerated
from 0 to images[0].channels()-1, the second array channels are counted from images[0].channels() to im-
ages[0].channels() + images[1].channels()-1, and so on.
mask Optional mask. If the matrix is not empty, it must be an 8-bit array of the same size as images[i]. The
non-zero mask elements mark the array elements counted in the histogram.
hist Output histogram, which is a dense or sparse dims -dimensional array.
histSize Array of histogram sizes in each dimension.
ranges Array of the dims arrays of the histogram bin boundaries in each dimension. When the histogram
is uniform (uniform =true), then for each dimension i it is enough to specify the lower (inclusive) boundary
L_0 of the 0-th histogram bin and the upper (exclusive) boundary U_(histSize[i]-1) for the last histogram bin
histSize[i]-1. That is, in case of a uniform histogram each of ranges[i] is an array of 2 elements. When the his-
togram is not uniform (uniform=false), then each of ranges[i] contains histSize[i]+1 elements: L_0, U_0=L_1,
U_1=L_2,..., U_(histSize[i]-2)=L_(histSize[i]-1), U_(histSize[i]-1). The array elements, that are not between
L_0 and U_(histSize[i]-1), are not counted in the histogram.
accumulate Accumulation flag. If it is set, the histogram is not cleared in the beginning when it is allocated.
This feature enables you to compute a single histogram from several sets of arrays, or to update the histogram
in time.
The image will be our frame, we dont need a mask and the last flag will be false; thus we need to define the channels,
the hist, the histSize and the ranges:
4.5. Calculate a Histogram 27

In the RGB case we will need all of the hist defined, in the grayscale case instead we will use just the hist_b one.
We are now ready to do the histogram calculation:
where gray is the flag we passed to the showHistogram method.
Draw the Histogram
Next step is to draw the calculated histogram in our GUI. Open the fxml file with Scene Builder and add an ImageView
above the Controls text in the right of the BP and set its id:
<ImageView fx:id="histogram" />
Now back to the Controller class. Lets add a global variable to control the just added image view:
@FXML
private ImageView histogram;
and continue to write the showHistogram method. First thing first, lets create an image to display the histogram:
int hist_w = 150;

int hist_h = 150;
int bin_w = (int) Math.round(hist_w / histSize.get(0, 0)[0]);
Mat histImage = new Mat(hist_h, hist_w, CvType.CV_8UC3, new Scalar(0, 0, 0));
before drawing, we first normalize the histogram so its values fall in the range indicated by the parameters entered:
Core.normalize(hist_b, hist_b, 0, histImage.rows(), Core.NORM_MINMAX, -1, new Mat());

if (!gray){
Core.normalize(hist_g, hist_g, 0, histImage.rows(), Core.NORM_MINMAX, -1, new
Mat());
Core.normalize(hist_r, hist_r, 0, histImage.rows(), Core.NORM_MINMAX, -1, new

Mat());
Now we can draw the histogram in our Mat:
for (int i = 1; i < histSize.get(0, 0)[0]; i++){

Imgproc.line(histImage, new Point(bin_w * (i - 1), hist_h - Math.round(hist_b.
get(i - 1, 0)[0])), new Point(bin_w * (i), hist_h - Math.round(hist_b.get(i,
0)[0])), new Scalar(255, 0, 0), 2, 8, 0);
if (!gray){
Imgproc.line(histImage, new Point(bin_w * (i - 1), hist_h - Math.round(hist_g.
get(i - 1, 0)[0])),new Point(bin_w * (i), hist_h - Math.round(hist_g.get(i, 0)[0])),
new Scalar(0, 255, 0), 2, 8, 0);
Imgproc.line(histImage, new Point(bin_w * (i - 1), hist_h - Math.round(hist_r.

get(i - 1, 0)[0])),Math.round(hist_r.get(i, 0)[0])), new Scalar(0, 0, 255), 2, 8,
0);
}
}
Lets convert the obtained Mat to an Image with our method mat2Image and update the ImageView with the returned
Image:
histo = mat2Image(histImage);
histogram.setImage(histo);

4.6. Draw the Histogram 29


CHAPTER 5
Fourier Transform
Goal
In this tutorial we are going to create a JavaFX application where we can load a picture from our file system and apply
to it the DFT and the inverse DFT.
What is the Fourier Transform?
The Fourier Transform will decompose an image into its sinus and cosines components. In other words, it will
transform an image from its spatial domain to its frequency domain. The result of the transformation is complex
numbers. Displaying this is possible either via a real image and a complex image or via a magnitude and a phase
image. However, throughout the image processing algorithms only the magnitude image is interesting as this contains
all the information we need about the images geometric structure. For this tutorial we are going to use basic gray scale
image, whose values usually are between zero and 255. Therefore the Fourier Transform too needs to be of a discrete
type resulting in a Discrete Fourier Transform (DFT). The DFT is the sampled Fourier Transform and therefore does
not contain all frequencies forming an image, but only a set of samples which is large enough to fully describe the
spatial domain image. The number of frequencies corresponds to the number of pixels in the spatial domain image,
i.e. the image in the spatial and Fourier domain are of the same size.
31
Load an image from a file chooser.

Apply the DFT to the loaded image and show it.
Apply the iDFT to the transformed image and show it.
Getting Started
Lets create a new JavaFX project. In Scene Builder set the windows element so that we have a Border Pane with:
on the LEFT an ImageView to show the loaded picture:
<ImageView fx:id="originalImage" />
on the RIGHT two ImageViews one over the other to display the DFT and the iDFT;
<ImageView fx:id="transformedImage" />
<ImageView fx:id="antitransformedImage" />
on the BOTTOM three buttons, the first one to load the picture, the second one to apply the DFT and show it,
and the last one to apply the anti-transform and show it.
<Button alignment="center" text="Load Image" onAction="#loadImage"/>
<Button fx:id="transformButton" alignment="center" text="Apply transformation"
onAction="#transformImage" disable="true" />
<Button fx:id="antitransformButton" alignment="center" text="Apply anti transformation

" onAction="#antitransformImage" disable="true" />
The gui will look something like this one:
Load the file
First of all you need to add to your project a folder resources with two files in it. One of them is a sine function
and the other one is a circular aperture. In the Controller file, in order to load the image to our program, we are going
to use a filechooser:
32 Chapter 5. Fourier Transform

private FileChooser fileChooser;
When we click the load button we have to set the initial directory of the FC and open the dialog. The FC will return
the selected file:
File file = new File("./resources/");
this.fileChooser.setInitialDirectory(file);
file = this.fileChooser.showOpenDialog(this.main.getStage());
Once weve loaded the file we have to make sure that its going to be in grayscale and display the image into the image
view:
this.image = Imgcodecs.imread(file.getAbsolutePath(), Imgcodecs.CV_LOAD_IMAGE_
GRAYSCALE);
this.originalImage.setImage(this.mat2Image(this.image));
Applying the DFT
First of all expand the image to an optimal size. The performance of a DFT is dependent of the image size. It
tends to be the fastest for image sizes that are multiple of the numbers two, three and five. Therefore, to achieve
maximal performance it is generally a good idea to pad border values to the image to get a size with such traits. The
getOptimalDFTSize() returns this optimal size and we can use the copyMakeBorder() function to expand
the borders of an image:
int addPixelRows = Core.getOptimalDFTSize(image.rows());
int addPixelCols = Core.getOptimalDFTSize(image.cols());
Core.copyMakeBorder(image, padded, 0, addPixelRows - image.rows(), 0, addPixelCols -
image.cols(),Imgproc.BORDER_CONSTANT, Scalar.all(0));
The appended pixels are initialized with zeros.

The result of the DFT is complex so we have to make place for both the complex and the real values. We store these
usually at least in a float format. Therefore well convert our input image to this type and expand it with another
channel to hold the complex values:
padded.convertTo(padded, CvType.CV_32F);
this.planes.add(padded);
this.planes.add(Mat.zeros(padded.size(), CvType.CV_32F));
Core.merge(this.planes, this.complexImage);
Now we can apply the DFT and then get the real and the imaginary part from the complex image:
Core.dft(this.complexImage, this.complexImage);
Core.split(complexImage, newPlanes);
Core.magnitude(newPlanes.get(0), newPlanes.get(1), mag);
Unfortunately the dynamic range of the Fourier coefficients is too large to be displayed on the screen. To use the gray
scale values for visualization we can transform our linear scale to a logarithmic one:
Core.add(Mat.ones(mag.size(), CVType.CV_32F), mag);
Core.log(mag, mag);
Remember, that at the first step, we expanded the image? Well, its time to throw away the newly introduced values.
For visualization purposes we may also rearrange the quadrants of the result, so that the origin (zero, zero) corresponds
with the image center:
5.6. Applying the DFT 33

image = image.submat(new Rect(0, 0, image.cols() & -2, image.rows() & -2));

int cx = image.cols() / 2;
int cy = image.rows() / 2;
Mat q0 = new Mat(image, new Rect(0, 0, cx, cy));

Mat q1 = new Mat(image, new Rect(cx, 0, cx, cy));
Mat q2 = new Mat(image, new Rect(0, cy, cx, cy));
Mat q3 = new Mat(image, new Rect(cx, cy, cx, cy));
Mat tmp = new Mat();

q0.copyTo(tmp);
q3.copyTo(q0);
tmp.copyTo(q3);
q1.copyTo(tmp);
q2.copyTo(q1);
tmp.copyTo(q2);
Now we have to normalize our values by using the normalize() function in order to transform the matrix with
float values into a viewable image form:
Core.normalize(mag, mag, 0, 255, Core.NORM_MINMAX);
The last step is to show the magnitude image in the ImageView:
this.transformedImage.setImage(this.mat2Image(magnitude));
Applying the inverse DFT
To apply the inverse DFT we simply use the idft() function, extract the real values from the complex image with
the split() function, and normalize the result with normalize():
Core.idft(this.complexImage, this.complexImage);
Mat restoredImage = new Mat();
Core.split(this.complexImage, this.planes);
Core.normalize(this.planes.get(0), restoredImage, 0, 255, Core.NORM_MINMAX);
Finally we can show the result on the proper ImageView:
this.antitransformedImage.setImage(this.mat2Image(restoredImage));
Analyzing the results
sinfunction.png

The image is a horizontal sine of 4 cycles. Notice that the DFT just has a single component, represented by 2 bright
spots symmetrically placed about the center of the DFT image. The center of the image is the origin of the frequency
coordinate system. The x-axis runs left to right through the center and represents the horizontal component of fre-
quency. The y-axis runs bottom to top through the center and represents the vertical component of frequency. There
is a dot at the center that represents the (0,0) frequency term or average value of the image. Images usually have
a large average value (like 128) and lots of low frequency information so FT images usually have a bright blob of
components near the center. High frequencies in the horizontal direction will cause bright dots away from the center
in the horizontal direction.
circle.png
5.8. Analyzing the results 35

In this case we have a circular aperture, and what is the Fourier transform of a circular aperture? The diffraction disk
and rings. A large aperture produces a compact transform, instead a small one produces a larger Airy pattern; thus the
disk is greater if aperture is smaller; according to Fourier properties, from the center to the middle of the first dark ring
the distance is (1.22 x N) / d; in this case N is the size of the image, and d is the diameter of the circle. An Airy disk is
the bright center of the diffraction pattern created from a circular aperture ideal optical system; nearly half of the light
is contained in a diameter of 1.02 x lamba x f_number.

CHAPTER 6
Face Detection and Tracking
Goal
In this tutorial we are going to use well-known classifiers that have been already trained and distributed by OpenCV
in order to detect and track a moving face into a video stream.
Cascade Classifiers
The object recognition process (in our case, faces) is usually efficient if it is based on the features take-over which
include additional information about the object class to be taken-over. In this tutorial we are going to use the Haar-like
features and the Local Binary Patterns (LBP) in order to encode the contrasts highlighted by the human face and its
spatial relations with the other objects present in the picture. Usually these features are extracted using a Cascade
Classifier which has to be trained in order to recognize with precision different objects: the faces classification is
going to be much different from the cars classification.

Insert a checkbox to select the Haar Classifier, detect and track a face, and draw a green rectangle around
the detected face.
37
Inesrt a checkbox to select the LBP Classifier, detect and track a face, and draw a green rectangle around
the detected face.
Getting Started
on TOP a VBox a HBox and a separator. In the HBox we are goning to need two checkboxes, the first one is to
select the Haar Classifier and the second one is to select the LBP Classifier.
<CheckBox fx:id="haarClassifier" onAction="#haarSelected" text="Haar

Classifier"/>
<CheckBox fx:id="lbpClassifier" onAction="#lbpSelected" text="LBP

Classifier"/>
in the CENTRE we are going to put an ImageView for the web cam stream.
<ImageView fx:id="originalFrame" />
on the BOTTOM we can add the usual button to start/stop the stream
<Button fx:id="cameraButton" alignment="center" text="Start camera"

onAction="#startCamera" disable="true" />
Loading the Classifiers
First of all we need to add a folder resource to our project and put the classifiers in it. In order to use the classifiers
we need to load them from the resource folder, so every time that we check one of the two checkboxes we will load
the correct classifier. To do so, lets implement the OnAction methods we already declared before:
haarSelected inside this method we are going to load the desired Haar Classifier (e.g.
haarcascade_frontalface_alt.xml) as follows:
38 Chapter 6. Face Detection and Tracking

this.checkboxSelection("resources/lbpcascades/haarcascade_frontalface_alt.xml
");
...
private void checkboxSelection(String... classifierPath)
{
// load the classifier(s)
for (String xmlClassifier : classifierPath)
{
this.faceCascade.load(xmlClassifier);
}
// now the capture can start

this.cameraButton.setDisable(false);
}
lbpSelected for the LPB we can use the same method and change the path of the classifier to be loaded:
this.checkboxSelection("resources/lbpcascades/lbpcascade_frontalface.xml");
Detection and Tracking
Once weve loaded the classifiers we are ready to start the detection; we are going to implement the detection in the
detectAndDisplay method. First of all we need to convert the frame in grayscale and equalize the histogram to
improve the results:
Imgproc.cvtColor(frame, grayFrame, Imgproc.COLOR_BGR2GRAY);

Imgproc.equalizeHist(grayFrame, grayFrame);
Then we have to set the minimum size of the face to be detected (this required is need in the actual detection function).
Lets set the minimum size as the 20% of the frame height:
if (this.absoluteFaceSize == 0)
{
int height = grayFrame.rows();
if (Math.round(height * 0.2f) > 0)
{
this.absoluteFaceSize = Math.round(height * 0.2f);
}
}
Now we can start the detection:
this.faceCascade.detectMultiScale(grayFrame, faces, 1.1, 2, 0 | Objdetect.CASCADE_

SCALE_IMAGE, new Size(this.absoluteFaceSize, this.absoluteFaceSize), new Size());
The detectMultiScale function detects objects of different sizes in the input image. The detected objects are
returned as a list of rectangles. The parameters are:
image Matrix of the type CV_8U containing an image where objects are detected.
objects Vector of rectangles where each rectangle contains the detected object.
scaleFactor Parameter specifying how much the image size is reduced at each image scale.
minNeighbors Parameter specifying how many neighbors each candidate rectangle should have to retain it.
6.6. Detection and Tracking 39

flags Parameter with the same meaning for an old cascade as in the function cvHaarDetectObjects. It is not used
for a new cascade.
minSize Minimum possible object size. Objects smaller than that are ignored.
maxSize Maximum possible object size. Objects larger than that are ignored.
So the result of the detection is going to be in the objects parameter or in our case faces.
Lets put this result in an array of rects and draw them on the frame, by doing so we can display the detected face are:
Rect[] facesArray = faces.toArray();

for (int i = 0; i < facesArray.length; i++)
Imgproc.rectangle(frame, facesArray[i].tl(), facesArray[i].br(), new Scalar(0,
255, 0, 255), 3);
As you can see we selected the color green with a transparent background: Scalar(0, 255, 0, 255). .tl()
and .br() stand for top-left and bottom-right and they represents the two opposite vertexes. The last parameter just
set the thickness of the rectangles border.
The tracking part can be implemented by calling the detectAndDisplay method for each frame.

6.6. Detection and Tracking 41


CHAPTER 7
Image Segmentation
Goal
In this tutorial we are going to create a JavaFX application where we can decide to apply to video stream captured
from our web cam either a Canny edge detector or a trivial background removal using the two basic morphological
operations: dilatation and erosion.
Canny edge detector
The Canny edge detector is an edge detection operator that uses a multi-stage algorithm to detect a wide range of
edges in images. It was developed by John F. Canny in 1986.
Canny edge detection is a four step process:
1. A Gaussian blur is applied to clear any speckles and free the image of noise.
2. A gradient operator is applied for obtaining the gradients intensity and direction.
3. Non-maximum suppression determines if the pixel is a better candidate for an edge than its neighbors.
4. Hysteresis thresholding finds where edges begin and end.
The Canny algorithm contains a number of adjustable parameters, which can affect the computation time and effectiveness of t
43
The size of the Gaussian filter: the smoothing filter used in the first stage directly affects the results of the
Canny algorithm. Smaller filters cause less blurring, and allow detection of small, sharp lines. A larger
filter causes more blurring, smearing out the value of a given pixel over a larger area of the image.
Thresholds: A threshold set too high can miss important information. On the other hand, a threshold set
too low will falsely identify irrelevant information (such as noise) as important. It is difficult to give a
generic threshold that works well on all images. No tried and tested approach to this problem yet exists.
For our purpose we are going to set the filter size to 3 and the threshold editable by a Slider.
Dilatation and Erosion
Dilation and erosion are the most basic morphological operations. Dilation adds pixels to the boundaries of objects in
an image, while erosion removes pixels on object boundaries. The number of pixels added or removed from the objects
in an image depends on the size and shape of the structuring element used to process the image. In the morphological
dilation and erosion operations, the state of any given pixel in the output image is determined by applying a rule to the
corresponding pixel and its neighbors in the input image. The rule used to process the pixels defines the operation as
a dilation or an erosion.
Dilatation: the value of the output pixel is the maximum value of all the pixels in the input pixels neighborhood. In
a binary image, if any of the pixels is set to the value 1, the output pixel is set to 1.
Erosion: the value of the output pixel is the minimum value of all the pixels in the input pixels neighborhood. In a
binary image, if any of the pixels is set to 0, the output pixel is set to 0.

Add a checkbox and a Slider to select and control the Canny edge detector.
Use the Canny edge function provided by OpenCV.
Use the Dilatation and erosion functions provided by OpenCV.
Create a trivial background removal function.
Getting Started
on TOP a VBox containing two HBox, each one followed by a separator.
In the first HBox we are goning to need a checkbox and a slider, the first one is to select the Canny e.d. mode and
the second one is going to be used to control the value of the threshold to be passed to the Canny e.d. function.
<CheckBox fx:id="canny" onAction="#cannySelected" text="Edge detection"/>

<Label text="Canny Threshold" />
<Slider fx:id="threshold" disable="true" />
In the second HBox we need two checkboxes, the first one to select the Backgrond removal mode and the second
one to say if we want to invert the algorithm (a sort of foreground removal).
44 Chapter 7. Image Segmentation

<CheckBox fx:id="dilateErode" onAction="#dilateErodeSelected" text=

"Background removal"/>
<CheckBox fx:id="inverse" text="Invert" disable="true"/>
in the CENTRE we are going to put an ImageView for the web cam stream.
on the BOTTOM we can add the usual button to start/stop the stream
<Button fx:id="cameraButton" alignment="center" text="Start camera" onAction="

#startCamera" disable="true" />
Using the Canny edge detection
If we selected the Canny checkbox we can perform the method doCanny.
if (this.canny.isSelected()){
frame = this.doCanny(frame);
}
doCanny is a method that we define to execute the edge detection. First, we convert the image into a grayscale one
and blur it with a filter of kernel size 3:
Imgproc.cvtColor(frame, grayImage, Imgproc.COLOR_BGR2GRAY);

Imgproc.blur(grayImage, detectedEdges, new Size(3, 3));
Second, we apply the OpenCV function Canny:
Imgproc.Canny(detectedEdges, detectedEdges, this.threshold.getValue(), this.threshold.

getValue() * 3, 3, false);
7.6. Using the Canny edge detection 45

where the arguments are:

detectedEdges: Source image, grayscale
detectedEdges: Output of the detector (can be the same as the input)
this.threshold.getValue(): The value entered by the user moving the Slider
this.threshold.getValue() * 3: Set in the program as three times the lower threshold (follow-
ing Cannys recommendation)
3: The size of the Sobel kernel to be used internally
false: a flag, indicating whether to use a more accurate calculation of the magnitude gradient.
Then we fill a dest image with zeros (meaning the image is completely black).
Mat dest = new Mat();

Core.add(dest, Scalar.all(0), dest);
Finally, we will use the function copyTo to map only the areas of the image that are identified as edges (on a black
background).
frame.copyTo(dest, detectedEdges);
copyTo copies the src image onto dest. However, it will only copy the pixels in the locations where they have
non-zero values.
Canny Result
Using the Background Removal
If we selected the background removal checkbox we can perform the method doBackgroundRemoval
else if (this.dilateErode.isSelected())
{
frame = this.doBackgroundRemoval(frame);
}

doBackgroundRemoval is a method that we define to execute the background removal.

First we need to convert the current frame in HSV:
hsvImg.create(frame.size(), CvType.CV_8U);
Imgproc.cvtColor(frame, hsvImg, Imgproc.COLOR_BGR2HSV);
Now let's split the three channels of the image:
Core.split(hsvImg, hsvPlanes);
Calculate the Hue component mean value:
Imgproc.calcHist(hue, new MatOfInt(0), new Mat(), hist_hue, histSize, new

MatOfFloat(0, 179));
for (int h = 0; h < 180; h++)

average += (hist_hue.get(h, 0)[0] * h);
average = average / hsvImg.size().height / hsvImg.size().width;
If the background is uniform and fills most of the frame, its value should be close to mean just calculated. Then we
can use the mean as the threshold to separate the background from the foreground, depending on the invert checkbox
we need to perform a back(fore)ground removal:
if (this.inverse.isSelected())
Imgproc.threshold(hsvPlanes.get(0), thresholdImg, threshValue, 179.0, Imgproc.
THRESH_BINARY_INV);
else
Imgproc.threshold(hsvPlanes.get(0), thresholdImg, threshValue, 179.0, Imgproc.
THRESH_BINARY);
Now we apply a low pass filter (blur) with a 5x5 kernel mask to enhance the result:
Imgproc.blur(thresholdImg, thresholdImg, new Size(5, 5));
Finally apply the dilatation then the erosion (closing) to the image:
Imgproc.dilate(thresholdImg, thresholdImg, new Mat(), new Point(-1, -1), 1);

Imgproc.erode(thresholdImg, thresholdImg, new Mat(), new Point(-1, -1), 3);
The functions take these parameters:

thresholdImg input image;
thresholdImg output image of the same size and type as thresholdImg;
new Mat() a kernel;
new Point(-1, -1) position of the anchor within the element; default value ( -1, -1 ) means that the
anchor is at the element center.
6 number of times the operation is applied.
After the closing we need to do a new binary threshold:
Imgproc.threshold(thresholdImg, thresholdImg, threshValue, 179.0, Imgproc.THRESH_

BINARY);
At last, we can apply the image weve just obtained as a mask to the original frame:
Mat foreground = new Mat(frame.size(), CvType.CV_8UC3, new Scalar(255, 255, 255));

frame.copyTo(foreground, thresholdImg);
7.8. Using the Background Removal 47

Background Removal Result

CHAPTER 8
Object Detection
Goal
In this tutorial we are going to identify and track one or more tennis balls. It performs the detection of the tennis balls
upon a webcam video stream by using the color range of the balls, erosion and dilation, and the findContours method.
Morphological Image Processing
Is a collection of non-linear operations related to the morphology of features in an image. The morphological oper-
ations rely only on the relative ordering of pixel values and not on their numerical values. Some of the fundamental
morphological operations are dilation and erosion. Dilation causes objects to dilate or grow in size adding pixels to
the boundaries of objects in an image and therefore the holes within different regions become smaller. The dilation
allows, for example, to join parts of an object that appear separated. Erosion causes objects to shrink by stripping away
layers of pixels from the boundaries of objects in an image and therefore the holes within different regions become
larger. The erosion can be used to remove noise or small errors from the image due to the scanning process. The
opening is a compound operation that consist in an erosion followed by a dilation using the same structuring element
for both operations. This operation removes small objects from the foreground of an image and can be used to find
things into which a specific structuring element can fit. The opening can open up a gap between objects connected by
a thin bridge of pixels. Any regions that have survived the erosion are restored to their original size by the dilation.
49

Insert 3 groups of sliders to control the quantity of HSV (Hue, Saturation and Value) of the image.
Capture and process the image from the web cam removing noise in order to facilitate the object recognition.
Finally using morphological operator such as erosion and dilation we can identify the objects using the con-
tornous obtained after the image processing.
Getting Started
Lets create a new JavaFX project. In Scene Builder we set the window elements so that we have a Border Pane with:
on RIGHT CENTER we can add a VBox. In this one we are going to need 6 sliders, the first couple will control
hue, the next one saturation and finally brightness, with these sliders is possible to change the values of the HSV
image.
<Label text="Hue Start" />

<Slider fx:id="hueStart" min="0" max="180" value="20" blockIncrement="1" /
>
<Label text="Hue Stop" />

<Slider fx:id="hueStop" min="0" max="180" value="50" blockIncrement="1" />
<Label text="Saturation Start" />
<Slider fx:id="saturationStart" min="0" max="255" value="60"
blockIncrement="1" />
<Label text="Saturation Stop" />

<Slider fx:id="saturationStop" min="0" max="255" value="200"
blockIncrement="1" />
<Label text="Value Start" />

<Slider fx:id="valueStart" min="0" max="255" value="50" blockIncrement="1
" />
<Label text="Value Stop" />

<Slider fx:id="valueStop" min="0" max="255" value="255" blockIncrement="1
" />
in the CENTER. we are going to put three ImageViews, the first one shows normal image from the web cam
stream, the second one will show mask image and the last one will show morph image. The HBox is used to
normal image and VBox to put the other ones.
<HBox alignment="CENTER" spacing="5">

<padding>
<Insets right="10" left="10" />
</padding>
<VBox alignment="CENTER" spacing="5">
<ImageView fx:id="maskImage" />
<ImageView fx:id="morphImage" />
</VBox>
</HBox>
on the BOTTOM we can add the usual button to start/stop the stream and the current values HSV selected with
the sliders.
50 Chapter 8. Object Detection

<Button fx:id="cameraButton" alignment="center" text="Start camera"

onAction="#startCamera" />
<Separator />
<Label fx:id="hsvCurrentValues" />
Image processing
In order to use the morphological operators and obtain good results we need to process the image and remove the
noise, change the image to HSV allows to get the contours easily.
Remove noise We can remove some noise of the image using the method blur of the Imgproc class and then
apply a conversion to HSV in order to facilitate the process of object recognition.
Mat blurredImage = new Mat();

Mat hsvImage = new Mat();
Mat mask = new Mat();
Mat morphOutput = new Mat();
// remove some noise

Imgproc.blur(frame, blurredImage, new Size(7, 7));
// convert the frame to HSV

Imgproc.cvtColor(blurredImage, hsvImage, Imgproc.COLOR_BGR2HSV);
8.5. Image processing 51

Values of HSV image With the sliders we can modify the values of the HSV Image, the image will be
updtated in real time, that allows to increase or decrease the capactity to recognize object into the image. .
// get thresholding values from the UI

// remember: H ranges 0-180, S and V range 0-255
Scalar minValues = new Scalar(this.hueStart.getValue(), this.saturationStart.
getValue(),
this.valueStart.getValue());
Scalar maxValues = new Scalar(this.hueStop.getValue(), this.saturationStop.
getValue(),
this.valueStop.getValue());
// show the current selected HSV range

String valuesToPrint = "Hue range: " + minValues.val[0] + "-" + maxValues.
val[0]
+ "\tSaturation range: " + minValues.val[1] + "-" + maxValues.val[1] +

"\tValue range: "
+ minValues.val[2] + "-" + maxValues.val[2];

this.onFXThread(this.hsvValuesProp, valuesToPrint);
// threshold HSV image to select tennis balls

Core.inRange(hsvImage, minValues, maxValues, mask);
// show the partial output
this.onFXThread(maskProp, this.mat2Image(mask));
Morphological Operators
First of all we need to define the two matrices of morphological operator dilation and erosion, then with the methods
erode and dilate of the class Imgproc we process the image twice in each operation, the result is the matrix morphOut-
put that will be the partial output.
// morphological operators
// dilate with large element, erode with small ones
Mat dilateElement = Imgproc.getStructuringElement(Imgproc.MORPH_RECT, new
Size(24, 24));
Mat erodeElement = Imgproc.getStructuringElement(Imgproc.MORPH_RECT, new

Size(12, 12));
Imgproc.erode(mask, morphOutput, erodeElement);

Imgproc.erode(mask, morphOutput, erodeElement);
Imgproc.dilate(mask, morphOutput, dilateElement);

Imgproc.dilate(mask, morphOutput, dilateElement);
// show the partial output

this.onFXThread(this.morphProp, this.mat2Image(morphOutput));
Object tracking
With the partial output obtained before we can use the method findContours of the class Imgpoc to get a matrix with
the mapping of the objects recognized, then we draw the contours of these objects.

// init
List<MatOfPoint> contours = new ArrayList<>();
Mat hierarchy = new Mat();
// find contours
Imgproc.findContours(maskedImage, contours, hierarchy, Imgproc.RETR_CCOMP,
Imgproc.CHAIN_APPROX_SIMPLE);
// if any contour exist...

if (hierarchy.size().height > 0 && hierarchy.size().width > 0)
{
// for each contour, display it in blue
for (int idx = 0; idx >= 0; idx = (int) hierarchy.get(0, idx)[0])
{
Imgproc.drawContours(frame, contours, idx, new Scalar(250, 0,
0));
}
}
Finally we can get this results:
8.7. Object tracking 53


CHAPTER 9
Camera Calibration
Warning: This tutorial is not updated to OpenCV 3.0.
Goal
The goal of this tutorial is to learn how to calibrate a camera given a set of chessboard images.
What is the camera calibration?
The camera calibration is the process with which we can obtain the camera parameters such as intrinsic and extrinsic
parameters, distortions and so on. The calibration of the camera is often necessary when the alignment between the
lens and the optic sensors chip is not correct; the effect produced by this wrong alignment is usually more visible in
low quality cameras.
Calibration Pattern
As we said earlier we are going to need some sort of pattern that the program can recognize in order to make the
calibration work. The pattern that we are going to use is a chessboard image.
55
The reason why we use this image is because there are some OpenCV functions that can recognize this pattern and
draw a scheme which highlights the intersections between each block. To make the calibration work you need to print
the chessboard image and show it to the cam; it is important to maintain the sheet still, better if stick to a surface. In
order to make a good calibration, we need to have about 20 samples of the pattern taken from different angles and
distances.

Create some TextEdit field to give some inputs to our program
Recognize the pattern using some OpenCV functions
Calibrate and show the video stream.
Getting Started
Create a new JavaFX project (e.g. CameraCalibration) with the usual OpenCV user library. Open Scene Builder
and add a Border Pane with:
on TOP we need to have the possibility to set the number of samples for the calibration, the number of horizontal
corners we have in the test image, the number of vertical corners we have in the test image and a button to update
this data. To make things cleaner lets put all these elements inside a HBox.
56 Chapter 9. Camera Calibration

<HBox alignment="CENTER" spacing="10">
Lets also add some labels before each text fields. Each text field is going to need an id, and lets put a standard value
for them already.
<Label text="Boards #" />

<TextField fx:id="numBoards" text="20" maxWidth="50" />
<Label text="Horizontal corners #" />
<TextField fx:id="numHorCorners" text="9" maxWidth="50" />
<Label text="Vertical corners #" />
<TextField fx:id="numVertCorners" text="6" maxWidth="50" />
For the button instead, set the id and a method for the onAction field:
<Button fx:id="applyButton" alignment="center" text="Apply" onAction="#updateSettings

" />
on the LEFT add an ImageView inside a VBox for the normal cam stream; set an id for it.
on the RIGHT add an ImageView inside a VBox for the calibrated cam stream; set an id for it.
in the BOTTOM add a start/stop cam stream button and a snapshot button inside a HBox; set an id and a action
method for each one.
<Button fx:id="cameraButton" alignment="center" text="Start camera" onAction="

#startCamera" disable="true" />
<Button fx:id="snapshotButton" alignment="center" text="Take snapshot" onAction="

#takeSnapshot" disable="true" />
Your GUI will look something like this:
9.5. Getting Started 57

Pattern Recognition
The calibration process consists on showing to the cam the chessboard pattern from different angles, depth and points
of view. For each recognized pattern we need to track:
some reference systems 3D point where the chessboard is located (lets assume that the Z axe is always 0):
for (int j = 0; j < numSquares; j++)

obj.push_back(new MatOfPoint3f(new Point3(j / this.numCornersHor, j %
this.numCornersVer, 0.0f)));
the images 2D points (operation made by OpenCV with findChessboardCorners):
boolean found = Calib3d.findChessboardCorners(grayImage, boardSize,

imageCorners, Calib3d.CALIB_CB_ADAPTIVE_THRESH + Calib3d.CALIB_CB_
NORMALIZE_IMAGE + Calib3d.CALIB_CB_FAST_CHECK);
The findChessboardCorners function attempts to determine whether the input image is a view of the chessboard
pattern and locate the internal chessboard corners. Its parameters are:
image Source chessboard view. It must be an 8-bit grayscale or color image.
patternSize Number of inner corners per a chessboard row and column
corners Output array of detected corners.
flags Various operation flags that can be zero or a combination of the following values:
CV_CALIB_CB_ADAPTIVE_THRESH Use adaptive thresholding to convert the image to black and
white, rather than a fixed threshold level (computed from the average image brightness).
CV_CALIB_CB_NORMALIZE_IMAGE Normalize the image gamma with equalizeHist before ap-
plying fixed or adaptive thresholding.
CV_CALIB_CB_FILTER_QUADS Use additional criteria (like contour area, perimeter, square-like
shape) to filter out false quads extracted at the contour retrieval stage.
CALIB_CB_FAST_CHECK Run a fast check on the image that looks for chessboard corners, and
shortcut the call if none is found. This can drastically speed up the call in the degenerate condition
when no chessboard is observed.
Warning: Before doing the findChessboardCorners convert the image to grayscale and save the board
size into a Size variable:
Imgproc.cvtColor(frame, grayImage, Imgproc.COLOR_BGR2GRAY);
Size boardSize = new Size(this.numCornersHor, this.numCornersVer);
If the recognition went well found should be true.

For square images the positions of the corners are only approximate. We may improve this by calling the
cornerSubPix function. It will produce better calibration result.
TermCriteria term = new TermCriteria(TermCriteria.EPS | TermCriteria.MAX_ITER, 30, 0.

1);
Imgproc.cornerSubPix(grayImage, imageCorners, new Size(11, 11), new Size(-1, -1),

term);
We can now highlight the found points on stream:

Calib3d.drawChessboardCorners(frame, boardSize, imageCorners, found);
The function draws individual chessboard corners detected either as red circles if the board was not found, or as
colored corners connected with lines if the board was found.
Its parameters are:
image Destination image. It must be an 8-bit color image.
patternSize Number of inner corners per a chessboard row and column.
corners Array of detected corners, the output of findChessboardCorners.
patternWasFound Parameter indicating whether the complete board was found or not. The return value of
findChessboardCorners should be passed here.
Now we can activate the Snapshot button to save the data.
this.snapshotButton.setDisable(false);
9.6. Pattern Recognition 59

We should take the set number of snapshots from different angles and depth, in order to make the calibration.
Note: We dont actually save the image but just the data we need.
Saving Data
By clicking on the snapshot button we call the takeSnapshot method. Here we need to save the data (2D and 3D
points) if we did not make enough sample:
this.imagePoints.add(imageCorners);
this.objectPoints.add(obj);
this.successes++;
Otherwise we can calibrate the camera.
Camera Calibration
For the camera calibration we should create initiate some needed variable and then call the actual calibration function:

List<Mat> rvecs = new ArrayList<>();

List<Mat> tvecs = new ArrayList<>();
intrinsic.put(0, 0, 1);
intrinsic.put(1, 1, 1);
Calib3d.calibrateCamera(objectPoints, imagePoints, savedImage.size(), intrinsic,

distCoeffs, rvecs, tvecs);
The calibrateCamera function estimates the intrinsic camera parameters and extrinsic parameters for each of the
views. The algorithm is based on [Zhang2000] and [BouguetMCT]. The coordinates of 3D object points and their
corresponding 2D projections in each view must be specified. Its parameters are:
objectPoints In the new interface it is a vector of vectors of calibration pattern points in the calibration pattern
coordinate space. The outer vector contains as many elements as the number of the pattern views. The points
are 3D, but since they are in a pattern coordinate system, then, if the rig is planar, it may make sense to put the
model to a XY coordinate plane so that Z-coordinate of each input object point is 0.
imagePoints It is a vector of vectors of the projections of calibration pattern points.
imageSize Size of the image used only to initialize the intrinsic camera matrix.
cameraMatrix Output 3x3 floating-point camera matrix A = |fx 0 cx| |0 fy cy| |0 0 1|. If
CV_CALIB_USE_INTRINSIC_GUESS and/or CV_CALIB_FIX_ASPECT_RATIO are specified, some or
all of fx, fy, cx, cy must be initialized before calling the function.
distCoeffs Output vector of distortion coefficients of 4, 5, or 8 elements.
rvecs Output vector of rotation vectors estimated for each pattern view. That is, each k-th rotation vector together
with the corresponding k-th translation vector.
tvecs Output vector of translation vectors estimated for each pattern view.
We ran calibration and got cameras matrix with the distortion coefficients we may want to correct the image using
undistort function:
if (this.isCalibrated)
{
// prepare the undistored image
Mat undistored = new Mat();
Imgproc.undistort(frame, undistored, intrinsic, distCoeffs);
undistoredImage = mat2Image(undistored);
}
The undistort function transforms an image to compensate radial and tangential lens distortion.
9.8. Camera Calibration 61


CHAPTER 10
Indices and tables
genindex
modindex
search
63

Opencv Java Tutorials

Enviado por

Dados do documento

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Opencv Java Tutorials

Enviado por

Direitos autorais:

Formatos disponíveis

OpenCV Java Tutorials Documentation

Luigi De Russis, Alberto Sacco

Oct 26, 2017

1 Installing OpenCV for Java 3

2 Your First Java Application with OpenCV 9

3 Your First JavaFX Application with OpenCV 13

6 Face Detection and Tracking 37

10 Indices and tables 63

Installing OpenCV for Java

Introduction to OpenCV for Java

Install the latest Java version

Install the latest Eclipse version

Install OpenCV 3.x under Windows

Install OpenCV 3.x under macOS

Install OpenCV 3.x under Linux

4 Chapter 1. Installing OpenCV for Java

1.6. Install OpenCV 3.x under Linux 5

insert the alternative Java include path (e.g., /usr/lib/jvm/java-1.8.0/include/linux)

6 Chapter 1. Installing OpenCV for Java

Set up OpenCV for Java in Eclipse

1.7. Set up OpenCV for Java in Eclipse 7

Set up OpenCV for Java in other IDEs (experimental)

8 Chapter 1. Installing OpenCV for Java

Your First Java Application with OpenCV

A Java application with OpenCV

What we will do in this tutorial

In this guide, we will:

Create a New Project

Add a User Library

10 Chapter 2. Your First Java Application with OpenCV

Create a simple application

public class HelloCV {

Then we can define a new Mat.

Mat mat = Mat.eye(3, 3, CvType.CV_8UC1);

2.5. Create a simple application 11

The whole source code is available on GitHub.

12 Chapter 2. Your First Java Application with OpenCV

Your First JavaFX Application with OpenCV

A JavaFX application with OpenCV

What we will do in this tutorial

In this guide, we will:

Your First Application in JavaFX

Installing e(fx)clipse plugin and Scene Builder

In Eclipse, install the e(fx)clipse plugin, by following the guide at http://www.eclipse.org/efxclipse/install.

Choose a name for your project and click Next.

14 Chapter 3. Your First JavaFX Application with OpenCV

3.4. Installing e(fx)clipse plugin and Scene Builder 15

Working with Scene Builder

16 Chapter 3. Your First JavaFX Application with OpenCV

3.5. Working with Scene Builder 17

18 Chapter 3. Your First JavaFX Application with OpenCV

3.5. Working with Scene Builder 19

Key Concepts in JavaFX

public void start(Stage primaryStage)

FXMLLoader loader = new FXMLLoader(getClass().getResource("FXHelloCV.fxml"));

Managing GUI Interactions With the Controller Class

<Button fx:id="button" mnemonicParsing="false" onAction="#startCamera" text="Start

private VideoCapture capture = new VideoCapture();

20 Chapter 3. Your First JavaFX Application with OpenCV

Runnable frameGrabber = new Runnable() { ... }

Mat frame = new Mat();

Imgproc.cvtColor(frame, frame, Imgproc.COLOR_BGR2GRAY);

As you can see, cvtColor takes as arguments:

MatOfByte buffer = new MatOfByte();