Introduction | Lab 1: Migrate OpenCV to xfOpenCV | Lab 2: Build the SDSoC Acceleration Project |
This lab helps you understand how to take an OpenCV program written for a CPU and migrate it to the xfOpenCV library, making use of hardware accelerated functions on the reVision platform.
📌 NOTE: Not all OpenCV functions are replicated in the xfOpenCV library, and some functions were written to simplify certain procedures.**
This tutorial includes source files and a test image for your use in the source.zip file. Download and extract the file to a location of your choosing. This includes the following files and folders:
colordetect.cpp
- C++ source file for you to edit in this tutorial.rock_landscape.jpg
- Test image for your use in testing the application.solution
folder - Source files for a completed conversion for your reference.
The source code is a very simple color detector for Blue, Green, and Orange on an input image of 1920x1080, such as the test image shown below. With the provided code and image, you will look at how to migrate the OpenCV functions and application flow using the xfOpenCV functions based on the ZCU102 reVISION platform. You should learn the following from this tutorial:
- Migrating OpenCV functions and flows to the xfOpenCV functions and flows.
- Identify functions for hardware acceleration.
- Setup the build environment for compilation of xfOpenCV code.
- Modifying code to be used on the ZCU102 reVISION platform.
With the source code open in a code editor, you can begin to convert it to run on the ZCU102 board.
-
After downloading and extracting the source files, open the
colordetect.cpp
source file in a code editor of your choice.The
colordetect.cpp
file contains the thecolordetect()
function as described above, and themain()
function for the application. In this tutorial you will create a hardware accelerated version of the colordetect function,colordetect_accel()
, in a separate file and header file. -
At the top of the
colordetect.cpp
file, add the xfOpenCV include statements needed to support the reVISION platform. Replace the following three lines right below#include <iostream>
:#include <opencv2/opencv.hpp> #include <opencv2/highgui.hpp> #include <opencv2/imgproc.hpp>
With the following lines of code:
#if __SDSCC__ #undef __ARM_NEON__ #undef __ARM_NEON #include "opencv2/highgui/highgui.hpp" #include "opencv2/imgproc/imgproc.hpp" #include <opencv2/core/core.hpp> #define __ARM_NEON__ #define __ARM_NEON #else #include "opencv2/highgui/highgui.hpp" #include "opencv2/imgproc/imgproc.hpp" #include <opencv2/core/core.hpp> #endif
The first part related to
ARM_NEON
is to handle compilation of the OpenCV includes/libraries for the Arm processor on the ZCU102. Below theARM_NEON
section is where you are defining the xfOpenCV includes to tool is going to use. Including these files will insure the compiler has access to all functions and datatypes needed.📌 NOTE:
__SDSCC__
is automatically set by the sds++ compiler. -
In the same code editor you are using to edit
colordetect.cpp
, create two new files:colordetect_accel.cpp
andcolordetect_accel.hpp
. -
In the
colordetect_accel.cpp
file define the function signature for the new hardware accelerated function. Type the following at the top of the file:void colordetect_accel() {}
-
In the
colordetect_accel.hpp
header file add the following lines of code:#include "hls_stream.h" #include "ap_int.h" #include "common/xf_common.h" #include "common/xf_utility.h" #include "imgproc/xf_inrange.hpp" #include "imgproc/xf_rgb2hsv.hpp" #include "imgproc/xf_erosion.hpp" #include "imgproc/xf_dilation.hpp"
-
To make things more readable, and to support templating some of the xfOpenCV function calls and object instantiations, you will add the following macros to the
colordetect_accel.hpp
file, after the prior#include
statements:#define MAXCOLORS 3 #define WIDTH 1920 #define HEIGHT 1080
-
Save all the files, but keep them open for further editing.
-
When migrating OpenCV code to a hardware accelerated platform you should identify all the objects being used in the conversion code. Look in the
main()
andcolordetect()
functions and identify the followingcv::Mat
objects:in_img
,out_img
,mask1
,mask2
,mask3
,_imgrange
, and_imghsv
.When running on a CPU, these objects allow for dynamic allocation and deallocation of memory as needed. However, when targeting hardware acceleration, memory requirements need to be determined at compile time. To solve this, the xfOpenCV library provides the
xf::Mat
object which facilates memory allocatation in the FPGA device. Anycv::Mat
object used by a hardware accelerated function, whether user defined or coming from the xfOpenCV library, should be replaced with axf::Mat
equivalent object.In the
main()
function, you can continue to usecv::Mat
for input/output to the Linux system, because the code in this function is running on the CPU. However, you must convert the data from thecv::Mat
object(s) to thexf::Mat
object(s) for the accelerated function. -
In the
main()
function, create input and outputxf::Mat
objects for the accelerated function, after the following if-statement:if (!in_img.data) { return -1; }
Add the following code:
xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> xfIn(in_img.rows, in_img.cols); xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> xfOut(in_img.rows, in_img.cols);
These statements create templated
xf::Mat
objects for the input and output of the hardware accelerated function. General information related to the templated object can be found in the Xilinx OpenCV User Guide (UG1233).The
xfIn
andxfOut
objects you created have the following attributes:XF_8UC4
- Defines an 8-bit unsigned-char 4-channel datatype.XF_8UC1
- Defines an 8-bit unsigned-char 1-channel datatype.The input
xf:Mat
is declared as a 4-channel object to allow thecolorthresholding()
function to operate on the three color components (channels) of the image. The output object does not require three channels.HEIGHT
/WIDTH
- Defines the size of the input image. In thecolordetect_accel.hpp
header file, you have defined the HEIGHT and WIDTH macros for the image as 1920x1080. Alternatively, you could statically define the maximum image size, and process variable image sizes using function parameters.XF_NPPC1
- This template parameter tells the compiler that the number of pixels processed per clock cycle is one.
With the main()
function creating the xf::Mat
objects needed as inputs and outputs to the accelerated function, you are ready to write the accelerated function. The hardware accelerated colordetect_accel()
function will duplicate the colordetect()
function using the xfOpenCV functions that are accelerated on hardware.
With this accelerated function, you need to pass everything by reference, and not by value. Passing by reference lets these parameters act as either streaming inputs or outputs to the function, but not as both at the same time.
The main()
function will call both the colordetect
function and the accelerated function, colordetect_accel
, so that you can compare the performance between Arm processing and FPGA acceleration.
-
In the
colordetect_accel.cpp
file, edit the function signature to add parameters specific to thexf::Mat
objects you created earlier:void colordetect_accel( xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> &_src, xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> &_dst) {}
Like any other datatype, you must match the template values with what you pass into the function. If you mistype a parameter, the compiler will error out.
-
Since the
colordetect_accel()
function uses the HSV colorspace, you also need to pass the proper thresholds. AddnLowThreshold
andnHighThreshold
to the function signature as shown below:void colordetect_accel( xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> &_src, xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> &_dst, unsigned char nLowThresh[3][3], unsigned char nHighThresh[3][3]) {}
Using acceleration with programmable logic, you need definitive array sizes at compile time, because the code defines an actual circuit with a fixed size. Here, you know that the threshold data is a 2D array of 3 elements in each dimension, and you will define it as such.
-
Since
xf::Mat
objects are not dynamic, you will need to declare thexf::Mat
objects used to pass the data stream through the function. In the body of the function add the following objects:xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> _hsv; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _range; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _erode; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate1; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate2;
-
Now you will begin defining the operations of the
colordetect_accel
function. In the originalcolordetect()
function, the first operation is the color conversion (cv::cvtColor
) of the input image from the RGB color space to HSV. Looking through the Xilinx OpenCV User Guide (UG1233) you will not find acvtColor
function. However, in the "Color Detection" section you can see that the color detection algorithm uses four hardware functions from the xfOpenCV library, beginning with thexf::RGB2HSV
function. You will use that function in thecolordetect_accel
function to perform the color conversion.Add the
xf::RGB2HSV
function with the proper parameters tocolordetect_accel
after the otherxf::Mat
object definitions as follows:xf::RGB2HSV<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1>(_src, _hsv);
As you have probably noticed, functions and objects need to be templated with specific information. This is key to making sure the right datatypes/sizes are being used. The function template must match the
xf::Mat
templates, or you will hit an assert at runtime. -
Next in the
colordetect()
function code is thresholding and combining the resulting masks together. Again, in xfOpenCV, there is nocv::inRange
function; however, thexf::colorthresholding
is created for just this purpose. Referring to the "Color Thresholding" section of Xilinx OpenCV User Guide (UG1233), there are a few things to notice:- The template parameter MAXCOLORS
low_thresh
andhigh_thresh
are one-dimensional arrays- The source template has to be
XF_8UC4
.
For this function, the input
xf::Mat
object needs to beXF_8UC4
because you are working with 3-channel data; theMAXCOLORS
determines how many colors you're thresholding, and in the case of this code you are using 3. Lastly, thelow_thresh
and thehigh_thresh
are needed to be a 1D array. This is because depending on how big of aMAXCOLORS
you are using, you can tailor the array for the appropriate low and high values.There are two ways to convert a 2D array to one dimension: at instantiation time, or via for loops. For simplicity, you will use the for loop method here. Before the
xf::RGB2HSV
function that you just added, insert the following code:unsigned char low_thresh[9]; unsigned char high_thresh[9]; for(int i = 0; i < 3; ++i) for(int j = 0; j < 3; ++j) { low_thresh[i*3+j] = nLowThresh[i][j]; high_thresh[i*3+j] = nHighThresh[i][j]; }
Your
colordetect_accel
function should currently look something like this:void colordetect_accel( xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> &_src, xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> &_dst, unsigned char nLowThresh[3][3], unsigned char nHighThresh[3][3]) { //Define Matrix objects for operations xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> _hsv; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _range; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _erode; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate1; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate2; //Convert 2d Array to 1d unsigned char low_thresh[9]; unsigned char high_thresh[9]; for(int i = 0; i < 3; ++i) for(int j = 0; j < 3; ++j) { low_thresh[i*3+j] = nLowThresh[i][j]; high_thresh[i*3+j] = nHighThresh[i][j]; } //Begin color conversion... xf::RGB2HSV<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1>(_src, _hsv); }
Now that you have one-dimensional arrays for the thresholds, you need to add the
xf::colorthresholding
function. Add the following code after thexf::RGB2HSV
function:xf::colorthresholding<XF_8UC4, XF_8UC1, MAXCOLORS, HEIGHT, WIDTH, XF_NPPC1>(_hsv, _range, low_thresh, high_thresh);
With this function, you are converting the input of
XF_8UC4
dataype to the output ofXF_8UC1
(grayscale) datatype, and you are thresholding forMAXCOLORS
of 3. In comparison to the originalcolordetect()
code, you are essentially doing the same set ofcv::inRange
and bitwiseOR
operations for the mask. In more complex code where more or less colors are being detected, this will be able to scale, just adjustMAXCOLORS
and thelow_thresh
andhigh_thresh
arrays as needed. -
With the
cv::erode
andcv::dilate
functions, you have to determine what morphological type is being used, and what pixel area to take into account. In this function,xf::erode
andxf::dilate
useMORPH_RECT
and a 3x3 pixel.To add these functions similar to the OpenCV version, insert the following lines after the
colorthresholding
function:xf::erode<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_range, _erode); xf::dilate<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_erode, _dilate1); xf::dilate<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_dilate1, _dilate2); xf::erode<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_dilate2, _dst);
Notice that there is a new template parameter:
XF_BORDER_CONSTANT
. In thecv::erode
andcv::dilate
this parameter defines the border size, which has a default value ofBORDER_CONSTANT
. In xfOpenCV, you need to know by compile time if the border of an image is being adjusted.Finally, you need to make sure that all the template values in this function match the data input and output. Any variations will either cause compilation errors, or runtime errors.
Your completed function should look like this:
void colordetect_accel( xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> &_src, xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> &_dst, unsigned char nLowThresh[3][3], unsigned char nHighThresh[3][3]) { //Define Matrix objects for operations xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> _hsv; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _range; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _erode; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate1; xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> _dilate2; //Convert 2d Array to 1d unsigned char low_thresh[9]; unsigned char high_thresh[9]; for(int i = 0; i < 3; ++i) for(int j = 0; j < 3; ++j) { low_thresh[i*3+j] = nLowThresh[i][j]; high_thresh[i*3+j] = nHighThresh[i][j]; } //Begin color conversion... xf::RGB2HSV<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1>(_src, _hsv); xf::colorthresholding<XF_8UC4, XF_8UC1, MAXCOLORS, HEIGHT, WIDTH, XF_NPPC1>(_hsv, _range, low_thresh, high_thresh); xf::erode<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_range, _erode); xf::dilate<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_erode, _dilate1); xf::dilate<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_dilate1, _dilate2); xf::erode<XF_BORDER_CONSTANT, XF_8UC1, HEIGHT, WIDTH, XF_NPPC1>(_dilate2, _dst); }
-
With the accelerator completed, you can add the function definition to the
colordetect_accel.hpp
by adding this line after the#define
:void colordetect_accel( xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> &_src, xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> &_dst, unsigned char nLowThresh[3][3], unsigned char nHighThresh[3][3]);
-
Include the
colordetect_accel.hpp
header file in thecolordetect_accel.cpp
code file. Insert the following at the top of the file:#include "colordetect_accel.hpp"
This completes the definition of the colordetect_accel
function and header file.
It is time to add colordetect_accel
to the main
function.
-
You will need to include the
colordetect_accel.hpp
to thecolordetect.cpp
. Add this at the top of the file, after the various OpenCV include statements:#include "colordetect_accel.hpp"
Before calling the
colordetect_accel
function, you need to copy the input image,in_img
, to thexfIn
input object. Use anxf::Mat
member function calledcopyTo
which will copy the data from thecv::Mat
object to thexf::Mat
object. Do this after the instantiation ofxfIn
. -
Add the following line into
main()
after the creation of thexfIn
object:xfIn.copyTo(in_img.data);
-
Now you can instantiate the hardware accelerated function below the
colordetect()
function:colordetect_accel(xfIn,xfOut,nLowThresh,nHighThresh);
-
In order to visually compare the outputs of processing on the ARM and programmable logic, you can copy the data from
xfOut
to a separatecv::Mat
object using thecopyFrom
method. Add the following lines right after thecolordetect_accel
function:cv::Mat accel_out(height, width, CV_8U); accel_out.data = xfOut.copyFrom();
-
Save the output image using the
cv::imwrite
function. Add the following right after the othercv::imwrite
functions:cv::imwrite("accel_out.png",accel_out);
Your final changes should make the
main()
function look similar to the following:int main(int argc, char **argv) { //Create the input/output cv::Mat objects cv::Mat in_img, out_img; cv::Mat imghsv, imgrange, imgerode, imgdilate1, imgdilate2; // Define the low and high thresholds // Want to grab 3 colors (Blue, Green, Orange) for teh input image unsigned char nLowThresh[3][3] = { { 110, 150, 20 }, // Lower boundary for Blue { 38, 0, 20 }, // Lower boundary for Green { 10, 150, 20 } }; // Lower boundary for Orange unsigned char nHighThresh[3][3] = { { 130, 255, 255 }, // Upper boundary for Blue { 75, 125, 255 }, // Upper boundary for Green { 25, 255, 255 } }; // Upper boundary for Orange // Read in the commandline for an image in_img = cv::imread(argv[1], 1); if (!in_img.data) { return -1; } // Create input output images for the hardware function xf::Mat<XF_8UC4, HEIGHT, WIDTH, XF_NPPC1> xfIn(in_img.rows, in_img.cols); xfIn.copyTo(in_img.data); xf::Mat<XF_8UC1, HEIGHT, WIDTH, XF_NPPC1> xfOut(in_img.rows, in_img.cols); // Create the output image to match the input image (CV_8U) int height = in_img.rows; int width = in_img.cols; out_img.create(height, width, CV_8U); // Run the input and thresholds into the colordect function colordetect(in_img, out_img, nLowThresh, nHighThresh); // Call the hardware accelerated function colordetect_accel(xfIn,xfOut,nLowThresh,nHighThresh); //Copy xfOut image to cv::Mat object cv::Mat accel_out(height, width, CV_8U); accel_out.data = xfOut.copyFrom(); // Write out the input image and the output image cv::imwrite("output.png", out_img); cv::imwrite("input.png", in_img); cv::imwrite("accel_out.png",accel_out); return 0; }
-
Save and close your files.
By looking at the OpenCV objects being used, and understanding how xfOpenCV templating works, you can easily identify what objects should be migrated from cv::Mat
to xf::Mat
, and how general functions like xf::erode
, and complex functions like xf::RGB2HSV
and xf::colorthresholding
can be used to recreate the design and flow of the original OpenCV code.
Look at the original colordetect()
and the new colordetect_accel()
and notice that the flow is the same, but the functions used are different.
In this lab, you have taken an OpenCV function, colordetect()
, and converted it to a hardware accelerated function, colordetect_accel()
, using xfOpenCV functions. You have:
- Identified and modified OpenCV code to utilize the optimized xfOpenCV
- Transitioned the dynamic cv::Mat object to a templated xf:Mat object
- Call the templated xfOpenCV functions
Copyright© 2018 Xilinx