Escolar Documentos
Profissional Documentos
Cultura Documentos
highly scalable and massively parallel codes that run according to the GPU structures,
specifically NVIDIAs. This means that a code can run in thousands of threads and hundreds of
cores at the same time [3]. CUDA is advantageous for data parallelism because it is intended for
process distribution corresponding with the GPU architecture. Thanks to this, and the Basic
Linear Algebra Subprograms (BLAS) library, covariance and transformation matrices used in
ICP and PCA can solved with ease. For further efficiency, matrices should be saved transposed in
the memory of the GPU to reduce the time spent accessing the memory while solving them [3].
The ICP and PCA Registration algorithms will be implemented while utilizing the architecture of
the GPU through CUDA. It will be possible to parallelize these algorithms by distributing their
independent processes through different threads and cores, thereby reducing their execution time.
The acceleration of the registration process may open its use to real time applications. Some
expected problems may include processes that cannot be implemented due to the GPU structure
or CUDA limitations. Nevertheless, a large workload can be divided on all of the GPUs many
threads and cores with only the knowledge of a few abstractions and C/C++. At the end, a library
of support algorithms for point cloud operations will be delivered to the scientific community.
References:
[1] B. Bellekens et al., A survey of Rigid 3D Point Cloud Registration Algorithms, in Fourth
International Conference on Ambient Computing, Applications, Services and Technologies,
Services and Technologies, Rome, Italy, 2014, pp. 8 - 13.
[2] J. Rajput and B. Limbasiya, Image Registration using combination of PCA and GPOF
Method for Multiframe Super-Resolution. International Journal of Computer Applications
120(12):1-5, June 2015.
[3] M. Garland et al., "Parallel Computing Experiences with CUDA," in IEEE Micro, vol. 28,
no. 4, pp. 13-27, July-Aug. 2008.