k-d tree speedup (nanoflann / CUDA) #5299

yasamoka · 2022-06-17T22:03:30Z

This pull request provides tested kd-tree implementations using Nanoflann (CPU) and FLANN (CUDA) as well as the addition of the ability to set the max leaf size for any kd-tree implementation.

Benchmarks comparing FLANN (CPU), Nanoflann (CPU), and FLANN (CUDA) can be found here: https://yasamoka.github.io/pcl-knn-benchmark/

I am not sure if there is a better way of modifying CMake scripts to satisfy dependencies. If there is, then I would appreciate help with that.

Regarding documentation, I placed the FLANN CUDA implementation with the kdtree module. This has good visibility for users of k-d trees. Shall I move it to its own module (e.g. cuda/kdtree)? Is it possible to have 2 levels like that?

Thank you very much!

…r KdTreeFLANN added maximum leaf node size getter to KdTree base class

…Size for KdTreeFLANN

fixed template instantiation added compile definition

themightyoarfish · 2023-01-12T12:06:12Z

This plot seems to show that nanoflann is slower than FLANN, but your bar graphs further down show the opposite 🤔

yasamoka · 2023-01-12T12:39:42Z

This plot seems to show that nanoflann is slower than FLANN, but your bar graphs further down show the opposite thinking

The line graph you're seeing is tree build time.

The bar graphs you see below that are NN search time.

Yes, nanoflann is slower than FLANN in tree building for the same leaf size. It is faster than FLANN for NN search the more you head towards less threads / less # search points.

themightyoarfish · 2023-02-06T08:57:36Z

This seems useful, might a maintainer take a look?

themightyoarfish · 2023-11-02T18:49:13Z

@mvieth @larshg Could you give some feedback here?

xiaodong2077 · 2023-11-03T10:12:41Z

how about range search? is nanoflann quicker than flann?

mvieth · 2024-12-28T13:51:21Z

Hi, sorry for not replying earlier! I am not sure if you, @yasamoka , are still interested in working on this pull request and getting it merged (I would understand if not). Either way, here are my thoughts and questions:

where do you see the advantage of adding both nanoflann and CUDA-based FLANN? Is it simply because CUDA-based FLANN is fastest, but CUDA/a GPU is not always available, so nanoflann is the second fastest?
this pull request is very large, it would be better if it was split into several pull requests (e.g. one for CUDA-based FLANN, one for nanoflann, one for the max-leaf-size change). Then it would be easier to review and get it into a state where it is ready to be merged
there are now unfortunately a few (merge) conflicts that need to be resolved
why did you choose to add KdTreeBase? KdTree already is the (abstract) base class, KdTreeFLANN the subclass of that.
more importantly, in which situations do you wish to use the faster search methods? I am asking because many, if not most, PCL classes expect the user-configurable search method to be a subclass of pcl::search::Search. So adding the two methods in kdtree and cuda/kdtree is not really enough, or am I missing something? Maybe it makes more sense to add them directly in the search module, similar to FlannSearch?
it is great that you already wrote some tests, however they are currently not built or run on our CI checks because neither nanoflann nor the CUDA part of FLANN are installed. The dockerfile for that is in .dev/docker/env. I saw that nanoflann is available from apt for most Ubuntu releases, but for the CUDA-FLANN it is probably necessary to build FLANN from source? We actually need a separate pull request to first update the docker images, merge that one, only then will nanoflann and CUDA-FLANN be available for the tests. It might also make sense to add the new search methods to test/search/test_search.cpp
it is important to keep the new parts optional (that is, PCL is still built if nanoflann/CUDA-FLANN is not available), and that the behavior of the existing PCL classes does not change. But it looks like you made sure of this.
from what I have tested so far, nanoflann seems to be faster than (CPU-)FLANN for knn-search (especially for small neighborhoods, with almost no difference for larger k), but for radius search, nanoflann appears to be much slower than (CPU-)FLANN, Do you perhaps know why that is? EDIT: this was only because I left the result-sorting for nanoflann on. If I turn that off, nanoflann is also faster than (CPU-)FLANN for radius search.

I will have more comments once I start reviewing in detail, but these are the most important high-level things for now.

yasamoka and others added 5 commits June 18, 2022 00:54

added max leaf size to KdTree interface; implemented max leaf size fo…

94b5bca

…r KdTreeFLANN added maximum leaf node size getter to KdTree base class

added KdTreeNanoflann; adjusted kdtree tests

9ca2057

added copy and setMaxLeafSize tests to kdtree tests; fixed setMaxLeaf…

98ea16d

…Size for KdTreeFLANN

added KdTreeFLANN CUDA with tests

4fe394b

fixed template instantiation added compile definition

fixed KdTree documentation

fefead3

This was referenced Jun 18, 2022

[custom] Possibilitiy of adding fast KDtree (or other clustering algorithm) to the GPU module? #4817

Open

[FLANN] "Discussion around FLANN maintenance, possibility to use nanoflann" #4699

Open

larshg mentioned this pull request Apr 13, 2023

[request] Performance of pcl::KdTreeFLANN #5663

Closed

mvieth added changelog: new feature Meta-information for changelog generation module: search module: kdtree labels Dec 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k-d tree speedup (nanoflann / CUDA) #5299

k-d tree speedup (nanoflann / CUDA) #5299

yasamoka commented Jun 17, 2022 •

edited

Loading

themightyoarfish commented Jan 12, 2023

yasamoka commented Jan 12, 2023

themightyoarfish commented Feb 6, 2023

themightyoarfish commented Nov 2, 2023

xiaodong2077 commented Nov 3, 2023 •

edited

Loading

mvieth commented Dec 28, 2024 •

edited

Loading

k-d tree speedup (nanoflann / CUDA) #5299

Are you sure you want to change the base?

k-d tree speedup (nanoflann / CUDA) #5299

Conversation

yasamoka commented Jun 17, 2022 • edited Loading

themightyoarfish commented Jan 12, 2023

yasamoka commented Jan 12, 2023

themightyoarfish commented Feb 6, 2023

themightyoarfish commented Nov 2, 2023

xiaodong2077 commented Nov 3, 2023 • edited Loading

mvieth commented Dec 28, 2024 • edited Loading

yasamoka commented Jun 17, 2022 •

edited

Loading

xiaodong2077 commented Nov 3, 2023 •

edited

Loading

mvieth commented Dec 28, 2024 •

edited

Loading