Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 1 | namespace Eigen { |
| 2 | |
| 3 | /** \page TopicMultiThreading Eigen and multi-threading |
| 4 | |
| 5 | \section TopicMultiThreading_MakingEigenMT Make Eigen run in parallel |
| 6 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 7 | Some %Eigen's algorithms can exploit the multiple cores present in your hardware. |
| 8 | To this end, it is enough to enable OpenMP on your compiler, for instance: |
| 9 | - GCC: \c -fopenmp |
| 10 | - ICC: \c -openmp |
| 11 | - MSVC: check the respective option in the build properties. |
| 12 | |
| 13 | You can control the number of threads that will be used using either the OpenMP API or %Eigen's API using the following priority: |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 14 | \code |
| 15 | OMP_NUM_THREADS=n ./my_program |
| 16 | omp_set_num_threads(n); |
| 17 | Eigen::setNbThreads(n); |
| 18 | \endcode |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 19 | Unless `setNbThreads` has been called, %Eigen uses the number of threads specified by OpenMP. |
| 20 | You can restore this behavior by calling `setNbThreads(0);`. |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 21 | You can query the number of threads that will be used with: |
| 22 | \code |
| 23 | n = Eigen::nbThreads( ); |
| 24 | \endcode |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 25 | You can disable %Eigen's multi threading at compile time by defining the \link TopicPreprocessorDirectivesPerformance EIGEN_DONT_PARALLELIZE \endlink preprocessor token. |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 26 | |
| 27 | Currently, the following algorithms can make use of multi-threading: |
Austin Schuh | 189376f | 2018-12-20 22:11:15 +1100 | [diff] [blame] | 28 | - general dense matrix - matrix products |
| 29 | - PartialPivLU |
| 30 | - row-major-sparse * dense vector/matrix products |
| 31 | - ConjugateGradient with \c Lower|Upper as the \c UpLo template parameter. |
| 32 | - BiCGSTAB with a row-major sparse matrix format. |
| 33 | - LeastSquaresConjugateGradient |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 34 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 35 | \warning On most OS it is <strong>very important</strong> to limit the number of threads to the number of physical cores, otherwise significant slowdowns are expected, especially for operations involving dense matrices. |
| 36 | |
| 37 | Indeed, the principle of hyper-threading is to run multiple threads (in most cases 2) on a single core in an interleaved manner. |
| 38 | However, %Eigen's matrix-matrix product kernel is fully optimized and already exploits nearly 100% of the CPU capacity. |
| 39 | Consequently, there is no room for running multiple such threads on a single core, and the performance would drops significantly because of cache pollution and other sources of overheads. |
| 40 | At this stage of reading you're probably wondering why %Eigen does not limit itself to the number of physical cores? |
| 41 | This is simply because OpenMP does not allow to know the number of physical cores, and thus %Eigen will launch as many threads as <i>cores</i> reported by OpenMP. |
| 42 | |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 43 | \section TopicMultiThreading_UsingEigenWithMT Using Eigen in a multi-threaded application |
| 44 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 45 | In the case your own application is multithreaded, and multiple threads make calls to %Eigen, then you have to initialize %Eigen by calling the following routine \b before creating the threads: |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 46 | \code |
| 47 | #include <Eigen/Core> |
| 48 | |
| 49 | int main(int argc, char** argv) |
| 50 | { |
| 51 | Eigen::initParallel(); |
| 52 | |
| 53 | ... |
| 54 | } |
| 55 | \endcode |
| 56 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 57 | \note With %Eigen 3.3, and a fully C++11 compliant compiler (i.e., <a href="http://en.cppreference.com/w/cpp/language/storage_duration#Static_local_variables">thread-safe static local variable initialization</a>), then calling \c initParallel() is optional. |
Austin Schuh | 189376f | 2018-12-20 22:11:15 +1100 | [diff] [blame] | 58 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 59 | \warning Note that all functions generating random matrices are \b not re-entrant nor thread-safe. Those include DenseBase::Random(), and DenseBase::setRandom() despite a call to `Eigen::initParallel()`. This is because these functions are based on `std::rand` which is not re-entrant. |
| 60 | For thread-safe random generator, we recommend the use of c++11 random generators (\link DenseBase::NullaryExpr(Index, const CustomNullaryOp&) example \endlink) or `boost::random`. |
Austin Schuh | 189376f | 2018-12-20 22:11:15 +1100 | [diff] [blame] | 61 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 62 | In the case your application is parallelized with OpenMP, you might want to disable %Eigen's own parallelization as detailed in the previous section. |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 63 | |
Austin Schuh | c55b017 | 2022-02-20 17:52:35 -0800 | [diff] [blame^] | 64 | \warning Using OpenMP with custom scalar types that might throw exceptions can lead to unexpected behaviour in the event of throwing. |
Brian Silverman | 72890c2 | 2015-09-19 14:37:37 -0400 | [diff] [blame] | 65 | */ |
| 66 | |
| 67 | } |