blob: 7a8ff301f02f961e28d8168389ea0982d59c23b2 [file] [log] [blame]
Brian Silverman72890c22015-09-19 14:37:37 -04001namespace Eigen {
2
3/** \page TopicMultiThreading Eigen and multi-threading
4
5\section TopicMultiThreading_MakingEigenMT Make Eigen run in parallel
6
Austin Schuhc55b0172022-02-20 17:52:35 -08007Some %Eigen's algorithms can exploit the multiple cores present in your hardware.
8To this end, it is enough to enable OpenMP on your compiler, for instance:
9 - GCC: \c -fopenmp
10 - ICC: \c -openmp
11 - MSVC: check the respective option in the build properties.
12
13You can control the number of threads that will be used using either the OpenMP API or %Eigen's API using the following priority:
Brian Silverman72890c22015-09-19 14:37:37 -040014\code
15 OMP_NUM_THREADS=n ./my_program
16 omp_set_num_threads(n);
17 Eigen::setNbThreads(n);
18\endcode
Austin Schuhc55b0172022-02-20 17:52:35 -080019Unless `setNbThreads` has been called, %Eigen uses the number of threads specified by OpenMP.
20You can restore this behavior by calling `setNbThreads(0);`.
Brian Silverman72890c22015-09-19 14:37:37 -040021You can query the number of threads that will be used with:
22\code
23n = Eigen::nbThreads( );
24\endcode
Austin Schuhc55b0172022-02-20 17:52:35 -080025You can disable %Eigen's multi threading at compile time by defining the \link TopicPreprocessorDirectivesPerformance EIGEN_DONT_PARALLELIZE \endlink preprocessor token.
Brian Silverman72890c22015-09-19 14:37:37 -040026
27Currently, the following algorithms can make use of multi-threading:
Austin Schuh189376f2018-12-20 22:11:15 +110028 - general dense matrix - matrix products
29 - PartialPivLU
30 - row-major-sparse * dense vector/matrix products
31 - ConjugateGradient with \c Lower|Upper as the \c UpLo template parameter.
32 - BiCGSTAB with a row-major sparse matrix format.
33 - LeastSquaresConjugateGradient
Brian Silverman72890c22015-09-19 14:37:37 -040034
Austin Schuhc55b0172022-02-20 17:52:35 -080035\warning On most OS it is <strong>very important</strong> to limit the number of threads to the number of physical cores, otherwise significant slowdowns are expected, especially for operations involving dense matrices.
36
37Indeed, the principle of hyper-threading is to run multiple threads (in most cases 2) on a single core in an interleaved manner.
38However, %Eigen's matrix-matrix product kernel is fully optimized and already exploits nearly 100% of the CPU capacity.
39Consequently, there is no room for running multiple such threads on a single core, and the performance would drops significantly because of cache pollution and other sources of overheads.
40At this stage of reading you're probably wondering why %Eigen does not limit itself to the number of physical cores?
41This is simply because OpenMP does not allow to know the number of physical cores, and thus %Eigen will launch as many threads as <i>cores</i> reported by OpenMP.
42
Brian Silverman72890c22015-09-19 14:37:37 -040043\section TopicMultiThreading_UsingEigenWithMT Using Eigen in a multi-threaded application
44
Austin Schuhc55b0172022-02-20 17:52:35 -080045In the case your own application is multithreaded, and multiple threads make calls to %Eigen, then you have to initialize %Eigen by calling the following routine \b before creating the threads:
Brian Silverman72890c22015-09-19 14:37:37 -040046\code
47#include <Eigen/Core>
48
49int main(int argc, char** argv)
50{
51 Eigen::initParallel();
52
53 ...
54}
55\endcode
56
Austin Schuhc55b0172022-02-20 17:52:35 -080057\note With %Eigen 3.3, and a fully C++11 compliant compiler (i.e., <a href="http://en.cppreference.com/w/cpp/language/storage_duration#Static_local_variables">thread-safe static local variable initialization</a>), then calling \c initParallel() is optional.
Austin Schuh189376f2018-12-20 22:11:15 +110058
Austin Schuhc55b0172022-02-20 17:52:35 -080059\warning Note that all functions generating random matrices are \b not re-entrant nor thread-safe. Those include DenseBase::Random(), and DenseBase::setRandom() despite a call to `Eigen::initParallel()`. This is because these functions are based on `std::rand` which is not re-entrant.
60For thread-safe random generator, we recommend the use of c++11 random generators (\link DenseBase::NullaryExpr(Index, const CustomNullaryOp&) example \endlink) or `boost::random`.
Austin Schuh189376f2018-12-20 22:11:15 +110061
Austin Schuhc55b0172022-02-20 17:52:35 -080062In the case your application is parallelized with OpenMP, you might want to disable %Eigen's own parallelization as detailed in the previous section.
Brian Silverman72890c22015-09-19 14:37:37 -040063
Austin Schuhc55b0172022-02-20 17:52:35 -080064\warning Using OpenMP with custom scalar types that might throw exceptions can lead to unexpected behaviour in the event of throwing.
Brian Silverman72890c22015-09-19 14:37:37 -040065*/
66
67}