aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
Diffstat (limited to 'docs/proj/blas-lapack.xml')
-rw-r--r--docs/proj/blas-lapack.xml509
1 files changed, 509 insertions, 0 deletions
diff --git a/docs/proj/blas-lapack.xml b/docs/proj/blas-lapack.xml
new file mode 100644
index 000000000..2b931fc49
--- /dev/null
+++ b/docs/proj/blas-lapack.xml
@@ -0,0 +1,509 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE guide SYSTEM "/dtd/guide.dtd">
+<!-- $Header: /var/cvsroot/gentoo/xml/htdocs/proj/en/science/blas-lapack.xml,v 1.3 2007/12/27 19:35:25 bicatali Exp $ -->
+
+<guide link="/proj/en/science/blas-lapack.xml">
+<title>Using multiple versions of BLAS and LAPACK with Gentoo/Linux</title>
+
+<author title="Author">
+ <mail link="bicatali@gentoo.org">Sébastien Fabbro</mail>
+</author>
+<author title="Author">
+ <mail link="markusle@gentoo.org">Markus Dittrich</mail>
+</author>
+<author title="Editor">
+ <mail link="nightmorph@gentoo.org">Joshua Saddler</mail>
+</author>
+
+<abstract>
+This guide explains the use of the different implementations of the BLAS
+and LAPACK libraries that are available via Portage.
+</abstract>
+
+<!-- The content of this document is licensed under the CC-BY-SA license -->
+<!-- See http://creativecommons.org/licenses/by-sa/2.5 -->
+<license/>
+
+<version>1.0</version>
+<date>2007-10-22</date>
+
+<chapter>
+<title>Introduction</title>
+<section>
+<body>
+
+<p>
+The Basic Linear Algebra Subroutines (BLAS) and Linear Algebra PACKage (LAPACK)
+are well designed linear algebra software libraries developed by the
+High Performance Computing (HPC) community. BLAS
+implements dense matrix and vectors products, while LAPACK provides routines for
+solving systems of linear equations. Both are widely used in
+many scientific applications and it is, therefore, important to
+have efficient implementations available.
+</p>
+
+<p>
+Originally written in FORTRAN 77, a number of additional language
+wrappers have been developed for languages like C, C++, FORTRAN 95, and others.
+The following legacy libraries are available via the reference ebuilds:
+</p>
+
+<ul>
+ <li>
+ <uri link="http://netlib.org/blas">BLAS</uri>: FORTRAN 77 implementation of
+ BLAS
+ </li>
+ <li>
+ <uri link="http://netlib.org/blas">CBLAS</uri>: C implementation of BLAS
+ </li>
+ <li>
+ <uri link="http://netlib.org/lapack">LAPACK</uri>: FORTRAN 77 implementation
+ of LAPACK
+ </li>
+</ul>
+
+<p>
+In addition, Gentoo provides a number of optimized BLAS and LAPACK implementations
+that will be described below. Different implementations are bundled together with
+Gentoo's <c>eselect</c> system and the widely used <c>pkg-config</c> tool.
+</p>
+
+<p> It is important to note that if you require, e.g., a well performing
+BLAS implementation, simply emerging X over Y often is not enough. Rather, you will have
+to carefully benchmark your applications since performance may depend
+on many factors,
+such as hardware or network.
+If you are simply looking for a well performing and well tested
+implementation, the reference ebuilds will likely be your best choice.
+</p>
+
+
+</body>
+</section>
+</chapter>
+
+<chapter>
+<title>For Users</title>
+<section>
+<title>Installing</title>
+<body>
+
+<p>
+If best possible performance is not of paramount importance for you
+and you simply
+need BLAS and/or LAPACK, just emerge the virtual packages:
+</p>
+
+<pre caption="Installing">
+# <i>emerge blas cblas lapack</i>
+</pre>
+
+<p>
+These will install the reference legacy packages from
+<uri>http://netlib.org</uri>. They are well tested, easy to debug
+implementations. They should satisfy most users; if they're all you need, you're
+done reading.
+</p>
+
+<p>
+However, if:
+</p>
+
+<ul>
+ <li>BLAS/LAPACK are critical for the speed of your applications</li>
+ <li>you absolutely need to build the fastest computer</li>
+ <li>you want to help Gentoo sci project to improve their packages</li>
+</ul>
+
+<p>
+... then read on, and be sure to file bugs both to Gentoo and upstream.
+</p>
+
+<p>
+There is a number of optimized implementations of these libraries in the Portage
+tree:
+</p>
+
+<ul>
+ <li>
+ <uri link="http://math-atlas.sourceforge.net">ATLAS</uri>: Automatically
+ Tuned Linear Algebra Software is an open-source package that empirically
+ tunes the library to the machine it is being compiled on. It provides BLAS
+ (FORTRAN 77 and C), and LAPACK implementations on various architectures.
+ </li>
+ <li>
+ <uri
+ link="http://www.tacc.utexas.edu/resources/software/software.php">GotoBLAS</uri>:
+ Goto BLAS provide open-source, free for academic use, hand-coded
+ machine language, processor optimized versions of the FORTRAN 77 BLAS
+ routines. Claims to be the fastest BLAS.
+ </li>
+ <li>
+ <uri link="http://developer.amd.com/acml.jsp">ACML</uri>: AMD Core Math
+ Library is a closed-source but free package containing BLAS (FORTRAN 77
+ only) and LAPACK for x86 and amd64 architectures, but also other math tools
+ such as statistical libraries and FFTs.
+ </li>
+ <li>
+ <uri link="http://developer.intel.com/software/products/mkl/">MKL</uri>:
+ Intel® Math Kernel Library is a closed-source but free package for
+ non-commercial use containing BLAS (FORTRAN 77 and C), LAPACK optimized for
+ Intel® based architectures: x86, amd64 and ia64.
+ </li>
+</ul>
+
+<p>
+Usually performance gain is noticeable mainly with BLAS, since LAPACK routines
+depend on BLAS kernels.
+</p>
+
+</body>
+</section>
+<section>
+<title>Compiling and linking with installed libraries</title>
+<body>
+
+<p>
+We took great care to make sure that each package provides consistent pkg-config files.
+Hence, compiling and linking with BLAS/LAPACK should be simple and straightforward:
+</p>
+
+<pre caption="Compiling and linking BLAS/LAPACK">
+# <i>pkg-config --libs blas</i> <comment>(To link with FORTRAN 77 BLAS library)</comment>
+# <i>pkg-config --cflags cblas</i> <comment>(To compile against C BLAS library)</comment>
+# <i>pkg-config --libs cblas</i> <comment>(To link with C BLAS library)</comment>
+# <i>pkg-config --libs lapack</i> <comment>(To link with FORTRAN 77 LAPACK library)</comment>
+</pre>
+
+<p>
+<c>pkg-config</c> files are available for whichever implementation you select with <c>eselect</c>.
+More information on using <c>pkg-config</c> can be obtained with <c>man pkg-config</c>.
+</p>
+
+</body>
+</section>
+<section>
+<title>Choosing libraries</title>
+<body>
+
+<p>
+You can easily switch BLAS, CBLAS and LAPACK implementations with
+<c>eselect</c>. At this point, you probably have <c>eselect-blas</c>,
+<c>eselect-cblas</c> and <c>eselect-lapack</c> installed. If you do not,
+install them now! Using eselect, you can view which implementations of CBLAS
+are available.
+</p>
+
+<pre caption="Viewing available implementations">
+# <i>eselect cblas list</i>
+Installed CBLAS for library directory lib64
+[1] atlas
+[2] atlas-threads
+[3] gsl
+[4] mkl-threads *
+[5] reference
+</pre>
+
+<p>
+The implementation marked with an asterisk (*) is the currently
+selected implementation. To switch implementations, run:
+</p>
+
+<pre caption="Switching to the ATLAS implementation of LAPACK">
+# <i>eselect lapack set atlas</i>
+</pre>
+
+<p>
+To learn more about the <c>eselect</c> tool, visit the
+<uri link="http://www.gentoo.org/proj/en/eselect/user-guide.xml">eselect guide</uri>
+</p>
+
+<p>
+When selecting blas, cblas or lapack profiles try to avoid mixing
+different implementations since we don't have any mechanism to enforce
+reasonable profiles. However, here is a list of well performing
+profile combinations that have been used successfully in the past:
+</p>
+<ul>
+ <li> Most CPUs:
+ <ul>
+ <li>blas,cblas: atlas (or atlas-threads with multi-processor)</li>
+ <li>lapack:atlas</li>
+ </ul>
+ </li>
+ <li> Most CPUs:
+ <ul>
+ <li>blas: goto </li>
+ <li>cblas,lapack: reference</li>
+ </ul>
+ </li>
+ <li> AMD based CPUs:
+ <ul>
+ <li>blas,lapack: acml-gfortran (or acml-gfortran-openmp with
+ multi-processors) </li>
+ <li>cblas: reference</li>
+ </ul>
+ </li>
+ <li> Intel based CPUs:
+ <ul>
+ <li>blas,cblas,lapack: mkl-threads</li>
+ </ul>
+ </li>
+</ul>
+
+</body>
+</section>
+
+<section>
+<title>Choosing a compiler</title>
+<body>
+
+<p>
+Most of the libraries can compile with both the GNU compiler collection and the
+Intel® compilers on the x86, amd64 and ia64 architectures. By default, you are
+probably using <c>gcc</c>. You can also compile the <c>*-reference</c> packages
+with <c>ifort</c> and <c>icc</c>. To do this, you need to define the F77
+environment variable and don't forget the FFLAGS.
+</p>
+
+<pre caption="Using non-GCC compilers">
+# <i>F77=ifort FFLAGS="-O2 -mp" emerge blas-reference</i>
+</pre>
+
+<p>
+Depending on your hardware, a small performance gain can be noticed thanks to
+vectorization. The <c>-mp</c> flag maintains floating-point precision, since by
+default ifort is pretty aggressive on floating point arithmetic, and we are
+actually compiling a math package. Try <c>man ifort</c> to see additional flags
+to fit your hardware.
+</p>
+
+<p>
+Some of the implementations let you specify the Intel® C compiler as
+well. Please beware that not all libraries compile with all
+combinations. You should receive an error during the emerge in case you have
+chosen an incompatible combination.
+</p>
+
+<p>
+As usual for Gentoo, there are many combinations of USE flags and
+compilers with which you could compile a package. Unfortunately
+switching compilers between BLAS and LAPACK might not be always
+compatible. For example:
+</p>
+
+<pre caption="Incompatible combinations">
+# <i>F77=ifort FFLAGS="-O2 -mp" USE="openmp" emerge =acml-3.6.0-r1</i>
+# <i>eselect blas set acml-ifort-openmp</i>
+# <i>F77=gfortran FFLAGS="-O2" emerge lapack-reference</i>
+</pre>
+
+<p>
+This will most likely break things or not even compile.
+</p>
+
+<p>
+Try to be consistent in your choice. Stay with the GCC most of the time will
+avoid you some trouble, unless you want to use the MKL, in which case the Intel®
+C and FORTRAN compilers make a good combination.
+</p>
+
+</body>
+</section>
+<section>
+<title>Documentation</title>
+<body>
+
+<p>
+If you need BLAS or LAPACK to develop your own programs, the documentation
+becomes pretty handy. Setting the USE="doc" flag for the corresponding BLAS or
+LAPACK package will install man pages and quick reference sheets from the
+<c>app-doc/blas-docs</c> and <c>app-doc/lapack-docs</c> packages. They are
+standard and valid for all implementations. For optimized packages, the
+USE="doc" flags will usually install extra doc in PDF or HTML format.
+</p>
+
+</body>
+</section>
+</chapter>
+
+<chapter>
+<title>For Developers</title>
+<section>
+<title>Providing new implementations</title>
+<body>
+
+<p>
+The Portage tree contains many ebuilds that depend on the BLAS/CBLAS/LAPACK
+libraries. As there is more than one possible implementation, the Gentoo
+Scientific Project reorganized all the packages to provide <c>virtual/blas</c>,
+<c>virtual/cblas</c>, and <c>virtual/lapack</c>. All ebuilds using BLAS should
+depend on this virtual package, unless it is explicitly known to break with
+different BLAS implementations.
+</p>
+
+<p>
+To work with Gentoo's configuration tools
+<c>app-admin/eselect-{blas,cblas,lapack}</c>, and the virtual, every ebuild that
+installs a BLAS implementation must fulfill following requirements:
+</p>
+
+<ol>
+ <li>
+ The ebuild must install an eselect file for each profile it provides. The
+ libraries should link to the ones in <path>/usr/$(get_libdir)</path>
+ directories and the include files in <path>/usr/include</path>:
+ <ul>
+ <li>
+ <path>libblas.so[.0]</path> - Shared object for FORTRAN BLAS
+ applications
+ </li>
+ <li>
+ <path>libblas.a</path> - Static library for FORTRAN BLAS applications
+ </li>
+ <li>
+ <path>libcblas.so[.0]</path> - Shared object for C/C++ CBLAS applications
+ </li>
+ <li>
+ <path>libcblas.a</path> - Static library for C/C++ CBLAS applications
+ </li>
+ <li><path>cblas.h</path> - Include header for C/C++ applications</li>
+ <li>
+ <path>liblapack.so[.0]</path> - Shared object for FORTRAN LAPACK
+ applications
+ </li>
+ <li>
+ <path>liblapack.a</path> - Static library for FORTRAN LAPACK applications
+ </li>
+ </ul>
+ </li>
+ <li>
+ The ebuild must install a <path>blas.pc</path>, <path>cblas.pc</path> and/or
+ <path>lapack.pc</path> pkg-config file and therefore RDEPEND on
+ <c>dev-util/pkgconfig</c>. They should also be included in the eselect
+ files, and link to the <path>/usr/$(get_libdir)/pkgconfig</path> directory:
+ <ul>
+ <li><path>blas.pc</path> - BLAS pkg-config file</li>
+ <li><path>cblas.pc</path> - CBLAS pkg-config file</li>
+ <li><path>lapack.pc</path> - LAPACK pkg-config file</li>
+ </ul>
+ </li>
+ <li>Be included in the virtual package as a possible provider:
+ <ul>
+ <li><c>virtual/blas</c> - BLAS virtual package</li>
+ <li><c>virtual/cblas</c> - CBLAS virtual package</li>
+ <li><c>virtual/lapack</c> - LAPACK virtual package</li>
+ </ul>
+ </li>
+</ol>
+
+<p>
+The easiest way of understanding all this is probably getting inspiration from
+one of the available packages. Currently the Portage tree provide the following
+virtuals:
+</p>
+
+<table>
+<tr>
+ <th>Package name</th>
+ <th>virtual/blas</th>
+ <th>virtual/cblas</th>
+ <th>virtual/lapack</th>
+</tr>
+<tr>
+ <ti><c>sci-libs/acml</c></ti>
+ <ti>*</ti>
+ <ti></ti>
+ <ti>*</ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/blas-atlas</c></ti>
+ <ti>*</ti>
+ <ti>*</ti>
+ <ti></ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/blas-goto</c></ti>
+ <ti>*</ti>
+ <ti></ti>
+ <ti></ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/blas-reference</c></ti>
+ <ti>*</ti>
+ <ti></ti>
+ <ti></ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/cblas-reference</c></ti>
+ <ti></ti>
+ <ti>*</ti>
+ <ti></ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/gsl</c></ti>
+ <ti></ti>
+ <ti>*</ti>
+ <ti></ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/lapack-atlas</c></ti>
+ <ti></ti>
+ <ti></ti>
+ <ti>*</ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/lapack-reference</c></ti>
+ <ti></ti>
+ <ti></ti>
+ <ti>*</ti>
+</tr>
+<tr>
+ <ti><c>sci-libs/mkl</c></ti>
+ <ti>*</ti>
+ <ti>*</ti>
+ <ti>*</ti>
+</tr>
+</table>
+
+</body>
+</section>
+<section>
+<title>Packages with BLAS/LAPACK dependencies</title>
+<body>
+
+<p>
+Simply use <c>virtual/{blas,cblas,lapack}</c> as a [R]DEPEND. To build some
+packages, you might need to use the pkg-config tool. If you are lucky, the
+package uses autotools together with common BLAS and LAPACK M4 macros.In this
+case, the configuration step becomes simple. For example:
+</p>
+
+<pre caption="Sample package configuration">
+<keyword>econf</keyword> --with-blas="<var>$(pkg-config --libs blas)</var>"
+</pre>
+
+<impo>
+Don't forget to add <c>dev-util/pkgconfig</c> in DEPEND.
+</impo>
+
+</body>
+</section>
+</chapter>
+
+<chapter>
+<title>In the near future</title>
+<section>
+<body>
+
+<p>
+We plan to include more standard BLAS, LAPACK libraries: BLAS 95, LAPACK 95,
+Sparse BLAS, ScaLAPACK. If you feel inclined to write an ebuild for these, you
+are more than welcomed to file it on our <uri
+link="http://bugs.gentoo.org">Bugzilla</uri>.
+</p>
+
+</body>
+</section>
+</chapter>
+</guide>