SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 
Changelog for libelpa19-2024.05.001-lp156.3.1.x86_64.rpm :

* Mon Jul 08 2024 Tobias Melson - Update to release 2024.05.001
* support of ROCM 6.x and preparation for AMD Mi300
* allow internal matrix redistribution if device pointer API is used
* do not try to autotune GPU code paths if no GPUs are available
* implement a patch for a bug in cusolverDnXtrtri_bufferSize for CUDA versions < 12.1
* PoC RCCL support for AMD GPUs, only for experienced users
* significantly faster cholesky decomposition step
* Automatic setting for cublas caching: with CUDA > 12.x a slow down had been observed since cublas assumed problematic caching values
* Autoconf >= 2.71 required for building ELPA
* enable gpu-streams per default for NVIDIA and AMD GPUs
* Updated / improved documentation and man pages
* Fixed compilation error on AMD GPUs
* Fixed SVE 256 compute kernels
* Allow (currently in parts of ELPA) to use NVIDIA NCCL for device to device commpunication
* Speed up of GPU version of hermitian_multiply by up to an factor of 4
* significantly faster full-to-tridiagonal step in ELPA 1stage GPU
* significantly faster ELPA 2stage solver on Intel GPUs
* Consistent enabling/disabling of SKEW_SYMMETRIC in header files
* new setup_gpu API function
* added CITATION.cff file
* allow test programs to be run with 1 MPI task
* correct a memory leak in the gpu stream setup
* better handling of GPU BLAS handles
* implement the execution of the AMD HIP code path on NVIDIA GPUs
* implement the execution of the SYCL GPU code path on CPUs (debugging)
* port generalized routines to SYCL GPU
* Fri Jan 27 2023 Stefan BrĂ¼ns - Update to release 2022.11.001 For details, see https://gitlab.mpcdf.mpg.de/elpa/elpa/-/blob/master/Changelog- Fix build with GCC < 8.1, add 0001-Avoid-non-standard-initialization-from-integer-expre.patch
* Wed Dec 13 2017 devAATTstellardeath.org- Update to release 2017.05.003 Changelog for ELPA 2017.05.003 - remove bug in invert_triangular, which had been introduced in ELPA 2017.05.002 Changelog for ELPA 2017.05.002 Mainly bugfixes for ELPA 2017.05.001: - fix memory leak of MPI communicators - tests for hermitian_multiply, cholesky decomposition and - deal with a problem on Debian (mawk) Changelog for ELPA 2017.05.001 Final release of ELPA 2017.05.001 Since rc2 the following changes have been made - more extensive tests during \"make check\" - distribute missing C headers - introduce analytic tests - Fix stack overflow in some kernels Changelog for ELPA 2017.05.001.rc2 This is the release candidate 2 for the ELPA 2017.05.001 version. Additionaly to the changes from rc1, it fixes some smaller issues - add missing script \"manual_cpp\" - cleanup of code Changelog for ELPA 2017.05.001.rc1 This is the release candidate 1 for the ELPA 2017.05.001 version. It provides a first version of the new, more generic API of the ELPA library. Smaller changes to the API might be possible in the upcoming release candidates. For users, who would like to use the older API of the ELPA library, the API as defined with release 2016.11.001.pre is frozen in and also supported. Apart of the API change to be more flexible for the future, this release offers the following changes: - faster GPU implementation, especially for ELPA 1stage - the restriction of the block-cyclic distribution blocksize = 128 in the GPU case is relaxed - Faster CPU implementation due to better blocking - support of already banded matrices (new API only!) - improved KNL support Changelog for pre-release ELPA 2016.11.001.pre This pre-release contains an experimental API which will most likely change in the next stable release - also suport of single-precision (real and complex case) eigenvalule problems - GPU support in ELPA 1stage and 2stage (real and complex case) - change of API (w.r.t. ELPA 2016.05.004) to support runtime-choice of GPU usage
* Tue Oct 25 2016 devAATTstellardeath.org- Update to release 2016.05.004 - fix a problem with the private state of module precision - distribute test_project with dist tarball - generic driver routine for ELPA 1stage and 2stage - test case for elpa_mult_at_b_real - test case for elpa_mult_ah_b_complex - test case for elpa_cholesky_real - test case for elpa_cholesky_complex - test case for elpa_invert_trm_real - test case for elpa_invert_trm_complex - fix building of static library - better choice of AVX, AVX2, AVX512 kernels - make assumed size Fortran arrays default
* Mon Jul 11 2016 devAATTstellardeath.org- Update to release 2016.05.003 Changelog for release ELPA 2016.05.003 - fix a problem with the build of SSE kernels - make some (internal) functions public, such that they can be used outside of ELPA - add documentation and interfaces for new public functions - shorten file namses and directory names for test programs in under to by pass \"make agrument list too long\" error Changelog for release ELPA 2016.05.002 - fix problem with generated
*.sh- check scripts - name library differently if build without MPI support - install only public modules Changelog for release ELPA 2016.05.001 - support building without MPI for one node usage - doxygen and man pages documentation for ELPA - cleanup of documentation - introduction of SSE gcc intrinsic kernels - Remove errors due to unaligned memory - removal of Fortran \"contains functions\" - Fortran interfaces for assembly and C kernels
* Mon Jul 11 2016 devAATTstellardeath.org- Use a _service file to check GPG signature
* Mon Nov 16 2015 devAATTstellardeath.org- Update to release 2015.11.001
* Fri May 29 2015 devAATTstellardeath.org- Update to release 2015.05.001
* Wed Mar 18 2015 devAATTstellardeath.org- Updated to maintenance release 2015.02.002 This release was in part caused by problems observed from the OpenSUSE build server results.
* Tue Mar 03 2015 devAATTstellardeath.org- Update to released version 2015.02.001 Removed \'skip_openmp_for_missing_mpi_thread_support.patch\', was merged upstream
* Wed Oct 22 2014 devAATTstellardeath.org- Incorporate changes by Dmitry Roshchin, enable OpenMP in my home project This should allow to use the same spec file for OpenMP support within my home project with an experimental openmpi 1.8 package, while disabling OpenMP by default otherwise.
 
ICM