Changelog for
libopenblas-pthreads-gnu-hpc-0.2.20-1.15.x86_64.rpm :
* Tue Jan 30 2018 roAATTsuse.de- add openblas-s390.patch to build on s390 (bsc#1079513).
* Fri Jan 05 2018 eichAATTsuse.com- Switch from gcc6 to gcc7 as additional compiler flavor for HPC on SLES.- Fix library package requires - use HPC macro (boo#1074890).- Fix unexpanded rpm macro in environment module file for HPC (boo#1074897).
* Mon Nov 27 2017 normandAATTlinux.vnet.ibm.com- Add -mvsx option for ppc64 archi (not required for ppc64le) to avoid ./kernel/power/sasum_microk_power8.c:41:3: error: \'__vector\' undeclared (first use in this function); ...
* Tue Oct 17 2017 eichAATTsuse.com- Add magic to limit the number of flavors built in the OBS to non-HPC ones.
* Thu Oct 12 2017 eichAATTsuse.com- Generate baselib.conf dynamically and only for the non-HPC builds: this avoids issues with the source validator.
* Fri Sep 08 2017 eichAATTsuse.com- Convert openblas to multibuild.- Add HPC build using environment modules. (FATE#321708).- fix-arm64-cpuid-return.patch Fix CPUID detection on ARM (From OHPC).
* Wed Aug 09 2017 dmitry_rAATTopensuse.org- Remove migration %post scripts for old library names
* Sat Jul 29 2017 badshah400AATTgmail.com- Update to version 0.2.20:
* common: - Improved CMake support - Fixed several thread race and locking bugs - Fixed default LAPACK optimization level - Updated LAPACK to 3.7.0 - Added ReLAPACK (https://github.com/HPAC/ReLAPACK), make BUILD_RELAPACK=1
* POWER: - Optimizations for Power9 - Fixed several Power8 assembly bugs
* ARM: - New optimized Vulcan and ThunderX2T99 targets - Support for ARMV7 SOFT_FP ABI (make ARM_SOFTFP_ABI=1) - Detect all cpu cores including offline ones - Fix compilation with CLANG - Support building a shared library for Android
* MIPS: - Fixed several threading issues - Fix compilation with CLANG
* x86_64: - Detect Intel Bay Trail and Apollo Lake - Detect Intel Sky Lake and Kaby Lake - Detect Intel Knights Landing - Detect AMD A8, A10, A12 and Ryzen - Support 64bit builds with Visual Studio - Fix building with Intel and PGI compilers - Fix building with MINGW and TDM-GCC - Fix cmake builds for Haswell and related cpus - Fix building for Sandybridge with CLANG 3.9 - Add support for the FLANG compiler
* IBM Z: - New target z13 with BLAS3 optimizations- Drop 0001-Fix-power8-asm.patch; fixed upstream.- Minor rebase of c_xerbla_no-void-return.patch and openblas-noexecstack.patch for updated version.- Remove installed pkgconfig file as it is not adapted to the library names we use.
* Thu May 18 2017 meissnerAATTsuse.com- 0001-Fix-power8-asm.patch: fixed power8 assembly (bsc#1039397)
* Wed Sep 07 2016 idonmezAATTsuse.com- Update to version 0.2.19 POWER:
* Optimize BLAS on Power8
* Fixed Julia+OpenBLAS bugs on Power8 MIPS:
* Optimize BLAS on MIPS P5600 and I6400 ARM:
* Improved on ARM Cortex-A57
* Wed Apr 13 2016 dmitry_rAATTopensuse.org- Update to version 0.2.18 ARM:
* Provide DGEMM 8x4 kernel for Cortex-A57 POWER:
* Optimize S and C BLAS3 on Power8
* Optimize BLAS2/1 on Power8
* Mon Mar 21 2016 dmitry_rAATTopensuse.org- Update to version 0.2.17
* Enable BUILD_LAPACK_DEPRECATED=1 by default.
* Wed Mar 16 2016 idonmezAATTsuse.com- Update to version 0.2.16
* Upgrade LAPACK to 3.6.0 version.
* Disable multi-threading for small size swap and ger.
* Improve small zger, zgemv, ztrmv using stack alloction.
* Let openblas_get_num_threads return the number of active threads.
* Fix LAPACK Dormbr, Dormlq bug.
* Avoid potential getenv segfault.
* Import LAPACK svn bugfix #142-#147,#150-#155 x86/x86_64:
* Optimize trsm kernels for AMD Bulldozer, Piledriver, Steamroller.
* Detect Intel Avoton.
* Detect AMD Trinity, Richland, E2-3200.
* Optimize c/zgemv for AMD Bulldozer, Piledriver, Steamroller
* Fix bug with scipy linalg test. ARM:
* Support and optimize Cortex-A57 AArch64.
* Update ARMV6 kernels.
* Improve DGEMM for ARM Cortex-A57. POWER:
* Fix detection of POWER architecture.
* Optimize D and Z BLAS3 functions for Power8.- Remove openblas-libs.patch, not needed.
* Tue Oct 27 2015 dmitry_rAATTopensuse.org- Update to version 0.2.15
* Enable MAX_STACK_ALLOC flags by default.
* Improve ger and gemv for small matrices.
* Improve gemv parallel with small m and large n case.
* Improve ?imatcopy when lda==ldb
* Add vecLib benchmarks
* Fix LAPACK lantr for row major matrices
* Fix LAPACKE lansy
* Import bug fixes for LAPACKE s/dormlq, c/zunmlq
* Raise the signal when pthread_create fails
* Drop obsolete openblas-arm64-build.patch x86/x86-64:
* Support pure C generic kernels for x86/x86-64.
* Support Intel Boardwell and Skylake by Haswell kernels.
* Support AMD Excavator by Steamroller kernels.
* Optimize s/d/c/zdot for Intel SandyBridge and Haswell.
* Optimize s/d/c/zdot for AMD Piledriver and Steamroller.
* Optimize s/d/c/zapxy for Intel SandyBridge and Haswell.
* Optimize s/d/c/zapxy for AMD Piledriver and Steamroller.
* Optimize d/c/zscal for Intel Haswell, dscal for Intel SandyBridge.
* Optimize d/c/zscal for AMD Bulldozer, Piledriver and Steamroller.
* Optimize s/dger for Intel SandyBridge.
* Optimize s/dsymv for Intel SandyBridge.
* Optimize ssymv for Intel Haswell.
* Optimize dgemv for Intel Nehalem and Haswell.
* Optimize dtrmm for Intel Haswell. ARM:
* Support Android NDK armeabi-v7a-hard ABI (-mfloat-abi=hard)
* Fix lock, rpcc bugs POWER:
* Support ppc64le platform (ELF ABI v2)
* Support POWER7/8 by POWER6 kernels.
* Wed Jul 29 2015 dmitry_rAATTopensuse.org- Change library name suffix
* drop openblas-soname.patch- Add RPM %post script for manual BLAS/LAPACK update-alternatives configuration update- Use update-alternatives mechanism for OpenBLAS variants (serial, openmp, pthreads). pthreads variant is default for x86 and x86_64, OpenMP for other architectures.- Fix build on ARM64
* openblas-arm64-build.patch- Add update-alternatives mechanism for CBLAS- Provide cmake module- Delete info about host cpu from openblas_config.h for dynamic arch- Add update-alternatives to \'preup\' and \'post\' requires list for libraries- Add README.SUSE
* Wed Mar 25 2015 dmitry_rAATTopensuse.org- Update to version 0.2.14
* Improve ger and gemv for small matrices by stack allocation. e.g. make -DMAX_STACK_ALLOC=2048
* Introduce openblas_get_num_threads and openblas_get_num_procs.
* Add ATLAS-style ?geadd function.
* Fix c/zsyr bug with negative incx.
* Fix race condition during shutdown causing a crash in gotoblas_set_affinity(). x86/x86-64:
* Support AMD Streamroller. ARM:
* Add Cortex-A9 and Cortex-A15 targets.
* Wed Dec 03 2014 dmitry_rAATTopensuse.org- Update to version 0.2.13
* Add SYMBOLPREFIX and SYMBOLSUFFIX makefile options for adding a prefix or suffix to all exported symbol names in the shared library.
* Remove openblas-0.1.0-soname.patch
* Add openblas-soname.patch
* Rebase openblas-noexecstack.patch x86/x86-64:
* Add generic kernel files for x86-64. make TARGET=GENERIC
* Fix a bug of sgemm kernel on Intel Sandy Bridge.
* Fix c_check bug on some amd64 systems. ARM:
* Support APM\'s X-Gene 1 AArch64 processors.
* Optimize trmm and sgemm.
* Fri Oct 17 2014 dmitry_rAATTopensuse.org- Update to version 0.2.12
* Added CBLAS interface for ?omatcopy and ?imatcopy.
* Enable ?gemm3m functions.
* Added benchmark for ?gemm3m.
* Optimized multithreading lower limits.
* Disabled SYMM3M and HEMM3M functions because of segment violations. x86/x86-64:
* Improved axpy and symv performance on AMD Bulldozer.
* Improved gemv performance on modern Intel and AMD CPUs.
* Mon Aug 18 2014 dmitry_rAATTopensuse.org- Update to version 0.2.11
* Added some benchmark codes. x86/x86-64:
* Improved s/c/zgemm performance for Intel Haswell.
* Improved s/d/c/zgemv performance.
* Support the big numa machine.(EXPERIMENT) ARM:
* Fix detection when cpuinfo uses \"Processor\".
* Thu Jul 17 2014 dmitry_rAATTopensuse.org- Update to version 0.2.10
* Added BLAS extensions as following. s/d/c/zaxpby, s/d/c/zimatcopy, s/d/c/zomatcopy.
* Added OPENBLAS_CORETYPE environment for dynamic_arch. (a86d34)
* Support outputing the CPU corename on runtime.(#407)
* Patched LAPACK to fix bug 114, 117, 118. (http://www.netlib.org/lapack/bug_list.html)
* Disabled ?gemm3m for a work-around fix. (#400)
* Fixed lots of bugs for optimized kernels on sandybridge,Haswell, bulldozer, and piledriver.
* Remove obsolete openblas-0.2.9-gcc-warnings.patch
* Tue Jun 10 2014 dmitry_rAATTopensuse.org- Update to version 0.2.9
* Update LAPACK to 3.5.0 version
* Fixed compatiable issues with Clang and Pathscale compilers.
* Added OPENBLAS_VERBOSE environment variable.(#338)
* Make OpenBLAS thread-pool resilient to fork via pthread_atfork. (#294)
* Rewrote rotmg
* Fixed sdsdot bug.
* Improved the result for LAPACK testing. (#372) x86/x86-64:
* Optimization on Intel Haswell.
* Enable optimization kernels on AMD Bulldozer and Piledriver.
* Detect Intel Haswell for new Macbook.
* To improve LAPACK testing, we fallback some kernels. (#372) https://github.com/xianyi/OpenBLAS/wiki/Fixed-optimized-kernels-To-do-List ARM:
* Support ARMv6 and ARMv7 ISA.
* Optimization on ARM Cortex-A9.- Update patches:
* openblas-0.2.8-libs.patch
* openblas-0.2.8-noexecstack.patch to
* openblas-libs.patch
* openblas-noexecstack.patch- Fix gcc warnings (#385)
* openblas-0.2.9-gcc-warnings.patch
* Sat Apr 12 2014 dmitry_rAATTopensuse.org- Remove files with problematic licenses
* Fri Apr 04 2014 dmitry_rAATTopensuse.org- Update to version 0.2.8
* Add executable stack markings.
* Respect user\'s LDFLAGS
* Rollback bulldozer and piledriver kernels to barcelona kernels
* update openblas-0.2.6-libs.patch
* update c_xerbla_no-void-return.patch
* update openblas-0.2.7-noexecstack.patch
* Fri Jul 26 2013 scorotAATTfree.fr- version 0.2.7
* Support LSB (Linux Standard Base) 4.1. e.g. make CC=lsbcc
* Include LAPACK 3.4.2 source codes to the repo. Avoid downloading at compile time.
* Add NO_PARALLEL_MAKE flag to disable parallel make.
* Create openblas_get_parallel to retrieve information which parallelization model is used by OpenBLAS. (Thank grisuthedragon)
* Detect LLVM/Clang compiler.
* A walk round for dtrti_U single thread bug. Replace it with LAPACK codes. (#191)
* Optimize c/zgemm, trsm, dgemv_n, ddot, daxpy, dcopy on AMD Bulldozer. (Thank Werner Saar)
* Add Intel Haswell support (using Sandybridge optimizations). (Thank Dan Luu)
* Add AMD Piledriver support (using Bulldozer optimizations).
* Fix the computational error in zgemm avx kernel on Sandybridge. (#237)
* Fix the overflow bug in gemv.
* Fix the overflow bug in multi-threaded BLAS3, getrf when NUM_THREADS is very large.(#214, #221, #246).- rebase patch noexecstack.patch- remove lapack source tarball since lapack sources are included in openblas sources- increase NUM_THREAD from 32 to 64
* Sat Mar 02 2013 scorotAATTfree.fr- version 0.2.6
* Improved OpenMP performance slightly. (d744c9)
* Improved cblas.h compatibility with Intel MKL.(#185)
* Fixed the overflowing bug in single thread cholesky factorization.
* Fixed the overflowing buffer bug of multithreading hbmv and sbmv.(#174)
* Added AMD Bulldozer x86-64 S/DGEMM AVX kernels. (Thank Werner Saar) We will tune the performance in future.
* Auto-detect Intel Xeon E7540.
* Fixed the overflowing buffer bug of gemv. (#173)
* Fixed the bug of s/cdot about invalid reading NAN on x86_64. (#189)- rebase patch0 openblas-0.2.6-libs.patch
* Sun Feb 17 2013 jengelhAATTinai.de- Remove redundant cleaning commands- Do not create .so.0.2.5. SO versions are not package release numbers.
* Mon Jan 21 2013 scorotAATTfree.fr- use Requires(post) and Requires(preun) instead of PreReq- add patch markups in spec file
* Tue Jan 15 2013 scorotAATTfree.fr- add update-alternatives support to allow easy switching between the different blas and lapack implementations
* Fri Nov 30 2012 scorotAATTfree.fr- version 0.2.5
* Export LAPACK 3.4.2 symbols in shared library. (#147)
* Restore the original CPU affinity when calling openblas_set_num_threads(1) (#153)
* Fixed a SEGFAULT bug in dgemv_t when m is very large.(#154)
* Mon Oct 08 2012 scorotAATTfree.fr- version 0.2.4
* Upgraded LAPACK to 3.4.2 version. (#145)
* f77blas.h:compatibility for compilers without C99 complex number support. (#141)
* Added NO_AVX flag. Check OS supporting AVX on runtime. (#139)
* Mon Aug 20 2012 scorotAATTfree.fr- version 0.2.3
* Fixed LAPACK unstable bug about ?laswp. (#130)
* Fixed the shared library bug about unloading the library on Linux (#132).
* Sun Jul 08 2012 scorotAATTfree.fr- version 0.2.2
* Support Intel Sandy Bridge 22nm desktop/mobile CPU
* Mon Jul 02 2012 scorotAATTfree.fr- version 0.2.1
* Fixed the SEGFAULT bug about hyper-theading
* Support AMD Bulldozer by using GotoBLAS2 AMD Barcelona codes
* Removed the limitation (64) of numbers of CPU cores. Now, it supports 256 cores at max.
* Supported clang compiler.
* Fixed some build bugs on FreeBSD
* Optimized Level-3 BLAS on Intel Sandy Bridge x86-64 by AVX instructions.
* Support AMD Bobcat by using GotoBLAS2 AMD Barcelona codes.- update patch3
* Wed May 02 2012 scorotAATTfree.fr- update patch0
* Wed May 02 2012 scorotAATTfree.fr- again fix remaining library file name error in spec file
* Wed May 02 2012 scorotAATTfree.fr- fix wrong library file name version
* Wed May 02 2012 scorotAATTfree.fr- Update to version 0.1.1
* Upgraded LAPACK to 3.4.1 version. (Thank Zaheer Chothia)
* Supported LAPACKE, a C interface to LAPACKE. (Thank Zaheer Chothia)
* Fixed the build bug (MD5 and download) on Mac OSX.
* Auto download CUnit 2.1.2-2 from SF.net with UTEST_CHECK=1. x86/x86_64:
* Auto-detect Intel Sandy Bridge Core i7-3xxx & Xeon E7 Westmere-EX.
* Test alpha=Nan in dscale.
* Fixed a SEGFAULT bug in samax on x86 windows.
* Wed Apr 25 2012 scorotAATTfree.fr- version 0.1.0- update openblas-0.1.0-soname.patch- add openblas-0.1.0-noexecstack.patch- spec file cleanup
* Mon Mar 12 2012 scorotAATTfree.fr- version 0.1alpha2.5