SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 
Changelog for kokkos-devel-4.3.00-1.1.x86_64.rpm :

* Tue May 07 2024 ming li - 4.3.00- Update to version 4.3.00:
* Features: Add Experimental::sort_by_key(exec, keys, values) algorithm #6801
* Backend and Architecture Enhancements: CUDA: Experimental multi-GPU support (from the same process) #6782 Link against CUDA libraries even with KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE #6701 Don\'t use the compiler launcher script if the CMake compile language is CUDA. #6704 nvcc(wrapper): adding \"long\" and \"short\" versions for all flags #6615 HIP: Fix compilation when using amdclang (with ROCm >= 5.7) and RDC #6857 Use rocthrust for sorting, when available #6793 SYCL: We only support OneAPI SYCL implementation: add check during initialization Error out on initialization if the backend is different from ext_oneapi_
* #6784 Filter GPU devices for ext_onapi_
* GPU devices #6758 Performance Improvements Avoid unnecessary zero-memset of the scratch flags in SYCL #6739 Use host-pinned memory to copy reduction/scan result #6500 Address deprecations after oneAPI 2023.2.0 #6577 Make sure to call find_dependency for oneDPL if necessary #6870 OpenMPTarget: Use LLVM extensions for dynamic shared memory #6380 Guard scratch memory usage in ParallelReduce #6585 Update linker flags for Intel GPUs update #6735 Improve handling of printf on Intel GPUs #6652 OpenACC: Add atomics support #6446 Make the OpenACC backend asynchronous #6772 Threads: Add missing broadcast to TeamThreadRange parallel_scan #6601 OpenMP: Improve performance of view initializations and filling with zeros #6573
* General Enhancements Improve performance of random number generation when using a normal distribution on GPUs #6556 Allocate temporary view with the user-provided execution space instance and do not initialize in unique algorithm #6598 Add deduction guide for Kokkos::Array #6373 Provide new public headers and #6687 Fix/improvement to remove_if parallel algorithm: use the provided execution space instance for temporary allocations and drop unnecessaryinitialization + avoid evaluating twice the predicate during final pass #6747 Add runtime function to query the number of devices and make device ID consistent with KOKKOS_VISIBLE_DEVICES #6713 simd: support vector_aligned_tag #6243 Avoid unnecessary allocation when default constructing Bitset #6524 Fix constness for views in std algorithms #6813 Improve error message on unsafe implicit conversion in MDRangePolicy #6855 CTAD (deduction guides) for RangePolicy #6850 CTAD (deduction guides) for MDRangePolicy #5516
* Build System Changes Require Kokkos_ENABLE_ATOMICS_BYPASS option to bypass atomic operation for Serial backend only builds #6692 Add support for RISCV and the Milk-V\'s Pioneer #6773 Add C++26 standard to CMake setup #6733 Fix Makefile when using gnu_generate_makefile.sh and make >= 4.3 #6606 Cuda: Fix configuring with CMake >= 3.28.4 - temporary fallback to internal CudaToolkit.cmake #6898
* Incompatibilities (i.e. breaking changes) Remove all DEPRECATED_CODE_3 option and all code that was guarded by it #6523 Drop guards to accommodate external code defining KOKKOS_ASSERT #6665 Profiling::ProfilingSection(std::string) constructor marked explicit and nodiscard #6690 Add bound check preconditions for RangePolicy and MDRangePolicy #6617 #6726 Add checks for unsafe implicit conversions in RangePolicy #6754 Remove Kokkos::[b]half_t volatile overloads #6579 Remove KOKKOS_IMPL_DO_NOT_USE_PRINTF #6593 Check matching static extents in View constructor #5190 Tools(profiling): fix typo Kokkos_Tools_Optim[i]zationGoal #6642 Remove variadic range policy constructor (disallow passing multiple trailing chunk size arguments) #6845 Improve message on view out of bounds access and always abort #6861 Drop KOKKOS_ENABLE_INTEL_MM_ALLOC macro #6797 Remove Kokkos::Experimental::LogicalMemorySpace (without going through deprecation) #6557 Remove Experimental::HBWSpace and support for linking against memkind #6791 Drop librt TPL and associated KOKKOS_ENABLE_LIBRT macro #6798 Drop support for old CPU architectures (ARCH_BGQ, ARCH_POWER7, ARCH_WSM and associated ARCH_SSE4 macro) #6806 Drop support for deprecated command-line arguments and environment variables #6744
* Deprecations Provide kokkos_swap as part of Core and deprecate Experimental::swap in Algorithms #6697 Deprecate {Cuda,HIP}::detect_device_count() and Cuda::[detect_]device_arch() #6710 Deprecate ExecutionSpace::in_parallel() #6582
* Bug Fixes Fix team-level MDRange reductions: #6511 Fix CUDA and SYCL small value type (16-bit) team reductions #5334 Enable {transform_}exclusive_scan in place #6667 fill_random overload that do not take an execution space instance argument should fence #6658 HIP,Cuda,OpenMPTarget: Fixup use provided execution space when copying host inaccessible reduction result #6777 Fix typo in cuda_func_set_attribute[s]_wrapper preventing proper setting of desired occupancy #6786 Avoid undefined behavior due to conversion between signed and unsigned integers in shift_{right, left}_team_impl #6821 Fix a bug in Makefile.kokkos when using AMD GPU architectures as AMD_GFXYYY #6892
* Sun Mar 10 2024 Sarah Kriesch - Remove constraints file because of improved memory usage- Edit shebang lines for nvcc_wrapper and kokkos_launch_compiler because of rpm errors
* Wed Dec 06 2023 Cezary Skrzyński - 4.2.00- Update to new upstream release 4.2.00: see https://github.com/kokkos/kokkos/blob/4.2.00/CHANGELOG.md
* Wed Jul 05 2023 Vicente Adolfo Bolea Sanchez - 4.1.00- Update to new upstream release 4.1.00: see https://github.com/kokkos/kokkos/blob/4.1.00/CHANGELOG.md- Use a tarball as Source0 instead of an URL.
* Wed May 03 2023 Vicente Adolfo Bolea Sanchez - 4.0.01- Update to new upstream release 4.0.01: see https://github.com/kokkos/kokkos/blob/62d2b6c879b74b6ae7bd06eb3e5e80139c4708e6/CHANGELOG.md- Use Ninja for building the project.
* Tue Apr 04 2023 Cezary Skrzyński - 4.0.00- Update to new upstream release 4.0.00: see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md#400-2023-02-21- Increase memory limits for build to avoid OpenMP tests running out of memory- Add kokkos-do-not-use-env-as-interpreter.patch to fix rpmlint complaint about `env` being used as interpreter
* Tue Apr 26 2022 Jan Engelhardt - Resolve rpmlint report \"libkokkos3.x86_64: E: shlib-policy-name-error SONAME: libkokkoscontainers.so.3.3.0, expected package suffix: 3_3_0\"
* Tue Oct 12 2021 Stefan Brüns - Remove random constraint adjustments again, and use a global lower limit of 6 GB of memory (minimum for the OMP tests).- Limit the number of parallel jobs and OMP threads in the unit tests, to keep the required memory bounded. This way, even architectures with many cores and comparatively low memory can build successfully.
* Sun Oct 03 2021 Sarah Kriesch - New entries in constraints file because of missing build nodes with 18GB of memory for s390x and PPC: - 2GB for memoryperjob at x86, riscv64 and s390x - aarch64 3GB for memoryperjob because of \"Out of memory\" with 2GB - PPC64 and PPC64le with 1GB for memoryperjob because of missing node capacity
* Sat Dec 19 2020 Christoph Junghans - dropped 3308.patch - merged upstream- Version bump to 3.3.00: - Features: - Require C++14 as minimum C++ standard. C++17 and C++20 are supported too. - HIP backend is nearly feature complete. Kokkos Dynamic Task Graphs are missing. - Major update for OpenMPTarget: many capabilities now work. For details contact us. - Added DPC++/SYCL backend: primary capabilites are working. - Added Kokkos Graph API analogous to CUDA Graphs. - Added parallel_scan support with TeamThreadRange [gh#kokkos/kokkos#3536] - Added Logical Memory Spaces [gh#kokkos/kokkos#3546] - Added initial half precision support [gh#kokkos/kokkos#3439] - Experimental feature: control cuda occupancy [gh#kokkos/kokkos#3379] - Implemented enhancements Backends and Archs: - Add a64fx and fujitsu Compiler support [gh#kokkos/kokkos#3614] - Adding support for AMD gfx908 archictecture [gh#kokkos/kokkos#3375] - SYCL parallel_for MDRangePolicy [gh#kokkos/kokkos#3583] - SYCL add parallel_scan [gh#kokkos/kokkos#3577] - SYCL custom reductions [gh#kokkos/kokkos#3544] - SYCL Enable container unit tests [gh#kokkos/kokkos#3550] - SYCL feature level 5 [gh#kokkos/kokkos#3480] - SYCL Feature level 4 (parallel_for) [gh#kokkos/kokkos#3474] - SYCL feature level 3 [gh#kokkos/kokkos#3451] - SYCL feature level 2 [gh#kokkos/kokkos#3447] - OpenMPTarget: Hierarchial reduction for + operator on scalars [gh#kokkos/kokkos#3504] - OpenMPTarget hierarchical [gh#kokkos/kokkos#3411] - HIP Add Impl::atomic_[store,load] [gh#kokkos/kokkos#3440] - HIP enable global lock arrays [gh#kokkos/kokkos#3418] - HIP Implement multiple occupancy paths for various HIP kernel launchers [gh#kokkos/kokkos#3366] - Implemented enhancements Policies: - MDRangePolicy: Let it be semiregular [gh#kokkos/kokkos#3494] - MDRangePolicy: Check narrowing conversion in construction [gh#kokkos/kokkos#3527] - MDRangePolicy: CombinedReducers support [gh#kokkos/kokkos#3395] - Kokkos Graph: Interface and Default Implementation [gh#kokkos/kokkos#3362] - Kokkos Graph: add Cuda Graph implementation [gh#kokkos/kokkos#3369] - TeamPolicy: implemented autotuning of team sizes and vector lengths [gh#kokkos/kokkos#3206] - RangePolicy: Initialize all data members in default constructor [gh#kokkos/kokkos#3509] - Implemented enhancements BuildSystem: - Auto-generate core test files for all backends [gh#kokkos/kokkos#3488] - Avoid rewriting test files when calling cmake [gh#kokkos/kokkos#3548] - RULE_LAUNCH_COMPILE and RULE_LAUNCH_LINK system for nvcc_wrapper [gh#kokkos/kokkos#3136] - Adding -include as a known argument to nvcc_wrapper [gh#kokkos/kokkos#3434] - Install hpcbind script [gh#kokkos/kokkos#3402] - cmake/kokkos_tribits.cmake: add parsing for args [gh#kokkos/kokkos#3457] - Implemented enhancements Tools: - Changed namespacing of Kokkos::Tools::Impl::Impl::tune_policy [gh#kokkos/kokkos#3455] - Delegate to an impl allocate/deallocate method to allow specifying a SpaceHandle for MemorySpaces [gh#kokkos/kokkos#3530] - Use the Kokkos Profiling interface rather than the Impl interface [gh#kokkos/kokkos#3518] - Runtime option for tuning [gh#kokkos/kokkos#3459] - Dual View Tool Events [gh#kokkos/kokkos#3326] - Implemented enhancements Other: - Abort on errors instead of just printing [gh#kokkos/kokkos#3528] - Enable C++14 macros unconditionally [gh#kokkos/kokkos#3449] - Make ViewMapping trivially copyable [gh#kokkos/kokkos#3436] - Rename struct ViewMapping to class [gh#kokkos/kokkos#3435] - Replace enums in Kokkos_ViewMapping.hpp (removes -Wextra) [gh#kokkos/kokkos#3422] - Use bool for enums representing bools [gh#kokkos/kokkos#3416] - Fence active instead of default execution space instances [gh#kokkos/kokkos#3388] - Refactor parallel_reduce fence usage [gh#kokkos/kokkos#3359] - Moved Space EBO helpers to Kokkos_EBO [gh#kokkos/kokkos#3357] - Add remove_cvref type trait [gh#kokkos/kokkos#3340] - Adding identity type traits and update definition of identity_t alias [gh#kokkos/kokkos#3339] - Add is_specialization_of type trait [gh#kokkos/kokkos#3338] - Make ScratchMemorySpace semi-regular [gh#kokkos/kokkos#3309] - Optimize min/max atomics with early exit on no-op case [gh#kokkos/kokkos#3265] - Refactor Backend Development [gh#kokkos/kokkos#2941] - Fixed bugs: - Fixup MDRangePolicy construction from Kokkos arrays [gh#kokkos/kokkos#3591] - Add atomic functions for unsigned long long using gcc built-in [gh#kokkos/kokkos#3588] - Fixup silent pointless comparison with zero in checked_narrow_cast (compiler workaround) [gh#kokkos/kokkos#3566] - Fixes for ROCm 3.9 [gh#kokkos/kokkos#3565] - Fix windows build issues which crept in for the CUDA build [gh#kokkos/kokkos#3532] - HIP Fix atomics of large data types and clean up lock arrays [gh#kokkos/kokkos#3529] - Pthreads fix exception resulting from 0 grain size [gh#kokkos/kokkos#3510] - Fixup do not require atomic operation to be default constructible [gh#kokkos/kokkos#3503] - Fix race condition in HIP backend [gh#kokkos/kokkos#3467] - Replace KOKKOS_DEBUG with KOKKOS_ENABLE_DEBUG [gh#kokkos/kokkos#3458] - Fix multi-stream team scratch space definition for HIP [gh#kokkos/kokkos#3398] - HIP fix template deduction [gh#kokkos/kokkos#3393] - Fix compiling with HIP and C++17 [gh#kokkos/kokkos#3390] - Fix sigFPE in HIP blocksize deduction [gh#kokkos/kokkos#3378] - Type alias change: replace CS with CTS to avoid conflicts with NVSHMEM [gh#kokkos/kokkos#3348] - Clang compilation of CUDA backend on Windows [gh#kokkos/kokkos#3345] - Fix HBW support [gh#kokkos/kokkos#3343] - Added missing fences to unique token [gh#kokkos/kokkos#3260] - Incompatibilities: - Remove unused utilities (forward, move, and expand_variadic) from Kokkos::Impl [gh#kokkos/kokkos#3535] - Remove unused traits [gh#kokkos/kokkos#3534] - HIP: Remove old HCC code [gh#kokkos/kokkos#3301] - Prepare for deprecation of ViewAllocateWithoutInitializing [gh#kokkos/kokkos#3264] - Remove ROCm backend [gh#kokkos/kokkos#3148]
 
ICM