|
|
|
|
Changelog for dav1d-devel-1.x+git.20231024T202733~48ef395-lp156.21.2.x86_64.rpm :
* Sun Oct 29 2023 obs-service-tar-scmAATTinvalid- Update to version 1.x+git.20231024T202733~48ef395: * CI: Update images * x86: Add 8-bit ipred z3 AVX-512 (Ice Lake) asm * deblock_avx512: convert byte-shifts to gf2p8affineqb * x86: Add 8-bit ipred z1 AVX-512 (Ice Lake) asm * x86: Consolidate some pb_0to31 and pb_0to63 constants * Prepare for release 1.3.0 * fix: various errors in implementation of BTI * Use the correct free() function on dav1d_mem_pool_init() failure * Don\'t hard-code FGS block size * Rename BLOCK_SIZE to FG_BLOCK_SIZE * Fri Mar 18 2022 Luigi Baldoni - Update to version 1.0.0 * Automatic thread management. * Add support for AVX-512 acceleration. * x86 code speedup (from SSE2 to AVX2). * New grain API to ease acceleration on the GPU. * New API call to get information of which frame failed to decode, in error cases. * Numerous small bug fixes.- Bump soversion to 6 * Fri Sep 03 2021 Luigi Baldoni - Update to version 0.9.2 * x86: SSE4 optimizations of inverse transforms for 10bit for all sizes * x86: mc.resize optimizations with AVX2/SSSE3 for 10/12b * x86: SSSE3 optimizations for cdef_filter in 10/12b and mc_w_mask_422/444 in 8b * ARM NEON optimizations for FilmGrain Gen_grain functions * Optimizations for splat_mv in SSE2/AVX2 and NEON * x86: SGR improvements for SSSE3 CPUs * x86: AVX2 optimizations for cfl_ac * Thu Jul 29 2021 Luigi Baldoni - Update to version 0.9.1 * 10/12b SSSE3 optimizations for mc (avg, w_avg, mask, w_mask, emu_edge), prep/put_bilin, prep/put_8tap, ipred (dc/h/v, paeth, smooth, pal, filter), wiener, sgr (10b), warp8x8, deblock, film_grain, cfl_ac/pred for 32bit and 64bit x86 processors * Film grain NEON for fguv 10/12b, fgy/fguv 8b and fgy/fguv 10/12 arm32 * Fixes for filmgrain on ARM * itx 10bit optimizations for 4x4/x8/x16, 8x4/x8/x16 for SSE4 * Misc improvements on SSE2, SSE4 * Sun May 16 2021 Luigi Baldoni - Update to version 0.9.0 * x86 (64bit) AVX2 implementation of most 10b/12b functions, which should provide a large boost for high-bitdepth decoding on modern x86 computers and servers. * ARM64 neon implementation of FilmGrain (4:2:0/4:2:2/4:4:4 8bit) * New API to signal events happening during the decoding process * Wed Mar 24 2021 Luigi Baldoni - Disable LTO (fix boo#1183956) * Mon Feb 22 2021 Luigi Baldoni - Update to version 0.8.2 * ARM32 optimizations for ipred and itx in 10/12bits, completing the 10b/12b work on ARM64 and ARM32 * Give the post-filters their own threads * ARM64: rewrite the wiener functions * Speed up coefficient decoding, 0.5%-3% global decoding gain * x86 optimizations for CDEF_filter and wiener in 10/12bit * x86: rewrite the SGR AVX2 asm * x86: improve msac speed on SSE2+ machines * ARM32: improve speed of ipred and warp * ARM64: improve speed of ipred, cdef_dir, cdef_filter, warp_motion and itx16 * ARM32/64: improve speed of looprestoration * Add seeking, pausing to the player * Update the player for rendering of 10b/12b * Misc speed improvements and fixes on all platforms * Add a xxh3 muxer in the dav1d application * Sat Jan 02 2021 aloisioAATTgmx.com- Update to version 0.8.1 * Keep references to buffers valid after dav1d_close(). Fixes a regression caused by the picture buffer pool added in 0.8.0. * ARM32 optimizations for 10bit bitdepth for SGR * ARM32 optimizations for 16bit bitdepth for blend/w_masl/emu_edge * ARM64 optimizations for 10bit bitdepth for SGR * x86 optimizations for wiener in SSE2/SSSE3/AVX2 * Tue Nov 24 2020 Luigi Baldoni - Update to version 0.8.0 * Improve the performance by using a picture buffer pool; * ARM32 optimizations for 8bit bitdepth for ipred paeth, smooth, cfl * ARM32 optimizations for 10/12/16bit bitdepth for mc_avg/mask/w_avg, put/prep 8tap/bilin, wiener and CDEF filters * ARM64 optimizations for cfl_ac 444 for all bitdepths * x86 optimizations for MC 8-tap, mc_scaled in AVX2 * x86 optimizations for CDEF in SSE and {put/prep}_{8tap/bilin} in SSSE3- Bump soversion to 5- Drop dav1d-nasm-2.15.patch (merged upstream) * Tue Sep 01 2020 Dominique Leuenberger - Add dav1d-nasm-2.15.patch: Fix compilation with nasm 2.15. * Mon Jun 22 2020 aloisioAATTgmx.com- Update to version 0.7.1 * ARM32 NEON optimizations for itxfm, which can give up to 28% speedup, and MSAC * SSE2 optimizations for prep_bilin and prep_8tap * AVX2 optimizations for MC scaled * Fix a clamping issue in motion vector projection * Fix an issue on some specific Haswell CPU on ipred_z AVX2 functions * Improvements on the dav1dplay utility player to support resizing * Wed May 20 2020 Luigi Baldoni - Update to verison 0.7.0 * Faster refmv implementation gaining up to 12% speed while - 25% of RAM (Single Thread) * 10b/12b ARM64 optimizations are mostly complete: + ipred (paeth, smooth, dc, pal, filter, cfl) + itxfm (only 10b) * AVX2/SSSE3 for non-4:2:0 film grain and for mc.resize * AVX2 for cfl4:4:4 * AVX-512 CDEF filter * ARM64 8b improvements for cfl_ac and itxfm * ARM64 implementation for emu_edge in 8b/10b/12b * ARM32 implementation for emu_edge in 8b * Improvements on the dav1dplay utility player to support 10 bit, non-4:2:0 pixel formats and film grain on the GPU * Fri Mar 06 2020 Luigi Baldoni - Update to version 0.6.0 * New ARM64 optimizations for the 10/12bit depth: + mc_avg, mc_w_avg, mc_mask + mc_put/mc_prep 8tap/bilin + mc_warp_8x8 + mc_w_mask + mc_blend + wiener + SGR + loopfilter + cdef * New AVX-512 optimizations for prep_bilin, prep_8tap, cdef_filter, mc_avg/w_avg/mask * New SSSE3 optimizations for film grain * New AVX2 optimizations for msac_adapt16 * Fix rare mismatches against the reference decoder, notably because of clipping * Improvements on ARM64 on msac, cdef and looprestoration optimizations * Improvements on AVX2 optimizations for cdef_filter * Improvements in the C version for itxfm, cdef_filter- Bump sover to 4 * Wed Dec 04 2019 Luigi Baldoni - Update to version 0.5.2 * ARM32 optimizations for loopfilter, ipred_dc|h|v * Add section-5 raw OBU demuxer * Improve the speed by reducing the L2 cache collisions * Fix minor issues * Sat Oct 26 2019 Luigi Baldoni - Update to version 0.5.1 * SSE2 optimizations for CDEF, wiener and warp_affine * NEON optimizations for SGR on ARM32 * Fix mismatch issue in x86 asm in inverse identity transforms * Fix build issue in ARM64 assembly if debug info was enabled * Add a workaround for Xcode 11 -fstack-check bug- Dropped arm64_ipred_symbols_aligned.patch (merged upstream) * Fri Oct 11 2019 Luigi Baldoni - Update to version 0.5.0 Medium release fixing regressions and minor issues, and improving speed significantly: * Export ITU T.35 metadata * Speed improvements on blend_ on ARM * Speed improvements on decode_coef and MSAC * NEON optimizations for blend *, w_mask_, ipred functions for ARM64 * NEON optimizations for CDEF and warp on ARM32 * SSE2 optimizations for MSAC hi_tok decoding * SSSE3 optimizations for deblocking loopfilters and warp_affine * AVX-2 optimizations for film grain and ipred_z2 * SSE4 optimizations for warp_affine * VSX optimizations for wiener * Fix inverse transform overflows in x86 and NEON asm * Fix integer overflows with large frames * Improve film grain generation to match reference code * Improve compatibility with older binutils for ARM * More advanced Player example in tools- Bump soversion to 3- Added arm64_ipred_symbols_aligned.patch to fix aarch64 build * Mon Aug 05 2019 Luigi Baldoni - Update to version 0.4.0 * Fix playback with unknown OBUs * Add an option to limit the maximum frame size * SSE2 and ARM64 optimizations for MSAC * Improve speed on 32bits systems * Optimization in obmc blend * Reduce RAM usage significantly * The initial PPC SIMD code, cdef_filter * NEON optimizations for blend functions on ARM * NEON optimizations for w_mask functions on ARM * NEON optimizations for inverse transforms on ARM64 * Improve handling of malloc failures * Simple Player example in tools- Dropped dav1d.armv6.patch (merged upstream)- Bumped SOVERSION to 2 * Mon May 13 2019 olafAATTaepfle.de- Added dav1d.armv6.patch (disables armv7 asm for armv6 builds) * Sat May 11 2019 Luigi Baldoni - Update to version 0.3.1 * Fix a buffer overflow in frame-threading mode on SSSE3 CPUs * Reduce binary size, notably on Windows * SSSE3 optimizations for ipred_filter * ARM optimizations for MSAC * Mon Apr 29 2019 Luigi Baldoni - Update to version 0.3.0 * Fixes an annoying crash on SSSE3 that happened in the itx functions * Fri Apr 19 2019 Luigi Baldoni - Update to version 0.2.2 * Large improvement on MSAC decoding with SSE, bringing 4-6% speed increase The impact is important on SSSE3, SSE4 and AVX-2 cpus * SSSE3 optimizations for all blocks size in itx * SSSE3 optimizations for ipred_paeth and ipref_cfl (420, 422 and 444) * Speed improvements on CDEF for SSE4 CPUs * NEON optimizations for SGR and loop filter * Minor crashes, improvements and build changes * Tue Apr 02 2019 Dominique Leuenberger - Add baselibs.conf: ffmpeg, which is the main consumer of Dav1d, produces -32bit packages that would be uninstallable otherwise. * Tue Mar 12 2019 Luigi Baldoni - Update to version 0.2.1 * SSSE3 optimization for cdef_dir * AVX-2 improvements of the existing CDEF optimizations * NEON improvements of the existing CDEF and wiener optimizations * Clarification about the numbering/versionning scheme * Mon Mar 04 2019 Luigi Baldoni - Update to version 0.2.0 * ARM64 and ARM optimizations using NEON instructions * SSSE3 optimizations for both 32 and 64bits * More AVX-2 assembly, reaching almost completion * Fix installation of includes * Rewrite inverse transforms to avoid overflows * Snap packaging for Linux * Updated API (ABI and API break) * Fixes for un-decodable samples * Thu Dec 13 2018 Jan Engelhardt - Redo description and mention SIMD acceleration. * Thu Dec 13 2018 Luigi Baldoni - Moved license file to library package * Tue Dec 11 2018 Luigi Baldoni - Initial stable package (v0.1.0)
|
|
|