Hello,
The v23.11 release of Compute Library is out and comes with a collection of improvements and new features.
Source code and prebuilt binaries are available at:
https://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11
[https://opengraph.githubassets.com/9c6e9733a1038ab714edff3a08a9589bd88d70f30...]https://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11
Release v23.11 · ARM-software/ComputeLibraryhttps://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11
Public major release Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available here: https://arm-software.github.io/ComputeLibrary/v23.11
github.com
Highlights of the release:
- New features
- Add support for input data type U64/S64 in CLCast and NECast.
- Add support for output data type S64 in NEArgMinMaxLayer and CLArgMinMaxLayer
- Port the following kernels in the experimental Dynamic Fusion interface to use the new Compute Kernel Writer interface:
- experimental::dynamic_fusion::GpuCkwResize
- experimental::dynamic_fusion::GpuCkwPool2d
- experimental::dynamic_fusion::GpuCkwDepthwiseConv2d
- experimental::dynamic_fusion::GpuCkwMatMul
- Add support for OpenCL™ comand buffer with mutable dispatch extension.
- Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82.
- Add support for negative axis values and inverted axis values in arm_compute::NEReverse and arm_compute::CLReverse.
- Add new OpenCL™ kernels:
- opencl::kernels::ClMatMulLowpNativeMMULKernel support for QASYMM8 and QASYMM8_SIGNED, with batch support
- Performance optimizations:
- Optimize cpu::CpuReshape
- Optimize opencl::ClTranspose
- Optimize NEStackLayer
- Optimize CLReductionOperation.
- Optimize CLSoftmaxLayer.
- Optimize start-up time of NEConvolutionLayer for some input configurations where GeMM is selected as the convolution algorithm
- Reduce CPU Overhead by optimal flushing of CL kernels.
- Deprecate support for Bfloat16 in cpu::CpuCast.
- Support for U32 axis in arm_compute::NEReverse and arm_compute::CLReverse will be deprecated in 24.02.
- Remove legacy PostOps interface. PostOps was the experimental interface for kernel fusion and is replaced by the new Dynamic Fusion interface.
- Update OpenCL™ API headers to v2023.04.17.
Thanks
ACL
IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.