Hello,
The 23.05 release of Compute Library is out and comes with a collection of improvements and new features.
Source code and prebuilt binaries are available at:
Release v23.05 · ARM-software/ComputeLibraryhttps://github.com/ARM-software/ComputeLibrary/releases/tag/v23.05
Public major release Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available here: https://arm-software.github.io/ComputeLibrary/v23.05/
Highlights of the release:
- New features:
* Add new Arm® Neon™ kernels / functions: * NEMatMul for QASYMM8, QASYMM8_SIGNED, FP32 and FP16, with batch support. * NEReorderLayer (aarch64 only) * Add new OpenCL™ kernels / functions: * CLMatMul support for QASYMM8, QASYMM8_SIGNED, FP32 and FP16, with batch support. * Add support for the multiple dimensions in the indices parameter for both the Arm® Neon™ and OpenCL™ implementations of the Gather Layer. * Add support for dynamic weights in CLFullyConnectedLayer and NEFullyConnectedLayer for all data types. * Add support for cropping in the Arm® Neon™ and OpenCL™: implementations of the BatchToSpace Layer for all data types. * Add support for quantized data types for the ElementwiseUnary Operators for Arm® Neon™. * Implement RSQRT for quantized data types on OpenCL™. * Add FP16 depthwise convolution kernels for SME2.
- Performance optimizations:
* Improve CLTuner exhaustive mode tuning time.
- Deprecate dynamic block shape in NEBatchToSpaceLayer and CLBatchToSpaceLayer. - Various optimizations and bug fixes.
IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.