- Acl-dev - lists.linaro.org

by omar.alkhatib＠arm.com

Hello, The 24.11.1 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: [1]https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.11.1 Highlights of the release: * Add stateless GEMM execution via ICPPKernel::run_op * TensorShape class supports dynamic shapes * Add skeletons for Dynamic GEMM operator * Convert Double rounding to Single rounding quantization behaviour in both Cpu/Gpu backend IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you. References 1. https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.11.1

7 months, 2 weeks

1
0
0 0

Compute Library v24.11 is out!

by omar.alkhatib＠arm.com

Hello, The 24.11 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: [1]https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.11 Highlights of the release: * Add SVE SoftmaxLayer kernel for BF16 * Provide stateless API for CpuGemmLowpMatrixMultiplyCore, CpuQuantize, and DequantizationLayer * Extend static quantization interface for both matmul and convolution operations IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you. References 1. https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.11

7 months, 4 weeks

1
0
0 0

Compute Library v24.09 is out!

by omar.alkhatib＠arm.com

Hello, The 24.09 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: [1]https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.09 Highlights of the release: * Provide a wrapper class to expose cpu::CpuSoftmaxGeneric * Detect number of cores in Windows® * Add Optimized SME kernel for QASYMM8_SIGNED elementwise addition operation IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you. References 1. https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.09

9 months, 3 weeks

1
0
0 0

Compute Library v24.08.1 is out!

by omar.alkhatib＠arm.com

Hello, The 24.08.1 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: [1]https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.08.1 Highlights of the release: * Change inheritance qualifiers of experimental Cpu operator interface classes to public for cpu-wrappers. * Mismatches in static quantization updated after configure tests * CpuSoftmax configure ignores is_log on validation * Linker errors in armv8.2a Windows® builds IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you. References 1. https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.08.1

10 months, 3 weeks

1
0
0 0

Compute Library v24.05 is out!

by Michael Kozlov

Hello, The v24.05 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.05 Highlights of the release: - Add CLScatter operator for FP32/16, S32/16/8, U32/16/8 data types. - Various fixes to enable FP16 kernels in armv8a multi_isa builds. - Updated logic in the OpenMP scheduler to exclude LITTLE cores. IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

1 year, 1 month

1
0
0 0

Compute Library v24.04 is out!

by Michael Kozlov

Hello, The v24.04 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.04 Highlights of the release: * Add Bfloat16 data type support for NEMatMul<https://arm-software.github.io/ComputeLibrary/v24.04/classarm__compute_1_1_…>. * Add support for SoftMax in SME2 for FP32 and FP16. * Add support for in place accumulation to CPU GEMM kernels. * Add low-precision Int8 * Int8 -> FP32 CPU GEMM which dequantizes after multiplication * Add is_dynamic flag to QuantizationInfo<https://arm-software.github.io/ComputeLibrary/v24.04/classarm__compute_1_1_…> to signal to operators that it may change after configuration * Performance optimizations: * Optimize start-up time of NEConvolutionLayer<https://arm-software.github.io/ComputeLibrary/v24.04/classarm__compute_1_1_…> for some input configurations where GeMM is selected as the convolution algorithm * Optimize NEConvolutionLayer<https://arm-software.github.io/ComputeLibrary/v24.04/classarm__compute_1_1_…> for input tensor size > 1e7 bytes and weight tensor height > 7 * Optimize NESoftmaxLayer<https://arm-software.github.io/ComputeLibrary/v24.04/namespacearm__compute.…> for axis != 0 by natively supporting higher axes up to axis 3. IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

1 year, 2 months

1
0
0 0

Compute Library 24.02.1 is out!

by Michael Kozlov

Hello, The v24.02.1 release of Compute Library is out and comes with a collection of improvements. Source code and prebuilt binaries are available at: https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.02.1 Highlights of the release: - Fix performance regression in fixed-format kernels - Fix compile and runtime errors in arm_compute_validation for Windows on Arm(WoA) IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

1 year, 3 months

1
0
0 0

Compute Library 24.02 is out!

by Felix Johnny Thomasmathibalan

Hello, The 24.02 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.02 [https://opengraph.githubassets.com/f4adc8724578b04a783e65ee2a82284f3a7e9d7d…]<https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.02> Release v24.02 · ARM-software/ComputeLibrary<https://github.com/ARM-software/ComputeLibrary/releases/tag/v24.02> Public major release Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available here: https://arm-software.github.io/ComputeLibrary/v24.02 github.com Highlights compared to the 24.01 release are - Replace template writer with compute kernel writer in dynamic fusion. - Performance optimizations: - Parallelize NEDepthwiseConvolutionLayer over batches if there is only 1 row Kind Regards, Felix IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

1 year, 4 months

1
0
0 0

[Resend to acl-dev] Compute Library v23.11 is out!

by Anitha Raj

Hello, The v23.11 release of Compute Library is out and comes with a collection of improvements and new features. Source code and prebuilt binaries are available at: https://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11  [https://opengraph.githubassets.com/9c6e9733a1038ab714edff3a08a9589bd88d70f3…]<https://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11> Release v23.11 · ARM-software/ComputeLibrary<https://github.com/ARM-software/ComputeLibrary/releases/tag/v23.11> Public major release Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available here: https://arm-software.github.io/ComputeLibrary/v23.11 github.com Highlights of the release: - New features - Add support for input data type U64/S64 in CLCast and NECast. - Add support for output data type S64 in NEArgMinMaxLayer and CLArgMinMaxLayer - Port the following kernels in the experimental Dynamic Fusion interface to use the new Compute Kernel Writer interface: - experimental::dynamic_fusion::GpuCkwResize - experimental::dynamic_fusion::GpuCkwPool2d - experimental::dynamic_fusion::GpuCkwDepthwiseConv2d - experimental::dynamic_fusion::GpuCkwMatMul - Add support for OpenCL™ comand buffer with mutable dispatch extension. - Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82. - Add support for negative axis values and inverted axis values in arm_compute::NEReverse and arm_compute::CLReverse. - Add new OpenCL™ kernels: - opencl::kernels::ClMatMulLowpNativeMMULKernel support for QASYMM8 and QASYMM8_SIGNED, with batch support - Performance optimizations: - Optimize cpu::CpuReshape - Optimize opencl::ClTranspose - Optimize NEStackLayer - Optimize CLReductionOperation. - Optimize CLSoftmaxLayer. - Optimize start-up time of NEConvolutionLayer for some input configurations where GeMM is selected as the convolution algorithm - Reduce CPU Overhead by optimal flushing of CL kernels. - Deprecate support for Bfloat16 in cpu::CpuCast. - Support for U32 axis in arm_compute::NEReverse and arm_compute::CLReverse will be deprecated in 24.02. - Remove legacy PostOps interface. PostOps was the experimental interface for kernel fusion and is replaced by the new Dynamic Fusion interface. - Update OpenCL™ API headers to v2023.04.17. Thanks ACL IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

1 year, 7 months

1
0
0 0

ARM compute library on ARM v7l

by Andy Tai

Hi, I wonder if ARM Compute Library can be built and run on ARM v7l processors, qith subset of the functionalities as SVE not supported on v7l? Thanks for info

1 year, 10 months

1
0
0 0

2025

2024

2023

2022

2021

Acl-dev