linaro-toolchain

linaro-toolchain@lists.linaro.org

5 participants
5682 discussions

by Koen Kooi

Hi, We're currently running into issues with the OE builds due to OE-core having moved to 2.22. So what's the plan for glibc-linaro 2.22? -- Koen Kooi Builds and Baselines | Release Manager Linaro.org | Open source software for ARM SoCs

10 years, 3 months

Configuration question

by Bill Fischofer

Hi, This question has arisen in the ODP project and the thought is that a 'best practices' answer would be more likely to be found on this list. We have a component that wants to make use of specialized instructions for performing CRC and/or AES computations and was wondering what is the recommended way for an application to determine whether such instructions are available in the toolchain and whether the user has overruled their use? Thanks for any insight you can provide. Bill

10 years, 3 months

Re: [lng-odp] Runtime inlining

by Ola Liljedahl

I think there are many issues with binary compatibility beyond function inlining. An ODP application cannot expect all ODP implementations to support the same number of ODP queues or classification rules or even which classification terms (fields) are supported (efficiently/in HW) etc. Is there some kind of lowest common denominator an application should expect? Do we want to make guarantees of an ODP implementation stricter? What are the consequences of such strict functional guarantees? I think an application that requires binary compatibility over ARMv8.1 platforms should compile and link against a specific ODP SW implementation (possibly with some well-defined HW offloads where the underlying platform can provide the relevant drivers). I.e. more of a (user-space) Linux architecture than standard ODP (as influenced by OpenGL). The important binary interfaces then becomes the interfaces to these offloads/drivers. On 16 November 2015 at 14:23, Nicolas Morey-Chaisemartin <nmorey(a)kalray.eu> wrote: > > > On 11/11/2015 09:45 AM, Savolainen, Petri (Nokia - FI/Espoo) wrote: >> >>> -----Original Message----- >>> From: lng-odp [mailto:lng-odp-bounces@lists.linaro.org] On Behalf Of >>> EXT Nicolas Morey-Chaisemartin >>> Sent: Tuesday, November 10, 2015 5:13 PM >>> To: Zoltan Kiss; linaro-toolchain(a)lists.linaro.org >>> Cc: lng-odp >>> Subject: Re: [lng-odp] Runtime inlining >>> >>> As I said in the call last week, the problem is wider than that. >>> >>> ODP specifies a lot of types but not their sizes, a lot of >>> enums/defines (things like ODP_PKTIO_INVALID) but not their value >>> either. >>> For our port a lot of those values were changed for >>> performance/implementation reason. So I'm not even compatible between >>> one version of our ODP port and another one. >>> >>> The only way I can see to solve this is for ODP to fix the size of all >>> these types. >>> Default/Invalid values are not that easy, as a pointer would have a >>> completely different behaviour from structs/bitfields >>> >>> Nicolas >>> >> Type sizes do not need to be fixed in general, but only when an application is build for binary compatibility (the use case we are talking here). Binary compatibility and thus the fixed type sizes are defined per ISA. >> >> We can e.g. define a configure target (for our reference implementation == linux-generic) "--binary-compatible=armv8.x" or "--binary-compatible=x86_64". When you build your application with that option, "platform dependent" types and constants would be fixed to pre-defined values specified in (new) ODP API arch files. >> >> So instead of building against odp/platform/linux-generic/include/odp/plat/queue_types.h ... >> >> typedef ODP_HANDLE_T(odp_queue_t); >> #define ODP_QUEUE_INVALID _odp_cast_scalar(odp_queue_t, 0) >> #define ODP_QUEUE_NAME_LEN 32 >> >> >> ... you'd build against odp/arch/armv8.x/include/odp/queue_types.h ... >> >> typedef uintptr_t odp_queue_t; >> #define ODP_QUEUE_INVALID ((uintptr_t)0) >> #define ODP_QUEUE_NAME_LEN 64 >> >> >> ... or odp/arch/x86_64/include/odp/queue_types.h >> >> typedef uint64_t odp_queue_t; >> #define ODP_QUEUE_INVALID ((uint64_t)0xffffffffffffffff) >> #define ODP_QUEUE_NAME_LEN 32 >> >> >> For highest performance on a fixed target platform, you'd still build against the platform directly >> >> odp/platform/<soc_vendor_xyz>/include/odp/plat/queue_types.h >> >> typedef xyz_queue_desc_t * odp_queue_t; >> #define ODP_QUEUE_INVALID ((xyz_queue_desc_t *)0xdeadbeef) >> #define ODP_QUEUE_NAME_LEN 20 >> >> >> -Petri >> > > It still means that you need to enforce a type for all ODP implementation on a given arch. Which could be problematic. > As a precise example: the way handles are used now for odp_packet_t brings some useful features for checks and memory savings, but performance wise, they are a "disaster". One of the first thing I did was to switch them to pointers. And if I wanted a high perf linux x86_64 implementation, I'd probably do the same. > > Nicolas > _______________________________________________ > lng-odp mailing list > lng-odp(a)lists.linaro.org > https://lists.linaro.org/mailman/listinfo/lng-odp

10 years, 3 months

[ACTIVITY] 9 - 13 November 2015

by Omair Javaid

== Progress == LLDB development -- Root Google Nexus devices and read debug module configuration with kernel module [TCWG-429] [7/10] -- Figure out steps to unlock and root Nexus S -- Figure out steps to build kernel and kernel module for Nexus S -- Tried out lldb watchpoints with custom kernel on Nexus S -- Tried out reaching debug co processors without ptrace using kernel module. -- Identify mix-mode debugging problems (ARM & Thumb) [TCWG-229] [2/10] -- Ongoing Initial investigation and indentifying code areas needing changes Miscellaneous [1/10] -- Meetings, emails, discussions etc. == Plan == -- Root Google Nexus devices and read debug module configuration with kernel module [TCWG-429] -- Complete app and kernel module to read debug coprocessor registers. -- Try them out on remaining Android devices. -- Identify mix-mode debugging problems (ARM & Thumb) [TCWG-229] -- Further investigation and testing a mix mode application.

10 years, 3 months

[ACTIVITY]

by Renato Golin

== Progress == * Buildbots (4/10) - Found culprit for self-hosting breakages - Bot didn't get right because of dirty builds - Moving all self-hosting bots to clean builds (~3h) - More work on MIPS patch breaking self-hosting - Several breakages and bisections - Adding first cloud (Scaleway) buildbot to local master - No NEON, so we can't replace the Chromebooks * Infrastructure (4/10) - Power cut in Cambridge Lab, no generator yet - Chromebooks fail at the time of the cut, even with the UPS batteries still holding. I'm guessing the power regulator depends on the internal battery to work (and we removed them) - Bringing all bots up, etc. - Setting up an HiKey/AMD for benchmarks (APMs are too different) - Running EEMBC and SPEC on AMD * Background (2/10) - Code review, meetings, discussions, general support, etc. - Upstreaming -meabi, which may fix builds of kernel, android, bsd - Compiling aarch64-linux-gnu-gcc by hand because Arch pkg didn't work

10 years, 3 months

[ACTIVITY] Week 45

by Yvan Roux

o 1 day off (2/10) == Progress == o Linaro GCC (6/10) * FSF branch merge into linaro GCC 5 branch * Troubleshot various regression after the merge * Delivered GCC 5.2 2015.11 snapshot o Upstream work (1/10) * Sanitizing gfortran testsuite o Misc (1/10) * Various meetings == Plan == o Continue on sanitizing testsuite o Backports, infra, ...

10 years, 3 months

[ACTIVITY] 9th-13th November

by Bernie Ogden

Implement LAVA jobs for microinstance - TCWG-432 [6/10] * Refactoring to permit sharing of code between uinstance & main instance, as far as possible * Further refactoring for sane submission of bundles without inserting LAVA assumptions in the wrong places * Tested as far as possible in main instance, using light hacks and fakebench Jenkins benchmarking job - TCWG-348 [1/10] * Converted pbl hacks into a sane patch for yaml-to-json.py Controlled image builds - TCWG-360 [1/10] * Submitted aarch64 filesystem build for review * Generated armhf and amd64 filesystems * Started learning how to generate hwpack Misc [2/10] =Plan= Review security with shared uinstance/main instance code Expose more data, benchmarks to bundles Create YAML definition for Jenkins benchmarking job Generate (controlled) hwpack for at least one target, or know what the problems are Write up noise control report (if time) Have another at crashdump (if time, if new kexec patches)

10 years, 3 months

[ACTIVITY] 9 - 13 November 2015

by Prathamesh Kulkarni

* TCWG-72 (3/10) - divmod transform approved by Richard - builds cleanly on arm-linux-gnueabihf, aarch64-linux-gnu - Investigating segfault with __bdi64_div.c happens when mode == DImode and libval_mode == TImode - Found another segfault on x86 with TImode, on arm TImode is not supported and compiler aborts. Perhaps we should not do the transform when mode is TImode ? - Had a look at expand_binop_twoval_libfunc(). Wrote a similar function to obtain both results but this resulted in infinite loop in emit_libcall_block_1 - Strangely the bug is reproducible only during the build and doesn't trigger when compiled with preprocessed version of bid64_div.c (passing the same set of options). - waiting for upstream comments * TCWG-319 (1/10) - Submitted jobs for fp benchmark on a53, a57 * Misc: - PR66214 appears to have gone (fixed or became latent), that was blocking firefox LTO build with trunk - PR65837 still appears to be present after r230327 * Public Holidays (6/10) - Diwali festival == Next Week == - Continue with TCWG-72, TCWG-319 benchmarking, target hook conversion - Run SPEC2k6 with LTO

10 years, 3 months

[ACTIVITY] 9-13 November 2015

by Kugan

== Progress == - Widening pass (TCWG-547) - 6/10 * Bootstrapped latest patch on ppc64-linux-gnu, aarch64-linux-gnu and x64-64-linux-gnu. * Regression testing on ppc64-linux-gnu, aarch64-linux-gnu arm64-linux-gnu and x64-64-linux-gnu. * Fixed all of the execution issues * Posted updated patch to the list - Misc (4/10) * Linaro bug 1900 * Continued Looking at LuaJIT code-base * gcc/bug list == Plan == * bug 1900 * Look at implementing LuaJIT for aarch64 * LTO

10 years, 3 months

Runtime inlining

by Zoltan Kiss

Hi, We have a packaging/linking/optimization problem at LNG, I hope you guys can give us some advice on that. (Cc'ing ODP list in case someone want to add something) We have OpenDataPlane (ODP), an API stretching between userspace applications and hardware SDKs. It's defined in the form of C headers, and we already have several implementations to face SDKs (or whathever is actually controlling the hardware), e.g. linux-generic, a DPDK one etc. And we have applications, like Open vSwitch (OVS), which now is able to work with any ODP platform implementation which implements this API When it comes to packaging, the ideal scenario would be to create one package for the application, e.g. openvswitch.deb, and one for each platform, e.g odp-generic.deb, odp-dpdk.deb. The latter would contain the implementations in the form of a libodp.so file, so the application can dynamically load the actually installed platform's library runtime, with all the benefits of dynamic linking. The trouble is that we have several accessor functions in the API which are very short and __very__ frequently used. The best example is "uint32_t odp_packet_len(odp_packet_t pkt)", which returns the length of the packet. odp_packet_t is an opaque type defined by the implementation, often a pointer to the packet's actual metadata, so the actual function call yields to a simple load from that metadata pointer (+offset). Having it wrapped into a function call brings a significant performance decrease: when forwarding 64 byte packets at 10 Gbps, I got 13.2 Mpps with function calls. When I've inlined that function it brought 13.8 Mpps, that's ~5% difference. And there are a lot of other frequently used short accessor functions with the same problem. But obviously if I inline these functions I break the ABI, and I need to compile the application for each platform (and create packages like openvswitch-odp-dpdk.deb, containing the platform statically linked). I've tried to look around on Google and in gcc manual, but I couldn't find a good solution for this kind of problem. I've checked link time optimization (-flto), but it only helps with static linking. Is there any way to keep the ODP application and platform implementation binaries in separate files while having the performance benefit of inlining? Regards, Zoltan

10 years, 4 months

Jump to page:

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

linaro-toolchain