michaelh/zephyr

Author	SHA1	Message	Date
Gerard Marull-Paretas	16811660ee	arch: migrate includes to <zephyr/...> In order to bring consistency in-tree, migrate all arch code to the new prefix <zephyr/...>. Note that the conversion has been scripted, refer to zephyrproject-rtos#45388 for more details. Signed-off-by: Gerard Marull-Paretas <gerard.marull@nordicsemi.no>	2022-05-06 19:57:22 +02:00
Flavio Ceolin	f5a0d4cd26	arch: xtensa: Optimize cache management for pinned threads When building with CONFIG_SCHED_CPU_MASK_PIN_ONLY we can assume that a thread will always be executed in a same CPU and consequently skip the cache invalidation. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2022-05-04 13:46:48 -04:00
Andy Ross	64a3159dee	arch/xtensa: Optimize cache management on context switch Making context switch cache-coherent in SMP is hard. The KERNEL_COHERENCE handling was conservatively invalidating the stack region of a thread that was being switched in. This was because it might have (1) run on this CPU in the past, but (2) run most recently on a different CPU. In that case we might have stale data still in our local dcache! But this has performance impact in the (very common!) case of a thread being switched out briefly and then back in (e.g. k_sleep() for a small duration). It will come back having lost all of its cached stack context, and will have to fetch all that information back from shared SRAM! Treat this by tracking a "last_cpu" for each thread in the arch part of the thread struct. If we're coming back to the same CPU we left, we know we can skip the invalidate. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-04-27 18:54:10 -04:00
Ederson de Souza	c0b7864840	arch/xtensa: Enable backtrace on panic on Intel ADSP platforms Platform specific functions necessary to enable this feature were implemented (z_xtensa_ptr_executable() and z_xtensa_stack_ptr_is_sane() for Intel ADSP platforms. Current implementation just ensures stack pointer and program counter are within relevant areas defined in the linker scripts, without going too fine grained. Also, `.iram1` section, used by the backtrace code, also added to Intel ADSP linker script. Finally, update west manifest to use up-to-date SOF, which contains a patch to fix build issues related to the linker changes. Signed-off-by: Ederson de Souza <ederson.desouza@intel.com>	2022-04-14 11:03:40 -04:00
Nazar Kazakov	f483b1bc4c	everywhere: fix typos Fix a lot of typos Signed-off-by: Nazar Kazakov <nazar.kazakov.work@gmail.com>	2022-03-18 13:24:08 -04:00
Nazar Kazakov	9713f0d47c	everywhere: fix typos Fix a lot of typos Signed-off-by: Nazar Kazakov <nazar.kazakov.work@gmail.com>	2022-03-14 20:22:24 -04:00
Gerard Marull-Paretas	95fb0ded6b	kconfig: remove Enable from boolean prompts According to Kconfig guidelines, boolean prompts must not start with "Enable...". The following command has been used to automate the changes in this patch: sed -i "s/bool \"[Ee]nables\? \(\w\)/bool \"\U\1/g" */Kconfig Signed-off-by: Gerard Marull-Paretas <gerard.marull@nordicsemi.no>	2022-03-09 15:35:54 +01:00
Andy Ross	c174ade4a1	arch/xtensa: Rework irq_offload: automatic config, SMP-safe The Xtensa implementation of arch_irq_offload() required that the user select the correct interrupt manually, and would race with itself if invoked from separate CPUs (it was saved here by the main irq_offload() function which has a semaphore to serialize access). Use the new gen_zsr.py script to automatically detect the highest available software interrupt, and keep a per-CPU set of callback/parameter pointers. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-02-21 22:10:03 -05:00
Daniel Leung	35c1d3615f	xtensa: xcc: add a dummy atexit() Some XCC toolchains do not provide atexit() which results in undefined reference error. So add a weak dummy atexit() for this siutation. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2022-01-25 21:16:32 -05:00
Andy Ross	50a9c29d08	arch/xtensa: Fix xcc regression with ZSR Turns out that xt-xcc will bail when faced with a real core-isa.h (it wants you to rely on the builtins in the compiler). Undefine __XCC__ to force it to actually parse and emit declarations for its own header. (Also adds a newline to the generated one-line C file to silence a warning) Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 14:37:13 -05:00
Andy Ross	d175c18cbb	arch/xtensa: Use ZSR assignments for interrupt return We had a similar sequence for interrupt return, where we were selecting (actually only for the benefit of qemu) the highest priority EPCn/EPSn registers for our RFI instruction. That works much better in python the preprocessor. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 12:58:00 -05:00
Andy Ross	642fc7ad54	arch/xtensa: Use ZSR assignments for stack flush markers The kernel coherence cache flush code was using a scratch register to mark the top of the stack. Likewise a good candidate for ZSR use. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 12:58:00 -05:00
Andy Ross	3c7905b916	arch/xtensa: Use ZSR assignments for the alloca exception This is actually Cadence-authored code, but its use of EXCSAVE1 as a sideband input to the exception handler is very much in the same family of tricks. Use ZSR assignments here too. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 12:58:00 -05:00
Andy Ross	ca7024e1d6	arch/xtensa: Use ZSR assignments for the CPU pointer Use the zsr.h assignments for the special register containing the current CPU pointer. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 12:58:00 -05:00
Andy Ross	82071be443	arch/xtensa: Add special register allocation generator Zephyr likes to use the various Xtensa scratch registers for its own purposes in several places. Unfortunately, owing to the configurability of the architecture, we have to use different registers for different platforms. This has been done so far with a collection of different tricks, some... less elegant than others. Put it all in one place. This is a python script that emites a "zsr.h" header with register assignments for all the existing users. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-20 12:58:00 -05:00
Daniel Leung	e2e40862c1	xtensa: remove @return doc for void functions For functions returning nothing, there is no need to document with @return, as Doxgen complains about "documented empty return type of ...". Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2022-01-12 16:02:16 -05:00
Andy Ross	97ada8bc04	arch/xtensa: Promote adsp RPO/cache utilities to an arch API This is trick (mapping RAM twice so you can use alternate Region Protection Option addresses to control cacheability) is something any Xtensa hardware designer might productively choose to do. And as it works really well, we should encourage that by making this a generic architecture feature for Zephyr. Now everything works by setting two kconfig values at the soc level defining the cached and uncached regions. As long as these are correct, you can then use the new arch_xtensa_un/cached_ptr() APIs to convert between them and a ARCH_XTENSA_SET_RPO_TLB() macro that provides much smaller initialization code (in C!) than the HAL assembly macros. The conversion routines have been generalized to support conversion between any two regions. Note that full KERNEL_COHERENCE still requires support from the platform linker script, that can't be made generic given the way Zephyr does linkage. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2022-01-11 11:53:53 +01:00
Andy Ross	1a2fecec6d	soc/intel_adsp: Unify Xtensa CPU reset between cores Startup on these devices was sort of a mess, with multiple variants of Xtensa and platform initialization code from multiple ancestries being invoked at different places for different purposes. Just use one code path for everyone. Bootloader entry starts with a minimal assembly stub that simply sets WINDOW{START,BASE}, PS and a stack pointer and then jumps to C code. That then uses the cpu_early_init() implementation from cAVS 2.5's secondary cores to finish Xtensa initialization, and then flows directly into the pre-existing bootloader C code to initialize cache and memory and copy the HP-SRAM image, then it invokes Zephyr via a simple C function call to z_cstart(). Likewise, remove the "reset vector" from Zephyr. This was never a reset vector, reset on these devices goes to a fixed address in a ROM. CPU initialization is handled explicitly and completely in the bootloader now, in a way that can be unified between the main and secondary cores. Entry from the bootloader now goes directly into z_cstart() via a C call (via a single jump instruction placed at the entry point address -- that's going away soon too once we're using a unified link). Now that vector table initialization happens in a uniform way, there's no need to copy the VECBASE value during arch_start_cpu(). Finally note that this also reverts the CONFIG_RESET_VECTOR_IN_BOOTLOADER kconfig variable added for these platforms, because it's no longer a tunable and true always. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-12-14 18:43:05 -06:00
Lauren Murphy	c1711997bc	debug: coredump: add xtensa coredump Adds Xtensa as supported architecture for coredump. Fixes a few typos in documentation, Kconfig and a C file. Dumps minimal set of registers shown by 'info registers' in GDB for the sample_controller and ESP32 SOCs. Updates tests. Signed-off-by: Lauren Murphy <lauren.murphy@intel.com>	2021-12-14 07:40:55 -05:00
Daniel Leung	dc34f6c84d	xtensa: introduce support for GDB stub This adds basic support for GDB stub on Xtensa. Note that this only provides the common bits on the architecture side. SoC support is also required to fully enable GDB stub on each Xtensa SoC. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-11-30 15:24:00 -05:00
Andy Ross	884f1bf39d	arch/xtensa: Add hook for CONFIG_SCHED_THREAD_USAGE accounting in ISRs Call into z_thread_usage_stop() before ISR entry to avoid including interrupt handling totals in thread usage stats. Note that this hook is after the register save and stack swap has happened, so it still incldues some overhead. But calling out from the interrupted stack on Xtensa gets really, really hairy due to the weird intermediate states we leverage (once we've saved enough context to make a C call safely, we've lost the ability to use register windows per the C ABI!). Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-11-08 21:32:20 -05:00
Daniel Leung	88ccb5f8f0	Revert "xtensa: remove unused script" This reverts commit `67d290540e`. The script is actually used to generate the _soc_inthandlers.h file when introducing a new Xtensa SoC. So restore it. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-10-07 16:04:11 -04:00
Daniel Leung	1ec2dbd662	xtensa: fix implicit declaration of _xtensa_handle_one_int* Some Xtensa SoCs may not have that many levels of interrupts. So limit the call to DEF_INT_C_HANDLER() to only supported levels to avoid calling non-existent functions. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-09-28 20:33:56 -04:00
Iuliana Prodan	a6364da1a3	arch: xtensa: add workaround for small vector table entries For some platforms, like NXP's IMX8 or Mediatek's MT8195, the size of an interrupt vector table entry is 0x1C bytes, less than usual (0x30 for Intel's platforms). So, the interrupt handlers don't fit in the vector table entries. I've added a small indirection to bypass this size constraint and moved the default handlers to the end of vector table, renaming them to _Level\LVL\()VectorHelper. For this, I've added a generic configuration - XTENSA_SMALL_VECTOR_TABLE_ENTRY. Signed-off-by: Iuliana Prodan <iuliana.prodan@nxp.com>	2021-09-10 10:59:44 -04:00
Andy Ross	37bbe7aeea	arch/xtensa: Add arch_cpu_idle() workarounds A simple WAITI isn't sufficient in all cases. The cAVS 2.5 hardware uses WAITI as the entry state for per-core power gating, which is very difficult to debug. Provide a fallback that simply spins in the idle loop waiting for interrupts to provide a stable system while this feature stabilizes. Also, the SOF code for those platforms references a known bug with the Xtensa LX6 core IP (or at least some versions), and will prefix the WAIT instruction with 128 NOP.N's followed by an ISYNC and EXTW. This bug hasn't been seen under Zephyr yet, and details are sketchy. But the code is simply enough to import and works correctly. Place both workaround under new kconfig variables and select them both (even though they're actually mutually exclusive -- if you select both CPU_IDLE_SPIN overrides) for cavs_v25. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-09-03 07:19:34 -04:00
Andy Ross	b76bc6c80d	arch/xtensa: Fix outgoing stack flush for dummy threads On CPU startup, When we reach the cache flush code in arch_switch(), the outgoing thread is a dummy. The behavior of the existing code was to leave the existing value in the SR unchanged (probably NULL at startup). Then the context switch would walk from that address up to the top of the outgoing stack, flushing everything in between. That's wrong, because the outgoing stack is a real pointer (generally the interrupt stack of the current CPU), and we're flushing everything in memory underneath it. This also reverts commit `29abc8adc0` ("xtensa: fix booting secondary cores on the dummy thread"), which appears to have been an early attempt to address this issue. It worked (modulo all the extra and potentially incorrect flushing) on cavs v1.5/1.8 because of the way the entry code worked there. But on 2.5 we now hit the first context switch in a case where those extra lines are in address space already marked unwritable by the CPU, so the flush explodes. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-09-03 07:19:34 -04:00
Iuliana Prodan	f9810ccbe1	arch: xtensa: modify asm for interrupt sections For IMX, for timer interrupt, the interrupt handler was not the correct one executed and that’s because the handlers were not at the expected address. For IMX the size constraint of the interrupt vector table entry is 0x1C bytes of code, less than usual. I've added a small indirection to bypass this size constraint and moved the default handlers to the end of vector table, renaming them to _Level\LVL\()VectorHelper. Signed-off-by: Iuliana Prodan <iuliana.prodan@nxp.com>	2021-08-28 23:27:02 -04:00
Guennadi Liakhovetski	29abc8adc0	xtensa: fix booting secondary cores on the dummy thread When secondary cores are booted, they use the dummy thread and the IRQ stack until they switch over to a real thread. Therefore dummy threads shouldn't be skipped when cohering outgoing thread stack, only threads with zero stack size should be skipped. Signed-off-by: Guennadi Liakhovetski <guennadi.liakhovetski@linux.intel.com>	2021-05-03 17:13:01 -04:00
Flavio Ceolin	abb1bbe6b1	arch: xtensa: Fix 10.4 violations Both operands of an operator in which the usual arithmetic conversions are performed shall have the same essential type category. Signed-off-by: Flavio Ceolin <flavio.ceolin@intel.com>	2021-04-10 09:59:37 -04:00
Andy Ross	ae4f7a1a06	arch/xtensa: Remember to spill windows in arch_cohere_stacks() When we reach this code in interrupt context, our upper GPRs contain a cross-stack call that may still include some registers from the interrupted thread. Those need to go out to memory before we can do our cache coherence dance here. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	b28da4a3b7	arch/xtensa: Invalidate bottom of outbound stacks Both new thread creation and context switch had the same mistake in cache management: the bottom of the stack (the "unused" region between the lower memory bound and the live stack pointer) needs to be invalidated before we switch, because otherwise any dirty lines we might have left over can get flushed out on top of the same thread on another CPU that is putting live data there. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	64cf33952d	arch/xtensa: Add non-HAL caching primitives The Xtensa L1 cache layer has straightforward semantics accessible via single-instructions that operate on cache lines via physical addresses. These are very amenable to inlining. Unfortunately the Xtensa HAL layer requires function calls to do this, leading to significant code waste at the calling site, an extra frame on the stack and needless runtime instructions for situations where the call is over a constant region that could elide the loop. This is made even worse because the HAL library is not built with -ffunction-sections, so pulling in even one of these tiny cache functions has the effect of importing a 1500-byte object file into the link! Add our own tiny cache layer to include/arch/xtensa/cache.h and use that instead. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	d0c538e9a2	arch/xtensa: Add an arch-internal README on register windows Back when I started work on this stuff, I had a set of notes on register windows that slowly evolved into something that looks like formal documentation. There really isn't any overview-style documentation of this stuff on the public internet, so it couldn't hurt to commit it here for posterity. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	a230fafde5	arch/xtensa: soc/intel_adsp: Rework MP code entry Instead of passing the crt1 _start function as the entry code for auxiliary CPUs, use a tiny assembly stub instead which can avoid the runtime testing needed to skip the work in _start. All the crt1 code was doing was clearing BSS (which must not happen on a second CPU) and setting the stack pointer (which is wrong on the second CPU). This allows us to clean out the SMP code in crt1. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	613594e68c	soc/intel_adsp: Use the correct MP stack pointer The kernel passes the CPU's interrupt stack expected that it will start on that, so do it. Pass the initial stack pointer from the SOC layer in the variable "z_mp_stack_top" and set it in the assembly startup before calling z_mp_entry(). Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	820c94e5dd	arch/xtensa: Inline atomics The xtensa atomics layer was written with hand-coded assembly that had to be called as functions. That's needlessly slow, given that the low level primitives are a two-instruction sequence. Ideally the compiler should see this as an inline to permit it to better optimize around the needed barriers. There was also a bug with the atomic_cas function, which had a loop internally instead of returning the old value synchronously on a failed swap. That's benign right now because our existing spin lock does nothing but retry it in a tight loop anyway, but it's incorrect per spec and would have caused a contention hang with more elaborate algorithms (for example a spinlock with backoff semantics). Remove the old implementation and replace with a much smaller inline C one based on just two assembly primitives. This patch also contains a little bit of refactoring to address the scheme has been split out into a separate header for each, and the ATOMIC_OPERATIONS_CUSTOM kconfig has been renamed to ATOMIC_OPERATIONS_ARCH to better capture what it means. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Andy Ross	eb1ef50b6b	arch/xtensa: General cleanup, remove dead code There was a bunch of dead historical cruft floating around in the arch/xtensa tree, left over from older code versions. It's time to do a cleanup pass. This is entirely refactoring and size optimization, no behavior changes on any in-tree devices should be present. Among the more notable changes: + xtensa_context.h offered an elaborate API to deal with a stack frame and context layout that we no longer use. + xtensa_rtos.h was entirely dead code + xtensa_timer.h was a parallel abstraction layer implementing in the architecture layer what we're already doing in our timer driver. + The architecture thread structs (_callee_saved and _thread_arch) aren't used by current code, and had dead fields that were removed. Unfortunately for standards compliance and C++ compatibility it's not possible to leave an empty struct here, so they have a single byte field. + xtensa_api.h was really just some interrupt management inlines used by irq.h, so fold that code into the outer header. + Remove the stale assembly offsets. This architecture doesn't use that facility. All told, more than a thousand lines have been removed. Not bad. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-03-08 11:14:27 -05:00
Shubham Kulkarni	26efef4f94	arch: xtensa: Fix backtrace from ISR a0 is used as scratch register. Restore value of a0 (return address) from stack frame before spilling registers on stack Signed-off-by: Shubham Kulkarni <shubham.kulkarni@espressif.com>	2021-03-03 13:02:57 +01:00
Andy Ross	746c65acb7	soc/intel_adsp: Move KERNEL_COHERENCE to cavs15 Only the CAVS 1.5 linker script has full support for the coherence features, don't advertise it on the other SoC's yet. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2021-02-11 14:47:40 -05:00
Anas Nashif	5d1c535fc8	license: add missing SPDX headers Add SPDX header to files with existing license. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-02-11 08:05:16 -05:00
Anas Nashif	67d290540e	xtensa: remove unused script While fixing license headers, identified this script as orphan and not being used anywhere, so remove. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2021-02-11 08:05:16 -05:00
Daniel Leung	5a11caba33	xtensa: fix rsr/wsr assembly for XCC XCC doesn't like the "rsr.<reg name>" style assembly so fix that to the other style. Also, XCC doesn't like _CONCAT() with the EPC/EPS registers so need to spell out all of them. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-05 07:45:07 -05:00
Daniel Leung	92c93b1b7f	xtensa: fix hard-coded interrupt value for PS register There is a hard-coded value of PS_INTLEVEL(15) to set the PS register. The correct way is actually to use XCHAL_EXCM_LEVEL with PS_INTLEVEL() to setup the register. So fix it. Fixes #31858 Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-02-04 20:58:56 -05:00
Shubham Kulkarni	8b7da334d5	arch: xtensa: Print backtrace from panic handler This change uses stack frame to print backtrace once exception occurs Printing backtrace helps to identify the cause of exception Signed-off-by: Shubham Kulkarni <shubham.kulkarni@espressif.com>	2021-01-23 08:43:10 -05:00
Guennadi Liakhovetski	ca0e5df219	xtensa: don't build and run the reset handler twice Currently Zephyr links reset-vector.S twice in xtensa builds: into the bootloader and the main image. It is run at the end of the boot loader execution and immediately after that again in the beginning of the main code. This patch adds a configuration option to select whether to link the file to the bootloader or to the application. The default is to the application, as needed e.g. for QEMU, SOF links it to the bootloader like in native builds. Signed-off-by: Guennadi Liakhovetski <guennadi.liakhovetski@linux.intel.com>	2021-01-13 18:17:40 -05:00
Daniel Leung	0d7bdbc876	xtensa: use highest available EPC/EPS regs in restore context There may be Xtensa SoCs which don't have high enough interrupt levels for EPC6/EPS6 to exist in _restore_context. So changes these to those which should be available according to the ISA config file. Fixes #30126 Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2021-01-05 10:31:45 -08:00
Krzysztof Chruscinski	3ed8083dc1	kernel: Cleanup logger setup in kernel files Most of kernel files where declaring os module without providing log level. Because of that default log level was used instead of CONFIG_KERNEL_LOG_LEVEL. Signed-off-by: Krzysztof Chruscinski <krzysztof.chruscinski@nordicsemi.no>	2020-11-27 09:56:34 -05:00
Daniel Leung	11e6b43090	tracing: roll thread switch in/out into thread stats functions Since the tracing of thread being switched in/out has the same instrumentation points, we can roll the tracing function calls into the one for thread stats gathering functions. This avoids duplicating code to call another function. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2020-11-11 23:55:49 -05:00
Daniel Leung	f8a909dad1	xtensa: add support for thread local storage Adds the necessary bits to initialize TLS in the stack area and sets up CPU registers during context switch. Note that this does not enable TLS for all Xtensa SoC. This is because Xtensa SoCs are highly configurable so that each SoC can be considered a whole architecture. So TLS needs to be enabled on the SoC level, instead of at the arch level. Signed-off-by: Daniel Leung <daniel.leung@intel.com>	2020-10-24 10:52:00 -07:00
Andy Ross	a8d5437799	soc/xtensa: Misc. checkpatch fixups Code style fixes. Kept separate from the original changes to permit easier rebasing. Signed-off-by: Andy Ross <andrew.j.ross@intel.com>	2020-10-21 06:38:53 -04:00

... 5 6 7 8 9 ...

642 commits