michaelh/zephyr

Author	SHA1	Message	Date
Krzysztof Chruściński	66daaf6ba3	kernel: sched: sleep: Use value returned by z_add_timeout z_tick_sleep function needs to calculate expected wakeup tick. It required reading current system clock tick. It can be costly since it is a register access. When timeout is added this value is calculated and z_add_timeout returns it. It can be used instead to optimize sleep performance. Signed-off-by: Krzysztof Chruściński <krzysztof.chruscinski@nordicsemi.no>	2025-04-15 19:09:33 +02:00
Krzysztof Chruściński	6d35969a55	kernel: sched: Optimize sleeping function Accessing system timer registers can be costly and it shall be avoided if possible. When thread is waken up in z_tick_sleep it may be because timeout expired or because thread was waken up before sleeping period passed. Add function to detect if timeout is aborted (before it was expired). Use it in the sleep function and avoid reading system ticks if timeout was not aborted. Signed-off-by: Krzysztof Chruściński <krzysztof.chruscinski@nordicsemi.no>	2025-04-15 19:09:33 +02:00
Josh DeWitt	c05cfbf15e	kernel/sched: Re-sort waitq on priority change k_thread_priority_set() on a pended thread wasn't re-inserting into the waitq, causing the incorrect thread to run based on priority. When using the scalable waitq config, this can also break assumptions of the tree and leave the owner of a waitq still being in the waitq tree, cycles in the tree, or a crash. Remove and re-add a thread to a waitq to ensure the waitq remains in order and the tree's assumptions are not violated. To illustrate the issue, consider 4 threads in decreasing priority order: A, B, C, and D along with two mutexes, m0 and m1. This is implemented in the new complex_inversion mutex_api test. 1. D locks m1 2. C locks m0 3. C pends on m1 4. B pends on m1 5. A pends on m0, boosts C's priority, now tree on m1 is not sorted 6. D unlocks m1, left-most thread on tree is B. When removing B from tree it cannot be found because it searches to the right of C due to C's boosted priority when the node is actually on the left. rb_remove silently fails. 7. B unlocks m1, left-most thread on tree is still B and it tries to unpend itself, resulting in a NULL pointer dereference on B->base.pended_on. Signed-off-by: Josh DeWitt <josh.dewitt@garmin.com>	2025-03-24 07:58:36 +01:00
Peter Mitsis	701aab92e2	kernel: Add Z_IS_TIMEOUT_RELATIVE() macro Introduces the Z_IS_TIMEOUT_RELATIVE() macro to help ensure that checking for relative/absolute timeouts is consistent. Using this macro also helps ensure that we get the correct behavior when using 32-bit timeouts (CONFIG_TIMEOUT_64BIT=n). Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-03-17 02:21:02 +01:00
Andy Ross	8b4ed6655a	kernel: Clamp k_sleep() return value on overflow k_sleep() returns a 32 bit count of milliseconds, as that was its historical API. But it now accepts a potentially 64 bit tick count as an argument, leading to situations where an early wakeup will produce sleep times that aren't representable. Clamp this instead of truncating to an arbitrary value. Naive code will likely do the right thing with the large return (just sleeping an extra round), and sophisticated apps can detect INT_MAX to enable more elaborate retry logic. (Also fixes a somewhat unfortunate puncutation error in the docs that implied that it returns zero on early wakeup!) Fixes: #84669 Signed-off-by: Andy Ross <andyross@google.com>	2025-03-07 20:20:25 +01:00
Andy Ross	ddee1ab4cf	kernel/sched: Correct locking in essential thread panic Calling a (handled/ignored) panic with the scheduler lock held produces spinlock errors in some circumstances, depending on whether or not the swap gets reached before another context switch. Release the lock around the call, we don't touch any scheduler state on the path to z_swap(), so this is safe. Signed-off-by: Andy Ross <andyross@google.com>	2025-02-26 10:10:29 +00:00
Andy Ross	f6239c52ae	kernel/sched: Panic after aborting essential thread, not before The essential thread check and panic happens at the top of k_thread_abort(). This is arguably a performance bug: the system is going to blow up anyway no matter where we put the test, we shouldn't add instructions to the path taken by systems that DON'T blow up. But really it's more of a testability/robustness glitch: if you have a fatal error handler that wants to catch this panic (say, a test using ztest_set_fault_valid()), then the current code will panic and early-exit BEFORE THE THREAD IS DEAD. And so it won't actually die, and will continue on causing mayhem when presumably the handler code expected it to have been aborted. It's sort of an unanswerable question as to what the "right" behavior is here (the system is, after all, supposed to have panicked!). But this seems preferable for definable practical reasons. Kill the thread, then panic. Unless it's _current, in which case panic as late as possible for maximum coverage of the abort path. Fixes: #84460 Signed-off-by: Andy Ross <andyross@google.com>	2025-02-10 22:26:10 +01:00
Peter Mitsis	e55ac3ef65	kernel: Improve ordering in SMP k_thread_suspend() The routine k_thread_suspend() has a fast path for non-SMP when suspending the current thread. When SMP is enabled, it is expected that the compiler drop the entire fast path checks because the whole expression would always evaluate to false. However, the compiler has been observed to only drop whole fast path check when the "!IS_ENABLED(CONFIG_SMP)" condition appears at the beginning of the fast path check. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-02-07 02:23:45 +01:00
Peter Mitsis	c63b42d478	kernel: Fix k_wakeup() exit paths z_reschedule() already has a check to determine if it is called from the context of an ISR--no need to duplicate it in k_wakeup(). Furthermore, if the target thread is not sleeping, there is no need to reschedule and we can do a fast return. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-02-03 19:51:20 +01:00
Peter Mitsis	995ad43851	kernel: Streamline z_is_thread_ready() The check for an active timeout in z_is_thread_ready() was originally added to cover the case of a sleeping thread. However, since there is now a bit in the thread state that indicates if the thread is sleeping we can drop that superfluous check. Making this change necessitates moving k_wakeup()'s call to z_abort_thread_timeout() so that it is within the locked _sched_spinlock section to ensure that we do not end up with a stray thread timeout in the timeout list. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-28 18:14:22 +01:00
Peter Mitsis	bfe0b74aad	kernel: Do not mark thread as queued in k_yield() SMP does not need to mark the current thread as queued in k_yield() as that will naturally get done in do_swap(). Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-28 07:57:20 +01:00
Kalle Kietäväinen	a2eb78c03b	kernel: sched: Fix meta-IRQ preemption tracking for the idle thread When the PM subsystem is enabled, the idle thread locks the scheduler for the duration the system is suspended. If a meta-IRQ preempts the idle thread in this state, the idle thread is tracked in `metairq_preempted`. However, when returning from the preemption, the idle thread is not removed from `metairq_preempted`, unlike all the other threads. As a result, the scheduler keeps running the idle thread even if there are higher priority threads ready to run. This change treats the idle thread the same way as all other threads when returning from a meta-IRQ preemption. Fixes #64705 Signed-off-by: Kalle Kietäväinen <kalle.kietavainen@silabs.com>	2025-01-27 13:26:20 +01:00
Tom Hughes	0ec126c6d1	kernel: Add missing marshalling header for k_reschedule Without this header, compiling the kernel.poll test with -Werror=unused-function fails. Signed-off-by: Tom Hughes <tomhughes@chromium.org>	2025-01-13 20:24:16 +01:00
Nicolas Pitre	7a3124d866	kernel: move current thread pointer management to core code Define the generic _current directly and get rid of the generic arch_current_get(). The SMP default implementation is now known as z_smp_current_get(). It is no longer inlined which saves significant binary size (about 10% for some random test case I checked). Introduce z_current_thread_set() and use it in place of arch_current_thread_set() for updating the current thread pointer given this is not necessarily an architecture specific operation. The architecture specific optimization, when enabled, should only care about its own things and not have to also update the generic _current_cpu->current copy. Signed-off-by: Nicolas Pitre <npitre@baylibre.com>	2025-01-10 07:49:08 +01:00
Nicolas Pitre	46aa6717ff	Revert "arch: deprecate `_current`" Mostly a revert of commit `b1def7145f` ("arch: deprecate `_current`"). This commit was part of PR #80716 whose initial purpose was about providing an architecture specific optimization for _current. The actual deprecation was sneaked in later on without proper discussion. The Zephyr core always used _current before and that was fine. It is quite prevalent as well and the alternative is proving rather verbose. Furthermore, as a concept, the "current thread" is not something that is necessarily architecture specific. Therefore the primary abstraction should not carry the arch_ prefix. Hence this revert. Signed-off-by: Nicolas Pitre <npitre@baylibre.com>	2025-01-10 07:49:08 +01:00
Peter Mitsis	bdb04dbfba	kernel: Alter z_abort_thread_timeout() return type No caller of the internal kernel routine z_abort_thread_timeout() uses its return value anymore. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-09 04:04:36 +01:00
Peter Mitsis	f4f3b9378f	kernel: Inline halt_thread() and z_thread_halt() Inlining these routines helps to improve the performance of k_thread_suspend() Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	d774594547	kernel: thread suspend/resume bail paths are unlikely Gives a hint to the compiler that the bail-out paths in both k_thread_suspend() and k_thread_resume() are unlikely events. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	af14e120a5	kernel: Simplify clear_halting() on UP systems There is no need for clear_halting() to do anything on UP systems. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	ea6adb6726	kernel: Add custom scheduler yield routines Adds customized yield implementations based upon the selected scheduler (dumb, multiq or scalable). Although each follows the same broad outline, some of them allow for additional tweaking to extract maximal performance. For example, the multiq variant improves the performance of k_yield() by about 20%. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	30f667bceb	kernel: Add routines for _THREAD_QUEUED bit Adds routines for setting and clearing the _THREAD_QUEUED thread_state bit. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	d1c2fc0667	kernel: inline z_sched_prio_cmp() Inlines z_sched_prio_cmp() to get better performance. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2025-01-07 18:24:09 +01:00
Peter Mitsis	669f2c489a	kernel: Add k_reschedule() The routine k_reschedule() allows an application to manually force a schedule point. Although similar to k_yield(), it has different properties. The most significant difference is that k_yield() if invoked from a cooperative thread will voluntarily give up execution control to the next thread of equal or higher priority while k_reschedule() will not. Applications that play with EDF deadlines via k_thread_deadline_set() may need to use k_reschedule() to force a reschedule. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-12-24 04:22:51 +01:00
Peter Mitsis	35435928c2	kernel: Decouple sleep from suspend Sleeping and suspended are now orthogonal states. That is, a thread may be both sleeping and suspended and the two do not interact. One repercussion of this is that suspending a thread will no longer abort its timeout. Threads are now created in the 'sleeping' state instead of a 'suspended' state. This dovetails nicely with the start delay that can be given to a newly created thread--it is as though the very first operation that a thread with a start delay is a sleep. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-12-18 18:17:03 +01:00
Akaiwa Wataru	feefb7dd35	kernel/sched: correct k_sleep() return value calculation Fix the issue: https://github.com/zephyrproject-rtos/zephyr/issues/79863 The expected_wakeup_ticks and sys_clock_tick_get_32() are uint32_t values, and may wrap around individually. If the expected_wakeup_ticks has a wraparound and sys_clock_tick_get_32() doesn't, so expected_wakeup_ticks < sys_clock_tick_get_32(), the API return value will be corrupted. The API return value, that is the remaining time, should be calculated in 32bit-unsigned-integer manner, and any wraparound will be treated properly. Signed-off-by: Akaiwa Wataru <akaiwa@sonas.co.jp>	2024-12-03 02:37:03 +01:00
Andy Ross	7cdf40541b	kernel/sched: Eliminate PRESTART thread state Traditionally threads have been initialized with a PRESTART flag set, which gets cleared when the thread runs for the first time via either its timeout or the k_thread_start() API. But if you think about it, this is no different, semantically, than SUSPENDED: the thread is prevented from running until the flag is cleared. So unify the two. Start threads in the SUSPENDED state, point everyone looking at the PRESTART bit to the SUSPENDED flag, and make k_thread_start() be a synonym for k_thread_resume(). There is some mild code size savings from the eliminated duplication, but the real win here is that we make space in the thread flags byte, which had run out. Signed-off-by: Andy Ross <andyross@google.com>	2024-11-27 10:38:05 -05:00
Anas Nashif	f90ce01d4a	kernel: sched: use arch_current_thread instead of _current _current is deprecated, use arch_current_thread() Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2024-11-25 19:06:57 -05:00
Andy Ross	b3ff9ae82b	kernel/sched: Optimize handling for suspend(_current) k_thread_suspend() is an async API intended to stop any thread in any state from any context. Some apps just want to use it to "suspend myself", which is a much (!) simpler operation. Detect that specific usage as a performance case. Signed-off-by: Andy Ross <andyross@google.com>	2024-11-26 00:12:28 +01:00
Yong Cong Sin	b1def7145f	arch: deprecate `_current` `_current` is now functionally equals to `arch_curr_thread()`, remove its usage in-tree and deprecate it instead of removing it outright, as it has been with us since forever. Signed-off-by: Yong Cong Sin <ycsin@meta.com> Signed-off-by: Yong Cong Sin <yongcong.sin@gmail.com>	2024-11-23 20:12:24 -05:00
Yong Cong Sin	d26c712258	arch: add new interfaces to set/get the current thread of current CPU Add the following arch-specific APIs: - arch_curr_thread() - arch_set_curr_thread() which allow SMP architectures to implement a faster "get current thread pointer" than the default provided by the kernel. The 'set' function is required for the 'get' to work, more on that later. When `CONFIG_ARCH_HAS_CUSTOM_CURRENT_IMPL` is selected, calls to `_current` & `k_sched_current_thread_query()` will be redirected to `arch_curr_thread()`, which ideally should translate into a single instruction read, avoiding the current "lock > read CPU > read current thread > unlock" path in SMP architectures and thus greatly improves the read performance. However, since the kernel relies on a copy of the "current thread"s on every CPU for certain operations (i.e. to compare the priority of the currently scheduled thread on another CPU to determine if IPI should be sent), we can't eliminate the copy of "current thread" (`current`) from the `struct _cpu` and therefore the kernel now has to invoke `arch_set_curr_thread()` in addition to what it has been doing. This means that it will take slightly longer (most likely one instruction write) to change the current thread pointer on the current CPU. Signed-off-by: Yong Cong Sin <ycsin@meta.com> Signed-off-by: Yong Cong Sin <yongcong.sin@gmail.com>	2024-11-23 20:12:24 -05:00
Tom Burdick	2b5012a5d9	kernel: Move run queue initialization Move the initialization of the priority q for running out of sched.c to remove one more ifdef from sched.c. No change in functionality but better matches the rest of sched.c and priority_q.h such that the ifdefry needed is done in in priority_q.h. Signed-off-by: Tom Burdick <thomas.burdick@intel.com>	2024-11-16 15:20:15 -05:00
Peter Mitsis	f6a76c32b7	kernel: inline z_unpend_first_thread() Inlining z_unpend_first_thread() has been observed to give a +8% and +16% performance boost to the thread_metric benchmark's message processing and synchronization tests respectively. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-10-21 18:38:00 -05:00
Peter Mitsis	c70a619a2f	kernel: Remove unused z_ready_thread_locked() Removing the routine z_ready_thread_locked() as it is not used anywhere. It was a leftover artefact from development that previously escaped cleanup. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-10-15 19:08:30 -04:00
Peter Mitsis	cc415bc139	kernel: Apply 'unlikely' attribute Applies the 'unlikely' attribute to various kernel objects that use z_unpend_first_thread() to optimize for the non-blocking path. This boosts the thread_metric synchronization benchmark numbers on the frdm_k64f board by about 10%. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-10-15 04:06:32 -04:00
Anas Nashif	121cb49a46	kernel: sched: inline update_cache This improves context switching by 7% when measured using the thread_metric benchmark. Before: ** Thread-Metric Preemptive Scheduling Test Relative Time: 120 Time Period Total: 5451879 After: Thread-Metric Preemptive Scheduling Test ** Relative Time: 30 Time Period Total: 5853535 Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2024-10-10 20:21:04 -04:00
Peter Mitsis	318b49570a	tests: scheduler queue benchmarks Implements a set of tests designed to show how the performance of the three scheduler queue implementations (DUMB, SCALABLE and MULTIQ) varies with respect to the number of threads in the ready queue. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-10-07 20:16:20 -04:00
Anas Nashif	ca09a4b91c	doc: kernel/arch: fix some wrong doxygen references Remove non-existing references and document parameters. Signed-off-by: Anas Nashif <anas.nashif@intel.com>	2024-09-17 05:24:09 -04:00
Peter Mitsis	8c48665868	kernel: Remove k_thread_priority_set() restriction Removes the ISR restriction from k_thread_priority_set(). Since the first commit, the routine k_thread_priority_set() has had a restriction preventing it from being called from within the context of an ISR. As there does not (any longer) appear to be a reason why the restriction exists, it is being removed. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-08-02 03:29:55 -04:00
Peter Mitsis	20dee1a6e9	kernel: Tweak z_unpend_thread_no_timeout() API Removes the ALWAYS_INLINE attribute from the definition of the routine z_unpend_thread_no_timeout() to fix an unresolved symbol error that was occurring with some compilers. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-07-27 10:47:41 +03:00
Hess Nathan	980d3f4c4f	kernel: corrected parameter names - applied the exact parameter names of the interface to implementation Signed-off-by: Hess Nathan <nhess@baumer.com>	2024-07-08 12:18:31 -04:00
Pisit Sawangvonganan	5ed3cd4bc9	kernel: fix typo Utilize a code spell-checking tool to scan for and correct spelling errors in all files within the `kernel` directory. Signed-off-by: Pisit Sawangvonganan <pisit@ndrsolution.com>	2024-07-08 15:51:37 +02:00
Peter Mitsis	ada3c90f8f	kernel: Add CONFIG_SCHED_IPI_CASCADE When this new Kconfig option is enabled, preempted threads on an SMP system may generate additional IPIs when they are switched out. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-06-21 20:49:11 -04:00
TaiJu Wu	555c07ef08	sched: Limit deadline scheduler parameter The deadline of deadline scheduler should lager than zero because if deadline is negative, it menas the task should be finished in past. Signed-off-by: TaiJu Wu <tjwu1217@gmail.com>	2024-06-15 05:13:41 -04:00
Peter Mitsis	0bcdae2c62	kernel: Add CONFIG_ARCH_HAS_DIRECTED_IPIS Platforms that support IPIs allow them to be broadcast via the new arch_sched_broadcast_ipi() routine (replacing arch_sched_ipi()). Those that also allow IPIs to be directed to specific CPUs may use arch_sched_directed_ipi() to do so. As the kernel has the capability to track which CPUs may need an IPI (see CONFIG_IPI_OPTIMIZE), this commit updates the signalling of tracked IPIs to use the directed version if supported; otherwise they continue to use the broadcast version. Platforms that allow directed IPIs may see a significant reduction in the number of IPI related ISRs when CONFIG_IPI_OPTIMIZE is enabled and the number of CPUs increases. These platforms can be identified by the Kconfig option CONFIG_ARCH_HAS_DIRECTED_IPIS. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-06-04 22:35:54 -04:00
Peter Mitsis	d8a4c8a90c	kernel: Add CONFIG_IPI_OPTIMIZE The CONFIG_IPI_OPTIMIZE configuration option allows for the flagging and subsequent signaling of IPIs to be optimized. It does this by making each bit in the kernel's pending_ipi field a flag that indicates whether the corresponding CPU might need an IPI to trigger the scheduling of a new thread on that CPU. When a new thread is made ready, we compare that thread against each of the threads currently executing on the other CPUs. If there is a chance that that thread should preempt the thread on the other CPU then we flag that an IPI is needed for that CPU. That is, a clear bit indicates that the CPU absolutely will not need to reschedule, while a set bit indicates that the target CPU must make that determination for itself. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-06-04 22:35:54 -04:00
Peter Mitsis	9ff5221d23	kernel: Update IPI usage in k_thread_priority_set() 1. The flagging of IPIs is moved out of k_thread_priority_set() into z_thread_prio_set(). This allows for an IPI to be done for a thread that had its priority bumped due to the handling of priority inheritance from a mutex. 2. k_thread_priority_set()'s check for sched_locked only applies to non-SMP builds that are using the old arch_swap() framework to switch between threads. Incidentally, nearly all calls to flag_ipi() are now performed with sched_spinlock being locked. The only exception is in slice_timeout(). Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>	2024-06-04 22:35:54 -04:00
Yong Cong Sin	bbe5e1e6eb	build: namespace the generated headers with `zephyr/` Namespaced the generated headers with `zephyr` to prevent potential conflict with other headers. Introduce a temporary Kconfig `LEGACY_GENERATED_INCLUDE_PATH` that is enabled by default. This allows the developers to continue the use of the old include paths for the time being until it is deprecated and eventually removed. The Kconfig will generate a build-time warning message, similar to the `CONFIG_TIMER_RANDOM_GENERATOR`. Updated the includes path of in-tree sources accordingly. Most of the changes here are scripted, check the PR for more info. Signed-off-by: Yong Cong Sin <ycsin@meta.com>	2024-05-28 22:03:55 +02:00
Hess Nathan	20b55425d3	coding guidelines: comply with MISRA Rule 13.4 avoid the direct use of assignment expression values for conditions Signed-off-by: Hess Nathan <nhess@baumer.com>	2024-05-28 10:07:31 +02:00
Hess Nathan	6d417d52c2	coding guidelines: comply with MISRA Rule 12.1. added parentheses verifying lack of ambiguities Signed-off-by: Hess Nathan <nhess@baumer.com>	2024-05-12 13:37:27 -04:00
Andy Ross	dec022a848	kernel/sched: Fix edge^2 case in abort/join The previous abort-lifecycle fix missed a case: other threads can enter k_thread_join(), see that the thread is already dead, and then need to call z_thread_switch_spin() to wait for a context switch. But the new "dummification" code was (by design!) terminating the thread such that no context would be saved to it. So switch_handle stayed NULL and if you hit that timing case correctly[1] you'd deadlock waiting for a switch that would never come. Fix is just to set switch_handle when dummifying to any non-NULL value. Also add an assertion to catch the obvious case that a thread is actually dead on the exit path of k_thread_abort() to make sure the variant path continues to set flags correctly [1] CI was doing it fairly reliably via tests/kernel/smp_abort on qemu_cortex_a53 only. Only one of my dev systems could see it, and then only about 15% of the time. Signed-off-by: Andy Ross <andyross@google.com>	2024-05-02 13:55:03 -04:00

1 2 3 4 5 ...

399 commits