On Thu, Sep 08, 2022 at 05:35:21PM +0200, Jiri Wiesner wrote:
> There is an alternative design where you could increase kd->est_max_count for
> all kthreads once all of the available kthreads have kd->est_max_count
> estimators. Nevertheless, there would also have to be a limit to the value of
> kd->est_max_count. Imagine the estimation during a single tick would take so
> long that the gap variable in ip_vs_estimation_kthread() would become
> negative. You would need to have circa 250,000 estimators per kthread. Since
> you are already measuring the timeout you need for schedule_timeout() in
> ip_vs_estimation_kthread(), it should be possible to set the
> kd->est_max_count limit based on the maximum processing time per chain. It
> could be half a IPVS_EST_TICK, for example.
>
> But it seems to me that the alternative design - increasing kd->est_max_count
> - should have some support in what is used in production. Are there servers
> with more than 983,040 estimators (which would be IPVS_EST_MAX_COUNT * 30
> kthreads) or even one third of that?
I did some profiling (but could have just looked at top, actually) of a kthread
with IPVS_EST_MAX_COUNT estimators for 100 seconds:
# Samples: 4K of event 'bus-cycles'
# Event count (approx.): 125024900
# Overhead Period Command Shared Object Symbol
# ........ ............ ............... .................
.........................................
#
76.44% 95570475 ipvs-e:0:0 [kernel.kallsyms] [k]
ip_vs_estimation_kthread
8.75% 10935925 ipvs-e:0:0 [kernel.kallsyms] [k] _find_next_bit
3.18% 3978975 swapper [kernel.kallsyms] [k] intel_idle
1.00% 1251250 ipvs-e:0:0 [kernel.kallsyms] [k]
_raw_spin_lock_bh
0.36% 450450 swapper [kernel.kallsyms] [k] _raw_spin_lock
0.36% 450450 swapper [kernel.kallsyms] [k]
update_rq_clock
The bus-cycles event on this particular machine makes 25,000,000 events per
second. Based on the period in the profile, the CPU utilization for various
functions is:
ip_vs_estimation_kthread: 95570475/100/25000000*100 = 3.82%
_find_next_bit: 10935925/100/25000000*100 = 0.44%
_raw_spin_lock_bh: 1251250/100/25000000*100 = 0.05%
The kthread could definitely utilize the CPU more, which is an argument for
increasing kd->est_max_count.
--
Jiri Wiesner
SUSE Labs
|