On Thu, Nov 28, 2024 at 01:18:39PM +0200, Julian Anastasov wrote:
>
> Hello,
>
> On Thu, 28 Nov 2024, Paolo Abeni wrote:
>
> > On 11/23/24 10:42, Jinghao Jia wrote:
> > > Under certain kernel configurations when building with Clang/LLVM, the
> > > compiler does not generate a return or jump as the terminator
> > > instruction for ip_vs_protocol_init(), triggering the following objtool
> > > warning during build time:
> > >
> > > vmlinux.o: warning: objtool: ip_vs_protocol_init() falls through to
> > > next function __initstub__kmod_ip_vs_rr__935_123_ip_vs_rr_init6()
> > >
> > > At runtime, this either causes an oops when trying to load the ipvs
> > > module or a boot-time panic if ipvs is built-in. This same issue has
> > > been reported by the Intel kernel test robot previously.
> > >
> > > Digging deeper into both LLVM and the kernel code reveals this to be a
> > > undefined behavior problem. ip_vs_protocol_init() uses a on-stack buffer
> > > of 64 chars to store the registered protocol names and leaves it
> > > uninitialized after definition. The function calls strnlen() when
> > > concatenating protocol names into the buffer. With CONFIG_FORTIFY_SOURCE
> > > strnlen() performs an extra step to check whether the last byte of the
> > > input char buffer is a null character (commit 3009f891bb9f ("fortify:
> > > Allow strlen() and strnlen() to pass compile-time known lengths")).
> > > This, together with possibly other configurations, cause the following
> > > IR to be generated:
> > >
> > > define hidden i32 @ip_vs_protocol_init() local_unnamed_addr #5 section
> > > ".init.text" align 16 !kcfi_type !29 {
> > > %1 = alloca [64 x i8], align 16
> > > ...
> > >
> > > 14: ; preds = %11
> > > %15 = getelementptr inbounds i8, ptr %1, i64 63
> > > %16 = load i8, ptr %15, align 1
> > > %17 = tail call i1 @llvm.is.constant.i8(i8 %16)
> > > %18 = icmp eq i8 %16, 0
> > > %19 = select i1 %17, i1 %18, i1 false
> > > br i1 %19, label %20, label %23
> > >
> > > 20: ; preds = %14
> > > %21 = call i64 @strlen(ptr noundef nonnull dereferenceable(1) %1) #23
> > > ...
> > >
> > > 23: ; preds = %14, %11,
> > > %20
> > > %24 = call i64 @strnlen(ptr noundef nonnull dereferenceable(1) %1,
> > > i64 noundef 64) #24
> > > ...
> > > }
> > >
> > > The above code calculates the address of the last char in the buffer
> > > (value %15) and then loads from it (value %16). Because the buffer is
> > > never initialized, the LLVM GVN pass marks value %16 as undefined:
> > >
> > > %13 = getelementptr inbounds i8, ptr %1, i64 63
> > > br i1 undef, label %14, label %17
> > >
> > > This gives later passes (SCCP, in particular) more DCE opportunities by
> > > propagating the undef value further, and eventually removes everything
> > > after the load on the uninitialized stack location:
> > >
> > > define hidden i32 @ip_vs_protocol_init() local_unnamed_addr #0 section
> > > ".init.text" align 16 !kcfi_type !11 {
> > > %1 = alloca [64 x i8], align 16
> > > ...
> > >
> > > 12: ; preds = %11
> > > %13 = getelementptr inbounds i8, ptr %1, i64 63
> > > unreachable
> > > }
> > >
> > > In this way, the generated native code will just fall through to the
> > > next function, as LLVM does not generate any code for the unreachable IR
> > > instruction and leaves the function without a terminator.
> > >
> > > Zero the on-stack buffer to avoid this possible UB.
> > >
> > > Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
> > > Reported-by: kernel test robot <lkp@xxxxxxxxx>
> > > Closes:
> > > https://lore.kernel.org/oe-kbuild-all/202402100205.PWXIz1ZK-lkp@xxxxxxxxx/
> > > Co-developed-by: Ruowen Qin <ruqin@xxxxxxxxxx>
> > > Signed-off-by: Ruowen Qin <ruqin@xxxxxxxxxx>
> > > Signed-off-by: Jinghao Jia <jinghao7@xxxxxxxxxxxx>
> >
> > @Pablo, @Simon, @Julian: recent ipvs patches landed either on the
> > net(-next) trees or the netfiler trees according to a random (?) pattern.
> >
> > What is your preference here? Should such patches go via netfilter or
> > net? Or something else. FTR, I *think* netfilter should be the
> > preferable target, but I'm open to other options.
>
> IPVS patches should go always via Netfilter trees.
> It is my fault to tell people to use the 'net' tag, I'll
> recommend the proper nf tree the next time. Sorry for the
> confusion.
No issue, I have applied this to nf.git, thanks for the clarification.
|