commit ffdda93091577191b4b91168d1fabeb004d1fa89
Author: Alexandre Frade <kernel@xanmod.org>
Date:   Sat Feb 5 23:42:05 2022 +0000

    Linux 5.16.7-xanmod1
    
    Signed-off-by: Alexandre Frade <kernel@xanmod.org>

commit 84cdec9f11c2304380a8254851876f539a5d215d
Merge: df507e82507e a8d80f1f3cb7
Author: Alexandre Frade <kernel@xanmod.org>
Date:   Sat Feb 5 23:35:26 2022 +0000

    Merge tag 'v5.16.7' into 5.16
    
    This is the 5.16.7 stable release

commit a8d80f1f3cb77a229cfbefea6220e60e70d5e47a
Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Sat Feb 5 19:22:06 2022 +0100

    Linux 5.16.7
    
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit aec5a6ab2648dcf0f448359084d314a3ae0d6607
Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Sat Feb 5 19:01:29 2022 +0100

    Revert "drm/vc4: hdmi: Make sure the device is powered with CEC" again
    
    This reverts commit 9b0d360fd783c711fc1cafa51f3e03bdf8ca5518 which is
    commit 20b0dfa86bef0e80b41b0e5ac38b92f23b6f27f9 upstream.
    
    It wasn't applied correctly, something went wrong with an attempt to fix
    it up again, so just revert the whole thing to be back at a clean state.
    
    Reported-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/r/20220205171238.GA3073350@roeck-us.net
    Reported-by: Alexey Khoroshilov <khoroshilov@ispras.ru>
    Link: https://lore.kernel.org/r/Yf5lNIJnvhP4ajam@kroah.com
    Cc: Dave Stevenson <dave.stevenson@raspberrypi.com>
    Cc: Maxime Ripard <maxime@cerno.tech>
    Cc: Michael Stapelberg <michael+drm@stapelberg.ch>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 920d84d2c7ddc2ac2e272a5338f5fb766b3e4fe6
Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Sat Feb 5 19:01:20 2022 +0100

    Revert "drm/vc4: hdmi: Make sure the device is powered with CEC"
    
    This reverts commit 3a63b718200be5b1997f6eb28e5e688ec58ec41b which is
    commit 20b0dfa86bef0e80b41b0e5ac38b92f23b6f27f9 upstream.
    
    It wasn't applied correctly, something went wrong with an attempt to fix
    it up again, so just revert the whole thing to be back at a clean state.
    
    Reported-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/r/20220205171238.GA3073350@roeck-us.net
    Reported-by: Alexey Khoroshilov <khoroshilov@ispras.ru>
    Link: https://lore.kernel.org/r/Yf5lNIJnvhP4ajam@kroah.com
    Cc: Dave Stevenson <dave.stevenson@raspberrypi.com>
    Cc: Maxime Ripard <maxime@cerno.tech>
    Cc: Michael Stapelberg <michael+drm@stapelberg.ch>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 20e6a329f12d6f8862602eaf5e74544a768c0723
Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Sat Feb 5 12:39:58 2022 +0100

    Linux 5.16.6
    
    Link: https://lore.kernel.org/r/20220204091917.166033635@linuxfoundation.org
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Ronald Warsow <rwarsow@gmx.de>
    Tested-by: Florian Fainelli <f.fainelli@gmail.com>
    Tested-by: Shuah Khan <skhan@linuxfoundation.org>
    Tested-by: Justin M. Forbes <jforbes@fedoraproject.org>
    Tested-by: Guenter Roeck <linux@roeck-us.net>
    Tested-by: Rudi Heitbaum <rudi@heitbaum.com>
    Tested-by: Slade Watkins <slade@sladewatkins.com>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Scott Bruce <smbruce@gmail.com>
    Tested-by: Bagas Sanjaya <bagasdotme@gmail.com>
    Tested-by: Ron Economos <re@w6rz.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 9c7f8a35c5a83740c0e3ea540b6ad145c50d79aa
Author: Christoph Fritz <chf.fritz@googlemail.com>
Date:   Wed Jan 12 19:33:21 2022 +0100

    ovl: fix NULL pointer dereference in copy up warning
    
    commit 4ee7e4a6c9b298da44029ed9ec8ed23ae49cc209 upstream.
    
    This patch is fixing a NULL pointer dereference to get a recently
    introduced warning message working.
    
    Fixes: 5b0a414d06c3 ("ovl: fix filattr copy-up failure")
    Signed-off-by: Christoph Fritz <chf.fritz@googlemail.com>
    Cc: <stable@vger.kernel.org> # v5.15
    Signed-off-by: Miklos Szeredi <mszeredi@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 5afee7b2b1b27b9b001c11b671ba1bad57c4ff4f
Author: Eric Dumazet <edumazet@google.com>
Date:   Tue Feb 1 10:46:40 2022 -0800

    tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
    
    commit b67985be400969578d4d4b17299714c0e5d2c07b upstream.
    
    tcp_shift_skb_data() might collapse three packets into a larger one.
    
    P_A, P_B, P_C  -> P_ABC
    
    Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
    because it was enough.
    
    In commit 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions"),
    this call was replaced by a call to tcp_skb_can_collapse(P_A, P_B)
    
    But the now needed test over P_C has been missed.
    
    This probably broke MPTCP.
    
    Then later, commit 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
    added an extra condition to tcp_skb_can_collapse(), but the missing call
    from tcp_shift_skb_data() is also breaking TCP zerocopy, because P_A and P_C
    might have different skb_zcopy_pure() status.
    
    Fixes: 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions")
    Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Mat Martineau <mathew.j.martineau@linux.intel.com>
    Cc: Talal Ahmad <talalahmad@google.com>
    Cc: Arjun Roy <arjunroy@google.com>
    Cc: Willem de Bruijn <willemb@google.com>
    Acked-by: Soheil Hassas Yeganeh <soheil@google.com>
    Acked-by: Paolo Abeni <pabeni@redhat.com>
    Link: https://lore.kernel.org/r/20220201184640.756716-1-eric.dumazet@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 0c6e3cd1d5ed64e5aa740cd476ddd0dcf76f873b
Author: Eric Dumazet <edumazet@google.com>
Date:   Mon Jan 31 22:52:54 2022 -0800

    tcp: fix mem under-charging with zerocopy sendmsg()
    
    commit 479f5547239d970d3833f15f54a6481fffdb91ec upstream.
    
    We got reports of following warning in inet_sock_destruct()
    
            WARN_ON(sk_forward_alloc_get(sk));
    
    Whenever we add a non zero-copy fragment to a pure zerocopy skb,
    we have to anticipate that whole skb->truesize will be uncharged
    when skb is finally freed.
    
    skb->data_len is the payload length. But the memory truesize
    estimated by __zerocopy_sg_from_iter() is page aligned.
    
    Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Talal Ahmad <talalahmad@google.com>
    Cc: Arjun Roy <arjunroy@google.com>
    Cc: Willem de Bruijn <willemb@google.com>
    Acked-by: Soheil Hassas Yeganeh <soheil@google.com>
    Link: https://lore.kernel.org/r/20220201065254.680532-1-eric.dumazet@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c074e2c8d67af5f0b1fb953d4ff960a8dc854eab
Author: Eric Dumazet <edumazet@google.com>
Date:   Mon Jan 31 18:23:58 2022 -0800

    af_packet: fix data-race in packet_setsockopt / packet_setsockopt
    
    commit e42e70ad6ae2ae511a6143d2e8da929366e58bd9 upstream.
    
    When packet_setsockopt( PACKET_FANOUT_DATA ) reads po->fanout,
    no lock is held, meaning that another thread can change po->fanout.
    
    Given that po->fanout can only be set once during the socket lifetime
    (it is only cleared from fanout_release()), we can use
    READ_ONCE()/WRITE_ONCE() to document the race.
    
    BUG: KCSAN: data-race in packet_setsockopt / packet_setsockopt
    
    write to 0xffff88813ae8e300 of 8 bytes by task 14653 on cpu 0:
     fanout_add net/packet/af_packet.c:1791 [inline]
     packet_setsockopt+0x22fe/0x24a0 net/packet/af_packet.c:3931
     __sys_setsockopt+0x209/0x2a0 net/socket.c:2180
     __do_sys_setsockopt net/socket.c:2191 [inline]
     __se_sys_setsockopt net/socket.c:2188 [inline]
     __x64_sys_setsockopt+0x62/0x70 net/socket.c:2188
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x44/0xd0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    
    read to 0xffff88813ae8e300 of 8 bytes by task 14654 on cpu 1:
     packet_setsockopt+0x691/0x24a0 net/packet/af_packet.c:3935
     __sys_setsockopt+0x209/0x2a0 net/socket.c:2180
     __do_sys_setsockopt net/socket.c:2191 [inline]
     __se_sys_setsockopt net/socket.c:2188 [inline]
     __x64_sys_setsockopt+0x62/0x70 net/socket.c:2188
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x44/0xd0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    
    value changed: 0x0000000000000000 -> 0xffff888106f8c000
    
    Reported by Kernel Concurrency Sanitizer on:
    CPU: 1 PID: 14654 Comm: syz-executor.3 Not tainted 5.16.0-syzkaller #0
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
    
    Fixes: 47dceb8ecdc1 ("packet: add classic BPF fanout mode")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Willem de Bruijn <willemb@google.com>
    Reported-by: syzbot <syzkaller@googlegroups.com>
    Link: https://lore.kernel.org/r/20220201022358.330621-1-eric.dumazet@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 0d208ee32f5b3729ac4e115c6826bdf69105b4f6
Author: Sasha Neftin <sasha.neftin@intel.com>
Date:   Tue Dec 7 13:23:42 2021 +0200

    e1000e: Handshake with CSME starts from ADL platforms
    
    commit cad014b7b5a6897d8c4fad13e2888978bfb7a53f upstream.
    
    Handshake with CSME/AMT on none provisioned platforms during S0ix flow
    is not supported on TGL platform and can cause to HW unit hang. Update
    the handshake with CSME flow to start from the ADL platform.
    
    Fixes: 3e55d231716e ("e1000e: Add handshake with the CSME to support S0ix")
    Signed-off-by: Sasha Neftin <sasha.neftin@intel.com>
    Tested-by: Nechama Kraus <nechamax.kraus@linux.intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit f0c1ed004ea8845c8f2d69e58641f87a4c4b1408
Author: Tianchen Ding <dtcccc@linux.alibaba.com>
Date:   Tue Jan 18 18:05:18 2022 +0800

    cpuset: Fix the bug that subpart_cpus updated wrongly in update_cpumask()
    
    commit c80d401c52a2d1baf2a5afeb06f0ffe678e56d23 upstream.
    
    subparts_cpus should be limited as a subset of cpus_allowed, but it is
    updated wrongly by using cpumask_andnot(). Use cpumask_and() instead to
    fix it.
    
    Fixes: ee8dde0cd2ce ("cpuset: Add new v2 cpuset.sched.partition flag")
    Signed-off-by: Tianchen Ding <dtcccc@linux.alibaba.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ed1707d0a64d92432b9fac368983c1b4465220f0
Author: He Fengqing <hefengqing@huawei.com>
Date:   Sat Jan 22 10:29:36 2022 +0000

    bpf: Fix possible race in inc_misses_counter
    
    commit 0e3135d3bfa5dfb658145238d2bc723a8e30c3a3 upstream.
    
    It seems inc_misses_counter() suffers from same issue fixed in
    the commit d979617aa84d ("bpf: Fixes possible race in update_prog_stats()
    for 32bit arches"):
    As it can run while interrupts are enabled, it could
    be re-entered and the u64_stats syncp could be mangled.
    
    Fixes: 9ed9e9ba2337 ("bpf: Count the number of times recursion was prevented")
    Signed-off-by: He Fengqing <hefengqing@huawei.com>
    Acked-by: John Fastabend <john.fastabend@gmail.com>
    Link: https://lore.kernel.org/r/20220122102936.1219518-1-hefengqing@huawei.com
    Signed-off-by: Alexei Starovoitov <ast@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 577f932216d02157637c621dc318014bcfa5ed82
Author: Alex Elder <elder@linaro.org>
Date:   Tue Feb 1 09:02:05 2022 -0600

    net: ipa: request IPA register values be retained
    
    commit 34a081761e4e3c35381cbfad609ebae2962fe2f8 upstream.
    
    In some cases, the IPA hardware needs to request the always-on
    subsystem (AOSS) to coordinate with the IPA microcontroller to
    retain IPA register values at power collapse.  This is done by
    issuing a QMP request to the AOSS microcontroller.  A similar
    request ondoes that request.
    
    We must get and hold the "QMP" handle early, because we might get
    back EPROBE_DEFER for that.  But the actual request should be sent
    while we know the IPA clock is active, and when we know the
    microcontroller is operational.
    
    Fixes: 1aac309d3207 ("net: ipa: use autosuspend")
    Signed-off-by: Alex Elder <elder@linaro.org>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 36a9a0aee881940476b254e0352581401b23f210
Author: Eric Dumazet <edumazet@google.com>
Date:   Mon Jan 31 17:21:06 2022 -0800

    rtnetlink: make sure to refresh master_dev/m_ops in __rtnl_newlink()
    
    commit c6f6f2444bdbe0079e41914a35081530d0409963 upstream.
    
    While looking at one unrelated syzbot bug, I found the replay logic
    in __rtnl_newlink() to potentially trigger use-after-free.
    
    It is better to clear master_dev and m_ops inside the loop,
    in case we have to replay it.
    
    Fixes: ba7d49b1f0f8 ("rtnetlink: provide api for getting and setting slave info")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Jiri Pirko <jiri@nvidia.com>
    Link: https://lore.kernel.org/r/20220201012106.216495-1-eric.dumazet@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 95e34f61b58a152656cbe8d6e19843cc343fb089
Author: Eric Dumazet <edumazet@google.com>
Date:   Mon Jan 31 09:20:18 2022 -0800

    net: sched: fix use-after-free in tc_new_tfilter()
    
    commit 04c2a47ffb13c29778e2a14e414ad4cb5a5db4b5 upstream.
    
    Whenever tc_new_tfilter() jumps back to replay: label,
    we need to make sure @q and @chain local variables are cleared again,
    or risk use-after-free as in [1]
    
    For consistency, apply the same fix in tc_ctl_chain()
    
    BUG: KASAN: use-after-free in mini_qdisc_pair_swap+0x1b9/0x1f0 net/sched/sch_generic.c:1581
    Write of size 8 at addr ffff8880985c4b08 by task syz-executor.4/1945
    
    CPU: 0 PID: 1945 Comm: syz-executor.4 Not tainted 5.17.0-rc1-syzkaller-00495-gff58831fa02d #0
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:88 [inline]
     dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106
     print_address_description.constprop.0.cold+0x8d/0x336 mm/kasan/report.c:255
     __kasan_report mm/kasan/report.c:442 [inline]
     kasan_report.cold+0x83/0xdf mm/kasan/report.c:459
     mini_qdisc_pair_swap+0x1b9/0x1f0 net/sched/sch_generic.c:1581
     tcf_chain_head_change_item net/sched/cls_api.c:372 [inline]
     tcf_chain0_head_change.isra.0+0xb9/0x120 net/sched/cls_api.c:386
     tcf_chain_tp_insert net/sched/cls_api.c:1657 [inline]
     tcf_chain_tp_insert_unique net/sched/cls_api.c:1707 [inline]
     tc_new_tfilter+0x1e67/0x2350 net/sched/cls_api.c:2086
     rtnetlink_rcv_msg+0x80d/0xb80 net/core/rtnetlink.c:5583
     netlink_rcv_skb+0x153/0x420 net/netlink/af_netlink.c:2494
     netlink_unicast_kernel net/netlink/af_netlink.c:1317 [inline]
     netlink_unicast+0x539/0x7e0 net/netlink/af_netlink.c:1343
     netlink_sendmsg+0x904/0xe00 net/netlink/af_netlink.c:1919
     sock_sendmsg_nosec net/socket.c:705 [inline]
     sock_sendmsg+0xcf/0x120 net/socket.c:725
     ____sys_sendmsg+0x331/0x810 net/socket.c:2413
     ___sys_sendmsg+0xf3/0x170 net/socket.c:2467
     __sys_sendmmsg+0x195/0x470 net/socket.c:2553
     __do_sys_sendmmsg net/socket.c:2582 [inline]
     __se_sys_sendmmsg net/socket.c:2579 [inline]
     __x64_sys_sendmmsg+0x99/0x100 net/socket.c:2579
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    RIP: 0033:0x7f2647172059
    Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b8 ff ff ff f7 d8 64 89 01 48
    RSP: 002b:00007f2645aa5168 EFLAGS: 00000246 ORIG_RAX: 0000000000000133
    RAX: ffffffffffffffda RBX: 00007f2647285100 RCX: 00007f2647172059
    RDX: 040000000000009f RSI: 00000000200002c0 RDI: 0000000000000006
    RBP: 00007f26471cc08d R08: 0000000000000000 R09: 0000000000000000
    R10: 9e00000000000000 R11: 0000000000000246 R12: 0000000000000000
    R13: 00007fffb3f7f02f R14: 00007f2645aa5300 R15: 0000000000022000
     </TASK>
    
    Allocated by task 1944:
     kasan_save_stack+0x1e/0x40 mm/kasan/common.c:38
     kasan_set_track mm/kasan/common.c:45 [inline]
     set_alloc_info mm/kasan/common.c:436 [inline]
     ____kasan_kmalloc mm/kasan/common.c:515 [inline]
     ____kasan_kmalloc mm/kasan/common.c:474 [inline]
     __kasan_kmalloc+0xa9/0xd0 mm/kasan/common.c:524
     kmalloc_node include/linux/slab.h:604 [inline]
     kzalloc_node include/linux/slab.h:726 [inline]
     qdisc_alloc+0xac/0xa10 net/sched/sch_generic.c:941
     qdisc_create.constprop.0+0xce/0x10f0 net/sched/sch_api.c:1211
     tc_modify_qdisc+0x4c5/0x1980 net/sched/sch_api.c:1660
     rtnetlink_rcv_msg+0x413/0xb80 net/core/rtnetlink.c:5592
     netlink_rcv_skb+0x153/0x420 net/netlink/af_netlink.c:2494
     netlink_unicast_kernel net/netlink/af_netlink.c:1317 [inline]
     netlink_unicast+0x539/0x7e0 net/netlink/af_netlink.c:1343
     netlink_sendmsg+0x904/0xe00 net/netlink/af_netlink.c:1919
     sock_sendmsg_nosec net/socket.c:705 [inline]
     sock_sendmsg+0xcf/0x120 net/socket.c:725
     ____sys_sendmsg+0x331/0x810 net/socket.c:2413
     ___sys_sendmsg+0xf3/0x170 net/socket.c:2467
     __sys_sendmmsg+0x195/0x470 net/socket.c:2553
     __do_sys_sendmmsg net/socket.c:2582 [inline]
     __se_sys_sendmmsg net/socket.c:2579 [inline]
     __x64_sys_sendmmsg+0x99/0x100 net/socket.c:2579
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    
    Freed by task 3609:
     kasan_save_stack+0x1e/0x40 mm/kasan/common.c:38
     kasan_set_track+0x21/0x30 mm/kasan/common.c:45
     kasan_set_free_info+0x20/0x30 mm/kasan/generic.c:370
     ____kasan_slab_free mm/kasan/common.c:366 [inline]
     ____kasan_slab_free+0x130/0x160 mm/kasan/common.c:328
     kasan_slab_free include/linux/kasan.h:236 [inline]
     slab_free_hook mm/slub.c:1728 [inline]
     slab_free_freelist_hook+0x8b/0x1c0 mm/slub.c:1754
     slab_free mm/slub.c:3509 [inline]
     kfree+0xcb/0x280 mm/slub.c:4562
     rcu_do_batch kernel/rcu/tree.c:2527 [inline]
     rcu_core+0x7b8/0x1540 kernel/rcu/tree.c:2778
     __do_softirq+0x29b/0x9c2 kernel/softirq.c:558
    
    Last potentially related work creation:
     kasan_save_stack+0x1e/0x40 mm/kasan/common.c:38
     __kasan_record_aux_stack+0xbe/0xd0 mm/kasan/generic.c:348
     __call_rcu kernel/rcu/tree.c:3026 [inline]
     call_rcu+0xb1/0x740 kernel/rcu/tree.c:3106
     qdisc_put_unlocked+0x6f/0x90 net/sched/sch_generic.c:1109
     tcf_block_release+0x86/0x90 net/sched/cls_api.c:1238
     tc_new_tfilter+0xc0d/0x2350 net/sched/cls_api.c:2148
     rtnetlink_rcv_msg+0x80d/0xb80 net/core/rtnetlink.c:5583
     netlink_rcv_skb+0x153/0x420 net/netlink/af_netlink.c:2494
     netlink_unicast_kernel net/netlink/af_netlink.c:1317 [inline]
     netlink_unicast+0x539/0x7e0 net/netlink/af_netlink.c:1343
     netlink_sendmsg+0x904/0xe00 net/netlink/af_netlink.c:1919
     sock_sendmsg_nosec net/socket.c:705 [inline]
     sock_sendmsg+0xcf/0x120 net/socket.c:725
     ____sys_sendmsg+0x331/0x810 net/socket.c:2413
     ___sys_sendmsg+0xf3/0x170 net/socket.c:2467
     __sys_sendmmsg+0x195/0x470 net/socket.c:2553
     __do_sys_sendmmsg net/socket.c:2582 [inline]
     __se_sys_sendmmsg net/socket.c:2579 [inline]
     __x64_sys_sendmmsg+0x99/0x100 net/socket.c:2579
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    
    The buggy address belongs to the object at ffff8880985c4800
     which belongs to the cache kmalloc-1k of size 1024
    The buggy address is located 776 bytes inside of
     1024-byte region [ffff8880985c4800, ffff8880985c4c00)
    The buggy address belongs to the page:
    page:ffffea0002617000 refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x985c0
    head:ffffea0002617000 order:3 compound_mapcount:0 compound_pincount:0
    flags: 0xfff00000010200(slab|head|node=0|zone=1|lastcpupid=0x7ff)
    raw: 00fff00000010200 0000000000000000 dead000000000122 ffff888010c41dc0
    raw: 0000000000000000 0000000000100010 00000001ffffffff 0000000000000000
    page dumped because: kasan: bad access detected
    page_owner tracks the page as allocated
    page last allocated via order 3, migratetype Unmovable, gfp_mask 0x1d20c0(__GFP_IO|__GFP_FS|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP|__GFP_NOMEMALLOC|__GFP_HARDWALL), pid 1941, ts 1038999441284, free_ts 1033444432829
     prep_new_page mm/page_alloc.c:2434 [inline]
     get_page_from_freelist+0xa72/0x2f50 mm/page_alloc.c:4165
     __alloc_pages+0x1b2/0x500 mm/page_alloc.c:5389
     alloc_pages+0x1aa/0x310 mm/mempolicy.c:2271
     alloc_slab_page mm/slub.c:1799 [inline]
     allocate_slab mm/slub.c:1944 [inline]
     new_slab+0x28a/0x3b0 mm/slub.c:2004
     ___slab_alloc+0x87c/0xe90 mm/slub.c:3018
     __slab_alloc.constprop.0+0x4d/0xa0 mm/slub.c:3105
     slab_alloc_node mm/slub.c:3196 [inline]
     slab_alloc mm/slub.c:3238 [inline]
     __kmalloc+0x2fb/0x340 mm/slub.c:4420
     kmalloc include/linux/slab.h:586 [inline]
     kzalloc include/linux/slab.h:715 [inline]
     __register_sysctl_table+0x112/0x1090 fs/proc/proc_sysctl.c:1335
     neigh_sysctl_register+0x2c8/0x5e0 net/core/neighbour.c:3787
     devinet_sysctl_register+0xb1/0x230 net/ipv4/devinet.c:2618
     inetdev_init+0x286/0x580 net/ipv4/devinet.c:278
     inetdev_event+0xa8a/0x15d0 net/ipv4/devinet.c:1532
     notifier_call_chain+0xb5/0x200 kernel/notifier.c:84
     call_netdevice_notifiers_info+0xb5/0x130 net/core/dev.c:1919
     call_netdevice_notifiers_extack net/core/dev.c:1931 [inline]
     call_netdevice_notifiers net/core/dev.c:1945 [inline]
     register_netdevice+0x1073/0x1500 net/core/dev.c:9698
     veth_newlink+0x59c/0xa90 drivers/net/veth.c:1722
    page last free stack trace:
     reset_page_owner include/linux/page_owner.h:24 [inline]
     free_pages_prepare mm/page_alloc.c:1352 [inline]
     free_pcp_prepare+0x374/0x870 mm/page_alloc.c:1404
     free_unref_page_prepare mm/page_alloc.c:3325 [inline]
     free_unref_page+0x19/0x690 mm/page_alloc.c:3404
     release_pages+0x748/0x1220 mm/swap.c:956
     tlb_batch_pages_flush mm/mmu_gather.c:50 [inline]
     tlb_flush_mmu_free mm/mmu_gather.c:243 [inline]
     tlb_flush_mmu+0xe9/0x6b0 mm/mmu_gather.c:250
     zap_pte_range mm/memory.c:1441 [inline]
     zap_pmd_range mm/memory.c:1490 [inline]
     zap_pud_range mm/memory.c:1519 [inline]
     zap_p4d_range mm/memory.c:1540 [inline]
     unmap_page_range+0x1d1d/0x2a30 mm/memory.c:1561
     unmap_single_vma+0x198/0x310 mm/memory.c:1606
     unmap_vmas+0x16b/0x2f0 mm/memory.c:1638
     exit_mmap+0x201/0x670 mm/mmap.c:3178
     __mmput+0x122/0x4b0 kernel/fork.c:1114
     mmput+0x56/0x60 kernel/fork.c:1135
     exit_mm kernel/exit.c:507 [inline]
     do_exit+0xa3c/0x2a30 kernel/exit.c:793
     do_group_exit+0xd2/0x2f0 kernel/exit.c:935
     __do_sys_exit_group kernel/exit.c:946 [inline]
     __se_sys_exit_group kernel/exit.c:944 [inline]
     __x64_sys_exit_group+0x3a/0x50 kernel/exit.c:944
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    
    Memory state around the buggy address:
     ffff8880985c4a00: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
     ffff8880985c4a80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
    >ffff8880985c4b00: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
                          ^
     ffff8880985c4b80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
     ffff8880985c4c00: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    
    Fixes: 470502de5bdb ("net: sched: unlock rules update API")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Vlad Buslov <vladbu@mellanox.com>
    Cc: Jiri Pirko <jiri@mellanox.com>
    Cc: Cong Wang <xiyou.wangcong@gmail.com>
    Reported-by: syzbot <syzkaller@googlegroups.com>
    Link: https://lore.kernel.org/r/20220131172018.3704490-1-eric.dumazet@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit dea4fec0d87d4401b5d2717aa7c6c6cad050fb62
Author: Dan Carpenter <dan.carpenter@oracle.com>
Date:   Fri Jan 28 22:57:01 2022 +0300

    fanotify: Fix stale file descriptor in copy_event_to_user()
    
    commit ee12595147ac1fbfb5bcb23837e26dd58d94b15d upstream.
    
    This code calls fd_install() which gives the userspace access to the fd.
    Then if copy_info_records_to_user() fails it calls put_unused_fd(fd) but
    that will not release it and leads to a stale entry in the file
    descriptor table.
    
    Generally you can't trust the fd after a call to fd_install().  The fix
    is to delay the fd_install() until everything else has succeeded.
    
    Fortunately it requires CAP_SYS_ADMIN to reach this code so the security
    impact is less.
    
    Fixes: f644bc449b37 ("fanotify: fix copy_event_to_user() fid error clean up")
    Link: https://lore.kernel.org/r/20220128195656.GA26981@kili
    Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
    Reviewed-by: Mathias Krause <minipli@grsecurity.net>
    Signed-off-by: Jan Kara <jack@suse.cz>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit e8f73f620fee5f52653ed2da360121e4446575c5
Author: Shyam Sundar S K <Shyam-sundar.S-k@amd.com>
Date:   Thu Jan 27 14:50:03 2022 +0530

    net: amd-xgbe: Fix skb data length underflow
    
    commit 5aac9108a180fc06e28d4e7fb00247ce603b72ee upstream.
    
    There will be BUG_ON() triggered in include/linux/skbuff.h leading to
    intermittent kernel panic, when the skb length underflow is detected.
    
    Fix this by dropping the packet if such length underflows are seen
    because of inconsistencies in the hardware descriptors.
    
    Fixes: 622c36f143fc ("amd-xgbe: Fix jumbo MTU processing on newer hardware")
    Suggested-by: Tom Lendacky <thomas.lendacky@amd.com>
    Signed-off-by: Shyam Sundar S K <Shyam-sundar.S-k@amd.com>
    Acked-by: Tom Lendacky <thomas.lendacky@amd.com>
    Link: https://lore.kernel.org/r/20220127092003.2812745-1-Shyam-sundar.S-k@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit dc2c6fdbfc1af540eb0e8a81bf330a747846931f
Author: Raju Rangoju <Raju.Rangoju@amd.com>
Date:   Thu Jan 27 11:32:22 2022 +0530

    net: amd-xgbe: ensure to reset the tx_timer_active flag
    
    commit 7674b7b559b683478c3832527c59bceb169e701d upstream.
    
    Ensure to reset the tx_timer_active flag in xgbe_stop(),
    otherwise a port restart may result in tx timeout due to
    uncleared flag.
    
    Fixes: c635eaacbf77 ("amd-xgbe: Remove Tx coalescing")
    Co-developed-by: Sudheesh Mavila <sudheesh.mavila@amd.com>
    Signed-off-by: Sudheesh Mavila <sudheesh.mavila@amd.com>
    Signed-off-by: Raju Rangoju <Raju.Rangoju@amd.com>
    Acked-by: Tom Lendacky <thomas.lendacky@amd.com>
    Link: https://lore.kernel.org/r/20220127060222.453371-1-Raju.Rangoju@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2754d83160c96ae22afff8687ddb575d3b790587
Author: Karen Sornek <karen.sornek@intel.com>
Date:   Wed Jan 12 10:19:47 2022 +0100

    i40e: Fix reset path while removing the driver
    
    commit 6533e558c6505e94c3e0ed4281ed5e31ec985f4d upstream.
    
    Fix the crash in kernel while dereferencing the NULL pointer,
    when the driver is unloaded and simultaneously the VSI rings
    are being stopped.
    
    The hardware requires 50msec in order to finish RX queues
    disable. For this purpose the driver spins in mdelay function
    for the operation to be completed.
    
    For example changing number of queues which requires reset would
    fail in the following call stack:
    
    1) i40e_prep_for_reset
    2) i40e_pf_quiesce_all_vsi
    3) i40e_quiesce_vsi
    4) i40e_vsi_close
    5) i40e_down
    6) i40e_vsi_stop_rings
    7) i40e_vsi_control_rx -> disable requires the delay of 50msecs
    8) continue back in i40e_down function where
       i40e_clean_tx_ring(vsi->tx_rings[i]) is going to crash
    
    When the driver was spinning vsi_release called
    i40e_vsi_free_arrays where the vsi->tx_rings resources
    were freed and the pointer was set to NULL.
    
    Fixes: 5b6d4a7f20b0 ("i40e: Fix crash during removing i40e driver")
    Signed-off-by: Slawomir Laba <slawomirx.laba@intel.com>
    Signed-off-by: Sylwester Dziedziuch <sylwesterx.dziedziuch@intel.com>
    Signed-off-by: Karen Sornek <karen.sornek@intel.com>
    Tested-by: Gurucharan G <gurucharanx.g@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c448999f0e8685fff4cb23f059a2daae72d2433d
Author: Jedrzej Jagielski <jedrzej.jagielski@intel.com>
Date:   Tue Dec 14 10:08:22 2021 +0000

    i40e: Fix reset bw limit when DCB enabled with 1 TC
    
    commit 3d2504663c41104b4359a15f35670cfa82de1bbf upstream.
    
    There was an AQ error I40E_AQ_RC_EINVAL when trying
    to reset bw limit as part of bw allocation setup.
    This was caused by trying to reset bw limit with
    DCB enabled. Bw limit should not be reset when
    DCB is enabled. The code was relying on the pf->flags
    to check if DCB is enabled but if only 1 TC is available
    this flag will not be set even though DCB is enabled.
    Add a check for number of TC and if it is 1
    don't try to reset bw limit even if pf->flags shows
    DCB as disabled.
    
    Fixes: fa38e30ac73f ("i40e: Fix for Tx timeouts when interface is brought up if DCB is enabled")
    Suggested-by: Alexander Lobakin <alexandr.lobakin@intel.com> # Flatten the condition
    Signed-off-by: Sylwester Dziedziuch <sylwesterx.dziedziuch@intel.com>
    Signed-off-by: Jedrzej Jagielski <jedrzej.jagielski@intel.com>
    Reviewed-by: Alexander Lobakin <alexandr.lobakin@intel.com>
    Tested-by: Imam Hassan Reza Biswas <imam.hassan.reza.biswas@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 7bf02718b736e66609fc1002a358727a945f46a6
Author: Georgi Valkov <gvalkov@abv.bg>
Date:   Tue Feb 1 08:16:18 2022 +0100

    ipheth: fix EOVERFLOW in ipheth_rcvbulk_callback
    
    commit 63e4b45c82ed1bde979da7052229a4229ce9cabf upstream.
    
    When rx_buf is allocated we need to account for IPHETH_IP_ALIGN,
    which reduces the usable size by 2 bytes. Otherwise we have 1512
    bytes usable instead of 1514, and if we receive more than 1512
    bytes, ipheth_rcvbulk_callback is called with status -EOVERFLOW,
    after which the driver malfunctiones and all communication stops.
    
    Resolves ipheth 2-1:4.2: ipheth_rcvbulk_callback: urb status: -75
    
    Fixes: f33d9e2b48a3 ("usbnet: ipheth: fix connectivity with iOS 14")
    Signed-off-by: Georgi Valkov <gvalkov@abv.bg>
    Tested-by: Jan Kiszka <jan.kiszka@siemens.com>
    Link: https://lore.kernel.org/all/B60B8A4B-92A0-49B3-805D-809A2433B46C@abv.bg/
    Link: https://lore.kernel.org/all/24851bd2769434a5fc24730dce8e8a984c5a4505.1643699778.git.jan.kiszka@siemens.com/
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit e23a2f4e43a4a18de4137a446705281a8969b375
Author: Roi Dayan <roid@nvidia.com>
Date:   Tue Feb 1 15:27:48 2022 +0200

    net/mlx5e: Avoid implicit modify hdr for decap drop rule
    
    commit 5b209d1a22afabfb7d644abb10510c5713a3e569 upstream.
    
    Currently the driver adds implicit modify hdr action for
    decap rules on tunnel devices if the port is an ovs port.
    This is also done if the action is drop and makes the modify
    hdr redundant and also the FW doesn't support it and will generate
    a syndrome.
    
    kernel: mlx5_core 0000:08:00.0: mlx5_cmd_check:777:(pid 102063): SET_FLOW_TABLE_ENTRY(0x936) op_mod(0x0) failed, status bad parameter(0x3), syndrome (0x8708c3)
    
    Fix it by adding the implicit modify hdr only for fwd actions.
    
    Fixes: b16eb3c81fe2 ("net/mlx5: Support internal port as decap route device")
    Fixes: 077cdda764c7 ("net/mlx5e: TC, Fix memory leak with rules with internal port")
    Signed-off-by: Roi Dayan <roid@nvidia.com>
    Reviewed-by: Ariel Levkovich <lariel@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 28929e393da4947f4993ebfa9dde5fb26925783d
Author: Maor Dickman <maord@nvidia.com>
Date:   Sun Jan 30 16:00:41 2022 +0200

    net/mlx5: E-Switch, Fix uninitialized variable modact
    
    commit d8e5883d694bb053b19c4142a2d1f43a34f6fe2c upstream.
    
    The variable modact is not initialized before used in command
    modify header allocation which can cause command to fail.
    
    Fix by initializing modact with zeros.
    
    Addresses-Coverity: ("Uninitialized scalar variable")
    Fixes: 8f1e0b97cc70 ("net/mlx5: E-Switch, Mark miss packets with new chain id mapping")
    Signed-off-by: Maor Dickman <maord@nvidia.com>
    Reviewed-by: Roi Dayan <roid@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit a4161e4861132d5b324746a260283e87f2d65daf
Author: Khalid Manaa <khalidm@nvidia.com>
Date:   Wed Jan 26 14:25:55 2022 +0200

    net/mlx5e: Fix broken SKB allocation in HW-GRO
    
    commit 7957837b816f11eecb9146235bb0715478f4c81f upstream.
    
    In case the HW doesn't perform header-data split, it will write the whole
    packet into the data buffer in the WQ, in this case the SHAMPO CQE handler
    couldn't use the header entry to build the SKB, instead it should allocate
    a new memory to build the SKB using the function:
    mlx5e_skb_from_cqe_mpwrq_nonlinear.
    
    Fixes: f97d5c2a453e ("net/mlx5e: Add handle SHAMPO cqe support")
    Signed-off-by: Khalid Manaa <khalidm@nvidia.com>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 66c6dbfc8493fc1b0192f74c12d417d911ee632b
Author: Khalid Manaa <khalidm@nvidia.com>
Date:   Wed Jan 26 14:14:58 2022 +0200

    net/mlx5e: Fix wrong calculation of header index in HW_GRO
    
    commit b8d91145ed7cfa046cc07bcfb277465b9d45da73 upstream.
    
    The HW doesn't wrap the CQE.shampo.header_index field according to the
    headers buffer size, instead it always increases it until reaching overflow
    of u16 size.
    
    Thus the mlx5e_handle_rx_cqe_mpwrq_shampo handler should mask the
    CQE header_index field to find the actual header index in the headers buffer.
    
    Fixes: f97d5c2a453e ("net/mlx5e: Add handle SHAMPO cqe support")
    Signed-off-by: Khalid Manaa <khalidm@nvidia.com>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 8fbdf8c8b8ab82beab882175157650452c46493e
Author: Kees Cook <keescook@chromium.org>
Date:   Mon Jan 24 09:20:28 2022 -0800

    net/mlx5e: Avoid field-overflowing memcpy()
    
    commit ad5185735f7dab342fdd0dd41044da4c9ccfef67 upstream.
    
    In preparation for FORTIFY_SOURCE performing compile-time and run-time
    field bounds checking for memcpy(), memmove(), and memset(), avoid
    intentionally writing across neighboring fields.
    
    Use flexible arrays instead of zero-element arrays (which look like they
    are always overflowing) and split the cross-field memcpy() into two halves
    that can be appropriately bounds-checked by the compiler.
    
    We were doing:
    
            #define ETH_HLEN  14
            #define VLAN_HLEN  4
            ...
            #define MLX5E_XDP_MIN_INLINE (ETH_HLEN + VLAN_HLEN)
            ...
            struct mlx5e_tx_wqe      *wqe  = mlx5_wq_cyc_get_wqe(wq, pi);
            ...
            struct mlx5_wqe_eth_seg  *eseg = &wqe->eth;
            struct mlx5_wqe_data_seg *dseg = wqe->data;
            ...
            memcpy(eseg->inline_hdr.start, xdptxd->data, MLX5E_XDP_MIN_INLINE);
    
    target is wqe->eth.inline_hdr.start (which the compiler sees as being
    2 bytes in size), but copying 18, intending to write across start
    (really vlan_tci, 2 bytes). The remaining 16 bytes get written into
    wqe->data[0], covering byte_count (4 bytes), lkey (4 bytes), and addr
    (8 bytes).
    
    struct mlx5e_tx_wqe {
            struct mlx5_wqe_ctrl_seg   ctrl;                 /*     0    16 */
            struct mlx5_wqe_eth_seg    eth;                  /*    16    16 */
            struct mlx5_wqe_data_seg   data[];               /*    32     0 */
    
            /* size: 32, cachelines: 1, members: 3 */
            /* last cacheline: 32 bytes */
    };
    
    struct mlx5_wqe_eth_seg {
            u8                         swp_outer_l4_offset;  /*     0     1 */
            u8                         swp_outer_l3_offset;  /*     1     1 */
            u8                         swp_inner_l4_offset;  /*     2     1 */
            u8                         swp_inner_l3_offset;  /*     3     1 */
            u8                         cs_flags;             /*     4     1 */
            u8                         swp_flags;            /*     5     1 */
            __be16                     mss;                  /*     6     2 */
            __be32                     flow_table_metadata;  /*     8     4 */
            union {
                    struct {
                            __be16     sz;                   /*    12     2 */
                            u8         start[2];             /*    14     2 */
                    } inline_hdr;                            /*    12     4 */
                    struct {
                            __be16     type;                 /*    12     2 */
                            __be16     vlan_tci;             /*    14     2 */
                    } insert;                                /*    12     4 */
                    __be32             trailer;              /*    12     4 */
            };                                               /*    12     4 */
    
            /* size: 16, cachelines: 1, members: 9 */
            /* last cacheline: 16 bytes */
    };
    
    struct mlx5_wqe_data_seg {
            __be32                     byte_count;           /*     0     4 */
            __be32                     lkey;                 /*     4     4 */
            __be64                     addr;                 /*     8     8 */
    
            /* size: 16, cachelines: 1, members: 3 */
            /* last cacheline: 16 bytes */
    };
    
    So, split the memcpy() so the compiler can reason about the buffer
    sizes.
    
    "pahole" shows no size nor member offset changes to struct mlx5e_tx_wqe
    nor struct mlx5e_umr_wqe. "objdump -d" shows no meaningful object
    code changes (i.e. only source line number induced differences and
    optimizations).
    
    Fixes: b5503b994ed5 ("net/mlx5e: XDP TX forwarding support")
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit aff510fd826568f46f388340581689baa309be96
Author: Roi Dayan <roid@nvidia.com>
Date:   Mon Jan 24 13:56:26 2022 +0200

    net/mlx5: Bridge, Fix devlink deadlock on net namespace deletion
    
    commit 880b517691908fb753019b9b27cd082e7617debd upstream.
    
    When changing mode to switchdev, rep bridge init registered to netdevice
    notifier holds the devlink lock and then takes pernet_ops_rwsem.
    At that time deleting a netns holds pernet_ops_rwsem and then takes
    the devlink lock.
    
    Example sequence is:
    $ ip netns add foo
    $ devlink dev eswitch set pci/0000:00:08.0 mode switchdev &
    $ ip netns del foo
    
    deleting netns trace:
    
    [ 1185.365555]  ? devlink_pernet_pre_exit+0x74/0x1c0
    [ 1185.368331]  ? mutex_lock_io_nested+0x13f0/0x13f0
    [ 1185.370984]  ? xt_find_table+0x40/0x100
    [ 1185.373244]  ? __mutex_lock+0x24a/0x15a0
    [ 1185.375494]  ? net_generic+0xa0/0x1c0
    [ 1185.376844]  ? wait_for_completion_io+0x280/0x280
    [ 1185.377767]  ? devlink_pernet_pre_exit+0x74/0x1c0
    [ 1185.378686]  devlink_pernet_pre_exit+0x74/0x1c0
    [ 1185.379579]  ? devlink_nl_cmd_get_dumpit+0x3a0/0x3a0
    [ 1185.380557]  ? xt_find_table+0xda/0x100
    [ 1185.381367]  cleanup_net+0x372/0x8e0
    
    changing mode to switchdev trace:
    
    [ 1185.411267]  down_write+0x13a/0x150
    [ 1185.412029]  ? down_write_killable+0x180/0x180
    [ 1185.413005]  register_netdevice_notifier+0x1e/0x210
    [ 1185.414000]  mlx5e_rep_bridge_init+0x181/0x360 [mlx5_core]
    [ 1185.415243]  mlx5e_uplink_rep_enable+0x269/0x480 [mlx5_core]
    [ 1185.416464]  ? mlx5e_uplink_rep_disable+0x210/0x210 [mlx5_core]
    [ 1185.417749]  mlx5e_attach_netdev+0x232/0x400 [mlx5_core]
    [ 1185.418906]  mlx5e_netdev_attach_profile+0x15b/0x1e0 [mlx5_core]
    [ 1185.420172]  mlx5e_netdev_change_profile+0x15a/0x1d0 [mlx5_core]
    [ 1185.421459]  mlx5e_vport_rep_load+0x557/0x780 [mlx5_core]
    [ 1185.422624]  ? mlx5e_stats_grp_vport_rep_num_stats+0x10/0x10 [mlx5_core]
    [ 1185.424006]  mlx5_esw_offloads_rep_load+0xdb/0x190 [mlx5_core]
    [ 1185.425277]  esw_offloads_enable+0xd74/0x14a0 [mlx5_core]
    
    Fix this by registering rep bridges for per net netdev notifier
    instead of global one, which operats on the net namespace without holding
    the pernet_ops_rwsem.
    
    Fixes: 19e9bfa044f3 ("net/mlx5: Bridge, add offload infrastructure")
    Signed-off-by: Roi Dayan <roid@nvidia.com>
    Reviewed-by: Vlad Buslov <vladbu@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c65d61fc54b243c99f100d72b768ca474a51d4ad
Author: Maxim Mikityanskiy <maximmi@nvidia.com>
Date:   Tue Jan 18 13:31:54 2022 +0200

    net/mlx5e: Don't treat small ceil values as unlimited in HTB offload
    
    commit 736dfe4e68b868829a1e89dfef4a44c1580d4478 upstream.
    
    The hardware spec defines max_average_bw == 0 as "unlimited bandwidth".
    max_average_bw is calculated as `ceil / BYTES_IN_MBIT`, which can become
    0 when ceil is small, leading to an undesired effect of having no
    bandwidth limit.
    
    This commit fixes it by rounding up small values of ceil to 1 Mbit/s.
    
    Fixes: 214baf22870c ("net/mlx5e: Support HTB offload")
    Signed-off-by: Maxim Mikityanskiy <maximmi@nvidia.com>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit d7080574a77d898fb4de28c0e49026ee15188104
Author: Dima Chumak <dchumak@nvidia.com>
Date:   Mon Jan 17 15:32:16 2022 +0200

    net/mlx5: Fix offloading with ESWITCH_IPV4_TTL_MODIFY_ENABLE
    
    commit 55b2ca702cfa744a9eb108915996a2294da47e71 upstream.
    
    Only prio 1 is supported for nic mode when there is no ignore flow level
    support in firmware. But for switchdev mode, which supports fixed number
    of statically pre-allocated prios, this restriction is not relevant so
    it can be relaxed.
    
    Fixes: d671e109bd85 ("net/mlx5: Fix tc max supported prio for nic mode")
    Signed-off-by: Dima Chumak <dchumak@nvidia.com>
    Reviewed-by: Roi Dayan <roid@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c5a4a32fbe4bb93740843828421cea08c47f5bdb
Author: Roi Dayan <roid@nvidia.com>
Date:   Mon Jan 17 15:00:30 2022 +0200

    net/mlx5e: TC, Reject rules with forward and drop actions
    
    commit 5623ef8a118838aae65363750dfafcba734dc8cb upstream.
    
    Such rules are redundant but allowed and passed to the driver.
    The driver does not support offloading such rules so return an error.
    
    Fixes: 03a9d11e6eeb ("net/mlx5e: Add TC drop and mirred/redirect action parsing for SRIOV offloads")
    Signed-off-by: Roi Dayan <roid@nvidia.com>
    Reviewed-by: Oz Shlomo <ozsh@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 59879f296a84428fc999e5686b695a6c53f2e2d1
Author: Gal Pressman <gal@nvidia.com>
Date:   Sun Jan 16 09:07:22 2022 +0200

    net/mlx5e: Fix module EEPROM query
    
    commit 4a08a131351e375a2969b98e46df260ed04dcba7 upstream.
    
    When querying the module EEPROM, there was a misusage of the 'offset'
    variable vs the 'query.offset' field.
    Fix that by always using 'offset' and assigning its value to
    'query.offset' right before the mcia register read call.
    
    While at it, the cross-pages read size adjustment was changed to be more
    intuitive.
    
    Fixes: e19b0a3474ab ("net/mlx5: Refactor module EEPROM query")
    Reported-by: Wang Yugui <wangyugui@e16-tech.com>
    Signed-off-by: Gal Pressman <gal@nvidia.com>
    Reviewed-by: Maxim Mikityanskiy <maximmi@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2a038dd1d942f8fbc495c58fa592ff24af05f1c2
Author: Maher Sanalla <msanalla@nvidia.com>
Date:   Thu Jan 13 15:48:48 2022 +0200

    net/mlx5: Use del_timer_sync in fw reset flow of halting poll
    
    commit 3c5193a87b0fea090aa3f769d020337662d87b5e upstream.
    
    Substitute del_timer() with del_timer_sync() in fw reset polling
    deactivation flow, in order to prevent a race condition which occurs
    when del_timer() is called and timer is deactivated while another
    process is handling the timer interrupt. A situation that led to
    the following call trace:
            RIP: 0010:run_timer_softirq+0x137/0x420
            <IRQ>
            recalibrate_cpu_khz+0x10/0x10
            ktime_get+0x3e/0xa0
            ? sched_clock_cpu+0xb/0xc0
            __do_softirq+0xf5/0x2ea
            irq_exit_rcu+0xc1/0xf0
            sysvec_apic_timer_interrupt+0x9e/0xc0
            asm_sysvec_apic_timer_interrupt+0x12/0x20
            </IRQ>
    
    Fixes: 38b9f903f22b ("net/mlx5: Handle sync reset request event")
    Signed-off-by: Maher Sanalla <msanalla@nvidia.com>
    Reviewed-by: Moshe Shemesh <moshe@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 4fad499d7fece448e7230d5e5b92f6d8a073e0bb
Author: Maor Dickman <maord@nvidia.com>
Date:   Thu Jan 13 15:11:42 2022 +0200

    net/mlx5e: Fix handling of wrong devices during bond netevent
    
    commit ec41332e02bd0acf1f24206867bb6a02f5877a62 upstream.
    
    Current implementation of bond netevent handler only check if
    the handled netdev is VF representor and it missing a check if
    the VF representor is on the same phys device of the bond handling
    the netevent.
    
    Fix by adding the missing check and optimizing the check if
    the netdev is VF representor so it will not access uninitialized
    private data and crashes.
    
    BUG: kernel NULL pointer dereference, address: 000000000000036c
    PGD 0 P4D 0
    Oops: 0000 [#1] SMP NOPTI
    Workqueue: eth3bond0 bond_mii_monitor [bonding]
    RIP: 0010:mlx5e_is_uplink_rep+0xc/0x50 [mlx5_core]
    RSP: 0018:ffff88812d69fd60 EFLAGS: 00010282
    RAX: 0000000000000000 RBX: ffff8881cf800000 RCX: 0000000000000000
    RDX: ffff88812d69fe10 RSI: 000000000000001b RDI: ffff8881cf800880
    RBP: ffff8881cf800000 R08: 00000445cabccf2b R09: 0000000000000008
    R10: 0000000000000004 R11: 0000000000000008 R12: ffff88812d69fe10
    R13: 00000000fffffffe R14: ffff88820c0f9000 R15: 0000000000000000
    FS:  0000000000000000(0000) GS:ffff88846fb00000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 000000000000036c CR3: 0000000103d80006 CR4: 0000000000370ea0
    DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Call Trace:
     mlx5e_eswitch_uplink_rep+0x31/0x40 [mlx5_core]
     mlx5e_rep_is_lag_netdev+0x94/0xc0 [mlx5_core]
     mlx5e_rep_esw_bond_netevent+0xeb/0x3d0 [mlx5_core]
     raw_notifier_call_chain+0x41/0x60
     call_netdevice_notifiers_info+0x34/0x80
     netdev_lower_state_changed+0x4e/0xa0
     bond_mii_monitor+0x56b/0x640 [bonding]
     process_one_work+0x1b9/0x390
     worker_thread+0x4d/0x3d0
     ? rescuer_thread+0x350/0x350
     kthread+0x124/0x150
     ? set_kthread_struct+0x40/0x40
     ret_from_fork+0x1f/0x30
    
    Fixes: 7e51891a237f ("net/mlx5e: Use netdev events to set/del egress acl forward-to-vport rule")
    Signed-off-by: Maor Dickman <maord@nvidia.com>
    Reviewed-by: Roi Dayan <roid@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ee719d38eca38d1af4e4a74ca7ad84f87fb56bb5
Author: Vlad Buslov <vladbu@nvidia.com>
Date:   Thu Jan 6 18:45:26 2022 +0200

    net/mlx5: Bridge, ensure dev_name is null-terminated
    
    commit 350d9a823734b5a7e767cddc3bdde5f0bcbb7ff4 upstream.
    
    Even though net_device->name is guaranteed to be null-terminated string of
    size<=IFNAMSIZ, the test robot complains that return value of netdev_name()
    can be larger:
    
    In file included from include/trace/define_trace.h:102,
                        from drivers/net/ethernet/mellanox/mlx5/core/esw/diag/bridge_tracepoint.h:113,
                        from drivers/net/ethernet/mellanox/mlx5/core/esw/bridge.c:12:
       drivers/net/ethernet/mellanox/mlx5/core/esw/diag/bridge_tracepoint.h: In function 'trace_event_raw_event_mlx5_esw_bridge_fdb_template':
    >> drivers/net/ethernet/mellanox/mlx5/core/esw/diag/bridge_tracepoint.h:24:29: warning: 'strncpy' output may be truncated copying 16 bytes from a string of length 20 [-Wstringop-truncation]
          24 |                             strncpy(__entry->dev_name,
             |                             ^~~~~~~~~~~~~~~~~~~~~~~~~~
          25 |                                     netdev_name(fdb->dev),
             |                                     ~~~~~~~~~~~~~~~~~~~~~~
          26 |                                     IFNAMSIZ);
             |                                     ~~~~~~~~~
    
    This is caused by the fact that default value of IFNAMSIZ is 16, while
    placeholder value that is returned by netdev_name() for unnamed net devices
    is larger than that.
    
    The offending code is in a tracing function that is only called for mlx5
    representors, so there is no straightforward way to reproduce the issue but
    let's fix it for correctness sake by replacing strncpy() with strscpy() to
    ensure that resulting string is always null-terminated.
    
    Fixes: 9724fd5d9c2a ("net/mlx5: Bridge, add tracepoints")
    Reported-by: kernel test robot <lkp@intel.com>
    Signed-off-by: Vlad Buslov <vladbu@nvidia.com>
    Reviewed-by: Roi Dayan <roid@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit b651c3642c77f8c4e9d30233a422d6bbe26c7395
Author: Vlad Buslov <vladbu@nvidia.com>
Date:   Thu Jan 6 16:40:18 2022 +0200

    net/mlx5: Bridge, take rtnl lock in init error handler
    
    commit 04f8c12f031fcd0ffa0c72822eb665ceb2c872e7 upstream.
    
    The mlx5_esw_bridge_cleanup() is expected to be called with rtnl lock
    taken, which is true for mlx5e_rep_bridge_cleanup() function but not for
    error handling code in mlx5e_rep_bridge_init(). Add missing rtnl
    lock/unlock calls and extend both mlx5_esw_bridge_cleanup() and its dual
    function mlx5_esw_bridge_init() with ASSERT_RTNL() to verify the invariant
    from now on.
    
    Fixes: 7cd6a54a8285 ("net/mlx5: Bridge, handle FDB events")
    Fixes: 19e9bfa044f3 ("net/mlx5: Bridge, add offload infrastructure")
    Signed-off-by: Vlad Buslov <vladbu@nvidia.com>
    Reviewed-by: Roi Dayan <roid@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 5ddd37044b9e6f93e66acc48c49593cb49b35bda
Author: Roi Dayan <roid@nvidia.com>
Date:   Tue Jan 4 10:38:02 2022 +0200

    net/mlx5e: TC, Reject rules with drop and modify hdr action
    
    commit a2446bc77a16cefd27de712d28af2396d6287593 upstream.
    
    This kind of action is not supported by firmware and generates a
    syndrome.
    
    kernel: mlx5_core 0000:08:00.0: mlx5_cmd_check:777:(pid 102063): SET_FLOW_TABLE_ENTRY(0x936) op_mod(0x0) failed, status bad parameter(0x3), syndrome (0x8708c3)
    
    Fixes: d7e75a325cb2 ("net/mlx5e: Add offloading of E-Switch TC pedit (header re-write) actions")
    Signed-off-by: Roi Dayan <roid@nvidia.com>
    Reviewed-by: Oz Shlomo <ozsh@nvidia.com>
    Reviewed-by: Maor Dickman <maord@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 15184af88d8ab02cd34bc4b18c83b6cd798079c6
Author: Raed Salem <raeds@nvidia.com>
Date:   Thu Dec 2 17:49:01 2021 +0200

    net/mlx5e: IPsec: Fix tunnel mode crypto offload for non TCP/UDP traffic
    
    commit de47db0cf7f4a9c555ad204e06baa70b50a70d08 upstream.
    
    IPsec Tunnel mode crypto offload software parser (SWP) setting in data
    path currently always set the inner L4 offset regardless of the
    encapsulated L4 header type and whether it exists in the first place,
    this breaks non TCP/UDP traffic as such.
    
    Set the SWP inner L4 offset only when the IPsec tunnel encapsulated L4
    header protocol is TCP/UDP.
    
    While at it fix inner ip protocol read for setting MLX5_ETH_WQE_SWP_INNER_L4_UDP
    flag to address the case where the ip header protocol is IPv6.
    
    Fixes: f1267798c980 ("net/mlx5: Fix checksum issue of VXLAN and IPsec crypto offload")
    Signed-off-by: Raed Salem <raeds@nvidia.com>
    Reviewed-by: Maor Dickman <maord@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 95dde75d5207a66f18e6e7e0b995c152579dd7fe
Author: Raed Salem <raeds@nvidia.com>
Date:   Thu Dec 2 17:43:50 2021 +0200

    net/mlx5e: IPsec: Fix crypto offload for non TCP/UDP encapsulated traffic
    
    commit 5352859b3bfa0ca188b2f1d2c1436fddc781e3b6 upstream.
    
    IPsec crypto offload always set the ethernet segment checksum flags with
    the inner L4 header checksum flag enabled for encapsulated IPsec offloaded
    packet regardless of the encapsulated L4 header type, and even if it
    doesn't exists in the first place, this breaks non TCP/UDP traffic as
    such.
    
    Set the inner L4 checksum flag only when the encapsulated L4 header
    protocol is TCP/UDP using software parser swp_inner_l4_offset field as
    indication.
    
    Fixes: 5cfb540ef27b ("net/mlx5e: Set IPsec WAs only in IP's non checksum partial case.")
    Signed-off-by: Raed Salem <raeds@nvidia.com>
    Reviewed-by: Maor Dickman <maord@nvidia.com>
    Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 41332593b54910ac145142e4c86894c785f9c840
Author: J. Bruce Fields <bfields@redhat.com>
Date:   Tue Jan 18 17:00:51 2022 -0500

    lockd: fix failure to cleanup client locks
    
    commit d19a7af73b5ecaac8168712d18be72b9db166768 upstream.
    
    In my testing, we're sometimes hitting the request->fl_flags & FL_EXISTS
    case in posix_lock_inode, presumably just by random luck since we're not
    actually initializing fl_flags here.
    
    This probably didn't matter before commit 7f024fcd5c97 ("Keep read and
    write fds with each nlm_file") since we wouldn't previously unlock
    unless we knew there were locks.
    
    But now it causes lockd to give up on removing more locks.
    
    We could just initialize fl_flags, but really it seems dubious to be
    calling vfs_lock_file with random values in some of the fields.
    
    Fixes: 7f024fcd5c97 ("Keep read and write fds with each nlm_file")
    Signed-off-by: J. Bruce Fields <bfields@redhat.com>
    [ cel: fixed checkpatch.pl nit ]
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 75fe271b2f41352a028d3c4a528decd6ea098bc8
Author: J. Bruce Fields <bfields@redhat.com>
Date:   Tue Jan 18 17:00:16 2022 -0500

    lockd: fix server crash on reboot of client holding lock
    
    commit 6e7f90d163afa8fc2efd6ae318e7c20156a5621f upstream.
    
    I thought I was iterating over the array when actually the iteration is
    over the values contained in the array?
    
    Ugh, keep it simple.
    
    Symptoms were a null deference in vfs_lock_file() when an NFSv3 client
    that previously held a lock came back up and sent a notify.
    
    Reported-by: Jonathan Woithe <jwoithe@just42.net>
    Fixes: 7f024fcd5c97 ("Keep read and write fds with each nlm_file")
    Signed-off-by: J. Bruce Fields <bfields@redhat.com>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 20842a8d93e3a595dbb49038588e4c994a500ccb
Author: Miklos Szeredi <mszeredi@redhat.com>
Date:   Fri Jan 14 16:57:56 2022 +0100

    ovl: don't fail copy up if no fileattr support on upper
    
    commit 94fd19752b28aa66c98e7991734af91dfc529f8f upstream.
    
    Christoph Fritz is reporting that failure to copy up fileattr when upper
    doesn't support fileattr or xattr results in a regression.
    
    Return success in these failure cases; this reverts overlayfs to the old
    behavior.
    
    Add a pr_warn_once() in these cases to still let the user know about the
    copy up failures.
    
    Reported-by: Christoph Fritz <chf.fritz@googlemail.com>
    Fixes: 72db82115d2b ("ovl: copy up sync/noatime fileattr flags")
    Cc: <stable@vger.kernel.org> # v5.15
    Signed-off-by: Miklos Szeredi <mszeredi@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2c6be97ad899ad1c1632b6138c2547e53e733a30
Author: Jonathan McDowell <noodles@earth.li>
Date:   Mon Jan 31 13:56:41 2022 +0000

    net: phy: Fix qca8081 with speeds lower than 2.5Gb/s
    
    commit 881cc731df6af99a21622e9be25a23b81adcd10b upstream.
    
    A typo in qca808x_read_status means we try to set SMII mode on the port
    rather than SGMII when the link speed is not 2.5Gb/s. This results in no
    traffic due to the mismatch in configuration between the phy and the
    mac.
    
    v2:
     Only change interface mode when the link is up
    
    Fixes: 79c7bc0521545 ("net: phy: add qca8081 read_status")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jonathan McDowell <noodles@earth.li>
    Reviewed-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit b780215397628fec83cabbb23dae332dc8f8fc41
Author: John Hubbard <jhubbard@nvidia.com>
Date:   Tue Feb 1 19:23:17 2022 -0800

    Revert "mm/gup: small refactoring: simplify try_grab_page()"
    
    commit c36c04c2e132fc39f6b658bf607aed4425427fd7 upstream.
    
    This reverts commit 54d516b1d62ff8f17cee2da06e5e4706a0d00b8a
    
    That commit did a refactoring that effectively combined fast and slow
    gup paths (again).  And that was again incorrect, for two reasons:
    
     a) Fast gup and slow gup get reference counts on pages in different
        ways and with different goals: see Linus' writeup in commit
        cd1adf1b63a1 ("Revert "mm/gup: remove try_get_page(), call
        try_get_compound_head() directly""), and
    
     b) try_grab_compound_head() also has a specific check for
        "FOLL_LONGTERM && !is_pinned(page)", that assumes that the caller
        can fall back to slow gup. This resulted in new failures, as
        recently report by Will McVicker [1].
    
    But (a) has problems too, even though they may not have been reported
    yet.  So just revert this.
    
    Link: https://lore.kernel.org/r/20220131203504.3458775-1-willmcvicker@google.com [1]
    Fixes: 54d516b1d62f ("mm/gup: small refactoring: simplify try_grab_page()")
    Reported-and-tested-by: Will McVicker <willmcvicker@google.com>
    Cc: Christoph Hellwig <hch@lst.de>
    Cc: Minchan Kim <minchan@google.com>
    Cc: Matthew Wilcox <willy@infradead.org>
    Cc: Christian Borntraeger <borntraeger@de.ibm.com>
    Cc: Heiko Carstens <hca@linux.ibm.com>
    Cc: Vasily Gorbik <gor@linux.ibm.com>
    Cc: stable@vger.kernel.org # 5.15
    Signed-off-by: John Hubbard <jhubbard@nvidia.com>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 9c9dbb954e618e3d9110f13cc02c5db1fb73ea5d
Author: Eric W. Biederman <ebiederm@xmission.com>
Date:   Thu Jan 20 11:04:01 2022 -0600

    cgroup-v1: Require capabilities to set release_agent
    
    commit 24f6008564183aa120d07c03d9289519c2fe02af upstream.
    
    The cgroup release_agent is called with call_usermodehelper.  The function
    call_usermodehelper starts the release_agent with a full set fo capabilities.
    Therefore require capabilities when setting the release_agaent.
    
    Reported-by: Tabitha Sable <tabitha.c.sable@gmail.com>
    Tested-by: Tabitha Sable <tabitha.c.sable@gmail.com>
    Fixes: 81a6a5cdd2c5 ("Task Control Groups: automatic userspace notification of idle cgroups")
    Cc: stable@vger.kernel.org # v2.6.24+
    Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 3a63b718200be5b1997f6eb28e5e688ec58ec41b
Author: Maxime Ripard <maxime@cerno.tech>
Date:   Thu Aug 19 15:59:30 2021 +0200

    drm/vc4: hdmi: Make sure the device is powered with CEC
    
    Commit 20b0dfa86bef0e80b41b0e5ac38b92f23b6f27f9 upstream.
    
    The original commit depended on a rework commit (724fc856c09e ("drm/vc4:
    hdmi: Split the CEC disable / enable functions in two")) that
    (rightfully) didn't reach stable.
    
    However, probably because the context changed, when the patch was
    applied to stable the pm_runtime_put called got moved to the end of the
    vc4_hdmi_cec_adap_enable function (that would have become
    vc4_hdmi_cec_disable with the rework) to vc4_hdmi_cec_init.
    
    This means that at probe time, we now drop our reference to the clocks
    and power domains and thus end up with a CPU hang when the CPU tries to
    access registers.
    
    The call to pm_runtime_resume_and_get() is also problematic since the
    .adap_enable CEC hook is called both to enable and to disable the
    controller. That means that we'll now call pm_runtime_resume_and_get()
    at disable time as well, messing with the reference counting.
    
    The behaviour we should have though would be to have
    pm_runtime_resume_and_get() called when the CEC controller is enabled,
    and pm_runtime_put when it's disabled.
    
    We need to move things around a bit to behave that way, but it aligns
    stable with upstream.
    
    Cc: <stable@vger.kernel.org> # 5.10.x
    Cc: <stable@vger.kernel.org> # 5.15.x
    Cc: <stable@vger.kernel.org> # 5.16.x
    Reported-by: Michael Stapelberg <michael+drm@stapelberg.ch>
    Signed-off-by: Maxime Ripard <maxime@cerno.tech>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 62600b0fbc6e69adfec2fca0fb6c69d10b72204b
Author: Alex Elder <elder@linaro.org>
Date:   Wed Jan 12 07:30:12 2022 -0600

    net: ipa: prevent concurrent replenish
    
    commit 998c0bd2b3715244da7639cc4e6a2062cb79c3f4 upstream.
    
    We have seen cases where an endpoint RX completion interrupt arrives
    while replenishing for the endpoint is underway.  This causes another
    instance of replenishing to begin as part of completing the receive
    transaction.  If this occurs it can lead to transaction corruption.
    
    Use a new flag to ensure only one replenish instance for an endpoint
    executes at a time.
    
    Fixes: 84f9bd12d46db ("soc: qcom: ipa: IPA endpoints")
    Signed-off-by: Alex Elder <elder@linaro.org>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 74a0e011912021740c46771a3d8c494440d4688c
Author: Alex Elder <elder@linaro.org>
Date:   Wed Jan 12 07:30:11 2022 -0600

    net: ipa: use a bitmap for endpoint replenish_enabled
    
    commit c1aaa01dbf4cef95af3e04a5a43986c290e06ea3 upstream.
    
    Define a new replenish_flags bitmap to contain Boolean flags
    associated with an endpoint's replenishing state.  Replace the
    replenish_enabled field with a flag in that bitmap.  This is to
    prepare for the next patch, which adds another flag.
    
    Signed-off-by: Alex Elder <elder@linaro.org>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 1223edb9e97e75b0d20bffe9d289c4f5c5512aab
Author: Paolo Abeni <pabeni@redhat.com>
Date:   Thu Jan 20 16:35:29 2022 -0800

    selftests: mptcp: fix ipv6 routing setup
    
    commit 9846921dba4936d92f7608315b5d1e0a8ec3a538 upstream.
    
    MPJ ipv6 selftests currently lack per link route to the server
    net. Additionally, ipv6 subflows endpoints are created without any
    interface specified. The end-result is that in ipv6 self-tests
    subflows are created all on the same link, leading to expected delays
    and sporadic self-tests failures.
    
    Fix the issue by adding the missing setup bits.
    
    Fixes: 523514ed0a99 ("selftests: mptcp: add ADD_ADDR IPv6 test cases")
    Reported-and-tested-by: Geliang Tang <geliang.tang@suse.com>
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Mat Martineau <mathew.j.martineau@linux.intel.com>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 6d6f1f0dac3e3441ecdb1103d4efb11b9ed24dd5
Author: Lukas Wunner <lukas@wunner.de>
Date:   Wed Nov 17 23:22:09 2021 +0100

    PCI: pciehp: Fix infinite loop in IRQ handler upon power fault
    
    commit 23584c1ed3e15a6f4bfab8dc5a88d94ab929ee12 upstream.
    
    The Power Fault Detected bit in the Slot Status register differs from
    all other hotplug events in that it is sticky:  It can only be cleared
    after turning off slot power.  Per PCIe r5.0, sec. 6.7.1.8:
    
      If a power controller detects a main power fault on the hot-plug slot,
      it must automatically set its internal main power fault latch [...].
      The main power fault latch is cleared when software turns off power to
      the hot-plug slot.
    
    The stickiness used to cause interrupt storms and infinite loops which
    were fixed in 2009 by commits 5651c48cfafe ("PCI pciehp: fix power fault
    interrupt storm problem") and 99f0169c17f3 ("PCI: pciehp: enable
    software notification on empty slots").
    
    Unfortunately in 2020 the infinite loop issue was inadvertently
    reintroduced by commit 8edf5332c393 ("PCI: pciehp: Fix MSI interrupt
    race"):  The hardirq handler pciehp_isr() clears the PFD bit until
    pciehp's power_fault_detected flag is set.  That happens in the IRQ
    thread pciehp_ist(), which never learns of the event because the hardirq
    handler is stuck in an infinite loop.  Fix by setting the
    power_fault_detected flag already in the hardirq handler.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=214989
    Link: https://lore.kernel.org/linux-pci/DM8PR11MB5702255A6A92F735D90A4446868B9@DM8PR11MB5702.namprd11.prod.outlook.com
    Fixes: 8edf5332c393 ("PCI: pciehp: Fix MSI interrupt race")
    Link: https://lore.kernel.org/r/66eaeef31d4997ceea357ad93259f290ededecfd.1637187226.git.lukas@wunner.de
    Reported-by: Joseph Bao <joseph.bao@intel.com>
    Tested-by: Joseph Bao <joseph.bao@intel.com>
    Signed-off-by: Lukas Wunner <lukas@wunner.de>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Cc: stable@vger.kernel.org # v4.19+
    Cc: Stuart Hayes <stuart.w.hayes@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>