1. 12 Apr, 2021 10 commits
  2. 11 Apr, 2021 20 commits
    • David S. Miller's avatar
      Merge branch 'ipa-next' · 5b489fea
      David S. Miller authored
      Alex Elder says:
      
      ====================
      net: ipa: support two more platforms
      
      This series adds IPA support for two more Qualcomm SoCs.
      
      The first patch updates the DT binding to add compatible strings.
      
      The second temporarily disables checksum offload support for IPA
      version 4.5 and above.  Changes are required to the RMNet driver
      to support the "inline" checksum offload used for IPA v4.5+, and
      once those are present this capability will be enabled for IPA.
      
      The third and fourth patches add configuration data for IPA versions
      4.5 (used for the SDX55 SoC) and 4.11 (used for the SD7280 SoC).
      ====================
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      5b489fea
    • Alex Elder's avatar
      net: ipa: add IPA v4.11 configuration data · 927c5043
      Alex Elder authored
      Add support for the SC7280 SoC, which includes IPA version 4.11.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      927c5043
    • Alex Elder's avatar
      net: ipa: add IPA v4.5 configuration data · fbb763e7
      Alex Elder authored
      Add support for the SDX55 SoC, which includes IPA version 4.5.
      
      Starting with IPA v4.5, a few of the memory regions have a different
      number of "canary" values; update comments in the where the region
      identifers are defined to accurately reflect that.
      
      I'll note three differences in SDX55 versus the other two existing
      platforms (SDM845 and SC7180):
        - SDX55 uses a 32-bit Linux kernel
        - SDX55 has four interconnects rather than three
        - SDX55 uses IPA v4.5, which uses inline checksum offload
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      fbb763e7
    • Alex Elder's avatar
      net: ipa: disable checksum offload for IPA v4.5+ · c88c34fc
      Alex Elder authored
      Checksum offload for IPA v4.5+ is implemented differently, using
      "inline" offload (which uses a common header format for both upload
      and download offload).
      
      The IPA hardware must be programmed to enable MAP checksum offload,
      but the RMNet driver is responsible for interpreting checksum
      metadata supplied with messages.
      
      Currently, the RMNet driver does not support inline checksum offload.
      This support is imminent, but until it is available, do not allow
      newer versions of IPA to specify checksum offload for endpoints.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c88c34fc
    • Alex Elder's avatar
      dt-bindings: net: qcom,ipa: add some compatible strings · c3264fee
      Alex Elder authored
      Add existing supported platform "qcom,sc7180-ipa" to the set of IPA
      compatible strings.  Also add newly-supported "qcom,sdx55-ipa",
      "qcom,sc7280-ipa".
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c3264fee
    • Qiheng Lin's avatar
      ehea: add missing MODULE_DEVICE_TABLE · 95291ced
      Qiheng Lin authored
      This patch adds missing MODULE_DEVICE_TABLE definition which generates
      correct modalias for automatic loading of this driver when it is built
      as an external module.
      Reported-by: default avatarHulk Robot <hulkci@huawei.com>
      Signed-off-by: default avatarQiheng Lin <linqiheng@huawei.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      95291ced
    • David S. Miller's avatar
      Merge branch 'veth-gro' · 23cfa4d4
      David S. Miller authored
      Paolo Abeni  says:
      
      ====================
      veth: allow GRO even without XDP
      
      This series allows the user-space to enable GRO/NAPI on a veth
      device even without attaching an XDP program.
      
      It does not change the default veth behavior (no NAPI, no GRO),
      except that the GRO feature bit on top of this series will be
      effectively off by default on veth devices. Note that currently
      the GRO bit is on by default, but GRO never takes place in
      absence of XDP.
      
      On top of this series, setting the GRO feature bit enables NAPI
      and allows the GRO to take place. The TSO features on the peer
      device are preserved.
      
      The main goal is improving UDP forwarding performances for
      containers in a typical virtual network setup:
      
      (container) veth -> veth peer -> bridge/ovs -> vxlan -> NIC
      
      Enabling the NAPI threaded mode, GRO the NETIF_F_GRO_UDP_FWD
      feature on the veth peer improves the UDP stream performance
      with not void netfilter configuration by 2x factor with no
      measurable overhead for TCP traffic: some heuristic ensures
      that TCP will not go through the additional NAPI/GRO layer.
      
      Some self-tests are added to check the expected behavior in
      the default configuration, with XDP and with plain GRO enabled.
      ====================
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      23cfa4d4
    • Paolo Abeni's avatar
      self-tests: add veth tests · 1c3cadbe
      Paolo Abeni authored
      Add some basic veth tests, that verify the expected flags and
      aggregation with different setups (default, xdp, etc...)
      Signed-off-by: default avatarPaolo Abeni <pabeni@redhat.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      1c3cadbe
    • Paolo Abeni's avatar
      veth: refine napi usage · 47e550e0
      Paolo Abeni authored
      After the previous patch, when enabling GRO, locally generated
      TCP traffic experiences some measurable overhead, as it traverses
      the GRO engine without any chance of aggregation.
      
      This change refine the NAPI receive path admission test, to avoid
      unnecessary GRO overhead in most scenarios, when GRO is enabled
      on a veth peer.
      
      Only skbs that are eligible for aggregation enter the GRO layer,
      the others will go through the traditional receive path.
      Signed-off-by: default avatarPaolo Abeni <pabeni@redhat.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      47e550e0
    • Paolo Abeni's avatar
      veth: allow enabling NAPI even without XDP · d3256efd
      Paolo Abeni authored
      Currently the veth device has the GRO feature bit set, even if
      no GRO aggregation is possible with the default configuration,
      as the veth device does not hook into the GRO engine.
      
      Flipping the GRO feature bit from user-space is a no-op, unless
      XDP is enabled. In such scenario GRO could actually take place, but
      TSO is forced to off on the peer device.
      
      This change allow user-space to really control the GRO feature, with
      no need for an XDP program.
      
      The GRO feature bit is now cleared by default - so that there are no
      user-visible behavior changes with the default configuration.
      
      When the GRO bit is set, the per-queue NAPI instances are initialized
      and registered. On xmit, when napi instances are available, we try
      to use them.
      
      Some additional checks are in place to ensure we initialize/delete NAPIs
      only when needed in case of overlapping XDP and GRO configuration
      changes.
      Signed-off-by: default avatarPaolo Abeni <pabeni@redhat.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      d3256efd
    • Paolo Abeni's avatar
      veth: use skb_orphan_partial instead of skb_orphan · c75fb320
      Paolo Abeni authored
      As described by commit 9c4c3252 ("skbuff: preserve sock
      reference when scrubbing the skb."), orphaning a skb
      in the TX path will cause OoO.
      
      Let's use skb_orphan_partial() instead of skb_orphan(), so
      that we keep the sk around for queue's selection sake and we
      still avoid the problem fixed with commit 4bf9ffa0 ("veth:
      Orphan skb before GRO")
      Signed-off-by: default avatarPaolo Abeni <pabeni@redhat.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c75fb320
    • David S. Miller's avatar
      Merge branch 'ethtool-eeprom' · 7dc85b59
      David S. Miller authored
      Moshe Shemesh says:
      
      ====================
      ethtool: Extend module EEPROM dump API
      
      Ethtool supports module EEPROM dumps via the `ethtool -m <dev>` command.
      But in current state its functionality is limited - offset and length
      parameters, which are used to specify a linear desired region of EEPROM
      data to dump, is not enough, considering emergence of complex module
      EEPROM layouts such as CMIS 4.0.
      Moreover, CMIS 4.0 extends the amount of pages that may be accessible by
      introducing another parameter for page addressing - banks.
      
      Besides, currently module EEPROM is represented as a chunk of
      concatenated pages, where lower 128 bytes of all pages, except page 00h,
      are omitted. Offset and length are used to address parts of this fake
      linear memory. But in practice drivers, which implement
      get_module_info() and get_module_eeprom() ethtool ops still calculate
      page number and set I2C address on their own.
      
      This series tackles these issues by adding ethtool op, which allows to
      pass page number, bank number and I2C address in addition to offset and
      length parameters to the driver, adds corresponding netlink
      infrastructure and implements the new interface in mlx5 driver.
      
      This allows to extend userspace 'ethtool -m' CLI by adding new
      parameters - page, bank and i2c. New command line format:
       ethtool -m <dev> [hex on|off] [raw on|off] [offset N] [length N] [page N] [bank N] [i2c N]
      
      The consequence of this series is a possibility to dump arbitrary EEPROM
      page at a time, in contrast to dumps of concatenated pages. Therefore,
      offset and length change their semantics and may be used only to specify
      a part of data within half page boundary, which size is currently limited
      to 128 bytes.
      
      As for drivers that support legacy get_module_info() and
      get_module_eeprom() pair, the series addresses it by implementing a
      fallback mechanism. As mentioned earlier, such drivers derive a page
      number from 'global' offset, so this can be done vice versa without
      their involvement thanks to standardization. If kernel netlink handler
      of 'ethtool -m' command detects that new ethtool op is not supported by
      the driver, it calculates offset from given page number and page offset
      and calls old ndos, if they are available.
      ====================
      
      \Signed-off-by: David S. Miller <davem@davemloft.net>
      7dc85b59
    • Andrew Lunn's avatar
      ethtool: wire in generic SFP module access · c97a31f6
      Andrew Lunn authored
      If the device has a sfp bus attached, call its
      sfp_get_module_eeprom_by_page() function, otherwise use the ethtool op
      for the device. This follows how the IOCTL works.
      Signed-off-by: default avatarAndrew Lunn <andrew@lunn.ch>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c97a31f6
    • Andrew Lunn's avatar
      phy: sfp: add netlink SFP support to generic SFP code · d740513f
      Andrew Lunn authored
      The new netlink API for reading SFP data requires a new op to be
      implemented. The idea of the new netlink SFP code is that userspace is
      responsible to parsing the EEPROM data and requesting pages, rather
      than have the kernel decide what pages are interesting and returning
      them. This allows greater flexibility for newer formats.
      
      Currently the generic SFP code only supports simple SFPs. Allow i2c
      address 0x50 and 0x51 to be accessed with page and bank must always be
      0. This interface will later be extended when for example QSFP support
      is added.
      Signed-off-by: default avatarAndrew Lunn <andrew@lunn.ch>
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      d740513f
    • Vladyslav Tarasiuk's avatar
      ethtool: Add fallback to get_module_eeprom from netlink command · 96d971e3
      Vladyslav Tarasiuk authored
      In case netlink get_module_eeprom_by_page() callback is not implemented
      by the driver, try to call old get_module_info() and get_module_eeprom()
      pair. Recalculate parameters to get_module_eeprom() offset and len using
      page number and their sizes. Return error if this can't be done.
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarAndrew Lunn <andrew@lunn.ch>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      96d971e3
    • Andrew Lunn's avatar
      net: ethtool: Export helpers for getting EEPROM info · 95dfc7ef
      Andrew Lunn authored
      There are two ways to retrieve information from SFP EEPROMs.  Many
      devices make use of the common code, and assign the sfp_bus pointer in
      the netdev to point to the bus holding the SFP device. Some MAC
      drivers directly implement ops in there ethool structure.
      
      Export within net/ethtool the two helpers used to call these methods,
      so that they can also be used in the new netlink code.
      Signed-off-by: default avatarAndrew Lunn <andrew@lunn.ch>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      95dfc7ef
    • Vladyslav Tarasiuk's avatar
      net/mlx5: Add support for DSFP module EEPROM dumps · 4c88fa41
      Vladyslav Tarasiuk authored
      Allow the driver to recognise DSFP transceiver module ID and therefore
      allow its EEPROM dumps using ethtool.
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      4c88fa41
    • Vladyslav Tarasiuk's avatar
      net/mlx5: Implement get_module_eeprom_by_page() · e109d2b2
      Vladyslav Tarasiuk authored
      Implement ethtool_ops::get_module_eeprom_by_page() to enable
      support of new SFP standards.
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      e109d2b2
    • Vladyslav Tarasiuk's avatar
      net/mlx5: Refactor module EEPROM query · e19b0a34
      Vladyslav Tarasiuk authored
      Prepare for ethtool_ops::get_module_eeprom_data() implementation by
      extracting common part of mlx5_query_module_eeprom() into a separate
      function.
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      e19b0a34
    • Vladyslav Tarasiuk's avatar
      ethtool: Allow network drivers to dump arbitrary EEPROM data · c781ff12
      Vladyslav Tarasiuk authored
      Define get_module_eeprom_by_page() ethtool callback and implement
      netlink infrastructure.
      
      get_module_eeprom_by_page() allows network drivers to dump a part of
      module's EEPROM specified by page and bank numbers along with offset and
      length. It is effectively a netlink replacement for get_module_info()
      and get_module_eeprom() pair, which is needed due to emergence of
      complex non-linear EEPROM layouts.
      Signed-off-by: default avatarVladyslav Tarasiuk <vladyslavt@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c781ff12
  3. 10 Apr, 2021 10 commits
    • Jakub Kicinski's avatar
      Merge branch 'net-ipa-a-few-small-fixes' · cbd31253
      Jakub Kicinski authored
      Alex Elder says:
      
      ====================
      net: ipa: a few small fixes
      
      This series implements some minor bug fixes or improvements.
      
      The first patch removes an apparently unnecessary restriction, which
      results in an error on a 32-bit ARM build.
      
      The second makes a definition used for SDM845 match what is used in
      the downstream code.
      
      The third just ensures two netdev pointers are only non-null when
      valid.
      
      The fourth simplifies a little code, knowing that a called function
      never returns an error.
      
      The fifth and sixth just remove some empty/place holder functions.
      
      And the last patch fixes a comment, makes a function private, and
      removes an unnecessary double-negation of a Boolean variable.  This
      patch produces a warning from checkpatch, indicating that a pair of
      parentheses is unnecessary.  I agree with that advice, but it
      conflicts with a suggestion from the compiler.  I left the "problem"
      in place to avoid the compiler warning.
      ====================
      
      Link: https://lore.kernel.org/r/20210409180722.1176868-1-elder@linaro.orgSigned-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      cbd31253
    • Alex Elder's avatar
      net: ipa: three small fixes · 602a1c76
      Alex Elder authored
      Some time ago changes were made to stop referring to clearing the
      hardware pipeline as a "tag process."  Fix a comment to use the
      newer terminology.
      
      Get rid of a pointless double-negation of the Boolean toward_ipa
      flag in ipa_endpoint_config().
      
      make ipa_endpoint_exit_one() private; it's only referenced inside
      "ipa_endpoint.c".
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      602a1c76
    • Alex Elder's avatar
      net: ipa: get rid of empty GSI functions · 57ab8ca4
      Alex Elder authored
      There are place holder functions in the GSI code that do nothing.
      Remove these, knowing we can add something back in their place if
      they're really needed someday.
      
      Some of these are inverse functions (such as teardown to match setup).
      Explicitly comment that there is no inverse in these cases.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      57ab8ca4
    • Alex Elder's avatar
      net: ipa: get rid of empty IPA functions · 74858b63
      Alex Elder authored
      There are place holder functions in the IPA code that do nothing.
      For the most part these are inverse functions, for example, once the
      routing or filter tables are set up there is no need to perform any
      matching teardown activity at shutdown, or in the case of an error.
      
      These can be safely removed, resulting in some code simplification.
      Add comments in these spots making it explicit that there is no
      inverse.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      74858b63
    • Alex Elder's avatar
      net: ipa: ipa_stop() does not return an error · 077e770f
      Alex Elder authored
      In ipa_modem_stop(), if the modem netdev pointer is non-null we call
      ipa_stop().  We check for an error and if one is returned we handle
      it.  But ipa_stop() never returns an error, so this extra handling
      is unnecessary.  Simplify the code in ipa_modem_stop() based on the
      knowledge no error handling is needed at this spot.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      077e770f
    • Alex Elder's avatar
      net: ipa: only set endpoint netdev pointer when in use · 57f63faf
      Alex Elder authored
      In ipa_modem_start(), we set endpoint netdev pointers before the
      network device is registered.  If registration fails, we don't undo
      those assignments.  Instead, wait to assign the netdev pointer until
      after registration succeeds.
      
      Set these endpoint netdev pointers to NULL in ipa_modem_stop()
      before unregistering the network device.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      57f63faf
    • Alex Elder's avatar
      net: ipa: update sequence type for modem TX endpoint · 49e76a41
      Alex Elder authored
      On IPA v3.5.1, the sequencer type for the modem TX endpoint does not
      define the replication portion in the same way the downstream code
      does.  This difference doesn't affect the behavior of the upstream
      code, but I'd prefer the two code bases use the same configuration
      value here.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      49e76a41
    • Alex Elder's avatar
      net: ipa: relax pool entry size requirement · 7ad3bd52
      Alex Elder authored
      I no longer know why a validation check ensured the size of an entry
      passed to gsi_trans_pool_init() was restricted to be a multiple of 8.
      For 32-bit builds, this condition doesn't always hold, and for DMA
      pools, the size is rounded up to a power of 2 anyway.
      
      Remove this restriction.
      Signed-off-by: default avatarAlex Elder <elder@linaro.org>
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      7ad3bd52
    • Jakub Kicinski's avatar
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net · 8859a44e
      Jakub Kicinski authored
      Conflicts:
      
      MAINTAINERS
       - keep Chandrasekar
      drivers/net/ethernet/mellanox/mlx5/core/en_main.c
       - simple fix + trust the code re-added to param.c in -next is fine
      include/linux/bpf.h
       - trivial
      include/linux/ethtool.h
       - trivial, fix kdoc while at it
      include/linux/skmsg.h
       - move to relevant place in tcp.c, comment re-wrapped
      net/core/skmsg.c
       - add the sk = sk // sk = NULL around calls
      net/tipc/crypto.c
       - trivial
      Signed-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      8859a44e
    • Claudiu Manoil's avatar
      enetc: Use generic rule to map Tx rings to interrupt vectors · 6c5e6b4c
      Claudiu Manoil authored
      Even if the current mapping is correct for the 1 CPU and 2 CPU cases
      (currently enetc is included in SoCs with up to 2 CPUs only), better
      use a generic rule for the mapping to cover all possible cases.
      The number of CPUs is the same as the number of interrupt vectors:
      
      Per device Tx rings -
      device_tx_ring[idx], where idx = 0..n_rings_total-1
      
      Per interrupt vector Tx rings -
      int_vector[i].ring[j], where i = 0..n_int_vects-1
      			     j = 0..n_rings_per_v-1
      
      Mapping rule -
      n_rings_per_v = n_rings_total / n_int_vects
      for i = 0..n_int_vects - 1:
      	for j = 0..n_rings_per_v - 1:
      		idx = n_int_vects * j + i
      		int_vector[i].ring[j] <- device_tx_ring[idx]
      Signed-off-by: default avatarClaudiu Manoil <claudiu.manoil@nxp.com>
      Tested-by: default avatarVladimir Oltean <vladimir.oltean@nxp.com>
      Link: https://lore.kernel.org/r/20210409071613.28912-1-claudiu.manoil@nxp.comSigned-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      6c5e6b4c