From patchwork Mon Aug 9 11:47:15 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Xueming(Steven) Li" X-Patchwork-Id: 96730 X-Patchwork-Delegate: ferruh.yigit@amd.com Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 83586A0C4C; Mon, 9 Aug 2021 13:48:12 +0200 (CEST) Received: from [217.70.189.124] (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 036194068A; Mon, 9 Aug 2021 13:48:12 +0200 (CEST) Received: from NAM11-CO1-obe.outbound.protection.outlook.com (mail-co1nam11on2040.outbound.protection.outlook.com [40.107.220.40]) by mails.dpdk.org (Postfix) with ESMTP id ECA834003C for ; Mon, 9 Aug 2021 13:48:09 +0200 (CEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Rr1qN3ILkPiAtADhgwlLJLM6uPmvU8v5MO9OmeCp2l4UPQWRi/b7Vpy1e7/Pz1q3VPmdIPBnDWusRMp/k/NtFJ4bDFTfbyV9D3NjL6alz0ng2zjrqaxJF5FegcVvJRJZRuqkqvq+UDdSpMDULWDuoHTCLYOO+84eC6YySmjsNZt1v8AsJ2vC92XYVH/vKwfLk3B7ATGsvk0AkpzYuqAl9NecTJpT+j+hemWfv/+mcavTINTCPCeME6FCvZy95rsAgebVhCclkaEiUu5cvlwnZU9oUiqO8XugbLkQt9e/jxYi5NNu2NLtE7t4Oie27WQhx6mleXP3fLDgII+wr4QDlA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=kSHZX1mNiAfHi3Grd4Zw7y/5B2N1RhHG6rwKFPX2+G8=; b=AezyZ1UnQ7nf8suz6m0XtwBVrBkHa6q2GYH6cXzLfLzfG2D4e4WbjKnIoG55btk8aKlGsLEPVDwwSlgbx0X4myTirhDF/B8cv5GzArC9foPUuRSPeVnl03LeHSOKX1ooV81Ys1kLW/ZclmNn7bEfGz7aLhpbHTLKTaMhSHEb1CQeJkFMVXL1uTKztiN3vM3muvPCy60U9rrvq6wp76mhJx+hOu2uZBFITuLWgwheHId+YLgbJKbSV/JInpMPb+dVRoQDhSfh2lvU5n9SY7c003ObApkUQYrC0zyswas19s2v+oQEhIQnVC4w7ki0Ijmo+ay2j+cvSNe3mrToGNHhxw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 216.228.112.34) smtp.rcpttodomain=monjalon.net smtp.mailfrom=nvidia.com; dmarc=pass (p=quarantine sp=none pct=100) action=none header.from=nvidia.com; dkim=none (message not signed); arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=kSHZX1mNiAfHi3Grd4Zw7y/5B2N1RhHG6rwKFPX2+G8=; b=JJ/R7zwXKUYd+UKmMFeKVa7W4Kco67uXPOeTHCwJ/P6hDgwNfve+cVMrCjYmmONrA2ephuNDoeZTAsBVirASwWxotP8039UCXumCh/RTKm/5qxHFFDCkr/5cm7xoZW+AtOusvJgLIAPB8KSvL5iZJZ6IwzIZP++Al/rI7pD7U/8YUvKz6IQq4byQw7TfK+MXmQpvzO61izg/IhjU/EbHsMO6laSfKIB9+nqWLg9VAcHlnxTSHIVptB8xPg0I870aI68YTg00Vx2zxDfzFDbSdcuGao4tb19OgRq+ca9wZN9gKGoMtX6wVE2VFNz0bjumygh2mGOHLUKWy7hGdGqRCw== Received: from MWHPR2001CA0007.namprd20.prod.outlook.com (2603:10b6:301:15::17) by BN9PR12MB5340.namprd12.prod.outlook.com (2603:10b6:408:105::22) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.16; Mon, 9 Aug 2021 11:48:08 +0000 Received: from CO1NAM11FT007.eop-nam11.prod.protection.outlook.com (2603:10b6:301:15:cafe::17) by MWHPR2001CA0007.outlook.office365.com (2603:10b6:301:15::17) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.15 via Frontend Transport; Mon, 9 Aug 2021 11:48:07 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 216.228.112.34) smtp.mailfrom=nvidia.com; monjalon.net; dkim=none (message not signed) header.d=none;monjalon.net; dmarc=pass action=none header.from=nvidia.com; Received-SPF: Pass (protection.outlook.com: domain of nvidia.com designates 216.228.112.34 as permitted sender) receiver=protection.outlook.com; client-ip=216.228.112.34; helo=mail.nvidia.com; Received: from mail.nvidia.com (216.228.112.34) by CO1NAM11FT007.mail.protection.outlook.com (10.13.174.131) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384) id 15.20.4394.16 via Frontend Transport; Mon, 9 Aug 2021 11:48:07 +0000 Received: from nvidia.com (172.20.187.6) by HQMAIL107.nvidia.com (172.20.187.13) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Mon, 9 Aug 2021 11:48:05 +0000 From: Xueming Li To: CC: , , Ferruh Yigit , Thomas Monjalon , "Andrew Rybchenko" Date: Mon, 9 Aug 2021 19:47:15 +0800 Message-ID: <20210809114716.22035-1-xuemingl@nvidia.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20210727034204.20649-1-xuemingl@nvidia.com> References: <20210727034204.20649-1-xuemingl@nvidia.com> MIME-Version: 1.0 X-Originating-IP: [172.20.187.6] X-ClientProxiedBy: HQMAIL107.nvidia.com (172.20.187.13) To HQMAIL107.nvidia.com (172.20.187.13) X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 14cd5ad3-faef-4095-3cef-08d95b2b8dda X-MS-TrafficTypeDiagnostic: BN9PR12MB5340: X-LD-Processed: 43083d15-7273-40c1-b7db-39efd9ccc17a,ExtAddr X-Microsoft-Antispam-PRVS: X-MS-Oob-TLC-OOBClassifiers: OLM:8882; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 0avbtpt6hE+GN6YejLTPHeJiAJwhUsvCzoQ39n6a2DO5qXgz6AnBDUfnaqjQIBJPh7vEFNuDPWYAwxOTJwDi9+dl92SjN+8QuUUOp5FEt6M8g6nEJzpCJ0PsMBb5RjLXcOJoAL+7q9NP7BaCbTcf04RK+RVhk/jvtrGB+OW6rchJPfQtb7h0eAyMENLrnBO92C+VxLIqMHJQN6ojqr3Vpf5tbDtqGOwGsCPlY6MArj0ce9hgkhyYCv5bJn8H2cz0miWXWTRSGoAX8HwC0MN7nhqYBOQEsr65eiQW5tckXdb7TKD6ZDyBA5uoYyiIcOUS7oyZUeRrL8n3wx9gyovHzjosve8Nm2NI0SvAFnWKp1fvIhGsmxNYOPurHLcj9gw7KpIHvz6SMILbBVOSpbASbjptxk9wSl3gKJScvmCvcdCf1RPuj1NFKleQurJjqotYcBFGfm3HSvHgQnKCLumPq7sjNupy1uNxy7fLFwmOwdHhY10KIk0d1b9IN+nSsnSbg6WeZPw0kGmii3Qjq4BxdlM9QmYY4ERJWVfbTnf6jXSA7WT9genfQ58Ag8I+bErFq/P0tTKiKKmlEOSe9sg6f/UQL8QLATTY0YBGIUtaESJs11KHbmx9bmLkeqGq+1cSc5kET0zY+Hp7/yCETAxbL6lDPbb08JMacY01pA9JZGgBHUMuPX969PC77sTdORJgasCLvwsiDRSlL+4dMyrtyBbqIr0pYRMR3uvnhAPqD5cMJdGsN0REsYvxcNxYClvWgYpmVq7rHu4ePFhyDxmiDwahPQf13ZwKYGjYjaD5atB8FdSakDTaVCwmaBLrFwE6 X-Forefront-Antispam-Report: CIP:216.228.112.34; CTRY:US; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:mail.nvidia.com; PTR:schybrid03.nvidia.com; CAT:NONE; SFS:(4636009)(396003)(39860400002)(136003)(376002)(346002)(46966006)(36840700001)(478600001)(6286002)(2906002)(54906003)(70206006)(109986005)(426003)(4326008)(336012)(55016002)(2616005)(36860700001)(82310400003)(70586007)(5660300002)(356005)(26005)(1076003)(6666004)(8676002)(7636003)(36906005)(86362001)(8936002)(83380400001)(82740400003)(316002)(186003)(47076005)(16526019)(36756003)(7696005)(266003); DIR:OUT; SFP:1101; X-OriginatorOrg: Nvidia.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 09 Aug 2021 11:48:07.0197 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 14cd5ad3-faef-4095-3cef-08d95b2b8dda X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=43083d15-7273-40c1-b7db-39efd9ccc17a; Ip=[216.228.112.34]; Helo=[mail.nvidia.com] X-MS-Exchange-CrossTenant-AuthSource: CO1NAM11FT007.eop-nam11.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN9PR12MB5340 Subject: [dpdk-dev] [PATCH v1] ethdev: introduce shared Rx queue X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" In current DPDK framework, each RX queue is pre-loaded with mbufs for incoming packets. When number of representors scale out in a switch domain, the memory consumption became significant. Most important, polling all ports leads to high cache miss, high latency and low throughput. This patch introduces shared RX queue. Ports with same configuration in a switch domain could share RX queue set by specifying sharing group. Polling any queue using same shared RX queue receives packets from all member ports. Source port is identified by mbuf->port. Port queue number in a shared group should be identical. Queue index is 1:1 mapped in shared group. Share RX queue is supposed to be polled on same thread. Multiple groups is supported by group ID. Signed-off-by: Xueming Li Signed-off-by: Xueming Li > --- doc/guides/nics/features.rst | 11 +++++++++++ doc/guides/nics/features/default.ini | 1 + doc/guides/prog_guide/switch_representation.rst | 10 ++++++++++ lib/ethdev/rte_ethdev.c | 1 + lib/ethdev/rte_ethdev.h | 7 +++++++ 5 files changed, 30 insertions(+) diff --git a/doc/guides/nics/features.rst b/doc/guides/nics/features.rst index a96e12d155..2e2a9b1554 100644 --- a/doc/guides/nics/features.rst +++ b/doc/guides/nics/features.rst @@ -624,6 +624,17 @@ Supports inner packet L4 checksum. ``tx_offload_capa,tx_queue_offload_capa:DEV_TX_OFFLOAD_OUTER_UDP_CKSUM``. +.. _nic_features_shared_rx_queue: + +Shared Rx queue +--------------- + +Supports shared Rx queue for ports in same switch domain. + +* **[uses] rte_eth_rxconf,rte_eth_rxmode**: ``offloads:RTE_ETH_RX_OFFLOAD_SHARED_RXQ``. +* **[provides] mbuf**: ``mbuf.port``. + + .. _nic_features_packet_type_parsing: Packet type parsing diff --git a/doc/guides/nics/features/default.ini b/doc/guides/nics/features/default.ini index 754184ddd4..ebeb4c1851 100644 --- a/doc/guides/nics/features/default.ini +++ b/doc/guides/nics/features/default.ini @@ -19,6 +19,7 @@ Free Tx mbuf on demand = Queue start/stop = Runtime Rx queue setup = Runtime Tx queue setup = +Shared Rx queue = Burst mode info = Power mgmt address monitor = MTU update = diff --git a/doc/guides/prog_guide/switch_representation.rst b/doc/guides/prog_guide/switch_representation.rst index ff6aa91c80..45bf5a3a10 100644 --- a/doc/guides/prog_guide/switch_representation.rst +++ b/doc/guides/prog_guide/switch_representation.rst @@ -123,6 +123,16 @@ thought as a software "patch panel" front-end for applications. .. [1] `Ethernet switch device driver model (switchdev) `_ +- Memory usage of representors is huge when number of representor grows, + because PMD always allocate mbuf for each descriptor of Rx queue. + Polling the large number of ports brings more CPU load, cache miss and + latency. Shared Rx queue can be used to share Rx queue between PF and + representors in same switch domain. ``RTE_ETH_RX_OFFLOAD_SHARED_RXQ`` + is present in Rx offloading capability of device info. Setting the + offloading flag in device Rx mode or Rx queue configuration to enable + shared Rx queue. Polling any member port of shared Rx queue can return + packets of all ports in group, port ID is saved in ``mbuf.port``. + Basic SR-IOV ------------ diff --git a/lib/ethdev/rte_ethdev.c b/lib/ethdev/rte_ethdev.c index 9d95cd11e1..1361ff759a 100644 --- a/lib/ethdev/rte_ethdev.c +++ b/lib/ethdev/rte_ethdev.c @@ -127,6 +127,7 @@ static const struct { RTE_RX_OFFLOAD_BIT2STR(OUTER_UDP_CKSUM), RTE_RX_OFFLOAD_BIT2STR(RSS_HASH), RTE_ETH_RX_OFFLOAD_BIT2STR(BUFFER_SPLIT), + RTE_ETH_RX_OFFLOAD_BIT2STR(SHARED_RXQ), }; #undef RTE_RX_OFFLOAD_BIT2STR diff --git a/lib/ethdev/rte_ethdev.h b/lib/ethdev/rte_ethdev.h index d2b27c351f..a578c9db9d 100644 --- a/lib/ethdev/rte_ethdev.h +++ b/lib/ethdev/rte_ethdev.h @@ -1047,6 +1047,7 @@ struct rte_eth_rxconf { uint8_t rx_drop_en; /**< Drop packets if no descriptors are available. */ uint8_t rx_deferred_start; /**< Do not start queue with rte_eth_dev_start(). */ uint16_t rx_nseg; /**< Number of descriptions in rx_seg array. */ + uint32_t shared_group; /**< Shared port group index in switch domain. */ /** * Per-queue Rx offloads to be set using DEV_RX_OFFLOAD_* flags. * Only offloads set on rx_queue_offload_capa or rx_offload_capa @@ -1373,6 +1374,12 @@ struct rte_eth_conf { #define DEV_RX_OFFLOAD_OUTER_UDP_CKSUM 0x00040000 #define DEV_RX_OFFLOAD_RSS_HASH 0x00080000 #define RTE_ETH_RX_OFFLOAD_BUFFER_SPLIT 0x00100000 +/** + * Rx queue is shared among ports in same switch domain to save memory, + * avoid polling each port. Any port in group can be used to receive packets. + * Real source port number saved in mbuf->port field. + */ +#define RTE_ETH_RX_OFFLOAD_SHARED_RXQ 0x00200000 #define DEV_RX_OFFLOAD_CHECKSUM (DEV_RX_OFFLOAD_IPV4_CKSUM | \ DEV_RX_OFFLOAD_UDP_CKSUM | \