From patchwork Tue Oct 4 14:58:50 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Olivier Matz X-Patchwork-Id: 117316 X-Patchwork-Delegate: david.marchand@redhat.com Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id D7940A0543; Tue, 4 Oct 2022 16:59:23 +0200 (CEST) Received: from [217.70.189.124] (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 8545E40DFB; Tue, 4 Oct 2022 16:59:23 +0200 (CEST) Received: from mail-wr1-f47.google.com (mail-wr1-f47.google.com [209.85.221.47]) by mails.dpdk.org (Postfix) with ESMTP id 03D2B40DDC for ; Tue, 4 Oct 2022 16:59:21 +0200 (CEST) Received: by mail-wr1-f47.google.com with SMTP id b4so14783074wrs.1 for ; Tue, 04 Oct 2022 07:59:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=6wind.com; s=google; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date; bh=6KjjUd4Fus1y27ys56p7IHMJlb/Eu/6hqBavwjNbJb8=; b=IFm+q+ynLKDHQSqtJMK54ExJ6wjxtQ2p4E1ErCL83L/vrBeaZQJHONInuo1mUynEMw Lo7FCGg117WVIsIm2+XBtvGNxqUiq0KplZr4AeT8ki3Rhj4mcT0GZN3fwRbN6SoBKfkf JnMXgifnDTEcR0QBsf3pqvjEdimzy1Hzjpkh+MUVX9slfwrtjiEPDOVUk2KKqRMRwh7V g9uDp4YPL7QoBUp2bOQsZxtNe2m1xx3j9XxKKeJ2LULYz2mF2T2aWJWB8aHsaPNA39GK PR3n7qbHtb7SNccTD5bPcvUf3T0PLB3JnwEaHj+AaOE0EbcR39PWX+CualURp4lzhqgc IY/Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date; bh=6KjjUd4Fus1y27ys56p7IHMJlb/Eu/6hqBavwjNbJb8=; b=4k+8qkvhP021ZDfIYyata6qlM96izm96NVCUR9ceV4/aW6gyukHMjyu3waUe72CIX9 zpUVpgzskbqtpu6Y1EuDxxfc5Zvi98yyb+JusEPY58ChVmsMI0IhRFJrVz6AMpDrHFl9 yoR45cwtkuqtWQCG3zv9lRXno+EGVUaRugLv2nIAi6UpfEGbGWKn9IgjvHByyYqZMp01 39eTNgQzP6D/cvr4YYaUUBpfLG1cnVzz4PXgh05mcnknS8a71ckgBPBa3XlUo/kQUTd3 almZUgjufssC4JkrsfR8uq626WwDTSsEpZuX9l8AeybaD7D32inVyC5ifD3Ik5zvKqng xYYw== X-Gm-Message-State: ACrzQf27isvIEGfRDsSJAhAEY1fE2sFcWJYYjoJH3KNvbh090Brjerai GAiWc6RpWnRWeAIKMnrh3ilH74kxsFCo1g== X-Google-Smtp-Source: AMsMyM5Wf/ffkwWAQcvHQ0ioZGKl5vvEbAti54Qr1WQxpptuBJnIznu60HBNfy9EZTFKRcVYJCna/g== X-Received: by 2002:a5d:5c08:0:b0:228:e139:43f3 with SMTP id cc8-20020a5d5c08000000b00228e13943f3mr16767278wrb.396.1664895561527; Tue, 04 Oct 2022 07:59:21 -0700 (PDT) Received: from gojira.dev.6wind.com ([185.13.181.2]) by smtp.gmail.com with ESMTPSA id v4-20020a5d5904000000b0022ac38fb20asm12329734wrd.111.2022.10.04.07.59.20 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 04 Oct 2022 07:59:20 -0700 (PDT) From: Olivier Matz To: dev@dpdk.org Cc: Ray Kinsella , Parav Pandit , Xueming Li , Hemant Agrawal , Sachin Saxena , Stephen Hemminger , Long Li , david.marchand@redhat.com Subject: [PATCH v2] drivers/bus: set device NUMA node to unknown by default Date: Tue, 4 Oct 2022 16:58:50 +0200 Message-Id: <20221004145850.32331-1-olivier.matz@6wind.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: References: MIME-Version: 1.0 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org The dev->device.numa_node field is set by each bus driver for every device it manages to indicate on which NUMA node this device lies. When this information is unknown, the assigned value is not consistent across the bus drivers. Set the default value to SOCKET_ID_ANY (-1) by all bus drivers when the NUMA information is unavailable. This change impacts rte_eth_dev_socket_id() in the same manner. Signed-off-by: Olivier Matz --- v2 * use SOCKET_ID_ANY instead of -1 in drivers/dma/idxd (David) * document the behavior change of rte_eth_dev_socket_id() * fix few examples where rte_eth_dev_socket_id() was expected to return 0 on unknown socket doc/guides/rel_notes/deprecation.rst | 7 ------- doc/guides/rel_notes/release_22_11.rst | 6 ++++++ drivers/bus/auxiliary/auxiliary_common.c | 8 ++------ drivers/bus/auxiliary/linux/auxiliary.c | 13 +++++-------- drivers/bus/dpaa/dpaa_bus.c | 1 + drivers/bus/fslmc/fslmc_bus.c | 1 + drivers/bus/pci/bsd/pci.c | 2 +- drivers/bus/pci/linux/pci.c | 16 ++++++---------- drivers/bus/pci/pci_common.c | 8 ++------ drivers/bus/pci/windows/pci.c | 1 - drivers/bus/vmbus/linux/vmbus_bus.c | 1 - drivers/bus/vmbus/vmbus_common.c | 8 ++------ drivers/dma/idxd/idxd_bus.c | 3 ++- examples/distributor/main.c | 4 ++-- examples/flow_classify/flow_classify.c | 2 ++ examples/rxtx_callbacks/main.c | 2 +- lib/ethdev/rte_ethdev.h | 4 ++-- 17 files changed, 35 insertions(+), 52 deletions(-) diff --git a/doc/guides/rel_notes/deprecation.rst b/doc/guides/rel_notes/deprecation.rst index a991fa14de..2a1a6ff899 100644 --- a/doc/guides/rel_notes/deprecation.rst +++ b/doc/guides/rel_notes/deprecation.rst @@ -33,13 +33,6 @@ Deprecation Notices ``__atomic_thread_fence`` must be used for patches that need to be merged in 20.08 onwards. This change will not introduce any performance degradation. -* bus: The ``dev->device.numa_node`` field is set by each bus driver for - every device it manages to indicate on which NUMA node this device lies. - When this information is unknown, the assigned value is not consistent - across the bus drivers. - In DPDK 22.11, the default value will be set to -1 by all bus drivers - when the NUMA information is unavailable. - * kni: The KNI kernel module and library are not recommended for use by new applications - other technologies such as virtio-user are recommended instead. Following the DPDK technical board diff --git a/doc/guides/rel_notes/release_22_11.rst b/doc/guides/rel_notes/release_22_11.rst index 53fe21453c..d52f823694 100644 --- a/doc/guides/rel_notes/release_22_11.rst +++ b/doc/guides/rel_notes/release_22_11.rst @@ -317,6 +317,12 @@ ABI Changes * eventdev: Added ``weight`` and ``affinity`` fields to ``rte_event_queue_conf`` structure. +* bus: Changed the device numa node to -1 when NUMA information is unavailable. + The ``dev->device.numa_node`` field is set by each bus driver for + every device it manages to indicate on which NUMA node this device lies. + When this information is unknown, the assigned value was not consistent + across the bus drivers. This similarly impacts ``rte_eth_dev_socket_id()``. + Known Issues ------------ diff --git a/drivers/bus/auxiliary/auxiliary_common.c b/drivers/bus/auxiliary/auxiliary_common.c index 259ff152c4..6bb1fe7c96 100644 --- a/drivers/bus/auxiliary/auxiliary_common.c +++ b/drivers/bus/auxiliary/auxiliary_common.c @@ -105,12 +105,8 @@ rte_auxiliary_probe_one_driver(struct rte_auxiliary_driver *drv, return -1; } - if (dev->device.numa_node < 0) { - if (rte_socket_count() > 1) - AUXILIARY_LOG(INFO, "Device %s is not NUMA-aware, defaulting socket to 0", - dev->name); - dev->device.numa_node = 0; - } + if (dev->device.numa_node < 0 && rte_socket_count() > 1) + RTE_LOG(INFO, EAL, "Device %s is not NUMA-aware\n", dev->name); iova_mode = rte_eal_iova_mode(); if ((drv->drv_flags & RTE_AUXILIARY_DRV_NEED_IOVA_AS_VA) > 0 && diff --git a/drivers/bus/auxiliary/linux/auxiliary.c b/drivers/bus/auxiliary/linux/auxiliary.c index d4c564cd78..02fc9285dc 100644 --- a/drivers/bus/auxiliary/linux/auxiliary.c +++ b/drivers/bus/auxiliary/linux/auxiliary.c @@ -40,14 +40,11 @@ auxiliary_scan_one(const char *dirname, const char *name) /* Get NUMA node, default to 0 if not present */ snprintf(filename, sizeof(filename), "%s/%s/numa_node", dirname, name); - if (access(filename, F_OK) != -1) { - if (eal_parse_sysfs_value(filename, &tmp) == 0) - dev->device.numa_node = tmp; - else - dev->device.numa_node = -1; - } else { - dev->device.numa_node = 0; - } + if (access(filename, F_OK) == 0 && + eal_parse_sysfs_value(filename, &tmp) == 0) + dev->device.numa_node = tmp; + else + dev->device.numa_node = SOCKET_ID_ANY; auxiliary_on_scan(dev); diff --git a/drivers/bus/dpaa/dpaa_bus.c b/drivers/bus/dpaa/dpaa_bus.c index 682427ba2c..447b222a76 100644 --- a/drivers/bus/dpaa/dpaa_bus.c +++ b/drivers/bus/dpaa/dpaa_bus.c @@ -179,6 +179,7 @@ dpaa_create_device_list(void) } dev->device.bus = &rte_dpaa_bus.bus; + dev->device.numa_node = SOCKET_ID_ANY; /* Allocate interrupt handle instance */ dev->intr_handle = diff --git a/drivers/bus/fslmc/fslmc_bus.c b/drivers/bus/fslmc/fslmc_bus.c index 8503004e3d..57bfb5111a 100644 --- a/drivers/bus/fslmc/fslmc_bus.c +++ b/drivers/bus/fslmc/fslmc_bus.c @@ -156,6 +156,7 @@ scan_one_fslmc_device(char *dev_name) } dev->device.bus = &rte_fslmc_bus.bus; + dev->device.numa_node = SOCKET_ID_ANY; /* Allocate interrupt instance */ dev->intr_handle = diff --git a/drivers/bus/pci/bsd/pci.c b/drivers/bus/pci/bsd/pci.c index 844d145fed..7459d15c7e 100644 --- a/drivers/bus/pci/bsd/pci.c +++ b/drivers/bus/pci/bsd/pci.c @@ -246,7 +246,7 @@ pci_scan_one(int dev_pci_fd, struct pci_conf *conf) dev->max_vfs = 0; /* FreeBSD has no NUMA support (yet) */ - dev->device.numa_node = 0; + dev->device.numa_node = SOCKET_ID_ANY; pci_common_set(dev); diff --git a/drivers/bus/pci/linux/pci.c b/drivers/bus/pci/linux/pci.c index c8703d52f3..ade17079fc 100644 --- a/drivers/bus/pci/linux/pci.c +++ b/drivers/bus/pci/linux/pci.c @@ -283,17 +283,13 @@ pci_scan_one(const char *dirname, const struct rte_pci_addr *addr) } /* get numa node, default to 0 if not present */ - snprintf(filename, sizeof(filename), "%s/numa_node", - dirname); + snprintf(filename, sizeof(filename), "%s/numa_node", dirname); - if (access(filename, F_OK) != -1) { - if (eal_parse_sysfs_value(filename, &tmp) == 0) - dev->device.numa_node = tmp; - else - dev->device.numa_node = -1; - } else { - dev->device.numa_node = 0; - } + if (access(filename, F_OK) == 0 && + eal_parse_sysfs_value(filename, &tmp) == 0) + dev->device.numa_node = tmp; + else + dev->device.numa_node = SOCKET_ID_ANY; pci_common_set(dev); diff --git a/drivers/bus/pci/pci_common.c b/drivers/bus/pci/pci_common.c index 5ea72bcf23..a59c5b4286 100644 --- a/drivers/bus/pci/pci_common.c +++ b/drivers/bus/pci/pci_common.c @@ -235,12 +235,8 @@ rte_pci_probe_one_driver(struct rte_pci_driver *dr, return 1; } - if (dev->device.numa_node < 0) { - if (rte_socket_count() > 1) - RTE_LOG(INFO, EAL, "Device %s is not NUMA-aware, defaulting socket to 0\n", - dev->name); - dev->device.numa_node = 0; - } + if (dev->device.numa_node < 0 && rte_socket_count() > 1) + RTE_LOG(INFO, EAL, "Device %s is not NUMA-aware\n", dev->name); already_probed = rte_dev_is_probed(&dev->device); if (already_probed && !(dr->drv_flags & RTE_PCI_DRV_PROBE_AGAIN)) { diff --git a/drivers/bus/pci/windows/pci.c b/drivers/bus/pci/windows/pci.c index 3f7a8b9432..5cf05ce1a0 100644 --- a/drivers/bus/pci/windows/pci.c +++ b/drivers/bus/pci/windows/pci.c @@ -249,7 +249,6 @@ get_device_resource_info(HDEVINFO dev_info, DWORD error = GetLastError(); if (error == ERROR_NOT_FOUND) { /* On older CPUs, NUMA is not bound to PCIe locality. */ - dev->device.numa_node = 0; return ERROR_SUCCESS; } RTE_LOG_WIN32_ERR("SetupDiGetDevicePropertyW" diff --git a/drivers/bus/vmbus/linux/vmbus_bus.c b/drivers/bus/vmbus/linux/vmbus_bus.c index f502783f7a..01d8111b85 100644 --- a/drivers/bus/vmbus/linux/vmbus_bus.c +++ b/drivers/bus/vmbus/linux/vmbus_bus.c @@ -293,7 +293,6 @@ vmbus_scan_one(const char *name) goto error; dev->device.numa_node = tmp; } else { - /* if no NUMA support, set default to 0 */ dev->device.numa_node = SOCKET_ID_ANY; } diff --git a/drivers/bus/vmbus/vmbus_common.c b/drivers/bus/vmbus/vmbus_common.c index 03b39c82b7..8d32d66504 100644 --- a/drivers/bus/vmbus/vmbus_common.c +++ b/drivers/bus/vmbus/vmbus_common.c @@ -111,12 +111,8 @@ vmbus_probe_one_driver(struct rte_vmbus_driver *dr, /* reference driver structure */ dev->driver = dr; - if (dev->device.numa_node < 0) { - if (rte_socket_count() > 1) - VMBUS_LOG(INFO, "Device %s is not NUMA-aware, defaulting socket to 0", - guid); - dev->device.numa_node = 0; - } + if (dev->device.numa_node < 0 && rte_socket_count() > 1) + VMBUS_LOG(INFO, "Device %s is not NUMA-aware", guid); /* call the driver probe() function */ VMBUS_LOG(INFO, " probe driver: %s", dr->driver.name); diff --git a/drivers/dma/idxd/idxd_bus.c b/drivers/dma/idxd/idxd_bus.c index 9b7b16c6e3..bbbfd3f648 100644 --- a/drivers/dma/idxd/idxd_bus.c +++ b/drivers/dma/idxd/idxd_bus.c @@ -12,6 +12,7 @@ #include #include #include +#include #include #include #include @@ -322,7 +323,7 @@ dsa_scan(void) while ((wq = readdir(dev_dir)) != NULL) { struct rte_dsa_device *dev; - int numa_node = -1; + int numa_node = SOCKET_ID_ANY; if (strncmp(wq->d_name, "wq", 2) != 0) continue; diff --git a/examples/distributor/main.c b/examples/distributor/main.c index 68f07cc7fb..d41c3bdb14 100644 --- a/examples/distributor/main.c +++ b/examples/distributor/main.c @@ -231,7 +231,7 @@ lcore_rx(struct lcore_params *p) if ((enabled_port_mask & (1 << port)) == 0) continue; - if (rte_eth_dev_socket_id(port) > 0 && + if (rte_eth_dev_socket_id(port) >= 0 && rte_eth_dev_socket_id(port) != socket_id) printf("WARNING, port %u is on remote NUMA node to " "RX thread.\n\tPerformance will not " @@ -406,7 +406,7 @@ lcore_tx(struct rte_ring *in_r) if ((enabled_port_mask & (1 << port)) == 0) continue; - if (rte_eth_dev_socket_id(port) > 0 && + if (rte_eth_dev_socket_id(port) >= 0 && rte_eth_dev_socket_id(port) != socket_id) printf("WARNING, port %u is on remote NUMA node to " "TX thread.\n\tPerformance will not " diff --git a/examples/flow_classify/flow_classify.c b/examples/flow_classify/flow_classify.c index 97708b7084..cdd51b2476 100644 --- a/examples/flow_classify/flow_classify.c +++ b/examples/flow_classify/flow_classify.c @@ -818,6 +818,8 @@ main(int argc, char *argv[]) printf("\nWARNING: Too many lcores enabled. Only 1 used.\n"); socket_id = rte_eth_dev_socket_id(0); + if (socket_id == SOCKET_ID_ANY) + socket_id = rte_lcore_to_socket_id(rte_get_next_lcore(-1, 0, 0)); /* Memory allocation. 8< */ size = RTE_CACHE_LINE_ROUNDUP(sizeof(struct flow_classifier_acl)); diff --git a/examples/rxtx_callbacks/main.c b/examples/rxtx_callbacks/main.c index edf0ab9b08..59eee49208 100644 --- a/examples/rxtx_callbacks/main.c +++ b/examples/rxtx_callbacks/main.c @@ -249,7 +249,7 @@ lcore_main(void) uint16_t port; RTE_ETH_FOREACH_DEV(port) - if (rte_eth_dev_socket_id(port) > 0 && + if (rte_eth_dev_socket_id(port) >= 0 && rte_eth_dev_socket_id(port) != (int)rte_socket_id()) printf("WARNING, port %u is on remote NUMA node to " diff --git a/lib/ethdev/rte_ethdev.h b/lib/ethdev/rte_ethdev.h index a21f58b9cd..dd8d25d6d4 100644 --- a/lib/ethdev/rte_ethdev.h +++ b/lib/ethdev/rte_ethdev.h @@ -2445,8 +2445,8 @@ int rte_eth_hairpin_unbind(uint16_t tx_port, uint16_t rx_port); * The port identifier of the Ethernet device * @return * The NUMA socket ID to which the Ethernet device is connected or - * a default of zero if the socket could not be determined. - * -1 is returned is the port_id value is out of range. + * a default of -1 (SOCKET_ID_ANY) if the socket could not be determined. + * -1 is also returned if the port_id is invalid. */ int rte_eth_dev_socket_id(uint16_t port_id);