From patchwork Thu Feb 8 18:16:40 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Mattias_R=C3=B6nnblom?= X-Patchwork-Id: 136557 X-Patchwork-Delegate: thomas@monjalon.net Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 622E143AE1; Thu, 8 Feb 2024 19:24:55 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id E43A042E3B; Thu, 8 Feb 2024 19:24:50 +0100 (CET) Received: from EUR03-DBA-obe.outbound.protection.outlook.com (mail-dbaeur03on2074.outbound.protection.outlook.com [40.107.104.74]) by mails.dpdk.org (Postfix) with ESMTP id 50E6540273 for ; Thu, 8 Feb 2024 19:24:49 +0100 (CET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=i4L5IokDkvTOw2TTFRVAbKOrxvykN7CHAQzvXeuEMYzg+fnoGpu1W8K39jfkv34dkAZoNRNFAT5LjTmH6QBi8slt52yMbPksZqM1enAA/XcfOnPl18UEqq7biXNYnnFAS4FqDikXjf7YM5p5WCwGZ0zO7jm7FYTjxcjsySjG85nMYqYryVAl6YSBTQBYvoxEHJ/Clf3jTqkpQq19rZPomgJTo6NRMyv5e7N47vuwm9XQJE/eyapXH2R0Yla4eOfGutqtTGIK0IMW7lFOclEb1HTCcGWC0OfduRr4QEucsUq8EsnTKRBUjJrCyS2jE8HPDKJKZMHpX75cu0IoRFdhdQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=ZyAefawRm4Tdz70HkmXYjunTT+hUJRosOdPnN7Uiy8c=; b=j772hBVCtUJN3vYqVpokYvYWNQzFt19CQkQJLqbM2X8/+8m/CSmhKnXydvCDUAotkcS9ckbkpt4OrrKv083iiFhz+2q79v/fEe5p08iQsoeW0oBay+5U3FNoHzzy1QNOLKy8xfc6kpU5FIFCk0jKKhpyjLAasnQGsDXvZ/PL6IR7Mf5fSLM1wrv6vEtuoYu3xBIZ8g1rFfwP0Du/0r3kq7flafOLu9RmtJ5yw5YGr4XWny1IFsybw4YJHtPXh0LFrX+vcpn2fUvBSrP8/WcAAiCYTdOwGdhplu5POw0xjbo8RzAthJAYLgJduK25F1U0/mF3wKr6t4m32V3bcF/kEQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.176.1.74) smtp.rcpttodomain=dpdk.org smtp.mailfrom=ericsson.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=ericsson.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ZyAefawRm4Tdz70HkmXYjunTT+hUJRosOdPnN7Uiy8c=; b=j4d64/loWzYS/4hnQv8+iMeS0wkDwZ7LF/Ow5YRiAH/PxPgPosGErQgFWgeoV2A6AXTeHeEUgc/eG6rWW1nzlg9VCzJ614aB61oDOhIyxatOCy2yRAvSllGwI2Kh2zUc+ZBFuofI+8TQtKdLMF+KQyPOcJK74mRKRoNjtbWVF8BGNLKc6uEp4ej8eYABSV0VdmlIPruisfVtLxx8hmyLbbWUhkL9NNEcPPyXwmYINEpmWVmbsAS9AEGmVSOQejsRx+wTQ4xvNlWl02U44uwhlBGh7ItAmKb5Iu1eB0Odalnl5BJN/Ug40mtfEfZvxdlVSJZqAchhY/7eat9R8ZpUcA== Received: from DUZPR01CA0027.eurprd01.prod.exchangelabs.com (2603:10a6:10:46b::14) by AS4PR07MB8706.eurprd07.prod.outlook.com (2603:10a6:20b:4f0::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.39; Thu, 8 Feb 2024 18:24:47 +0000 Received: from DB1PEPF000509E6.eurprd03.prod.outlook.com (2603:10a6:10:46b:cafe::f3) by DUZPR01CA0027.outlook.office365.com (2603:10a6:10:46b::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.39 via Frontend Transport; Thu, 8 Feb 2024 18:24:47 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.176.1.74) smtp.mailfrom=ericsson.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=ericsson.com; Received-SPF: Pass (protection.outlook.com: domain of ericsson.com designates 192.176.1.74 as permitted sender) receiver=protection.outlook.com; client-ip=192.176.1.74; helo=oa.msg.ericsson.com; pr=C Received: from oa.msg.ericsson.com (192.176.1.74) by DB1PEPF000509E6.mail.protection.outlook.com (10.167.242.56) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.19 via Frontend Transport; Thu, 8 Feb 2024 18:24:46 +0000 Received: from seliicinfr00049.seli.gic.ericsson.se (153.88.142.248) by smtp-central.internal.ericsson.com (100.87.178.68) with Microsoft SMTP Server id 15.2.1258.12; Thu, 8 Feb 2024 19:24:46 +0100 Received: from breslau.. (seliicwb00002.seli.gic.ericsson.se [10.156.25.100]) by seliicinfr00049.seli.gic.ericsson.se (Postfix) with ESMTP id 1AE11380061; Thu, 8 Feb 2024 19:24:46 +0100 (CET) From: =?utf-8?q?Mattias_R=C3=B6nnblom?= To: CC: , =?utf-8?q?Morten_Br=C3=B8rup?= , Stephen Hemminger , =?utf-8?q?Mattias_R=C3=B6nn?= =?utf-8?q?blom?= Subject: [RFC 1/5] eal: add static per-lcore memory allocation facility Date: Thu, 8 Feb 2024 19:16:40 +0100 Message-ID: <20240208181644.455233-2-mattias.ronnblom@ericsson.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> References: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> MIME-Version: 1.0 X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DB1PEPF000509E6:EE_|AS4PR07MB8706:EE_ X-MS-Office365-Filtering-Correlation-Id: 9bf02fdc-5ceb-4fa6-8ae1-08dc28d33abf X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 5WrCMg6t1moi2pOiT86PTaN/h5X8P+yOIZBz6V6whux8n131fXPd5IR7OxHrNAFqQL+nee3YF9a26t/04pBqrbtUUGeLLn3isxi1g5YCMbxkmPUKbEDorGn5yiEiSY/kaWs1XR/VitqXytdYCFYx5VawlsWkF+Oi5OjlYRdo8LgUzQAHU5n9n0giWtiTonik5RfKQ7SwhO5CK9pHuHu67k07jGvtDFOhFv8Zcp+0k7J5blYVwFU9RKayhGhuYjfCC3ROeWSPDEFfgncXR7LWfYlGKKdd0Sv3d7Pi+fYxkjd7sd/nL5OJ2foO1q+tnfCYDRBONDq8xCTNNkc4EX9xldT5ge9kO/bzFqgBAxIWUREyxBUZak6yXFTqRqw+cVba1w3DRZq8Ypz1IXiM5qTtdY1ss23zbVzjcgsgJBfFvPb8ZMaXpReBzYJq0IqA2fwK2kQSCea6wZaImIjJfRZw7VgSgoUk0MKbCkW4wWrbfIQ10fCZk+FCWyiwD2re7wxdBv2wdMQeiLhgREKIvpdRilaf/sCV/Gl3LqDsvtswUNlPiHgpylW0IKS0lbsP4Cjl6dpHWYpxKccBOOWyux+dvZ5rfzHOjuue8DSH2L9FqXk= X-Forefront-Antispam-Report: CIP:192.176.1.74; CTRY:SE; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:oa.msg.ericsson.com; PTR:office365.se.ericsson.net; CAT:NONE; SFS:(13230031)(4636009)(346002)(136003)(396003)(376002)(39860400002)(230273577357003)(230922051799003)(186009)(82310400011)(64100799003)(451199024)(1800799012)(46966006)(36840700001)(40470700004)(8936002)(4326008)(30864003)(5660300002)(41300700001)(8676002)(2906002)(83380400001)(86362001)(7636003)(82960400001)(356005)(316002)(36756003)(107886003)(70586007)(26005)(70206006)(6266002)(6666004)(6916009)(66574015)(336012)(54906003)(82740400003)(1076003)(2616005)(478600001); DIR:OUT; SFP:1101; X-OriginatorOrg: ericsson.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 08 Feb 2024 18:24:46.7828 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 9bf02fdc-5ceb-4fa6-8ae1-08dc28d33abf X-MS-Exchange-CrossTenant-Id: 92e84ceb-fbfd-47ab-be52-080c6b87953f X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=92e84ceb-fbfd-47ab-be52-080c6b87953f; Ip=[192.176.1.74]; Helo=[oa.msg.ericsson.com] X-MS-Exchange-CrossTenant-AuthSource: DB1PEPF000509E6.eurprd03.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: AS4PR07MB8706 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Introduce DPDK per-lcore id variables, or lcore variables for short. An lcore variable has one value for every current and future lcore id-equipped thread. The primary use case is for statically allocating small chunks of often-used data, which is related logically, but where there are performance benefits to reap from having updates being local to an lcore. Lcore variables are similar to thread-local storage (TLS, e.g., C11 _Thread_local), but decoupling the values' life time with that of the threads. Lcore variables are also similar in terms of functionality provided by FreeBSD kernel's DPCPU_*() family of macros and the associated build-time machinery. DPCPU uses linker scripts, which effectively prevents the reuse of its, otherwise seemingly viable, approach. The currently-prevailing way to solve the same problem as lcore variables is to keep a module's per-lcore data as RTE_MAX_LCORE-sized array of cache-aligned, RTE_CACHE_GUARDed structs. The benefit of lcore variables over this approach is that data related to the same lcore now is close (spatially, in memory), rather than data used by the same module, which in turn avoid excessive use of padding, polluting caches with unused data. Signed-off-by: Mattias Rönnblom --- config/rte_config.h | 1 + doc/api/doxy-api-index.md | 1 + lib/eal/common/eal_common_lcore_var.c | 80 ++++++ lib/eal/common/meson.build | 1 + lib/eal/include/meson.build | 1 + lib/eal/include/rte_lcore_var.h | 352 ++++++++++++++++++++++++++ lib/eal/version.map | 4 + 7 files changed, 440 insertions(+) create mode 100644 lib/eal/common/eal_common_lcore_var.c create mode 100644 lib/eal/include/rte_lcore_var.h diff --git a/config/rte_config.h b/config/rte_config.h index da265d7dd2..884482e473 100644 --- a/config/rte_config.h +++ b/config/rte_config.h @@ -30,6 +30,7 @@ /* EAL defines */ #define RTE_CACHE_GUARD_LINES 1 #define RTE_MAX_HEAPS 32 +#define RTE_MAX_LCORE_VAR 1048576 #define RTE_MAX_MEMSEG_LISTS 128 #define RTE_MAX_MEMSEG_PER_LIST 8192 #define RTE_MAX_MEM_MB_PER_LIST 32768 diff --git a/doc/api/doxy-api-index.md b/doc/api/doxy-api-index.md index a6a768bd7c..bb06bb7ca1 100644 --- a/doc/api/doxy-api-index.md +++ b/doc/api/doxy-api-index.md @@ -98,6 +98,7 @@ The public API headers are grouped by topics: [interrupts](@ref rte_interrupts.h), [launch](@ref rte_launch.h), [lcore](@ref rte_lcore.h), + [lcore-varible](@ref rte_lcore_var.h), [per-lcore](@ref rte_per_lcore.h), [service cores](@ref rte_service.h), [keepalive](@ref rte_keepalive.h), diff --git a/lib/eal/common/eal_common_lcore_var.c b/lib/eal/common/eal_common_lcore_var.c new file mode 100644 index 0000000000..5276fe7192 --- /dev/null +++ b/lib/eal/common/eal_common_lcore_var.c @@ -0,0 +1,80 @@ +/* SPDX-License-Identifier: BSD-3-Clause + * Copyright(c) 2024 Ericsson AB + */ + +/* XXX: should this file be called eal_common_ldata.c or rte_ldata.c? */ + +#include + +#include +#include +#include + +#include + +#include "eal_private.h" + +#define WARN_THRESHOLD 75 +#define MAX_AUTO_ALIGNMENT 16U + +/* + * Avoid using offset zero, since it would result in a NULL-value + * "handle" (offset) pointer, which in principle and per the API + * definition shouldn't be an issue, but may confuse some tools and + * users. + */ +#define INITIAL_OFFSET MAX_AUTO_ALIGNMENT + +char rte_lcore_var[RTE_MAX_LCORE][RTE_MAX_LCORE_VAR] __rte_cache_aligned; + +static uintptr_t allocated = INITIAL_OFFSET; + +static void +verify_allocation(uintptr_t new_allocated) +{ + static bool has_warned; + + RTE_VERIFY(new_allocated < RTE_MAX_LCORE_VAR); + + if (new_allocated > (WARN_THRESHOLD * RTE_MAX_LCORE_VAR) / 100 && + !has_warned) { + EAL_LOG(WARNING, "Per-lcore data usage has exceeded %d%% " + "of the maximum capacity (%d bytes)", WARN_THRESHOLD, + RTE_MAX_LCORE_VAR); + has_warned = true; + } +} + +static void * +lcore_var_alloc(size_t size, size_t alignment) +{ + uintptr_t new_allocated = RTE_ALIGN_CEIL(allocated, alignment); + + void *offset = (void *)new_allocated; + + new_allocated += size; + + verify_allocation(new_allocated); + + allocated = new_allocated; + + EAL_LOG(DEBUG, "Allocated %"PRIuPTR" bytes of per-lcore data with a " + "%"PRIuPTR"-byte alignment", size, alignment); + + return offset; +} + +void * +rte_lcore_var_alloc(size_t size) +{ + RTE_BUILD_BUG_ON(RTE_MAX_LCORE_VAR % RTE_CACHE_LINE_SIZE != 0); + + /* Allocations are naturally aligned (i.e., the same alignment + * as the object size, up to a maximum of 16 bytes, which + * should satisify alignment requirements of any kind of + * object. + */ + size_t alignment = RTE_MIN(size, MAX_AUTO_ALIGNMENT); + + return lcore_var_alloc(size, alignment); +} diff --git a/lib/eal/common/meson.build b/lib/eal/common/meson.build index 22a626ba6f..d41403680b 100644 --- a/lib/eal/common/meson.build +++ b/lib/eal/common/meson.build @@ -18,6 +18,7 @@ sources += files( 'eal_common_interrupts.c', 'eal_common_launch.c', 'eal_common_lcore.c', + 'eal_common_lcore_var.c', 'eal_common_mcfg.c', 'eal_common_memalloc.c', 'eal_common_memory.c', diff --git a/lib/eal/include/meson.build b/lib/eal/include/meson.build index e94b056d46..9449253e23 100644 --- a/lib/eal/include/meson.build +++ b/lib/eal/include/meson.build @@ -27,6 +27,7 @@ headers += files( 'rte_keepalive.h', 'rte_launch.h', 'rte_lcore.h', + 'rte_lcore_var.h', 'rte_lock_annotations.h', 'rte_malloc.h', 'rte_mcslock.h', diff --git a/lib/eal/include/rte_lcore_var.h b/lib/eal/include/rte_lcore_var.h new file mode 100644 index 0000000000..c1854dc6a4 --- /dev/null +++ b/lib/eal/include/rte_lcore_var.h @@ -0,0 +1,352 @@ +/* SPDX-License-Identifier: BSD-3-Clause + * Copyright(c) 2024 Ericsson AB + */ + +#ifndef _RTE_LCORE_VAR_H_ +#define _RTE_LCORE_VAR_H_ + +/** + * @file + * + * RTE Per-lcore id variables + * + * This API provides a mechanism to create and access per-lcore id + * variables in a space- and cycle-efficient manner. + * + * A per-lcore id variable (or lcore variable for short) has one value + * for each EAL thread and registered non-EAL thread. In other words, + * there's one copy of its value for each and every current and future + * lcore id-equipped thread, with the total number of copies amounting + * to \c RTE_MAX_LCORE. + * + * In order to access the values of an lcore variable, a handle is + * used. The type of the handle is a pointer to the value's type + * (e.g., for \c uint32_t lcore variable, the handle is a + * uint32_t *. A handle may be passed between modules and + * threads just like any pointer, but its value is not the address of + * any particular object, but rather just an opaque identifier, stored + * in a typed pointer (to inform the access macro the type of values). + * + * @b Creation + * + * An lcore variable is created in two steps: + * 1. Define a lcore variable handle by using \ref RTE_LCORE_VAR_HANDLE. + * 2. Allocate lcore variable storage and initialize the handle with + * a unique identifier by \ref RTE_LCORE_VAR_ALLOC or + * \ref RTE_LCORE_VAR_INIT. Allocation generally occurs the time of + * module initialization, but may be done at any time. + * + * An lcore variable is not tied to the owning thread's lifetime. It's + * available for use by any thread immediately after having been + * allocated, and continues to be available throughout the lifetime of + * the EAL. + * + * Lcore variables cannot and need not be freed. + * + * @b Access + * + * The value of any lcore variable for any lcore id may be accessed + * from any thread (including unregistered threads), but is should + * generally only *frequently* read from or written to by the owner. + * + * Values of the same lcore variable but owned by to different lcore + * ids *may* be frequently read or written by the owners without the + * risk of false sharing. + * + * An appropriate synchronization mechanism (e.g., atomics) should + * employed to assure there are no data races between the owning + * thread and any non-owner threads accessing the same lcore variable + * instance. + * + * The value of the lcore variable for a particular lcore id may be + * retrieved with \ref RTE_LCORE_VAR_LCORE_GET. To get a pointer to the + * same object, use \ref RTE_LCORE_VAR_LCORE_PTR. + * + * To modify the value of an lcore variable for a particular lcore id, + * either access the object through the pointer retrieved by \ref + * RTE_LCORE_VAR_LCORE_PTR or, for primitive types, use \ref + * RTE_LCORE_VAR_LCORE_SET. + * + * The access macros each has a short-hand which may be used by an EAL + * thread or registered non-EAL thread to access the lcore variable + * instance of its own lcore id. Those are \ref RTE_LCORE_VAR_GET, + * \ref RTE_LCORE_VAR_PTR, and \ref RTE_LCORE_VAR_SET. + * + * Although the handle (as defined by \ref RTE_LCORE_VAR_HANDLE) is a + * pointer with the same type as the value, it may not be directly + * dereferenced and must be treated as an opaque identifier. The + * *identifier* value is common across all lcore ids. + * + * @b Storage + * + * An lcore variable's values may by of a primitive type like \c int, + * but would more typically be a \c struct. An application may choose + * to define an lcore variable, which it then it goes on to never + * allocate. + * + * The lcore variable handle introduces a per-variable (not + * per-value/per-lcore id) overhead of \c sizeof(void *) bytes, so + * there are some memory footprint gains to be made by organizing all + * per-lcore id data for a particular module as one lcore variable + * (e.g., as a struct). + * + * The sum of all lcore variables, plus any padding required, must be + * less than the DPDK build-time constant \c RTE_MAX_LCORE_VAR. A + * violation of this maximum results in the process being terminated. + * + * It's reasonable to expected that \c RTE_MAX_LCORE_VAR is on the + * same order of magnitude in size as a thread stack. + * + * The lcore variable storage buffers are kept in the BSS section in + * the resulting binary, where data generally isn't mapped in until + * it's accessed. This means that unused portions of the lcore + * variable storage area will not occupy any physical memory (with a + * granularity of the memory page size [usually 4 kB]). + * + * Lcore variables should generally *not* be \ref __rte_cache_aligned + * and need *not* include a \ref RTE_CACHE_GUARD field, since the use + * of these constructs are designed to avoid false sharing. In the + * case of an lcore variable instance, all nearby data structures + * should almost-always be written to by a single thread (the lcore + * variable owner). Adding padding will increase the effective memory + * working set size, and potentially reducing performance. + * + * @b Example + * + * Below is an example of the use of an lcore variable: + * + * \code{.c} + * struct foo_lcore_state { + * int a; + * long b; + * }; + * + * static RTE_LCORE_VAR_HANDLE(struct foo_lcore_state, lcore_states); + * + * long foo_get_a_plus_b(void) + * { + * struct foo_lcore_state *state = RTE_LCORE_VAR_PTR(lcore_states); + * + * return state->a + state->b; + * } + * + * RTE_INIT(rte_foo_init) + * { + * unsigned int lcore_id; + * + * RTE_LCORE_VAR_ALLOC(foo_state); + * + * struct foo_lcore_state *state; + * RTE_LCORE_VAR_FOREACH(lcore_states) { + * (initialize 'state') + * } + * + * (other initialization) + * } + * \endcode + * + * + * @b Alternatives + * + * Lcore variables are designed to replace a pattern exemplified below: + * \code{.c} + * struct foo_lcore_state { + * int a; + * long b; + * RTE_CACHE_GUARD; + * } __rte_cache_aligned; + * + * static struct foo_lcore_state lcore_states[RTE_MAX_LCORE]; + * \endcode + * + * This scheme is simple and effective, but has one drawback: the data + * is organized so that objects related to all lcores for a particular + * module is kept close in memory. At a bare minimum, this forces the + * use of cache-line alignment to avoid false sharing. With CPU + * hardware prefetching and memory loads resulting from speculative + * execution (functions which seemingly are getting more eager faster + * than they are getting more intelligent), one or more "guard" cache + * lines may be required to separate one lcore's data from another's. + * + * Lcore variables has the upside of working with, not against, the + * CPU's assumptions and for example next-line prefetchers may well + * work the way its designers intended (i.e., to the benefit, not + * detriment, of system performance). + * + * Another alternative to \ref rte_lcore_var.h is the \ref + * rte_per_lcore.h API, which make use of thread-local storage (TLS, + * e.g., GCC __thread or C11 _Thread_local). The main differences + * between by using the various forms of TLS (e.g., \ref + * RTE_DEFINE_PER_LCORE or _Thread_local) and the use of lcore + * variables are: + * + * * The existence and non-existence of a thread-local variable + * instance follow that of particular thread's. The data cannot be + * accessed before the thread has been created, nor after it has + * exited. One effect of this is thread-local variables must + * initialized in a "lazy" manner (e.g., at the point of thread + * creation). Lcore variables may be accessed immediately after + * having been allocated (which is usually prior any thread beyond + * the main thread is running). + * * A thread-local variable is duplicated across all threads in the + * process, including unregistered non-EAL threads (i.e., + * "regular" threads). For DPDK applications heavily relying on + * multi-threading (in conjunction to DPDK's "one thread per core" + * pattern), either by having many concurrent threads or + * creating/destroying threads at a high rate, an excessive use of + * thread-local variables may cause inefficiencies (e.g., + * increased thread creation overhead due to thread-local storage + * initialization or increased total RAM footprint usage). Lcore + * variables *only* exist for threads with an lcore id, and thus + * not for such "regular" threads. + * * If data in thread-local storage may be shared between threads + * (i.e., can a pointer to a thread-local variable be passed to + * and successfully dereferenced by non-owning thread) depends on + * the details of the TLS implementation. With GCC __thread and + * GCC _Thread_local, such data sharing is supported. In the C11 + * standard, the result of accessing another thread's + * _Thread_local object is implementation-defined. Lcore variable + * instances may be accessed reliably by any thread. + */ + +#ifdef __cplusplus +extern "C" { +#endif + +#include + +#include +#include +#include + +/** + * Given the lcore variable type, produces the type of the lcore + * variable handle. + */ +#define RTE_LCORE_VAR_HANDLE_TYPE(type) \ + type * + +/** + * Define a lcore variable handle. + * + * This macro defines a variable which is used as a handle to access + * the various per-lcore id instances of a per-lcore id variable. + * + * The aim with this macro is to make clear at the point of + * declaration that this is an lcore handler, rather than a regular + * pointer. + * + * Add @b static as a prefix in case the lcore variable are only to be + * accessed from a particular translation unit. + */ +#define RTE_LCORE_VAR_HANDLE(type, name) \ + RTE_LCORE_VAR_HANDLE_TYPE(type) name + +/** + * Allocate space for an lcore variable, and initialize its handle. + */ +#define RTE_LCORE_VAR_ALLOC_SZ(name, size) \ + name = rte_lcore_var_alloc(size) + +/** + * Allocate space for an lcore variable of the size suggested by the + * handler pointer type and initialize its handle. + */ +#define RTE_LCORE_VAR_ALLOC(name) \ + RTE_LCORE_VAR_ALLOC_SZ(name, sizeof(*(name))) + +/** + * Allocate an explicitly-sized lcore variable by means of a \ref + * RTE_INIT constructor. + */ +#define RTE_LCORE_VAR_INIT_SZ(name) \ + RTE_INIT(rte_lcore_var_init_ ## name) \ + { \ + RTE_LCORE_VAR_ALLOC_SZ(name); \ + } + +/** + * Allocate an lcore variable by means of a \ref RTE_INIT constructor. + */ +#define RTE_LCORE_VAR_INIT(name) \ + RTE_INIT(rte_lcore_var_init_ ## name) \ + { \ + RTE_LCORE_VAR_ALLOC(name); \ + } + +#define __RTE_LCORE_VAR_LCORE_PTR(lcore_id, name) \ + ((void *)(&rte_lcore_var[lcore_id][(uintptr_t)(name)])) + +/** + * Get pointer to lcore variable instance with the specified lcore id. + */ +#define RTE_LCORE_VAR_LCORE_PTR(lcore_id, name) \ + ((typeof(name))__RTE_LCORE_VAR_LCORE_PTR(lcore_id, name)) + +/** + * Get value of a lcore variable instance of the specified lcore id. + */ +#define RTE_LCORE_VAR_LCORE_GET(lcore_id, name) \ + (*(RTE_LCORE_VAR_LCORE_PTR(lcore_id, name))) + +/** + * Set the value of a lcore variable instance of the specified lcore id. + */ +#define RTE_LCORE_VAR_LCORE_SET(lcore_id, name, value) \ + (*(RTE_LCORE_VAR_LCORE_PTR(lcore_id, name)) = (value)) + +/** + * Get pointer to lcore variable instance of the current thread. + * + * May only be used by EAL threads and registered non-EAL threads. + */ +#define RTE_LCORE_VAR_PTR(name) RTE_LCORE_VAR_LCORE_PTR(rte_lcore_id(), name) + +/** + * Get value of lcore variable instance of the current thread. + * + * May only be used by EAL threads and registered non-EAL threads. + */ +#define RTE_LCORE_VAR_GET(name) RTE_LCORE_VAR_LCORE_GET(rte_lcore_id(), name) + +/** + * Set value of lcore variable instance of the current thread. + * + * May only be used by EAL threads and registered non-EAL threads. + */ +#define RTE_LCORE_VAR_SET(name, value) \ + RTE_LCORE_VAR_LCORE_SET(rte_lcore_id(), name, value) + +/** + * Iterate over each lcore id's value for a lcore variable. + */ +#define RTE_LCORE_VAR_FOREACH(var, name) \ + for (unsigned int lcore_id = \ + (((var) = RTE_LCORE_VAR_LCORE_PTR(0, name)), 0); \ + lcore_id < RTE_MAX_LCORE; \ + lcore_id++, (var) = RTE_LCORE_VAR_LCORE_PTR(lcore_id, name)) + +extern char rte_lcore_var[RTE_MAX_LCORE][RTE_MAX_LCORE_VAR]; + +/** + * Allocate space in the per-lcore id buffer for a lcore variable. + * + * The pointer returned is only an opaque identifer of the variable. To + * get an actual pointer to a particular instance of the variable use + * \ref RTE_LCORE_VAR_PTR or \ref RTE_LCORE_VAR_LCORE_PTR. + * + * The allocation is always successful, barring an fatal exhaustion of + * the per-lcore id buffer space. + * + * @return + * The id of the variable, stored in a void pointer value. + */ +__rte_experimental +void * +rte_lcore_var_alloc(size_t size); + +#ifdef __cplusplus +} +#endif + +#endif /* _RTE_LCORE_VAR_H_ */ diff --git a/lib/eal/version.map b/lib/eal/version.map index 5e0cd47c82..e90b86115a 100644 --- a/lib/eal/version.map +++ b/lib/eal/version.map @@ -393,6 +393,10 @@ EXPERIMENTAL { # added in 23.07 rte_memzone_max_get; rte_memzone_max_set; + + # added in 24.03 + rte_lcore_var_alloc; + rte_lcore_var; }; INTERNAL { From patchwork Thu Feb 8 18:16:41 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Mattias_R=C3=B6nnblom?= X-Patchwork-Id: 136558 X-Patchwork-Delegate: thomas@monjalon.net Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id C25E343AE1; Thu, 8 Feb 2024 19:25:03 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 8A39242E5E; Thu, 8 Feb 2024 19:24:52 +0100 (CET) Received: from EUR05-VI1-obe.outbound.protection.outlook.com (mail-vi1eur05on2048.outbound.protection.outlook.com [40.107.21.48]) by mails.dpdk.org (Postfix) with ESMTP id 81CCB402D1 for ; Thu, 8 Feb 2024 19:24:49 +0100 (CET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=UsgGUnpdPei8Pbxzx9KX9xZff6lcaAkhn8qAGar9egPjp5lL9fL0mXHr8J9FDppPImr6ns+vePMAiJrJn0ulOi48yL/K1Ff5y2/LXVWg/XpeWJhTrM2ZCDnP/xUhDvYlByfhbPlILSNKYCY+xZDtJ4ASy+ISfTBA46dvTb/oQ/74vAMUvM/uzSpk7nacvXufDghMVox7ax28U6ZPdZ/Xi3QhwVggeoNkvYbhMIJWcwRJ6mblqwGxq1ng/+hbdb/aHSJwCPn8YRm7xhRMIyVqXhP3Joi60acJjmwg7G/y6peNhWUsshp8YTbn3wHNOWlY+AGC4ZPecX8VxBHr/YDVQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=ZOMLdiBdVDZxM95hhq8RR8gJEdhE5xReFfNoLsshPc4=; b=GBEsj/pmff/+RSDKTDAnBnkrDTgKUxwU6QofUdHEyEsH1Ed5LyKb21L4jun73wDttZNzMtXIhzrF/e06FmaG5m5WAzaYk+oal6gbxSaZ8dZbXGPHu+so9lPdPQyj1h7AJzr8G1456X1eGlyLSU8gycFNxnk9JhJVgBQlHuKgxe63TMF9SAGTTje5XqIsgRF4alf8I1Sa+Qs1tCDswfojXEp1gp2TzDmwj4x8pGoXPZmUBz2EDy1S6ZwgFp+Uc4/9j2FcevqoFs2nXQMnEVAa1uh+CYIqPZa7nYGFlHFIdWZLAAGHKEgsQUaedqUvub+CC5GOFMBPiAk6fsLxj70evw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.176.1.74) smtp.rcpttodomain=dpdk.org smtp.mailfrom=ericsson.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=ericsson.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ZOMLdiBdVDZxM95hhq8RR8gJEdhE5xReFfNoLsshPc4=; b=BnCGTS1j7ReTF+GtOYGJjIoy2SrhNRrm72Bt2rBlV5t+p8eNmlwZtbAcvsZoZa60i23m3DOZUqbq2VMQmlcCqdLdkrs52brcJp/Qvt4BD4ohnT45V/z6RIpL7CK3n3SgEhMgAlYD5/wbkgAF0QLtM3eF92Q8iLBSxfuGVHdSr+r/nnTX+HuHnawciO1bnBOiLgOU5H+Pk/CAozeug1Soxv5vQNejZzjK2atAkvltWQiZ2QqgDAnJRq4KbyiFW+GVkrk38kQAMLhoXMjuSKgGKe4JM5mR1fRyYuJwn0t3qsbRbSvbg39+mvKKoCtWFPMrtAI3s8XOzicnbVRXZ+kNXA== Received: from AM0PR03CA0016.eurprd03.prod.outlook.com (2603:10a6:208:14::29) by PAXPR07MB7968.eurprd07.prod.outlook.com (2603:10a6:102:13e::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.41; Thu, 8 Feb 2024 18:24:47 +0000 Received: from AM3PEPF0000A793.eurprd04.prod.outlook.com (2603:10a6:208:14:cafe::19) by AM0PR03CA0016.outlook.office365.com (2603:10a6:208:14::29) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.37 via Frontend Transport; Thu, 8 Feb 2024 18:24:47 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.176.1.74) smtp.mailfrom=ericsson.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=ericsson.com; Received-SPF: Pass (protection.outlook.com: domain of ericsson.com designates 192.176.1.74 as permitted sender) receiver=protection.outlook.com; client-ip=192.176.1.74; helo=oa.msg.ericsson.com; pr=C Received: from oa.msg.ericsson.com (192.176.1.74) by AM3PEPF0000A793.mail.protection.outlook.com (10.167.16.122) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.19 via Frontend Transport; Thu, 8 Feb 2024 18:24:47 +0000 Received: from seliicinfr00049.seli.gic.ericsson.se (153.88.142.248) by smtp-central.internal.ericsson.com (100.87.178.63) with Microsoft SMTP Server id 15.2.1258.12; Thu, 8 Feb 2024 19:24:46 +0100 Received: from breslau.. (seliicwb00002.seli.gic.ericsson.se [10.156.25.100]) by seliicinfr00049.seli.gic.ericsson.se (Postfix) with ESMTP id C84DE380061; Thu, 8 Feb 2024 19:24:46 +0100 (CET) From: =?utf-8?q?Mattias_R=C3=B6nnblom?= To: CC: , =?utf-8?q?Morten_Br=C3=B8rup?= , Stephen Hemminger , =?utf-8?q?Mattias_R=C3=B6nn?= =?utf-8?q?blom?= Subject: [RFC 2/5] eal: add lcore variable test suite Date: Thu, 8 Feb 2024 19:16:41 +0100 Message-ID: <20240208181644.455233-3-mattias.ronnblom@ericsson.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> References: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> MIME-Version: 1.0 X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: AM3PEPF0000A793:EE_|PAXPR07MB7968:EE_ X-MS-Office365-Filtering-Correlation-Id: 9b494fcf-19a3-41ac-fea3-08dc28d33b10 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: Ic/ViIC6RkmlQ3SpbLq8lhqUC5goZiktrZVfzvX4snDnQ2W/ElJzep5qkFD/0VnSTxnDrvkORYbnoXBl5qr3710VnVfN+qFc5JhwWJJkyD+7hsgaJ9P24FyQVOwiQVefnu+rlq9H+GYatVMuHg8i/lQN4uMK+BXrARfJDW7fAyarW6PPFHD5MUP3/6Bh66HbkWtVet4+SavEf6br6cOdcgs0JH+evZMs4f9M6nw8sCkK7CjQnepom1jvT3MX0ZfTc1J65KlKsTuK8lhWHR4SsZtHFJqZ6IDAS/dJHTVyFWjMg6jqaJiyuCiOKW8BkpRj7+FQFc2XoxrcI4ys6nmkGOypyVuBWFCjXkvotR/aFZRqaQxXmNr4KcmHFBs+Rov5UB+gw6s4SzlseCBpy03YfN2nwF2dXYcvFRyWfeXVwoDWYomxBUbLTJgUe+LH7i7Khpryl7NOFZ5BfRZC5IX4ZU4mubCFfYerFf2uGqUEx8qcdTUlt1iDpmtgYrQIiWFvgJXo57GvAMeWY/ijVPY2C5dR+ZWyWoHUFSQEPFFMR8y3lsxVBGjKvqj0KCI5IjAZP7tOvxg4OaUjNngSwcaROZ9skPdloHibd6aMak4s5k8= X-Forefront-Antispam-Report: CIP:192.176.1.74; CTRY:SE; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:oa.msg.ericsson.com; PTR:office365.se.ericsson.net; CAT:NONE; SFS:(13230031)(4636009)(346002)(136003)(396003)(39860400002)(376002)(230922051799003)(82310400011)(186009)(64100799003)(1800799012)(451199024)(36840700001)(46966006)(40470700004)(2906002)(8936002)(30864003)(41300700001)(5660300002)(4326008)(8676002)(82960400001)(83380400001)(356005)(7636003)(86362001)(36756003)(26005)(1076003)(66574015)(70586007)(107886003)(336012)(6666004)(478600001)(6916009)(6266002)(54906003)(70206006)(2616005)(316002)(82740400003); DIR:OUT; SFP:1101; X-OriginatorOrg: ericsson.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 08 Feb 2024 18:24:47.3479 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 9b494fcf-19a3-41ac-fea3-08dc28d33b10 X-MS-Exchange-CrossTenant-Id: 92e84ceb-fbfd-47ab-be52-080c6b87953f X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=92e84ceb-fbfd-47ab-be52-080c6b87953f; Ip=[192.176.1.74]; Helo=[oa.msg.ericsson.com] X-MS-Exchange-CrossTenant-AuthSource: AM3PEPF0000A793.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: PAXPR07MB7968 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Signed-off-by: Mattias Rönnblom --- app/test/meson.build | 1 + app/test/test_lcore_var.c | 384 ++++++++++++++++++++++++++++++++++++++ 2 files changed, 385 insertions(+) create mode 100644 app/test/test_lcore_var.c diff --git a/app/test/meson.build b/app/test/meson.build index 6389ae83ee..93412cce51 100644 --- a/app/test/meson.build +++ b/app/test/meson.build @@ -101,6 +101,7 @@ source_file_deps = { 'test_ipsec_sad.c': ['ipsec'], 'test_kvargs.c': ['kvargs'], 'test_latencystats.c': ['ethdev', 'latencystats', 'metrics'] + sample_packet_forward_deps, + 'test_lcore_var.c': [], 'test_lcores.c': [], 'test_link_bonding.c': ['ethdev', 'net_bond', 'net'] + packet_burst_generator_deps + virtual_pmd_deps, diff --git a/app/test/test_lcore_var.c b/app/test/test_lcore_var.c new file mode 100644 index 0000000000..0229f90bf2 --- /dev/null +++ b/app/test/test_lcore_var.c @@ -0,0 +1,384 @@ +/* SPDX-License-Identifier: BSD-3-Clause + * Copyright(c) 2024 Ericsson AB + */ + +#include +#include +#include + +#include +#include +#include + +#include "test.h" + +#define MIN_LCORES 2 + +RTE_LCORE_VAR_HANDLE(int, test_int); + +struct int_checker_state { + int old_value; + int new_value; + bool success; +}; + +static bool +rand_bool(void) +{ + return rte_rand() & 1; +} + +static void +rand_blk(void *blk, size_t size) +{ + size_t i; + + for (i = 0; i < size; i++) + ((unsigned char *)blk)[i] = (unsigned char)rte_rand(); +} + +static int +check_int(void *arg) +{ + struct int_checker_state *state = arg; + + int *ptr = RTE_LCORE_VAR_PTR(test_int); + + bool naturally_aligned = RTE_PTR_ALIGN_CEIL(ptr, sizeof(int)) == ptr; + + bool equal; + + if (rand_bool()) + equal = RTE_LCORE_VAR_GET(test_int) == state->old_value; + else + equal = *(RTE_LCORE_VAR_PTR(test_int)) == state->old_value; + + state->success = equal && naturally_aligned; + + if (rand_bool()) + RTE_LCORE_VAR_SET(test_int, state->new_value); + else + *ptr = state->new_value; + + return 0; +} + +RTE_LCORE_VAR_INIT(test_int); + +static int +test_int_lvar(void) +{ + unsigned int lcore_id; + + struct int_checker_state states[RTE_MAX_LCORE] = {}; + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct int_checker_state *state = &states[lcore_id]; + + state->old_value = (int)rte_rand(); + state->new_value = (int)rte_rand(); + + RTE_LCORE_VAR_LCORE_SET(lcore_id, test_int, state->old_value); + } + + RTE_LCORE_FOREACH_WORKER(lcore_id) + rte_eal_remote_launch(check_int, &states[lcore_id], lcore_id); + + rte_eal_mp_wait_lcore(); + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct int_checker_state *state = &states[lcore_id]; + + TEST_ASSERT(state->success, "Unexpected value " + "encountered on lcore %d", lcore_id); + + TEST_ASSERT_EQUAL(state->new_value, + RTE_LCORE_VAR_LCORE_GET(lcore_id, test_int), + "Lcore %d failed to update int", lcore_id); + } + + /* take the opportunity to test the foreach macro */ + int *v; + lcore_id = 0; + RTE_LCORE_VAR_FOREACH(v, test_int) { + printf("expected %d actual %d\n", + states[lcore_id].new_value, *v); + TEST_ASSERT_EQUAL(states[lcore_id].new_value, *v, + "Unexpected value on lcore %d during " + "iteration", lcore_id); + lcore_id++; + } + + return TEST_SUCCESS; +} + +/* private, larger, struct */ +#define TEST_STRUCT_DATA_SIZE 1234 + +struct test_struct { + uint8_t data[TEST_STRUCT_DATA_SIZE]; +}; + +static RTE_LCORE_VAR_HANDLE(char, before_struct); +static RTE_LCORE_VAR_HANDLE(struct test_struct, test_struct); +static RTE_LCORE_VAR_HANDLE(char, after_struct); + +struct struct_checker_state { + struct test_struct old_value; + struct test_struct new_value; + bool success; +}; + +static int check_struct(void *arg) +{ + struct struct_checker_state *state = arg; + + struct test_struct *lcore_struct = RTE_LCORE_VAR_PTR(test_struct); + + /* + * Lcore variable alignment is based on object size, not any + * particular requirements on the struct's field. + */ + bool properly_aligned = + RTE_PTR_ALIGN_CEIL(lcore_struct, 16) == lcore_struct; + + bool equal = memcmp(lcore_struct->data, state->old_value.data, + TEST_STRUCT_DATA_SIZE) == 0; + + state->success = equal && properly_aligned; + + memcpy(lcore_struct->data, state->new_value.data, + TEST_STRUCT_DATA_SIZE); + + return 0; +} + +static int +test_struct_lvar(void) +{ + unsigned int lcore_id; + + RTE_LCORE_VAR_ALLOC(before_struct); + RTE_LCORE_VAR_ALLOC(test_struct); + RTE_LCORE_VAR_ALLOC(after_struct); + + struct struct_checker_state states[RTE_MAX_LCORE]; + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct struct_checker_state *state = &states[lcore_id]; + + rand_blk(state->old_value.data, TEST_STRUCT_DATA_SIZE); + rand_blk(state->new_value.data, TEST_STRUCT_DATA_SIZE); + + memcpy(RTE_LCORE_VAR_LCORE_PTR(lcore_id, test_struct)->data, + state->old_value.data, TEST_STRUCT_DATA_SIZE); + } + + RTE_LCORE_FOREACH_WORKER(lcore_id) + rte_eal_remote_launch(check_struct, &states[lcore_id], + lcore_id); + + rte_eal_mp_wait_lcore(); + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct struct_checker_state *state = &states[lcore_id]; + struct test_struct *lstruct = + RTE_LCORE_VAR_LCORE_PTR(lcore_id, test_struct); + + TEST_ASSERT(state->success, "Unexpected value encountered on " + "lcore %d", lcore_id); + + bool equal = memcmp(lstruct->data, state->new_value.data, + TEST_STRUCT_DATA_SIZE) == 0; + + TEST_ASSERT(equal, "Lcore %d failed to update struct", + lcore_id); + } + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + char before = RTE_LCORE_VAR_LCORE_GET(lcore_id, before_struct); + char after = RTE_LCORE_VAR_LCORE_GET(lcore_id, after_struct); + + TEST_ASSERT_EQUAL(before, 0, "Lcore variable before test " + "struct was modified on lcore %d", lcore_id); + TEST_ASSERT_EQUAL(after, 0, "Lcore variable after test " + "struct was modified on lcore %d", lcore_id); + } + + return TEST_SUCCESS; +} + +#define TEST_ARRAY_SIZE 99 + +typedef uint16_t test_array_t[TEST_ARRAY_SIZE]; + +static void test_array_init_rand(test_array_t a) +{ + size_t i; + for (i = 0; i < TEST_ARRAY_SIZE; i++) + a[i] = (uint16_t)rte_rand(); +} + +static bool test_array_equal(test_array_t a, test_array_t b) +{ + size_t i; + for (i = 0; i < TEST_ARRAY_SIZE; i++) { + if (a[i] != b[i]) + return false; + } + return true; +} + +static void test_array_copy(test_array_t dst, const test_array_t src) +{ + size_t i; + for (i = 0; i < TEST_ARRAY_SIZE; i++) + dst[i] = src[i]; +} + +static RTE_LCORE_VAR_HANDLE(char, before_array); +static RTE_LCORE_VAR_HANDLE(test_array_t, test_array); +static RTE_LCORE_VAR_HANDLE(char, after_array); + +struct array_checker_state +{ + test_array_t old_value; + test_array_t new_value; + bool success; +}; + +static int check_array(void *arg) +{ + struct array_checker_state *state = arg; + + test_array_t *lcore_array = RTE_LCORE_VAR_PTR(test_array); + + /* + * Lcore variable alignment is based on object size, not any + * particular requirements on the struct's field. + */ + bool properly_aligned = + RTE_PTR_ALIGN_CEIL(lcore_array, 16) == lcore_array; + + bool equal = test_array_equal(*lcore_array, state->old_value); + + state->success = equal && properly_aligned; + + test_array_copy(*lcore_array, state->new_value); + + return 0; +} + +static int +test_array_lvar(void) +{ + unsigned int lcore_id; + + RTE_LCORE_VAR_ALLOC(before_array); + RTE_LCORE_VAR_ALLOC(test_array); + RTE_LCORE_VAR_ALLOC(after_array); + + struct array_checker_state states[RTE_MAX_LCORE]; + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct array_checker_state *state = &states[lcore_id]; + + test_array_init_rand(state->new_value); + test_array_init_rand(state->old_value); + + test_array_copy(RTE_LCORE_VAR_LCORE_GET(lcore_id, test_array), + state->old_value); + } + + RTE_LCORE_FOREACH_WORKER(lcore_id) + rte_eal_remote_launch(check_array, &states[lcore_id], + lcore_id); + + rte_eal_mp_wait_lcore(); + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + struct array_checker_state *state = &states[lcore_id]; + test_array_t *larray = + RTE_LCORE_VAR_LCORE_PTR(lcore_id, test_array); + + TEST_ASSERT(state->success, "Unexpected value encountered on " + "lcore %d", lcore_id); + + bool equal = test_array_equal(*larray, state->new_value); + + TEST_ASSERT(equal, "Lcore %d failed to update array", + lcore_id); + } + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + char before = RTE_LCORE_VAR_LCORE_GET(lcore_id, before_array); + char after = RTE_LCORE_VAR_LCORE_GET(lcore_id, after_array); + + TEST_ASSERT_EQUAL(before, 0, "Lcore variable before test " + "array was modified on lcore %d", lcore_id); + TEST_ASSERT_EQUAL(after, 0, "Lcore variable after test " + "array was modified on lcore %d", lcore_id); + } + + return TEST_SUCCESS; +} + +#define MANY_LVARS (RTE_MAX_LCORE_VAR / 2) + +static int +test_many_lvars(void) +{ + void **handlers = malloc(sizeof(void *) * MANY_LVARS); + int i; + + TEST_ASSERT(handlers != NULL, "Unable to allocate memory"); + + for (i = 0; i < MANY_LVARS; i++) { + void *handle = rte_lcore_var_alloc(1); + + uint8_t *b = __RTE_LCORE_VAR_LCORE_PTR(rte_lcore_id(), handle); + + *b = (uint8_t)i; + + handlers[i] = handle; + } + + for (i = 0; i < MANY_LVARS; i++) { + unsigned int lcore_id; + + RTE_LCORE_FOREACH_WORKER(lcore_id) { + uint8_t *b = __RTE_LCORE_VAR_LCORE_PTR(rte_lcore_id(), + handlers[i]); + TEST_ASSERT_EQUAL((uint8_t)i, *b, + "Unexpected lcore variable value."); + } + } + + free(handlers); + + return TEST_SUCCESS; +} + +static struct unit_test_suite lcore_var_testsuite = { + .suite_name = "lcore variable autotest", + .unit_test_cases = { + TEST_CASE(test_int_lvar), + TEST_CASE(test_struct_lvar), + TEST_CASE(test_array_lvar), + TEST_CASE(test_many_lvars), + TEST_CASES_END() + }, +}; + +static int test_lcore_var(void) +{ + if (rte_lcore_count() < MIN_LCORES) { + printf("Not enough cores for lcore_var_autotest; expecting at " + "least %d.\n", MIN_LCORES); + return TEST_SKIPPED; + } + + return unit_test_suite_runner(&lcore_var_testsuite); +} + +REGISTER_FAST_TEST(lcore_var_autotest, true, false, test_lcore_var); From patchwork Thu Feb 8 18:16:42 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Mattias_R=C3=B6nnblom?= X-Patchwork-Id: 136559 X-Patchwork-Delegate: thomas@monjalon.net Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 5B1F643AE1; Thu, 8 Feb 2024 19:25:10 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id BABE542E65; Thu, 8 Feb 2024 19:24:53 +0100 (CET) Received: from EUR04-VI1-obe.outbound.protection.outlook.com (mail-vi1eur04on2080.outbound.protection.outlook.com [40.107.8.80]) by mails.dpdk.org (Postfix) with ESMTP id 01FD3402D1 for ; Thu, 8 Feb 2024 19:24:49 +0100 (CET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=oKacDfYbRMlxU1Vp5BjJuhQwWsGV/JCIgEOH7OtBu65T7jzKh/uPZnlQvD1lmOHfDEVFphXZEYoUhG53rqufc9fGl2tkPcJLpPz6HZ96/oHWd6VK2EFOl0PWwG89hzgbnNW4oq6KTgOVfTkkq8fVSc/mP800Ol5oFLezbfQq6tuOM+aq1xZZ03gKeplfcOVXigaDPBsRQI49WcCcQapo5iFmh8AZGg6D8siu3endOZzP2trYQUpsoDb1VhPNT0Ouhr4OWm44UlXCSJMgu+J0BIkc263oFquslZ2whXursf+9ltsq6nld/jsT5z9x1yxTW9FkPwicNCYhS1Rxqi4Wfg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=PwVPW22hNXCaG/41uZ2V2v+iJOvFP7KGk1YCJ2lynMI=; b=fBPKeDgRChP3ZSiKPmOfJqyVPKAMyJW6sJ1Mb3Gvfqynb9BZtFO/H30Y1VW6wJfa2Vpdz2g/ZwOvWbsmq0nuakzHSEBKDksYfnM8FaahkEnI1ffVtAer6UQ2LlxTuRVQnkbSBrMVKqL4RJUPcBU0xhmgVnGoZHelHabs7NJkkJzzLT7VzJzOFqrH2bICVeR6d6fxO7bjU6ejy/3LPk8Kk9pKruDXzg9OBTvAfy4BhciOkJ+wrvEj7WQA7FbCycuhpQYgz/LfnNBOM3ebqLsQ+wZmFpz5J/eNG5Dpmrn4xOOzjS1pYautRmFvvm34YlLrtT0u13r3HwivaPspCK/yNw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.176.1.74) smtp.rcpttodomain=dpdk.org smtp.mailfrom=ericsson.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=ericsson.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=PwVPW22hNXCaG/41uZ2V2v+iJOvFP7KGk1YCJ2lynMI=; b=GQdn4KkrzW/tQY6hO8V+aQMkFxWiR1APrRQ/diV7D/PnjnDsak+k6VvdIjtM9RlVGXoU+zr01/+ILwjnoC5Pnu3CkD+6pVmMUNhyLGkIVUQLRJvfXyEhnVIThkSBh84dlkooQOouBPa6DKUTsTkhA9jf/EMHmOwJGy1cU8tpBf0t0xYflY1d8FGiGWzi/fQqy1QHuES4CUHRa4wjKlccVjHc5oshSBv/nzuxCLTl4toHKobjFgUDjlUcSIk78EuC9ryUekYZsGbevTvAWY828sssu9E24ks0wj0/uls7LB+tSEnG17Izxnu7LOVX/AZcEhCvRxhTStzvmhw+ca1lcQ== Received: from DUZPR01CA0052.eurprd01.prod.exchangelabs.com (2603:10a6:10:469::8) by AS8PR07MB8236.eurprd07.prod.outlook.com (2603:10a6:20b:37b::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.39; Thu, 8 Feb 2024 18:24:48 +0000 Received: from DB5PEPF00014B96.eurprd02.prod.outlook.com (2603:10a6:10:469:cafe::56) by DUZPR01CA0052.outlook.office365.com (2603:10a6:10:469::8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.41 via Frontend Transport; Thu, 8 Feb 2024 18:24:48 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.176.1.74) smtp.mailfrom=ericsson.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=ericsson.com; Received-SPF: Pass (protection.outlook.com: domain of ericsson.com designates 192.176.1.74 as permitted sender) receiver=protection.outlook.com; client-ip=192.176.1.74; helo=oa.msg.ericsson.com; pr=C Received: from oa.msg.ericsson.com (192.176.1.74) by DB5PEPF00014B96.mail.protection.outlook.com (10.167.8.234) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.19 via Frontend Transport; Thu, 8 Feb 2024 18:24:48 +0000 Received: from seliicinfr00049.seli.gic.ericsson.se (153.88.142.248) by smtp-central.internal.ericsson.com (100.87.178.60) with Microsoft SMTP Server id 15.2.1258.12; Thu, 8 Feb 2024 19:24:47 +0100 Received: from breslau.. (seliicwb00002.seli.gic.ericsson.se [10.156.25.100]) by seliicinfr00049.seli.gic.ericsson.se (Postfix) with ESMTP id 5FC2E380061; Thu, 8 Feb 2024 19:24:47 +0100 (CET) From: =?utf-8?q?Mattias_R=C3=B6nnblom?= To: CC: , =?utf-8?q?Morten_Br=C3=B8rup?= , Stephen Hemminger , =?utf-8?q?Mattias_R=C3=B6nn?= =?utf-8?q?blom?= Subject: [RFC 3/5] random: keep PRNG state in lcore variable Date: Thu, 8 Feb 2024 19:16:42 +0100 Message-ID: <20240208181644.455233-4-mattias.ronnblom@ericsson.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> References: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> MIME-Version: 1.0 X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DB5PEPF00014B96:EE_|AS8PR07MB8236:EE_ X-MS-Office365-Filtering-Correlation-Id: 14a19984-45c4-4651-42e3-08dc28d33b7e X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: Lxtbdspak07E8lSfCHgs1YwMJE0sceGuMSW++N4T/eNKWQ/z52jktfshUuzoo1U3DmWDSWyPzD1QZNmfijvMsUu+CvcymZRTzo7hDe626RjN3sW7Btv01yM79QtF7CQ2GlSwD/I6WXKWgK4EMG/yd0lX5okVULNG2tCvTvPIWqbZJCmiFzDB9MDk+SXvowPiChGU/lSeCgtfiByLClhS3Zfrt595TvNxyVzoaSOajZ1rzLQNtVLDb1qnnO3BMCdnygDoK+Z6bOHY8DVt9ef4Qi+0qnmoCF3zFJyLwWoKNW8vGxVWeCFRheFpWz1kMexf5HWdZD1EmD+2HGLCVy+2YBzk26cW3dw+7k/OhBiSDmILRXHgKbqcvukPulVn8n85sBWyxX7vnj5cvMjFeHqjcfSicJdyGOr0ujUZO6U9ljuLsidNSy4sKYJ/u46paMZP0+TUqI4ZO/WhLAo5iA/yAQpeKjhjfSIHINW0yQ1WyaGdshiO6YQWbjAk+U5aleJ2KFBCfvPXIvdBQks1erjipIAqTgf+N/Y5YJ9vxWbRlGAfdhbhMnzpkAVicY+6yzW3pji7WQ9b7280/E9sKUn+FAfRoT3/Dc4QHlQ1m6Yyuq4= X-Forefront-Antispam-Report: CIP:192.176.1.74; CTRY:SE; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:oa.msg.ericsson.com; PTR:office365.se.ericsson.net; CAT:NONE; SFS:(13230031)(4636009)(376002)(346002)(136003)(39860400002)(396003)(230922051799003)(186009)(64100799003)(82310400011)(1800799012)(451199024)(40470700004)(36840700001)(46966006)(36756003)(41300700001)(54906003)(70206006)(8936002)(6916009)(316002)(70586007)(5660300002)(478600001)(8676002)(4326008)(82960400001)(86362001)(6266002)(336012)(82740400003)(356005)(7636003)(2906002)(83380400001)(1076003)(26005)(6666004)(107886003)(2616005); DIR:OUT; SFP:1101; X-OriginatorOrg: ericsson.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 08 Feb 2024 18:24:48.0327 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 14a19984-45c4-4651-42e3-08dc28d33b7e X-MS-Exchange-CrossTenant-Id: 92e84ceb-fbfd-47ab-be52-080c6b87953f X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=92e84ceb-fbfd-47ab-be52-080c6b87953f; Ip=[192.176.1.74]; Helo=[oa.msg.ericsson.com] X-MS-Exchange-CrossTenant-AuthSource: DB5PEPF00014B96.eurprd02.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: AS8PR07MB8236 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Move keeping PRNG state in a RTE_MAX_LCORE-sized static array of cache-aligned and RTE_CACHE_GUARDed struct instances to keeping the same state in to a more cache-friendly lcore variable. Signed-off-by: Mattias Rönnblom --- lib/eal/common/rte_random.c | 30 ++++++++++++++++++------------ 1 file changed, 18 insertions(+), 12 deletions(-) diff --git a/lib/eal/common/rte_random.c b/lib/eal/common/rte_random.c index 7709b8f2c6..af9fffd81b 100644 --- a/lib/eal/common/rte_random.c +++ b/lib/eal/common/rte_random.c @@ -11,6 +11,7 @@ #include #include #include +#include #include struct rte_rand_state { @@ -19,14 +20,12 @@ struct rte_rand_state { uint64_t z3; uint64_t z4; uint64_t z5; - RTE_CACHE_GUARD; -} __rte_cache_aligned; +}; -/* One instance each for every lcore id-equipped thread, and one - * additional instance to be shared by all others threads (i.e., all - * unregistered non-EAL threads). - */ -static struct rte_rand_state rand_states[RTE_MAX_LCORE + 1]; +RTE_LCORE_VAR_HANDLE(struct rte_rand_state, rand_state); + +/* instance to be shared by all unregistered non-EAL threads */ +static struct rte_rand_state unregistered_rand_state __rte_cache_aligned; static uint32_t __rte_rand_lcg32(uint32_t *seed) @@ -85,8 +84,14 @@ rte_srand(uint64_t seed) unsigned int lcore_id; /* add lcore_id to seed to avoid having the same sequence */ - for (lcore_id = 0; lcore_id < RTE_DIM(rand_states); lcore_id++) - __rte_srand_lfsr258(seed + lcore_id, &rand_states[lcore_id]); + for (lcore_id = 0; lcore_id < RTE_MAX_LCORE; lcore_id++) { + struct rte_rand_state *lcore_state = + RTE_LCORE_VAR_LCORE_PTR(lcore_id, rand_state); + + __rte_srand_lfsr258(seed + lcore_id, lcore_state); + } + + __rte_srand_lfsr258(seed + lcore_id, &unregistered_rand_state); } static __rte_always_inline uint64_t @@ -124,11 +129,10 @@ struct rte_rand_state *__rte_rand_get_state(void) idx = rte_lcore_id(); - /* last instance reserved for unregistered non-EAL threads */ if (unlikely(idx == LCORE_ID_ANY)) - idx = RTE_MAX_LCORE; + return &unregistered_rand_state; - return &rand_states[idx]; + return RTE_LCORE_VAR_PTR(rand_state); } uint64_t @@ -228,6 +232,8 @@ RTE_INIT(rte_rand_init) { uint64_t seed; + RTE_LCORE_VAR_ALLOC(rand_state); + seed = __rte_random_initial_seed(); rte_srand(seed); From patchwork Thu Feb 8 18:16:43 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Mattias_R=C3=B6nnblom?= X-Patchwork-Id: 136561 X-Patchwork-Delegate: thomas@monjalon.net Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 625D443AAA; Thu, 8 Feb 2024 19:25:24 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id D432642E5C; Thu, 8 Feb 2024 19:24:59 +0100 (CET) Received: from EUR04-DB3-obe.outbound.protection.outlook.com (mail-db3eur04on2080.outbound.protection.outlook.com [40.107.6.80]) by mails.dpdk.org (Postfix) with ESMTP id 1EE6F42E38 for ; Thu, 8 Feb 2024 19:24:57 +0100 (CET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=kv8/+LMicH1TGZPKm+uJYFUuJmkDEenYeA9TlayG3MZZIqKVefosvqHMQj6MP9WpzBew5owvRNZlWhhDZgIoTArY8ary/FJ/kZMiQYbqhPjR14mlHxA/TFYRiiEfzQdQ/JuU1h70HeMWT6tQeIzz2ZtkQzRaLxYxBrYRZ4YhYFNS0J2Az5T1oqstadOURQ4+8Disb27kgOgZcAtFVnH03KK9eUp4AJT8HYcfSqo7GaFQx82TjaCm9Z+dAk3f8zNg4arvhUuSpkL6mKCYjEarOpQ/VZGwOXSqpbDrRf5p35NLJ20jy5XEOBYX8cjKI4cY+cR0tPWxy6M6+O8tH7rC4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=6M5eDI/e9A/eoZGR022N1+fXYGp1IWmdBUcIORrYIWg=; b=OY9Mg4S7l7iHYiujZphjXrSi37XvHDSDHBvgpw28O8OODYIiQ0rXV+ZH4P6Z8LFDEh+SOT/XGRKZOaYV/Q8+rLNffVTZpxeK3WyWhhKk9K/XY3FnaUWR3pMeH52wr9LVFeMw3RYwtvIeA31A+cj/Pq47iaV6w8zOiqeosIjxcu0DjlGjN4RnQVuytTbyCJ/vJBGBH5C2wiW0IERMZMby6bcXrQPtGKxrqIr9zpz4jLtb4rfCj89aPjnytsD3wiawpY/W9jTpae230DrJN01OMQ9ry0ruLiu2pbd6ar3NLpLKCL+wy2/KLiMBewDzmu9VLNKfcSIZZ+RXNXf5YkxOVQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.176.1.74) smtp.rcpttodomain=dpdk.org smtp.mailfrom=ericsson.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=ericsson.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=6M5eDI/e9A/eoZGR022N1+fXYGp1IWmdBUcIORrYIWg=; b=t1Ex+qJTYsaYiNg186SCtupmOHj8LrFYGiMz7JKsA8cvN4WDjuqGrl39jbLIwtKhQjySg6Wo3jMzx7j6pmhPQM5PXOY5KfZIxxlE/sR7QfUwJE78OX/9NhnGIkJT9+xLx1g30WPUjMF2ho9+xCKzWc/u4979FdCUpRTZJk+AN72Jaw5ar5TUSYNT7c76FON0AP85CmD5jD3bGRpRDWY5XZ4Vh8CCH+00mDHySCYqaA7La+0Hm/qJLQvMiShcQH9IcNQgVDkzxYOLQTnYutfFRvJco+1sTKZ9PjuD2sW4Fy6Q6WaUNs39rrVvWAGvFLas5MVEEKYt7pQSmhvfj7sosQ== Received: from DB8PR06CA0063.eurprd06.prod.outlook.com (2603:10a6:10:120::37) by PA4PR07MB7534.eurprd07.prod.outlook.com (2603:10a6:102:ce::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.30; Thu, 8 Feb 2024 18:24:54 +0000 Received: from DB5PEPF00014B9F.eurprd02.prod.outlook.com (2603:10a6:10:120:cafe::39) by DB8PR06CA0063.outlook.office365.com (2603:10a6:10:120::37) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.39 via Frontend Transport; Thu, 8 Feb 2024 18:24:54 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.176.1.74) smtp.mailfrom=ericsson.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=ericsson.com; Received-SPF: Pass (protection.outlook.com: domain of ericsson.com designates 192.176.1.74 as permitted sender) receiver=protection.outlook.com; client-ip=192.176.1.74; helo=oa.msg.ericsson.com; pr=C Received: from oa.msg.ericsson.com (192.176.1.74) by DB5PEPF00014B9F.mail.protection.outlook.com (10.167.8.169) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.19 via Frontend Transport; Thu, 8 Feb 2024 18:24:54 +0000 Received: from seliicinfr00049.seli.gic.ericsson.se (153.88.142.248) by smtp-central.internal.ericsson.com (100.87.178.61) with Microsoft SMTP Server id 15.2.1258.12; Thu, 8 Feb 2024 19:24:48 +0100 Received: from breslau.. (seliicwb00002.seli.gic.ericsson.se [10.156.25.100]) by seliicinfr00049.seli.gic.ericsson.se (Postfix) with ESMTP id E6A2E380061; Thu, 8 Feb 2024 19:24:47 +0100 (CET) From: =?utf-8?q?Mattias_R=C3=B6nnblom?= To: CC: , =?utf-8?q?Morten_Br=C3=B8rup?= , Stephen Hemminger , =?utf-8?q?Mattias_R=C3=B6nn?= =?utf-8?q?blom?= Subject: [RFC 4/5] power: keep per-lcore state in lcore variable Date: Thu, 8 Feb 2024 19:16:43 +0100 Message-ID: <20240208181644.455233-5-mattias.ronnblom@ericsson.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> References: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> MIME-Version: 1.0 X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DB5PEPF00014B9F:EE_|PA4PR07MB7534:EE_ X-MS-Office365-Filtering-Correlation-Id: 39fa8eb4-1838-4d34-c22c-08dc28d33f3c X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: UexH1BathVelAkCmji1MABFUEM1XuRQX4QejEN5KfUXrG+e5oOXcDitFrwo6QCoMorR8e5UwNJ93CtqoYwUIJEoQdZCRuiixsQ1b6dQntU10ZDWJKwPrkd+uNxZlwp1f+WABl14RzoNYAyixHM/ptJnhXH/x70lJDnvHe49FjlRxjW8BWK0knQz0BAIKbkdIxKHaV73SvVh5DqTnTNonIgAg1f407t2JLKdecwPHxPgUoj2XyaQ+55g4NIc0n2ZQfK4n7qK+3VFaJsMGdryLWZoM0LtbFu2nEnaJea6fv9771b/KqFfrlcC0eRvvro285oztsLqIeCEcBrnBodC9NZA8BQVVu1ampLtMicwpGRqDFxCTOWf0tooKllRcbcOPif3a7XVGcRHareii3qiWw70N501NwCWo5pEWj9JTAbgG2+bQLjnvADxRgBfxeLbIcbPrLGblZa3P02LOV6abBdgpXkkjorWrnkVTaZACUbaJmrNrvcHI1LlALfzdfsAVWIrVwJdvOpwVwU3gKiIB+LOHxf+rCfaXzwGxcKlUVeyjp5bPIGtUNGXOcDdEMvhLLrDH4v1H3eOeD4v9XNeeGCg5l+lPd/ezpMgYshg1ekA= X-Forefront-Antispam-Report: CIP:192.176.1.74; CTRY:SE; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:oa.msg.ericsson.com; PTR:office365.se.ericsson.net; CAT:NONE; SFS:(13230031)(4636009)(396003)(376002)(136003)(346002)(39860400002)(230922051799003)(1800799012)(82310400011)(64100799003)(451199024)(186009)(36840700001)(46966006)(40470700004)(41300700001)(1076003)(107886003)(2616005)(86362001)(6266002)(336012)(26005)(5660300002)(478600001)(2906002)(70586007)(70206006)(6916009)(316002)(54906003)(8676002)(8936002)(4326008)(36756003)(6666004)(82960400001)(356005)(83380400001)(7636003)(82740400003); DIR:OUT; SFP:1101; X-OriginatorOrg: ericsson.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 08 Feb 2024 18:24:54.3143 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 39fa8eb4-1838-4d34-c22c-08dc28d33f3c X-MS-Exchange-CrossTenant-Id: 92e84ceb-fbfd-47ab-be52-080c6b87953f X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=92e84ceb-fbfd-47ab-be52-080c6b87953f; Ip=[192.176.1.74]; Helo=[oa.msg.ericsson.com] X-MS-Exchange-CrossTenant-AuthSource: DB5PEPF00014B9F.eurprd02.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: PA4PR07MB7534 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Signed-off-by: Mattias Rönnblom --- lib/power/rte_power_pmd_mgmt.c | 27 ++++++++++++++------------- 1 file changed, 14 insertions(+), 13 deletions(-) diff --git a/lib/power/rte_power_pmd_mgmt.c b/lib/power/rte_power_pmd_mgmt.c index 591fc69f36..bb20e564de 100644 --- a/lib/power/rte_power_pmd_mgmt.c +++ b/lib/power/rte_power_pmd_mgmt.c @@ -5,6 +5,7 @@ #include #include +#include #include #include #include @@ -68,8 +69,8 @@ struct pmd_core_cfg { /**< Number of queues ready to enter power optimized state */ uint64_t sleep_target; /**< Prevent a queue from triggering sleep multiple times */ -} __rte_cache_aligned; -static struct pmd_core_cfg lcore_cfgs[RTE_MAX_LCORE]; +}; +static RTE_LCORE_VAR_HANDLE(struct pmd_core_cfg, lcore_cfgs); static inline bool queue_equal(const union queue *l, const union queue *r) @@ -252,12 +253,11 @@ clb_multiwait(uint16_t port_id __rte_unused, uint16_t qidx __rte_unused, struct rte_mbuf **pkts __rte_unused, uint16_t nb_rx, uint16_t max_pkts __rte_unused, void *arg) { - const unsigned int lcore = rte_lcore_id(); struct queue_list_entry *queue_conf = arg; struct pmd_core_cfg *lcore_conf; const bool empty = nb_rx == 0; - lcore_conf = &lcore_cfgs[lcore]; + lcore_conf = RTE_LCORE_VAR_PTR(lcore_cfgs); /* early exit */ if (likely(!empty)) @@ -317,13 +317,12 @@ clb_pause(uint16_t port_id __rte_unused, uint16_t qidx __rte_unused, struct rte_mbuf **pkts __rte_unused, uint16_t nb_rx, uint16_t max_pkts __rte_unused, void *arg) { - const unsigned int lcore = rte_lcore_id(); struct queue_list_entry *queue_conf = arg; struct pmd_core_cfg *lcore_conf; const bool empty = nb_rx == 0; uint32_t pause_duration = rte_power_pmd_mgmt_get_pause_duration(); - lcore_conf = &lcore_cfgs[lcore]; + lcore_conf = RTE_LCORE_VAR_PTR(lcore_cfgs); if (likely(!empty)) /* early exit */ @@ -358,9 +357,8 @@ clb_scale_freq(uint16_t port_id __rte_unused, uint16_t qidx __rte_unused, struct rte_mbuf **pkts __rte_unused, uint16_t nb_rx, uint16_t max_pkts __rte_unused, void *arg) { - const unsigned int lcore = rte_lcore_id(); const bool empty = nb_rx == 0; - struct pmd_core_cfg *lcore_conf = &lcore_cfgs[lcore]; + struct pmd_core_cfg *lcore_conf = RTE_LCORE_VAR_PTR(lcore_cfgs); struct queue_list_entry *queue_conf = arg; if (likely(!empty)) { @@ -518,7 +516,7 @@ rte_power_ethdev_pmgmt_queue_enable(unsigned int lcore_id, uint16_t port_id, goto end; } - lcore_cfg = &lcore_cfgs[lcore_id]; + lcore_cfg = RTE_LCORE_VAR_LCORE_PTR(lcore_id, lcore_cfgs); /* check if other queues are stopped as well */ ret = cfg_queues_stopped(lcore_cfg); @@ -619,7 +617,7 @@ rte_power_ethdev_pmgmt_queue_disable(unsigned int lcore_id, } /* no need to check queue id as wrong queue id would not be enabled */ - lcore_cfg = &lcore_cfgs[lcore_id]; + lcore_cfg = RTE_LCORE_VAR_LCORE_PTR(lcore_id, lcore_cfgs); /* check if other queues are stopped as well */ ret = cfg_queues_stopped(lcore_cfg); @@ -772,10 +770,13 @@ RTE_INIT(rte_power_ethdev_pmgmt_init) { size_t i; int j; + RTE_LCORE_VAR_ALLOC(lcore_cfgs); + /* initialize all tailqs */ - for (i = 0; i < RTE_DIM(lcore_cfgs); i++) { - struct pmd_core_cfg *cfg = &lcore_cfgs[i]; - TAILQ_INIT(&cfg->head); + for (i = 0; i < RTE_MAX_LCORE; i++) { + struct pmd_core_cfg *lcore_cfg = + RTE_LCORE_VAR_LCORE_PTR(i, lcore_cfgs); + TAILQ_INIT(&lcore_cfg->head); } /* initialize config defaults */ From patchwork Thu Feb 8 18:16:44 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Mattias_R=C3=B6nnblom?= X-Patchwork-Id: 136560 X-Patchwork-Delegate: thomas@monjalon.net Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id BEEEE43AAA; Thu, 8 Feb 2024 19:25:16 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id EA02F42E6B; Thu, 8 Feb 2024 19:24:54 +0100 (CET) Received: from EUR04-DB3-obe.outbound.protection.outlook.com (mail-db3eur04on2057.outbound.protection.outlook.com [40.107.6.57]) by mails.dpdk.org (Postfix) with ESMTP id B87FF42E29 for ; Thu, 8 Feb 2024 19:24:50 +0100 (CET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=GqjEHLouInGfzKbnHpMt8R7x2MHcbFVVpU+iVIyYDWoNveXHgY124lI0vQwJjGA4bBHPj6r9uGzac8/pQGyrIngGxhKOQb37/a8hOnrsDPTM6EawvhfAVPA/l9qHgUreZckKWn3I+VR9gkIHLlN6lEvlVNPz8chHo4wf4FyvO8I32xCj86Pbu4nQQmZ3O97d5cqsnG6vB0uGpgw5m37JNABvAf806G5a5vv4hJgcC4WO/R+MtalXAe9wAttGefW68zAOfqBj1ICn0soI7gbPkT1gpK+H+sn+ns9qLenk1KasQWKE2BdqtrkxhHcyPD4HKazWClpWRNC/qniULTWBcw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=ijM1pTnQ/aEviQO1uE4SbNkUyzna6IxG+tW3EWB12Kw=; b=UQ8btmqz+GCm7IuwmmiqntfHEYLzQ9J3esyzGOiqxILW6lluVtwL0BNdL48k/P2NkoFvfT5Y3L5Y6zRNJNjLNWwrzvcKFTuUvLS/Pv+a48cryz9p6OvWfIzatKPNG+/3052EdcjRC2H1ECqHKxQS7eiXelI2evTk7zi+t/tGgD/tpnyWPUi6kydE6TcvGMdxL8IbKbYoOBnc0cruHFHnzjVzp8DcBoLA9A4C5eANAgWM6rIG46N4FI2yO9o0WzvU7e8yCt1pzu6lN5zT546XEGoV9KGe1nLUf7f1Y02DBCFUqlBB/xow6kI0btVKL4JLgUuhAY/+AeQzeXNuaKPNJw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 192.176.1.74) smtp.rcpttodomain=dpdk.org smtp.mailfrom=ericsson.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=ericsson.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ijM1pTnQ/aEviQO1uE4SbNkUyzna6IxG+tW3EWB12Kw=; b=r9+epGjtSioaK1S3wIEgG5i3f9iUjW+8xvDFTdai3EM0+rU8fgYMj33HE4iCgCBL+grFpcGkK+q933MnYoPqHY+sixwqjrMEyLMr90TA+5/FHPmJwk1GvEAHr0NZcikPVC9kZ6wFiTUaP7RCM8LG07tABflYI12esHcw8sKi6xJ4R+d++ulSHRdxXCRUnkVtGVum0fMxJwn6pMeClKgm6Cu8ZlqVRziS+bUxU4wG/HWtwvRcUn4bFseRrnyEHyxC9c5cMBNLNCt+x/6vTV2DdTs9BePrGIqfRuCYNITQxb2QhJHJzEAyDGD5QuzjxbsEBBqZdlH4Qfj/lhF+QO6yIA== Received: from DUZPR01CA0068.eurprd01.prod.exchangelabs.com (2603:10a6:10:3c2::18) by DBBPR07MB7353.eurprd07.prod.outlook.com (2603:10a6:10:1e5::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.39; Thu, 8 Feb 2024 18:24:49 +0000 Received: from DB1PEPF000509E5.eurprd03.prod.outlook.com (2603:10a6:10:3c2:cafe::61) by DUZPR01CA0068.outlook.office365.com (2603:10a6:10:3c2::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.37 via Frontend Transport; Thu, 8 Feb 2024 18:24:49 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 192.176.1.74) smtp.mailfrom=ericsson.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=ericsson.com; Received-SPF: Pass (protection.outlook.com: domain of ericsson.com designates 192.176.1.74 as permitted sender) receiver=protection.outlook.com; client-ip=192.176.1.74; helo=oa.msg.ericsson.com; pr=C Received: from oa.msg.ericsson.com (192.176.1.74) by DB1PEPF000509E5.mail.protection.outlook.com (10.167.242.55) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7249.19 via Frontend Transport; Thu, 8 Feb 2024 18:24:49 +0000 Received: from seliicinfr00049.seli.gic.ericsson.se (153.88.142.248) by smtp-central.internal.ericsson.com (100.87.178.68) with Microsoft SMTP Server id 15.2.1258.12; Thu, 8 Feb 2024 19:24:48 +0100 Received: from breslau.. (seliicwb00002.seli.gic.ericsson.se [10.156.25.100]) by seliicinfr00049.seli.gic.ericsson.se (Postfix) with ESMTP id 7B907380061; Thu, 8 Feb 2024 19:24:48 +0100 (CET) From: =?utf-8?q?Mattias_R=C3=B6nnblom?= To: CC: , =?utf-8?q?Morten_Br=C3=B8rup?= , Stephen Hemminger , =?utf-8?q?Mattias_R=C3=B6nn?= =?utf-8?q?blom?= Subject: [RFC 5/5] service: keep per-lcore state in lcore variable Date: Thu, 8 Feb 2024 19:16:44 +0100 Message-ID: <20240208181644.455233-6-mattias.ronnblom@ericsson.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> References: <20240208181644.455233-1-mattias.ronnblom@ericsson.com> MIME-Version: 1.0 X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DB1PEPF000509E5:EE_|DBBPR07MB7353:EE_ X-MS-Office365-Filtering-Correlation-Id: 1b6240c6-fa1c-40fe-d9d0-08dc28d33c29 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: OqV3BWpD4KZlMIj6/GtWN7700vOUCAgKywuy6B0OPDBQEFNgOzkZ/hvGSeGf5i1tjo4LI/JfVMFK0tHBeTdorWAIfPFhhYSzN4kfWBAw3hRrq4npv6zME17EDVyQdHybGoTziEa0zru2zIA6Ra0szqhmkWXwNkWUCalVJw21NzzRWZGISdHC8SacFlYBlN1DoX4WmTN/R9iIbULPCXKiMaGaAVf/HsbOawPnJuvRIA5z5zdRsFV4PL8+SvHEdhw+LxD39PNL6m0saAE2QEljfMFPZUMVReVnbKFhbZmuiDYpsCz/rLQYR1Jec1YMcYkaIYAXzdapNTbQ0ti9APimwv4xUjGf3UtlRKhajCsuBPp9pPX2flMAv5JPekhs/bMEyg7BBCf57LlXKoYnia1swI0UMNGYBtjcNbg4D56ks1JSZ8lwz68CpZ/vM71xumaCPgcq0x/MtRQrnExvlfFBXPgLQX4Nv5mGZ1lEsNzs5n72SSbj3v0rdpJ6jJhIoDrREt4tEFjBWxRWCtpjzS5+OikhpUcMDN/mH7+YFgFq5LCD4coFXESpUhejKzeKWJafmI0oIfbBUh47QOK0VzGwgXB3PG7Rdc3bcgDg6XtBqlM= X-Forefront-Antispam-Report: CIP:192.176.1.74; CTRY:SE; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:oa.msg.ericsson.com; PTR:office365.se.ericsson.net; CAT:NONE; SFS:(13230031)(4636009)(39860400002)(346002)(136003)(376002)(396003)(230922051799003)(64100799003)(451199024)(1800799012)(82310400011)(186009)(36840700001)(46966006)(40470700004)(41300700001)(6666004)(478600001)(2906002)(70206006)(6916009)(54906003)(83380400001)(70586007)(2616005)(5660300002)(26005)(336012)(1076003)(6266002)(66574015)(107886003)(30864003)(36756003)(7636003)(86362001)(8676002)(8936002)(316002)(356005)(82740400003)(82960400001)(4326008); DIR:OUT; SFP:1101; X-OriginatorOrg: ericsson.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 08 Feb 2024 18:24:49.1572 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 1b6240c6-fa1c-40fe-d9d0-08dc28d33c29 X-MS-Exchange-CrossTenant-Id: 92e84ceb-fbfd-47ab-be52-080c6b87953f X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=92e84ceb-fbfd-47ab-be52-080c6b87953f; Ip=[192.176.1.74]; Helo=[oa.msg.ericsson.com] X-MS-Exchange-CrossTenant-AuthSource: DB1PEPF000509E5.eurprd03.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DBBPR07MB7353 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Signed-off-by: Mattias Rönnblom --- lib/eal/common/rte_service.c | 119 ++++++++++++++++++++--------------- 1 file changed, 68 insertions(+), 51 deletions(-) diff --git a/lib/eal/common/rte_service.c b/lib/eal/common/rte_service.c index d959c91459..c557e80409 100644 --- a/lib/eal/common/rte_service.c +++ b/lib/eal/common/rte_service.c @@ -11,6 +11,7 @@ #include #include +#include #include #include #include @@ -75,7 +76,7 @@ struct core_state { static uint32_t rte_service_count; static struct rte_service_spec_impl *rte_services; -static struct core_state *lcore_states; +static RTE_LCORE_VAR_HANDLE(struct core_state, lcore_states); static uint32_t rte_service_library_initialized; int32_t @@ -101,11 +102,12 @@ rte_service_init(void) goto fail_mem; } - lcore_states = rte_calloc("rte_service_core_states", RTE_MAX_LCORE, - sizeof(struct core_state), RTE_CACHE_LINE_SIZE); - if (!lcore_states) { - EAL_LOG(ERR, "error allocating core states array"); - goto fail_mem; + if (lcore_states == NULL) + RTE_LCORE_VAR_ALLOC(lcore_states); + else { + struct core_state *cs; + RTE_LCORE_VAR_FOREACH(cs, lcore_states) + memset(cs, 0, sizeof(struct core_state)); } int i; @@ -122,7 +124,6 @@ rte_service_init(void) return 0; fail_mem: rte_free(rte_services); - rte_free(lcore_states); return -ENOMEM; } @@ -136,7 +137,6 @@ rte_service_finalize(void) rte_eal_mp_wait_lcore(); rte_free(rte_services); - rte_free(lcore_states); rte_service_library_initialized = 0; } @@ -286,7 +286,6 @@ rte_service_component_register(const struct rte_service_spec *spec, int32_t rte_service_component_unregister(uint32_t id) { - uint32_t i; struct rte_service_spec_impl *s; SERVICE_VALID_GET_OR_ERR_RET(id, s, -EINVAL); @@ -294,9 +293,10 @@ rte_service_component_unregister(uint32_t id) s->internal_flags &= ~(SERVICE_F_REGISTERED); + struct core_state *cs; /* clear the run-bit in all cores */ - for (i = 0; i < RTE_MAX_LCORE; i++) - lcore_states[i].service_mask &= ~(UINT64_C(1) << id); + RTE_LCORE_VAR_FOREACH(cs, lcore_states) + cs->service_mask &= ~(UINT64_C(1) << id); memset(&rte_services[id], 0, sizeof(struct rte_service_spec_impl)); @@ -454,7 +454,10 @@ rte_service_may_be_active(uint32_t id) return -EINVAL; for (i = 0; i < lcore_count; i++) { - if (lcore_states[ids[i]].service_active_on_lcore[id]) + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(ids[i], lcore_states); + + if (cs->service_active_on_lcore[id]) return 1; } @@ -464,7 +467,7 @@ rte_service_may_be_active(uint32_t id) int32_t rte_service_run_iter_on_app_lcore(uint32_t id, uint32_t serialize_mt_unsafe) { - struct core_state *cs = &lcore_states[rte_lcore_id()]; + struct core_state *cs = RTE_LCORE_VAR_PTR(lcore_states); struct rte_service_spec_impl *s; SERVICE_VALID_GET_OR_ERR_RET(id, s, -EINVAL); @@ -486,8 +489,7 @@ service_runner_func(void *arg) { RTE_SET_USED(arg); uint8_t i; - const int lcore = rte_lcore_id(); - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_PTR(lcore_states); rte_atomic_store_explicit(&cs->thread_active, 1, rte_memory_order_seq_cst); @@ -533,13 +535,16 @@ service_runner_func(void *arg) int32_t rte_service_lcore_may_be_active(uint32_t lcore) { - if (lcore >= RTE_MAX_LCORE || !lcore_states[lcore].is_service_core) + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); + + if (lcore >= RTE_MAX_LCORE || !cs->is_service_core) return -EINVAL; /* Load thread_active using ACQUIRE to avoid instructions dependent on * the result being re-ordered before this load completes. */ - return rte_atomic_load_explicit(&lcore_states[lcore].thread_active, + return rte_atomic_load_explicit(&cs->thread_active, rte_memory_order_acquire); } @@ -547,9 +552,11 @@ int32_t rte_service_lcore_count(void) { int32_t count = 0; - uint32_t i; - for (i = 0; i < RTE_MAX_LCORE; i++) - count += lcore_states[i].is_service_core; + + struct core_state *cs; + RTE_LCORE_VAR_FOREACH(cs, lcore_states) + count += cs->is_service_core; + return count; } @@ -566,7 +573,8 @@ rte_service_lcore_list(uint32_t array[], uint32_t n) uint32_t i; uint32_t idx = 0; for (i = 0; i < RTE_MAX_LCORE; i++) { - struct core_state *cs = &lcore_states[i]; + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(i, lcore_states); if (cs->is_service_core) { array[idx] = i; idx++; @@ -582,7 +590,7 @@ rte_service_lcore_count_services(uint32_t lcore) if (lcore >= RTE_MAX_LCORE) return -EINVAL; - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); if (!cs->is_service_core) return -ENOTSUP; @@ -634,30 +642,31 @@ rte_service_start_with_defaults(void) static int32_t service_update(uint32_t sid, uint32_t lcore, uint32_t *set, uint32_t *enabled) { + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); + /* validate ID, or return error value */ if (!service_valid(sid) || lcore >= RTE_MAX_LCORE || - !lcore_states[lcore].is_service_core) + !cs->is_service_core) return -EINVAL; uint64_t sid_mask = UINT64_C(1) << sid; if (set) { - uint64_t lcore_mapped = lcore_states[lcore].service_mask & - sid_mask; + uint64_t lcore_mapped = cs->service_mask & sid_mask; if (*set && !lcore_mapped) { - lcore_states[lcore].service_mask |= sid_mask; + cs->service_mask |= sid_mask; rte_atomic_fetch_add_explicit(&rte_services[sid].num_mapped_cores, 1, rte_memory_order_relaxed); } if (!*set && lcore_mapped) { - lcore_states[lcore].service_mask &= ~(sid_mask); + cs->service_mask &= ~(sid_mask); rte_atomic_fetch_sub_explicit(&rte_services[sid].num_mapped_cores, 1, rte_memory_order_relaxed); } } if (enabled) - *enabled = !!(lcore_states[lcore].service_mask & (sid_mask)); + *enabled = !!(cs->service_mask & (sid_mask)); return 0; } @@ -685,13 +694,14 @@ set_lcore_state(uint32_t lcore, int32_t state) { /* mark core state in hugepage backed config */ struct rte_config *cfg = rte_eal_get_configuration(); + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); cfg->lcore_role[lcore] = state; /* mark state in process local lcore_config */ lcore_config[lcore].core_role = state; /* update per-lcore optimized state tracking */ - lcore_states[lcore].is_service_core = (state == ROLE_SERVICE); + cs->is_service_core = (state == ROLE_SERVICE); rte_eal_trace_service_lcore_state_change(lcore, state); } @@ -702,14 +712,16 @@ rte_service_lcore_reset_all(void) /* loop over cores, reset all to mask 0 */ uint32_t i; for (i = 0; i < RTE_MAX_LCORE; i++) { - if (lcore_states[i].is_service_core) { - lcore_states[i].service_mask = 0; + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(i, lcore_states); + if (cs->is_service_core) { + cs->service_mask = 0; set_lcore_state(i, ROLE_RTE); /* runstate act as guard variable Use * store-release memory order here to synchronize * with load-acquire in runstate read functions. */ - rte_atomic_store_explicit(&lcore_states[i].runstate, + rte_atomic_store_explicit(&cs->runstate, RUNSTATE_STOPPED, rte_memory_order_release); } } @@ -725,17 +737,19 @@ rte_service_lcore_add(uint32_t lcore) { if (lcore >= RTE_MAX_LCORE) return -EINVAL; - if (lcore_states[lcore].is_service_core) + + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); + if (cs->is_service_core) return -EALREADY; set_lcore_state(lcore, ROLE_SERVICE); /* ensure that after adding a core the mask and state are defaults */ - lcore_states[lcore].service_mask = 0; + cs->service_mask = 0; /* Use store-release memory order here to synchronize with * load-acquire in runstate read functions. */ - rte_atomic_store_explicit(&lcore_states[lcore].runstate, RUNSTATE_STOPPED, + rte_atomic_store_explicit(&cs->runstate, RUNSTATE_STOPPED, rte_memory_order_release); return rte_eal_wait_lcore(lcore); @@ -747,7 +761,7 @@ rte_service_lcore_del(uint32_t lcore) if (lcore >= RTE_MAX_LCORE) return -EINVAL; - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); if (!cs->is_service_core) return -EINVAL; @@ -771,7 +785,7 @@ rte_service_lcore_start(uint32_t lcore) if (lcore >= RTE_MAX_LCORE) return -EINVAL; - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); if (!cs->is_service_core) return -EINVAL; @@ -801,6 +815,8 @@ rte_service_lcore_start(uint32_t lcore) int32_t rte_service_lcore_stop(uint32_t lcore) { + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); + if (lcore >= RTE_MAX_LCORE) return -EINVAL; @@ -808,12 +824,11 @@ rte_service_lcore_stop(uint32_t lcore) * memory order here to synchronize with store-release * in runstate update functions. */ - if (rte_atomic_load_explicit(&lcore_states[lcore].runstate, rte_memory_order_acquire) == + if (rte_atomic_load_explicit(&cs->runstate, rte_memory_order_acquire) == RUNSTATE_STOPPED) return -EALREADY; uint32_t i; - struct core_state *cs = &lcore_states[lcore]; uint64_t service_mask = cs->service_mask; for (i = 0; i < RTE_SERVICE_NUM_MAX; i++) { @@ -834,7 +849,7 @@ rte_service_lcore_stop(uint32_t lcore) /* Use store-release memory order here to synchronize with * load-acquire in runstate read functions. */ - rte_atomic_store_explicit(&lcore_states[lcore].runstate, RUNSTATE_STOPPED, + rte_atomic_store_explicit(&cs->runstate, RUNSTATE_STOPPED, rte_memory_order_release); rte_eal_trace_service_lcore_stop(lcore); @@ -845,7 +860,7 @@ rte_service_lcore_stop(uint32_t lcore) static uint64_t lcore_attr_get_loops(unsigned int lcore) { - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); return rte_atomic_load_explicit(&cs->loops, rte_memory_order_relaxed); } @@ -853,7 +868,7 @@ lcore_attr_get_loops(unsigned int lcore) static uint64_t lcore_attr_get_cycles(unsigned int lcore) { - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); return rte_atomic_load_explicit(&cs->cycles, rte_memory_order_relaxed); } @@ -861,7 +876,7 @@ lcore_attr_get_cycles(unsigned int lcore) static uint64_t lcore_attr_get_service_calls(uint32_t service_id, unsigned int lcore) { - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); return rte_atomic_load_explicit(&cs->service_stats[service_id].calls, rte_memory_order_relaxed); @@ -870,7 +885,7 @@ lcore_attr_get_service_calls(uint32_t service_id, unsigned int lcore) static uint64_t lcore_attr_get_service_cycles(uint32_t service_id, unsigned int lcore) { - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); return rte_atomic_load_explicit(&cs->service_stats[service_id].cycles, rte_memory_order_relaxed); @@ -886,7 +901,10 @@ attr_get(uint32_t id, lcore_attr_get_fun lcore_attr_get) uint64_t sum = 0; for (lcore = 0; lcore < RTE_MAX_LCORE; lcore++) { - if (lcore_states[lcore].is_service_core) + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); + + if (cs->is_service_core) sum += lcore_attr_get(id, lcore); } @@ -930,12 +948,11 @@ int32_t rte_service_lcore_attr_get(uint32_t lcore, uint32_t attr_id, uint64_t *attr_value) { - struct core_state *cs; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); if (lcore >= RTE_MAX_LCORE || !attr_value) return -EINVAL; - cs = &lcore_states[lcore]; if (!cs->is_service_core) return -ENOTSUP; @@ -960,7 +977,8 @@ rte_service_attr_reset_all(uint32_t id) return -EINVAL; for (lcore = 0; lcore < RTE_MAX_LCORE; lcore++) { - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = + RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); cs->service_stats[id] = (struct service_stats) {}; } @@ -971,12 +989,11 @@ rte_service_attr_reset_all(uint32_t id) int32_t rte_service_lcore_attr_reset_all(uint32_t lcore) { - struct core_state *cs; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); if (lcore >= RTE_MAX_LCORE) return -EINVAL; - cs = &lcore_states[lcore]; if (!cs->is_service_core) return -ENOTSUP; @@ -1011,7 +1028,7 @@ static void service_dump_calls_per_lcore(FILE *f, uint32_t lcore) { uint32_t i; - struct core_state *cs = &lcore_states[lcore]; + struct core_state *cs = RTE_LCORE_VAR_LCORE_PTR(lcore, lcore_states); fprintf(f, "%02d\t", lcore); for (i = 0; i < RTE_SERVICE_NUM_MAX; i++) {