From patchwork Thu Aug 31 16:47:35 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Van Haaren, Harry" X-Patchwork-Id: 131000 X-Patchwork-Delegate: jerinj@marvell.com Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id B48EE42214; Thu, 31 Aug 2023 18:47:56 +0200 (CEST) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id C348040295; Thu, 31 Aug 2023 18:47:52 +0200 (CEST) Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.136]) by mails.dpdk.org (Postfix) with ESMTP id C67554027B for ; Thu, 31 Aug 2023 18:47:49 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1693500470; x=1725036470; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=v9JcLbWYg7BgpBZ2DsiNxRgkLgEzpCky6J4Tcm6+TDo=; b=b/kvj5KFLapCBOEpIW0YlC023HSkbx2l8Wj+EHgUlJClb8wcIUImwSX8 uoLoT7twgYqPa/clfS6dL968PflIhvNb2jEjGNfyy/ezhZKgW/9RJA94i QIBOEBTebWfeN0ALWLrEzGhhE8h4G/wZJgjOKNu09F7+Y0pGfz5NytWi7 Ap7dDBcr3xvmPSNYg7YQpNpwuxC1cnY2OJ2Gz3HwU+45rBNjQ4OlTNdtI x7MgJQIXxNoKzFNbEA7KUqNIWB08jTREjvB/TKe3CtiZImkwxI26q8b3G /qedejQK3Q4BwlbffGBNLzL4WKUS6tyy91JlPxsDzEdXh95HyBByB0n97 Q==; X-IronPort-AV: E=McAfee;i="6600,9927,10819"; a="355499850" X-IronPort-AV: E=Sophos;i="6.02,217,1688454000"; d="scan'208";a="355499850" Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 31 Aug 2023 09:47:47 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10819"; a="863163390" X-IronPort-AV: E=Sophos;i="6.02,217,1688454000"; d="scan'208";a="863163390" Received: from silpixa00401454.ir.intel.com ([10.55.128.147]) by orsmga004.jf.intel.com with ESMTP; 31 Aug 2023 09:47:44 -0700 From: Harry van Haaren To: dev@dpdk.org Cc: Harry van Haaren Subject: [PATCH 1/2] event/sw: add selftest for ordered history list Date: Thu, 31 Aug 2023 17:47:35 +0100 Message-Id: <20230831164736.2472671-1-harry.van.haaren@intel.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org This commit adds a unit test for an issue identified where ordered history-list entries are not correctly cleared when the returned event is of op RELEASE type. The result of the history-list bug is that a future event which re-uses that history-list slot, but has an op type of FORWARD will incorrectly be reordered. The existing unit-tests did not cover the RELEASE of an ORDERED queue, and then stress-test the history-list by iterating HIST_LIST times afterwards. Signed-off-by: Harry van Haaren --- drivers/event/sw/sw_evdev_selftest.c | 132 +++++++++++++++++++++++++++ 1 file changed, 132 insertions(+) diff --git a/drivers/event/sw/sw_evdev_selftest.c b/drivers/event/sw/sw_evdev_selftest.c index 3aa8d76ca8..59afa260c6 100644 --- a/drivers/event/sw/sw_evdev_selftest.c +++ b/drivers/event/sw/sw_evdev_selftest.c @@ -2959,6 +2959,132 @@ dev_stop_flush(struct test *t) /* test to check we can properly flush events */ return -1; } +static int +ordered_atomic_hist_completion(struct test *t) +{ + const int rx_enq = 0; + int err; + + /* Create instance with 1 atomic QID going to 3 ports + 1 prod port */ + if (init(t, 2, 2) < 0 || + create_ports(t, 2) < 0 || + create_ordered_qids(t, 1) < 0 || + create_atomic_qids(t, 1) < 0) + return -1; + + /* Helpers to identify queues */ + const uint8_t qid_ordered = t->qid[0]; + const uint8_t qid_atomic = t->qid[1]; + + /* CQ mapping to QID */ + if (rte_event_port_link(evdev, t->port[1], &t->qid[0], NULL, 1) != 1) { + printf("%d: error mapping port 1 qid\n", __LINE__); + return -1; + } + if (rte_event_port_link(evdev, t->port[1], &t->qid[1], NULL, 1) != 1) { + printf("%d: error mapping port 1 qid\n", __LINE__); + return -1; + } + if (rte_event_dev_start(evdev) < 0) { + printf("%d: Error with start call\n", __LINE__); + return -1; + } + + /* Enqueue 1x ordered event, to be RELEASE-ed by the worker + * CPU, which may cause hist-list corruption (by not comleting) + */ + struct rte_event ord_ev = { + .op = RTE_EVENT_OP_NEW, + .queue_id = qid_ordered, + .event_type = RTE_EVENT_TYPE_CPU, + .priority = RTE_EVENT_DEV_PRIORITY_NORMAL, + }; + err = rte_event_enqueue_burst(evdev, t->port[rx_enq], &ord_ev, 1); + if (err != 1) { + printf("%d: Failed to enqueue\n", __LINE__); + return -1; + } + + /* call the scheduler. This schedules the above event as a single + * event in an ORDERED queue, to the worker. + */ + rte_service_run_iter_on_app_lcore(t->service_id, 1); + + /* Dequeue ORDERED event 0 from port 1, so that we can then drop */ + struct rte_event ev; + if (!rte_event_dequeue_burst(evdev, t->port[1], &ev, 1, 0)) { + printf("%d: failed to dequeue\n", __LINE__); + return -1; + } + + /* drop the ORDERED event. Here the history list should be completed, + * but might not be if the hist-list bug exists. Call scheduler to make + * it act on the RELEASE that was enqueued. + */ + rte_event_enqueue_burst(evdev, t->port[1], &release_ev, 1); + rte_service_run_iter_on_app_lcore(t->service_id, 1); + + /* Enqueue 1x atomic event, to then FORWARD to trigger atomic hist-list + * completion. If the bug exists, the ORDERED entry may be completed in + * error (aka, using the ORDERED-ROB for the ATOMIC event). This is the + * main focus of this unit test. + */ + { + struct rte_event ev = { + .op = RTE_EVENT_OP_NEW, + .queue_id = qid_atomic, + .event_type = RTE_EVENT_TYPE_CPU, + .priority = RTE_EVENT_DEV_PRIORITY_NORMAL, + .flow_id = 123, + }; + + err = rte_event_enqueue_burst(evdev, t->port[rx_enq], &ev, 1); + if (err != 1) { + printf("%d: Failed to enqueue\n", __LINE__); + return -1; + } + } + rte_service_run_iter_on_app_lcore(t->service_id, 1); + + /* Deq ATM event, then forward it for more than HIST_LIST_SIZE times, + * to re-use the history list entry that may be corrupted previously. + */ + for (int i = 0; i < SW_PORT_HIST_LIST + 2; i++) { + if (!rte_event_dequeue_burst(evdev, t->port[1], &ev, 1, 0)) { + printf("%d: failed to dequeue, did corrupt ORD hist " + "list steal this ATM event?\n", __LINE__); + return -1; + } + + /* Re-enqueue the ATM event as FWD, trigger hist-list. */ + ev.op = RTE_EVENT_OP_FORWARD; + err = rte_event_enqueue_burst(evdev, t->port[1], &ev, 1); + if (err != 1) { + printf("%d: Failed to enqueue\n", __LINE__); + return -1; + } + + rte_service_run_iter_on_app_lcore(t->service_id, 1); + } + + /* If HIST-LIST + N count of dequeues succeed above, the hist list + * has not been corrupted. If it is corrupted, the ATM event is pushed + * into the ORDERED-ROB and will not dequeue. + */ + + /* release the ATM event that's been forwarded HIST_LIST times */ + err = rte_event_enqueue_burst(evdev, t->port[1], &release_ev, 1); + if (err != 1) { + printf("%d: Failed to enqueue\n", __LINE__); + return -1; + } + + rte_service_run_iter_on_app_lcore(t->service_id, 1); + + cleanup(t); + return 0; +} + static int worker_loopback_worker_fn(void *arg) { @@ -3388,6 +3514,12 @@ test_sw_eventdev(void) printf("ERROR - Stop Flush test FAILED.\n"); goto test_fail; } + printf("*** Running Ordered & Atomic hist-list completion test...\n"); + ret = ordered_atomic_hist_completion(t); + if (ret != 0) { + printf("ERROR - Ordered & Atomic hist-list test FAILED.\n"); + goto test_fail; + } if (rte_lcore_count() >= 3) { printf("*** Running Worker loopback test...\n"); ret = worker_loopback(t, 0); From patchwork Thu Aug 31 16:47:36 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Van Haaren, Harry" X-Patchwork-Id: 130999 X-Patchwork-Delegate: jerinj@marvell.com Return-Path: X-Original-To: patchwork@inbox.dpdk.org Delivered-To: patchwork@inbox.dpdk.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 2AD8042214; Thu, 31 Aug 2023 18:47:51 +0200 (CEST) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id ACB5940285; Thu, 31 Aug 2023 18:47:50 +0200 (CEST) Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.136]) by mails.dpdk.org (Postfix) with ESMTP id 2D2FA4027B for ; Thu, 31 Aug 2023 18:47:48 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1693500469; x=1725036469; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=e1gYnforJyllAPJWL5wHAxr6FexckH70KJnb0FPNKRU=; b=LlZTxhZqdI6b3iWTp7paaWRmoa8Jp0PkCcAqc9Ylb1x8IJm/S27tM/0S K4mWUxzL6NpCmfToqmJ2mDJf1LNe8rUGHAzr4vbR9E7IA7hs6HLUAS7NV NmIf0WtMOfELe2zhhvruSIywPQ5yOToAygoeHHzvhDUAZcxLm84FNtnLH qJe4uGNPjLWQ6AWqoIXndp5e59F388mS/khPJafjByvQxWiIfL7y9Zq8M 1Q9hU/zqZtFvRk13+3nsMUGeFd29kBhrmUvQMQEAj3m4mx/XJu5Hwhr/U 9/YBmz+9isOW28iHOXUjXRapKE+YhYDjPg3xaXRIRHpO7MxZGUnYFWGaO Q==; X-IronPort-AV: E=McAfee;i="6600,9927,10819"; a="355499851" X-IronPort-AV: E=Sophos;i="6.02,217,1688454000"; d="scan'208";a="355499851" Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 31 Aug 2023 09:47:48 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10819"; a="863163395" X-IronPort-AV: E=Sophos;i="6.02,217,1688454000"; d="scan'208";a="863163395" Received: from silpixa00401454.ir.intel.com ([10.55.128.147]) by orsmga004.jf.intel.com with ESMTP; 31 Aug 2023 09:47:46 -0700 From: Harry van Haaren To: dev@dpdk.org Cc: Harry van Haaren Subject: [PATCH 2/2] event/sw: fix ordering corruption with op release Date: Thu, 31 Aug 2023 17:47:36 +0100 Message-Id: <20230831164736.2472671-2-harry.van.haaren@intel.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20230831164736.2472671-1-harry.van.haaren@intel.com> References: <20230831164736.2472671-1-harry.van.haaren@intel.com> MIME-Version: 1.0 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org This commit changes the logic in the scheduler to always reset reorder-buffer entries in the QE_FLAG_COMPLETE path, and not just the QE_FLAG_VALID path. A release event is a COMPLETE but not VALID (no new event). As a result, release events previously left the history-list in an inconsistent state, and future events with op type of forward could be incorrectly reordered. Signed-off-by: Harry van Haaren --- drivers/event/sw/sw_evdev_scheduler.c | 45 ++++++++++++++++----------- 1 file changed, 27 insertions(+), 18 deletions(-) diff --git a/drivers/event/sw/sw_evdev_scheduler.c b/drivers/event/sw/sw_evdev_scheduler.c index 8bc21944f5..9ee6698525 100644 --- a/drivers/event/sw/sw_evdev_scheduler.c +++ b/drivers/event/sw/sw_evdev_scheduler.c @@ -360,10 +360,15 @@ __pull_port_lb(struct sw_evdev *sw, uint32_t port_id, int allow_reorder) while (port->pp_buf_count) { const struct rte_event *qe = &port->pp_buf[port->pp_buf_start]; - struct sw_hist_list_entry *hist_entry = NULL; uint8_t flags = qe->op; const uint16_t eop = !(flags & QE_FLAG_NOT_EOP); - int needs_reorder = 0; + + /* rob_entry being NULL or a value is used as the distinction + * between reordering being required (mark ROB as ready) or + * just an Atomic completion. + */ + struct reorder_buffer_entry *rob_ptr = NULL; + /* if no-reordering, having PARTIAL == NEW */ if (!allow_reorder && !eop) flags = QE_FLAG_VALID; @@ -386,6 +391,7 @@ __pull_port_lb(struct sw_evdev *sw, uint32_t port_id, int allow_reorder) const uint32_t hist_tail = port->hist_tail & (SW_PORT_HIST_LIST - 1); + struct sw_hist_list_entry *hist_entry; hist_entry = &port->hist_list[hist_tail]; const uint32_t hist_qid = hist_entry->qid; const uint32_t hist_fid = hist_entry->fid; @@ -396,17 +402,24 @@ __pull_port_lb(struct sw_evdev *sw, uint32_t port_id, int allow_reorder) if (fid->pcount == 0) fid->cq = -1; + /* Assign current hist-list entry to the rob_entry, to + * allow VALID code below re-use it for checks. + */ + rob_ptr = hist_entry->rob_entry; + + /* Clear the rob entry in this COMPLETE flag phase, as + * RELEASE events must clear hist-list, but MIGHT NOT + * contain a VALID flag too. + */ + hist_entry->rob_entry = NULL; + if (allow_reorder) { - /* set reorder ready if an ordered QID */ - uintptr_t rob_ptr = - (uintptr_t)hist_entry->rob_entry; const uintptr_t valid = (rob_ptr != 0); - needs_reorder = valid; - rob_ptr |= - ((valid - 1) & (uintptr_t)&dummy_rob); + uintptr_t tmp = (uintptr_t)rob_ptr; + tmp |= ((valid - 1) & (uintptr_t)&dummy_rob); struct reorder_buffer_entry *tmp_rob_ptr = - (struct reorder_buffer_entry *)rob_ptr; - tmp_rob_ptr->ready = eop * needs_reorder; + (struct reorder_buffer_entry *)tmp; + tmp_rob_ptr->ready = eop * valid; } port->inflights -= eop; @@ -415,22 +428,18 @@ __pull_port_lb(struct sw_evdev *sw, uint32_t port_id, int allow_reorder) if (flags & QE_FLAG_VALID) { port->stats.rx_pkts++; - if (allow_reorder && needs_reorder) { - struct reorder_buffer_entry *rob_entry = - hist_entry->rob_entry; - - hist_entry->rob_entry = NULL; + if (allow_reorder && rob_ptr) { /* Although fragmentation not currently * supported by eventdev API, we support it * here. Open: How do we alert the user that * they've exceeded max frags? */ - int num_frag = rob_entry->num_fragments; + int num_frag = rob_ptr->num_fragments; if (num_frag == SW_FRAGMENTS_MAX) sw->stats.rx_dropped++; else { - int idx = rob_entry->num_fragments++; - rob_entry->fragments[idx] = *qe; + int idx = rob_ptr->num_fragments++; + rob_ptr->fragments[idx] = *qe; } goto end_qe; }