[dpdk-dev] net/mlx5: fix port dead lock when port shutdown

Message ID 20180524152005.1833-1-xuemingl@mellanox.com (mailing list archive)
State Superseded, archived
Delegated to: Shahaf Shuler
Headers

Checks

Context Check Description
ci/checkpatch success coding style OK
ci/Intel-compilation success Compilation OK

Commit Message

Xueming Li May 24, 2018, 3:20 p.m. UTC
  This patch add check to avoid init device MR data twcie which caused
priv data linked to itself in list.

Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support")
Cc: yskoh@mellanox.com

Signed-off-by: Xueming Li <xuemingl@mellanox.com>
---
 drivers/net/mlx5/mlx5_ethdev.c | 2 ++
 drivers/net/mlx5/mlx5_mr.c     | 1 +
 2 files changed, 3 insertions(+)
  

Comments

Yongseok Koh May 24, 2018, 6:29 p.m. UTC | #1
> On May 24, 2018, at 8:20 AM, Xueming Li <xuemingl@mellanox.com> wrote:
> 
> This patch add check to avoid init device MR data twcie which caused
> priv data linked to itself in list.

A typo.

> Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support")
> Cc: yskoh@mellanox.com
> 
> Signed-off-by: Xueming Li <xuemingl@mellanox.com>
> ---
> drivers/net/mlx5/mlx5_ethdev.c | 2 ++
> drivers/net/mlx5/mlx5_mr.c     | 1 +
> 2 files changed, 3 insertions(+)
> 
> diff --git a/drivers/net/mlx5/mlx5_ethdev.c b/drivers/net/mlx5/mlx5_ethdev.c
> index c52ec6fb7..7ffd17ff3 100644
> --- a/drivers/net/mlx5/mlx5_ethdev.c
> +++ b/drivers/net/mlx5/mlx5_ethdev.c
> @@ -398,6 +398,8 @@ mlx5_dev_configure(struct rte_eth_dev *dev)
> 	 * deadlock. If it overflows, lookup should be done by searching MR list
> 	 * linearly, which is slow.
> 	 */
> +	if (priv->mr.cache.table)
> +		return 0;

Thanks, nice catch!

Adrien submitted a fix [1] for the similar issue. Adding device to the memory event
callback list is moved to mlx5_pci_probe() as it should be done once. So, how about
moving the rest (mlx5_mr_btree_init()) to mlx5_pci_probe() as well? I mean, instead
of adding this check, you can move the btree_init() above the line where Adrien moved
LIST_INSERT_HEAD() to. You'll need to rebase on Adrien's patch and add a comment to
Shahaf there's dependency.


[1]
net/mlx5: fix crash when configure is not called
http://dpdk.org/dev/patchwork/patch/40404/


Yongseok

> 	if (mlx5_mr_btree_init(&priv->mr.cache, MLX5_MR_BTREE_CACHE_N * 2,
> 			       dev->device->numa_node)) {
> 		/* rte_errno is already set. */
> diff --git a/drivers/net/mlx5/mlx5_mr.c b/drivers/net/mlx5/mlx5_mr.c
> index abb1f5179..08105a443 100644
> --- a/drivers/net/mlx5/mlx5_mr.c
> +++ b/drivers/net/mlx5/mlx5_mr.c
> @@ -191,6 +191,7 @@ mlx5_mr_btree_init(struct mlx5_mr_btree *bt, int n, int socket)
> 		rte_errno = EINVAL;
> 		return -rte_errno;
> 	}
> +	assert(!bt->table && !bt->size);
> 	memset(bt, 0, sizeof(*bt));
> 	bt->table = rte_calloc_socket("B-tree table",
> 				      n, sizeof(struct mlx5_mr_cache),
> -- 
> 2.13.3
>
  
Yongseok Koh May 24, 2018, 6:33 p.m. UTC | #2
> On May 24, 2018, at 11:29 AM, Yongseok Koh <yskoh@mellanox.com> wrote:
> 
>> 
>> On May 24, 2018, at 8:20 AM, Xueming Li <xuemingl@mellanox.com> wrote:
>> 
>> This patch add check to avoid init device MR data twcie which caused
>> priv data linked to itself in list.
> 
> A typo.
> 
>> Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support")
>> Cc: yskoh@mellanox.com
>> 
>> Signed-off-by: Xueming Li <xuemingl@mellanox.com>
>> ---
>> drivers/net/mlx5/mlx5_ethdev.c | 2 ++
>> drivers/net/mlx5/mlx5_mr.c     | 1 +
>> 2 files changed, 3 insertions(+)
>> 
>> diff --git a/drivers/net/mlx5/mlx5_ethdev.c b/drivers/net/mlx5/mlx5_ethdev.c
>> index c52ec6fb7..7ffd17ff3 100644
>> --- a/drivers/net/mlx5/mlx5_ethdev.c
>> +++ b/drivers/net/mlx5/mlx5_ethdev.c
>> @@ -398,6 +398,8 @@ mlx5_dev_configure(struct rte_eth_dev *dev)
>> 	 * deadlock. If it overflows, lookup should be done by searching MR list
>> 	 * linearly, which is slow.
>> 	 */
>> +	if (priv->mr.cache.table)
>> +		return 0;
> 
> Thanks, nice catch!
> 
> Adrien submitted a fix [1] for the similar issue. Adding device to the memory event
> callback list is moved to mlx5_pci_probe() as it should be done once. So, how about
> moving the rest (mlx5_mr_btree_init()) to mlx5_pci_probe() as well? I mean, instead
> of adding this check, you can move the btree_init() above the line where Adrien moved
> LIST_INSERT_HEAD() to. You'll need to rebase on Adrien's patch and add a comment to
> Shahaf there's dependency.
> 
> 
> [1]
> net/mlx5: fix crash when configure is not called
> https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdpdk.org%2Fdev%2Fpatchwork%2Fpatch%2F40404%2F&data=02%7C01%7Cyskoh%40mellanox.com%7Ca731aadfe6834a7a435f08d5c1a45fcf%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7C0%7C636627834121490391&sdata=D8HjhDMzMfRucGCTSWCieRkNmptGXLJHo8it0Py%2F3gY%3D&reserved=0

And you might want to write the same patch for mlx4.

Thanks,
Yongseok

>> 	if (mlx5_mr_btree_init(&priv->mr.cache, MLX5_MR_BTREE_CACHE_N * 2,
>> 			       dev->device->numa_node)) {
>> 		/* rte_errno is already set. */
>> diff --git a/drivers/net/mlx5/mlx5_mr.c b/drivers/net/mlx5/mlx5_mr.c
>> index abb1f5179..08105a443 100644
>> --- a/drivers/net/mlx5/mlx5_mr.c
>> +++ b/drivers/net/mlx5/mlx5_mr.c
>> @@ -191,6 +191,7 @@ mlx5_mr_btree_init(struct mlx5_mr_btree *bt, int n, int socket)
>> 		rte_errno = EINVAL;
>> 		return -rte_errno;
>> 	}
>> +	assert(!bt->table && !bt->size);
>> 	memset(bt, 0, sizeof(*bt));
>> 	bt->table = rte_calloc_socket("B-tree table",
>> 				      n, sizeof(struct mlx5_mr_cache),
>> -- 
>> 2.13.3
  
Xueming Li May 25, 2018, 6:04 a.m. UTC | #3
> -----Original Message-----

> From: Yongseok Koh

> Sent: Friday, May 25, 2018 2:30 AM

> To: Xueming(Steven) Li <xuemingl@mellanox.com>

> Cc: Shahaf Shuler <shahafs@mellanox.com>; dev@dpdk.org; Adrien Mazarguil <adrien.mazarguil@6wind.com>

> Subject: Re: [PATCH] net/mlx5: fix port dead lock when port shutdown

> 

> 

> > On May 24, 2018, at 8:20 AM, Xueming Li <xuemingl@mellanox.com> wrote:

> >

> > This patch add check to avoid init device MR data twcie which caused

> > priv data linked to itself in list.

> 

> A typo.

> 

> > Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support")

> > Cc: yskoh@mellanox.com

> >

> > Signed-off-by: Xueming Li <xuemingl@mellanox.com>

> > ---

> > drivers/net/mlx5/mlx5_ethdev.c | 2 ++

> > drivers/net/mlx5/mlx5_mr.c     | 1 +

> > 2 files changed, 3 insertions(+)

> >

> > diff --git a/drivers/net/mlx5/mlx5_ethdev.c

> > b/drivers/net/mlx5/mlx5_ethdev.c index c52ec6fb7..7ffd17ff3 100644

> > --- a/drivers/net/mlx5/mlx5_ethdev.c

> > +++ b/drivers/net/mlx5/mlx5_ethdev.c

> > @@ -398,6 +398,8 @@ mlx5_dev_configure(struct rte_eth_dev *dev)

> > 	 * deadlock. If it overflows, lookup should be done by searching MR list

> > 	 * linearly, which is slow.

> > 	 */

> > +	if (priv->mr.cache.table)

> > +		return 0;

> 

> Thanks, nice catch!

> 

> Adrien submitted a fix [1] for the similar issue. Adding device to the memory event callback list is

> moved to mlx5_pci_probe() as it should be done once. So, how about moving the rest

> (mlx5_mr_btree_init()) to mlx5_pci_probe() as well? I mean, instead of adding this check, you can move

> the btree_init() above the line where Adrien moved

> LIST_INSERT_HEAD() to. You'll need to rebase on Adrien's patch and add a comment to Shahaf there's

> dependency.

> 


Adrien's patch seems resolved the dead lock issue as well :) 
Moving mlx5_mr_btree_init() not related to it, I'll start a new patch.

> 

> [1]

> net/mlx5: fix crash when configure is not called http://dpdk.org/dev/patchwork/patch/40404/

> 

> 

> Yongseok

> 

> > 	if (mlx5_mr_btree_init(&priv->mr.cache, MLX5_MR_BTREE_CACHE_N * 2,

> > 			       dev->device->numa_node)) {

> > 		/* rte_errno is already set. */

> > diff --git a/drivers/net/mlx5/mlx5_mr.c b/drivers/net/mlx5/mlx5_mr.c

> > index abb1f5179..08105a443 100644

> > --- a/drivers/net/mlx5/mlx5_mr.c

> > +++ b/drivers/net/mlx5/mlx5_mr.c

> > @@ -191,6 +191,7 @@ mlx5_mr_btree_init(struct mlx5_mr_btree *bt, int n, int socket)

> > 		rte_errno = EINVAL;

> > 		return -rte_errno;

> > 	}

> > +	assert(!bt->table && !bt->size);

> > 	memset(bt, 0, sizeof(*bt));

> > 	bt->table = rte_calloc_socket("B-tree table",

> > 				      n, sizeof(struct mlx5_mr_cache),

> > --

> > 2.13.3

> >
  
Xueming Li May 25, 2018, 6:40 a.m. UTC | #4
Please bypass this patch, new patch submitted at:
	http://www.dpdk.org/dev/patchwork/patch/40412/

> -----Original Message-----

> From: Xueming(Steven) Li

> Sent: Friday, May 25, 2018 2:04 PM

> To: Yongseok Koh <yskoh@mellanox.com>

> Cc: Shahaf Shuler <shahafs@mellanox.com>; dev@dpdk.org; Adrien Mazarguil <adrien.mazarguil@6wind.com>

> Subject: RE: [PATCH] net/mlx5: fix port dead lock when port shutdown

> 

> 

> 

> > -----Original Message-----

> > From: Yongseok Koh

> > Sent: Friday, May 25, 2018 2:30 AM

> > To: Xueming(Steven) Li <xuemingl@mellanox.com>

> > Cc: Shahaf Shuler <shahafs@mellanox.com>; dev@dpdk.org; Adrien

> > Mazarguil <adrien.mazarguil@6wind.com>

> > Subject: Re: [PATCH] net/mlx5: fix port dead lock when port shutdown

> >

> >

> > > On May 24, 2018, at 8:20 AM, Xueming Li <xuemingl@mellanox.com> wrote:

> > >

> > > This patch add check to avoid init device MR data twcie which caused

> > > priv data linked to itself in list.

> >

> > A typo.

> >

> > > Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support")

> > > Cc: yskoh@mellanox.com

> > >

> > > Signed-off-by: Xueming Li <xuemingl@mellanox.com>

> > > ---

> > > drivers/net/mlx5/mlx5_ethdev.c | 2 ++

> > > drivers/net/mlx5/mlx5_mr.c     | 1 +

> > > 2 files changed, 3 insertions(+)

> > >

> > > diff --git a/drivers/net/mlx5/mlx5_ethdev.c

> > > b/drivers/net/mlx5/mlx5_ethdev.c index c52ec6fb7..7ffd17ff3 100644

> > > --- a/drivers/net/mlx5/mlx5_ethdev.c

> > > +++ b/drivers/net/mlx5/mlx5_ethdev.c

> > > @@ -398,6 +398,8 @@ mlx5_dev_configure(struct rte_eth_dev *dev)

> > > 	 * deadlock. If it overflows, lookup should be done by searching MR list

> > > 	 * linearly, which is slow.

> > > 	 */

> > > +	if (priv->mr.cache.table)

> > > +		return 0;

> >

> > Thanks, nice catch!

> >

> > Adrien submitted a fix [1] for the similar issue. Adding device to the

> > memory event callback list is moved to mlx5_pci_probe() as it should

> > be done once. So, how about moving the rest

> > (mlx5_mr_btree_init()) to mlx5_pci_probe() as well? I mean, instead of

> > adding this check, you can move the btree_init() above the line where

> > Adrien moved

> > LIST_INSERT_HEAD() to. You'll need to rebase on Adrien's patch and add

> > a comment to Shahaf there's dependency.

> >

> 

> Adrien's patch seems resolved the dead lock issue as well :) Moving mlx5_mr_btree_init() not related

> to it, I'll start a new patch.

> 

> >

> > [1]

> > net/mlx5: fix crash when configure is not called

> > http://dpdk.org/dev/patchwork/patch/40404/

> >

> >

> > Yongseok

> >

> > > 	if (mlx5_mr_btree_init(&priv->mr.cache, MLX5_MR_BTREE_CACHE_N * 2,

> > > 			       dev->device->numa_node)) {

> > > 		/* rte_errno is already set. */

> > > diff --git a/drivers/net/mlx5/mlx5_mr.c b/drivers/net/mlx5/mlx5_mr.c

> > > index abb1f5179..08105a443 100644

> > > --- a/drivers/net/mlx5/mlx5_mr.c

> > > +++ b/drivers/net/mlx5/mlx5_mr.c

> > > @@ -191,6 +191,7 @@ mlx5_mr_btree_init(struct mlx5_mr_btree *bt, int n, int socket)

> > > 		rte_errno = EINVAL;

> > > 		return -rte_errno;

> > > 	}

> > > +	assert(!bt->table && !bt->size);

> > > 	memset(bt, 0, sizeof(*bt));

> > > 	bt->table = rte_calloc_socket("B-tree table",

> > > 				      n, sizeof(struct mlx5_mr_cache),

> > > --

> > > 2.13.3

> > >
  

Patch

diff --git a/drivers/net/mlx5/mlx5_ethdev.c b/drivers/net/mlx5/mlx5_ethdev.c
index c52ec6fb7..7ffd17ff3 100644
--- a/drivers/net/mlx5/mlx5_ethdev.c
+++ b/drivers/net/mlx5/mlx5_ethdev.c
@@ -398,6 +398,8 @@  mlx5_dev_configure(struct rte_eth_dev *dev)
 	 * deadlock. If it overflows, lookup should be done by searching MR list
 	 * linearly, which is slow.
 	 */
+	if (priv->mr.cache.table)
+		return 0;
 	if (mlx5_mr_btree_init(&priv->mr.cache, MLX5_MR_BTREE_CACHE_N * 2,
 			       dev->device->numa_node)) {
 		/* rte_errno is already set. */
diff --git a/drivers/net/mlx5/mlx5_mr.c b/drivers/net/mlx5/mlx5_mr.c
index abb1f5179..08105a443 100644
--- a/drivers/net/mlx5/mlx5_mr.c
+++ b/drivers/net/mlx5/mlx5_mr.c
@@ -191,6 +191,7 @@  mlx5_mr_btree_init(struct mlx5_mr_btree *bt, int n, int socket)
 		rte_errno = EINVAL;
 		return -rte_errno;
 	}
+	assert(!bt->table && !bt->size);
 	memset(bt, 0, sizeof(*bt));
 	bt->table = rte_calloc_socket("B-tree table",
 				      n, sizeof(struct mlx5_mr_cache),