RE: [PATCH] media: staging/intel-ipu3: Implement lock for stream on/off operations

From: Mani, Rajmohan
Date: Fri Feb 01 2019 - 14:13:22 EST


Hi Sakari,

> Subject: Re: [PATCH] media: staging/intel-ipu3: Implement lock for stream
> on/off operations
>
> Hi Raj,
>
> On Wed, Jan 30, 2019 at 05:17:15PM +0000, Mani, Rajmohan wrote:
> > Hi Sakari,
> >
> > > -----Original Message-----
> > > From: Sakari Ailus [mailto:sakari.ailus@xxxxxxxxxxxxxxx]
> > > Sent: Wednesday, January 30, 2019 12:59 AM
> > > To: Mani, Rajmohan <rajmohan.mani@xxxxxxxxx>
> > > Cc: Mauro Carvalho Chehab <mchehab@xxxxxxxxxx>; Greg Kroah-Hartman
> > > <gregkh@xxxxxxxxxxxxxxxxxxx>; linux-media@xxxxxxxxxxxxxxx;
> > > devel@xxxxxxxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx; Laurent
> > > Pinchart <laurent.pinchart@xxxxxxxxxxxxxxxx>; Jacopo Mondi
> > > <jacopo@xxxxxxxxxx>; Qiu, Tian Shu <tian.shu.qiu@xxxxxxxxx>; Cao,
> > > Bingbu <bingbu.cao@xxxxxxxxx>; Zhi@xxxxxxxxxxxxxxxxxxxxxx; Zhi, Yong
> > > <yong.zhi@xxxxxxxxx>; hverkuil@xxxxxxxxx; tfiga@xxxxxxxxxxxx
> > > Subject: Re: [PATCH] media: staging/intel-ipu3: Implement lock for
> > > stream on/off operations
> > >
> > > Hi Rajmohan,
> > >
> > > On Tue, Jan 29, 2019 at 02:27:36PM -0800, Rajmohan Mani wrote:
> > > > Currently concurrent stream off operations on ImgU nodes are not
> > > > synchronized, leading to use-after-free bugs (as reported by KASAN).
> > > >
> > > > [ 250.090724] BUG: KASAN: use-after-free in
> > > > ipu3_dmamap_free+0xc5/0x116 [ipu3_imgu] [ 250.090726] Read of
> > > > size 8 at addr ffff888127b29bc0 by task yavta/18836 [ 250.090731]
> > > > Hardware
> > > > name: HP Soraka/Soraka, BIOS Google_Soraka.10431.17.0 03/22/2018 [
> > > 250.090732] Call Trace:
> > > > [ 250.090735] dump_stack+0x6a/0xb1 [ 250.090739]
> > > > print_address_description+0x8e/0x279
> > > > [ 250.090743] ? ipu3_dmamap_free+0xc5/0x116 [ipu3_imgu] [
> > > > 250.090746] kasan_report+0x260/0x28a [ 250.090750]
> > > > ipu3_dmamap_free+0xc5/0x116 [ipu3_imgu] [ 250.090754]
> > > > ipu3_css_pool_cleanup+0x24/0x37 [ipu3_imgu] [ 250.090759]
> > > > ipu3_css_pipeline_cleanup+0x61/0xb9 [ipu3_imgu] [ 250.090763]
> > > > ipu3_css_stop_streaming+0x1f2/0x321 [ipu3_imgu] [ 250.090768]
> > > > imgu_s_stream+0x94/0x443 [ipu3_imgu] [ 250.090772] ?
> > > > ipu3_vb2_buf_queue+0x280/0x280 [ipu3_imgu] [ 250.090775] ?
> > > > vb2_dma_sg_unmap_dmabuf+0x16/0x6f [videobuf2_dma_sg] [
> > > > 250.090778]
> > > ?
> > > > vb2_buffer_in_use+0x36/0x58 [videobuf2_common] [ 250.090782]
> > > > ipu3_vb2_stop_streaming+0xf9/0x135 [ipu3_imgu]
> > > >
> > > > Implemented a lock to synchronize imgu stream on / off operations
> > > > and the modification of streaming flag (in struct imgu_device), to
> > > > prevent these issues.
> > > >
> > > > Reported-by: Laurent Pinchart <laurent.pinchart@xxxxxxxxxxxxxxxx>
> > > > Suggested-by: Laurent Pinchart <laurent.pinchart@xxxxxxxxxxxxxxxx>
> > > >
> > > > Signed-off-by: Rajmohan Mani <rajmohan.mani@xxxxxxxxx>
> > > > ---
> > > > drivers/staging/media/ipu3/ipu3-v4l2.c | 6 ++++++
> > > > drivers/staging/media/ipu3/ipu3.c | 3 +++
> > > > drivers/staging/media/ipu3/ipu3.h | 4 ++++
> > > > 3 files changed, 13 insertions(+)
> > > >
> > > > diff --git a/drivers/staging/media/ipu3/ipu3-v4l2.c
> > > > b/drivers/staging/media/ipu3/ipu3-v4l2.c
> > > > index c7936032beb9..cf7e917cd0c8 100644
> > > > --- a/drivers/staging/media/ipu3/ipu3-v4l2.c
> > > > +++ b/drivers/staging/media/ipu3/ipu3-v4l2.c
> > > > @@ -507,12 +507,15 @@ static int ipu3_vb2_start_streaming(struct
> > > vb2_queue *vq, unsigned int count)
> > > > goto fail_stop_pipeline;
> > > > }
> > > >
> > > > + mutex_lock(&imgu->streaming_lock);
> > > > +
> > >
> > > You appear to be using imgu_device.lock (while searching buffers to
> > > queue to the device) as well as imgu_video_device.lock (qbuf, dqbuf)
> > > to serialise access to imgu_video_device.buffers list.
> >
> > Ack
> >
> > > The two locks may be acquired at the same time but each by different
> > > processes. That needs to be addressed, but probably not in this
> > > patch.
> > >
> >
> > The node specific locks will be used by different processes and all of
> > these processes will be competing commonly (and successfully) for the
> imgu_device lock.
> > I will look into this more.
> >
> > > I wonder if it'd be more simple to use imgu->lock here instead of
> > > adding a new one.
> > >
> >
> > Extending imgu->lock here, does not work in this case, as
> > imgu_queue_buffers() will be stuck acquiring imgu->lock, which was
> > already acquired by imgu_s_stream() through ipu3_vb2_start_streaming().
>
> You could move acquiring the lock out of these functions. It would also seem
> that there is device-wide streaming state etc. information to which the access
> should also be serialised. Currently it's relying on the node-specific lock only
> which does not help.
>

Ack. Let me look into this more.

> Can you grab the lock right after dev_dbg() line in the function?
>

In order to reduce the amount of code that's run with the lock held,
I placed the lock here.

Do you see issues in calls to imgu_sd subdevs and media_pipeline_start(),
without the lock being held?

> The lock should be also acquired before testing imgu->streaming in
> ipu3_vb2_buf_queue() to make sure it won't change in the meantime.
>

I thought about this and decided against this since the odds of multiple
instances of applications, each of which using a single node simultaneously
are less, in a typical camera use case.
Please confirm if you see significant issues without the lock here, so we can
add the lock here as well.

> >
> > > > /* Start streaming of the whole pipeline now */
> > > > dev_dbg(dev, "IMGU streaming is ready to start");
> > > > r = imgu_s_stream(imgu, true);
> > > > if (!r)
> > > > imgu->streaming = true;
> > > >
> > > > + mutex_unlock(&imgu->streaming_lock);
> > > > return 0;
> > > >
> > > > fail_stop_pipeline:
> > > > @@ -543,6 +546,8 @@ static void ipu3_vb2_stop_streaming(struct
> > > vb2_queue *vq)
> > > > dev_err(&imgu->pci_dev->dev,
> > > > "failed to stop subdev streaming\n");
> > > >
> > > > + mutex_lock(&imgu->streaming_lock);
> > > > +
> > > > /* Was this the first node with streaming disabled? */
> > > > if (imgu->streaming && ipu3_all_nodes_streaming(imgu, node)) {
> > > > /* Yes, really stop streaming now */ @@ -552,6 +557,7 @@
> > > static
> > > > void ipu3_vb2_stop_streaming(struct vb2_queue *vq)
> > > > imgu->streaming = false;
> > > > }
> > > >
> > > > + mutex_unlock(&imgu->streaming_lock);
> > > > ipu3_return_all_buffers(imgu, node, VB2_BUF_STATE_ERROR);
>
> I'd also call ipu3_return_all_buffers() before releasing the lock: in principle the
> user may have queued new buffers on the devices before the driver marks the
> buffers as faulty.
>
> > >
> > > > media_pipeline_stop(&node->vdev.entity);
> > > > }
> > > > diff --git a/drivers/staging/media/ipu3/ipu3.c
> > > > b/drivers/staging/media/ipu3/ipu3.c
> > > > index d521b3afb8b1..2daee51cd845 100644
> > > > --- a/drivers/staging/media/ipu3/ipu3.c
> > > > +++ b/drivers/staging/media/ipu3/ipu3.c
> > > > @@ -635,6 +635,7 @@ static int imgu_pci_probe(struct pci_dev
> *pci_dev,
> > > > return r;
> > > >
> > > > mutex_init(&imgu->lock);
> > > > + mutex_init(&imgu->streaming_lock);
> > > > atomic_set(&imgu->qbuf_barrier, 0);
> > > > init_waitqueue_head(&imgu->buf_drain_wq);
> > > >
> > > > @@ -699,6 +700,7 @@ static int imgu_pci_probe(struct pci_dev
> *pci_dev,
> > > > ipu3_css_set_powerdown(&pci_dev->dev, imgu->base);
> > > > out_mutex_destroy:
> > > > mutex_destroy(&imgu->lock);
> > > > + mutex_destroy(&imgu->streaming_lock);
> > > >
> > > > return r;
> > > > }
> > > > @@ -716,6 +718,7 @@ static void imgu_pci_remove(struct pci_dev
> > > *pci_dev)
> > > > ipu3_dmamap_exit(imgu);
> > > > ipu3_mmu_exit(imgu->mmu);
> > > > mutex_destroy(&imgu->lock);
> > > > + mutex_destroy(&imgu->streaming_lock);
> > > > }
> > > >
> > > > static int __maybe_unused imgu_suspend(struct device *dev) diff
> > > > --git a/drivers/staging/media/ipu3/ipu3.h
> > > > b/drivers/staging/media/ipu3/ipu3.h
> > > > index 04fc99f47ebb..f732315f0701 100644
> > > > --- a/drivers/staging/media/ipu3/ipu3.h
> > > > +++ b/drivers/staging/media/ipu3/ipu3.h
> > > > @@ -146,6 +146,10 @@ struct imgu_device {
> > > > * vid_buf.list and css->queue
> > > > */
> > > > struct mutex lock;
> > > > +
> > > > + /* Lock to protect writes to streaming flag in this struct */
> > > > + struct mutex streaming_lock;
> > > > +
> > > > /* Forbit streaming and buffer queuing during system suspend. */
> > > > atomic_t qbuf_barrier;
> > > > /* Indicate if system suspend take place while imgu is
> > > > streaming. */
> > >
>
> --
> Sakari Ailus
> sakari.ailus@xxxxxxxxxxxxxxx