Re: [PATCH 1a/7] dlm: core locking

From: David Teigland
Date: Wed Apr 27 2005 - 22:43:22 EST

Next message: Lee Revell: "Re: [RFC][PATCH] Reduce ext3 allocate-with-reservation locklatencies"
Previous message: Li Shaohua: "Re: [PATCH]broadcast IPI race condition on CPU hotplug"
In reply to: Daniel Phillips: "Re: [PATCH 1a/7] dlm: core locking"
Next in thread: Stephen C. Tweedie: "Re: [PATCH 1a/7] dlm: core locking"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, Apr 27, 2005 at 02:41:36PM -0700, Mark Fasheh wrote:

> > +#define DLM_LVB_LEN (32)
> Why so small? In OCFS2 land, a nice healthy 64 bytes helps us fit most of
> our important inode bits, thus ensuring that we don't have to go to disk to
> update our metadata in some cases :)

We were questioned for 32 being unnecessarily large when we started, which
seems to make a case for it being configurable.

> > + * DLM_LKF_EXPEDITE
> > + *
> > + * Used only with new requests for NL mode locks. Tells the lock manager
> > + * to grant the lock, ignoring other locks in convert and wait queues.

> What's happens to non DLM_LKF_EXPEDITE NL mode requests? It seems that
> an new NL mode lock should always be immediately be put on the grant
> queue...

A comment in _can_be_granted() quotes the VMS rule:

"By default, a new request is immediately granted only if all three of the
following conditions are satisfied when the request is issued:

- The queue of ungranted conversion requests for the resoure is empty.
- The queue of ungranted new requests for the resource is empty.
- The mode of the new request is compatible with the most
restrictive mode of all granted locks on the resource."

Which means without EXPEDITE it could go on the waiting queue. I suspect
EXPEDITE was invented because most people want NL requests to work as you
suggest, despite the rules.

> Where's the LKM_LOCAL equivalent? What happens a dlm user wants to create a
> lock on a resource it knows to be unique in the cluster (think file creation
> for a cfs)? Does it have to hit the network for a resource lookup on those
> locks?
>
> Perhaps I should explain how this is interpreted in the OCFS2 dlm: When
> LKM_LOCAL is provided with a request for a new lock, the normal master
> lookup process is skipped and the resource is immediately created and
> mastered to the local node. Obviously this requires that users be careful
> not to create duplicate resources in the cluster. Any future requests for
> the lock from other nodes go through the master discovery process and will
> find it on the originating node.
>
> We explicitly do not support LKM_FINDLOCAL - the notion of "local only"
> lookups does not apply as the resource is only considered to have been
> created locally and explicitly *not* hidden from the rest of the cluster.
>
> >From a light skimming of dir.c (specifically dlm_dir_name2nodeid), I have a
> hunch that our methods for determing a resource master are fundamentally
> different, which would make implementation of LKM_LOCAL (at least as I have
> described it) on your side, difficult.

Interesting, I was reading about this recently and wondered if people
really used it. I figured parent/child locks were probably a more common
way to get similar benefits.

Just to clarify, though: when the LOCAL resource is immediately created
and mastered locally, there must be a resource directory entry added for
it, right? For us, the resource directory entry is added as part of a new
master lookup (which is being skipped). If you don't add a directory
entry, how does another node that later wants to lock the same resource
(without LOCAL) discover who the master is?

If I understand LOCAL correctly, it should be simple for us to do. We'd
still have a LOCAL request _send_ the lookup to create the directory
entry, but we'd simply not wait for the reply. We'd assume, based on
LOCAL, that the lookup result indicates we're the master.

Some people don't use a resource directory, and maybe that's why
dlm_dir_name2nodeid() doesn't look familiar? That function determines the
directory node for a resource, not the master node. The nodeid returned
from that function is where we send the master lookup, and the lookup
reply says where we send the lock request.

[We'll be adding a simple config option to change this so you can operate
without a resource directory in which case there's never a master lookup
to do. The downside is that the first node to request a lock on a
resource would no longer always be the master of it.]

> > +static struct list_head ast_queue;
> > +static struct semaphore ast_queue_lock;

> Why a semaphore here? On quick inspection I'm not seeing much more than list
> operations being protected by ast_queue_lock... A spinlock might be more
> appropriate.

You're right, thanks.
Dave

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Lee Revell: "Re: [RFC][PATCH] Reduce ext3 allocate-with-reservation locklatencies"
Previous message: Li Shaohua: "Re: [PATCH]broadcast IPI race condition on CPU hotplug"
In reply to: Daniel Phillips: "Re: [PATCH 1a/7] dlm: core locking"
Next in thread: Stephen C. Tweedie: "Re: [PATCH 1a/7] dlm: core locking"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]