cluster-devel.redhat.com archive mirror
 help / color / mirror / Atom feed
* [Cluster-devel] problem about dlm posix file lock (sorry for missing subjuct)
       [not found] <561D4DDD020000D40000DB70@relay2.provo.novell.com>
@ 2015-10-13 14:36 ` David Teigland
  2015-10-14  1:09   ` [Cluster-devel] problem about dlm posix file lock (sorry for missing subject) Eric Ren
  0 siblings, 1 reply; 2+ messages in thread
From: David Teigland @ 2015-10-13 14:36 UTC (permalink / raw)
  To: cluster-devel.redhat.com

On Tue, Oct 13, 2015 at 04:30:53AM -0600, Zhen Ren wrote:
> It expects alarm timeout to send SIGALRM, and wake up the sleep process,
> as "man fcntl" says: "If a  signal  is  caught  while waiting,  then
> the call is interrupted and (after the signal handler has returned)
> returns immediately (with return value -1 and errno set to EINTR".

Hi, thanks for investigating this.  Have you verified that SIGALRM
interrupts the process on a local fs?  Is it any signal?  Is this causing
problems for a real program?

> dlm_posix_lock -> wait_event_killable
> And wait_event_killable will put process into "TASK_KILLABLE" state
> which's like "UNINTERRUPTABLE" but can be waked up by fatal signals. I
> did some tests, SIGTERM can did it, but SIGALRM cannot.

Perhaps we should replace that wait_event_killable with
wait_event_interruptible.  Could you recompile with that change and see if
it does what you want?

> Did this go against posix file lock semanteme? Any hints would be very
> appreciated!  I can provide any infos as I can if needed;-)

Posix locks on a cluster file system are always going to behave somewhat
differently than a local fs, but this may be a case where the difference
can be easily avoided with the change above.  Changing the behavior after
so long is a slightly concerning, though.

Dave



^ permalink raw reply	[flat|nested] 2+ messages in thread

* [Cluster-devel] problem about dlm posix file lock (sorry for missing subject)
  2015-10-13 14:36 ` [Cluster-devel] problem about dlm posix file lock (sorry for missing subjuct) David Teigland
@ 2015-10-14  1:09   ` Eric Ren
  0 siblings, 0 replies; 2+ messages in thread
From: Eric Ren @ 2015-10-14  1:09 UTC (permalink / raw)
  To: cluster-devel.redhat.com

Hi Divad,

Please see comments in lines;-)

David Teigland <teigland@redhat.com> write?
> 
>> On Tue, Oct 13, 2015 at 04:30:53AM -0600, Zhen Ren wrote:
>> It expects alarm timeout to send SIGALRM, and wake up the sleep process,
>> as "man fcntl" says: "If a  signal  is  caught  while waiting,  then
>> the call is interrupted and (after the signal handler has returned)
>> returns immediately (with return value -1 and errno set to EINTR".
> 
> Hi, thanks for investigating this.  Have you verified that SIGALRM
> interrupts the process on a local fs?  Is it any signal?  Is this causing
> problems for a real program?

Yeah,I tried on ext4. Two processes fcntl the same file with F_SETLKW in F_WRLCK mode. The second blocked process can be interrupted by SIGALRM.

Also, for ocfs2,  on local node did the same as ext4, SIGALRM works! And I noticed that the status of the blocked process on multiple nodes was "Dl",but was "D+" on local ocfs2. Don't know if it matters. It's some odd.

> 
>> dlm_posix_lock -> wait_event_killable
>> And wait_event_killable will put process into "TASK_KILLABLE" state
>> which's like "UNINTERRUPTABLE" but can be waked up by fatal signals. I
>> did some tests, SIGTERM can did it, but SIGALRM cannot.
> 
> Perhaps we should replace that wait_event_killable with
> wait_event_interruptible.  Could you recompile with that change and see if
> it does what you want?

OK,I'll try;-)

> 
>> Did this go against posix file lock semanteme? Any hints would be very
>> appreciated!  I can provide any infos as I can if needed;-)
> 
> Posix locks on a cluster file system are always going to behave somewhat
> differently than a local fs, but this may be a case where the difference
> can be easily avoided with the change above.  Changing the behavior after
> so long is a slightly concerning, though.

Yeah, more test after change, ensure no unexpected effects.

thanks,
Eric

> 
> Dave
> 



^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2015-10-14  1:09 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <561D4DDD020000D40000DB70@relay2.provo.novell.com>
2015-10-13 14:36 ` [Cluster-devel] problem about dlm posix file lock (sorry for missing subjuct) David Teigland
2015-10-14  1:09   ` [Cluster-devel] problem about dlm posix file lock (sorry for missing subject) Eric Ren

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).