From: Ian Johnson <ian@ianjohnson.dev>
To: linux-btrfs@vger.kernel.org
Subject: Possible readdir regression with BTRFS
Date: Fri, 08 Sep 2023 21:07:10 -0400 [thread overview]
Message-ID: <YR1P0S.NGASEG570GJ8@ianjohnson.dev> (raw)
I was debugging an issue recently and came across what seems to be a
change in the behavior of opendir and readdir in Btrfs seemingly
starting with 9b378f6ad48cfa195ed868db9123c09ee7ec5ea2, specifically
when the contents of a directory are changed between the call to opendir
and the initial call to readdir. The following C program demonstrates
the behavior:
#import <dirent.h>
#import <stdio.h>
int main(void) {
DIR *dir = opendir("test");
FILE *file;
file = fopen("test/1", "w");
fwrite("1", 1, 1, file);
fclose(file);
file = fopen("test/2", "w");
fwrite("2", 1, 1, file);
fclose(file);
/* rewinddir(dir); */
struct dirent *entry;
while ((entry = readdir(dir))) {
printf("%s\n", entry->d_name);
}
closedir(dir);
return 0;
}
Running this program with an initially empty test directory will print
entry 1 but not entry 2 on Btrfs, while on Ext4 it prints both entries.
Evidently this particular result is not an issue, as POSIX states:
> If a file is removed from or added to the directory after the most
> recent call to opendir() or rewinddir(), whether a subsequent call to
> readdir_r() returns an entry for that file is unspecified.
However, the same program with rewinddir uncommented above yields the
same behavior, seemingly in contradiction of POSIX:
> It [rewinddir] shall also cause the directory stream to refer to the
> current state of the corresponding directory, as a call to opendir()
> would have done.
I know that the functions used here are part of the C library and not
the kernel, but given this seems to be a recent change, I wanted to
bring this up. I looked over the man pages for getdents and lseek and
didn't see any mention of guarantees (or lack thereof) in the face of
concurrent directory updates, so it's quite possible that this is within
the realm of expected/allowed behavior.
next reply other threads:[~2023-09-09 1:18 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-09-09 1:07 Ian Johnson [this message]
2023-09-09 7:27 ` Possible readdir regression with BTRFS Filipe Manana
2023-09-09 11:52 ` Evangelos Foutras
2023-09-09 12:18 ` Filipe Manana
2023-09-11 11:56 ` Filipe Manana
2023-09-09 12:16 ` Filipe Manana
2023-09-09 19:27 ` Ian Johnson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YR1P0S.NGASEG570GJ8@ianjohnson.dev \
--to=ian@ianjohnson.dev \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox