diff mbox series

[v2,1/2] t: add a test helper to truncate files

Message ID 20231012160930.330618-2-sandals@crustytoothpaste.net (mailing list archive)
State New, archived
Headers show
Series Prevent re-reading 4 GiB files on every status | expand

Commit Message

brian m. carlson Oct. 12, 2023, 4:09 p.m. UTC
From: "brian m. carlson" <bk2204@github.com>

In a future commit, we're going to work with some large files which will
be at least 4 GiB in size.  To take advantage of the sparseness
functionality on most Unix systems and avoid running the system out of
disk, it would be convenient to use truncate(2) to simply create a
sparse file of sufficient size.

However, the GNU truncate(1) utility isn't portable, so let's write a
tiny test helper that does the work for us.

Signed-off-by: brian m. carlson <bk2204@github.com>
---
 Makefile                 |  1 +
 t/helper/test-tool.c     |  1 +
 t/helper/test-tool.h     |  1 +
 t/helper/test-truncate.c | 27 +++++++++++++++++++++++++++
 4 files changed, 30 insertions(+)
 create mode 100644 t/helper/test-truncate.c

Comments

Eric Sunshine Oct. 12, 2023, 5:49 p.m. UTC | #1
On Thu, Oct 12, 2023 at 12:10 PM brian m. carlson
<sandals@crustytoothpaste.net> wrote:
> In a future commit, we're going to work with some large files which will
> be at least 4 GiB in size.  To take advantage of the sparseness
> functionality on most Unix systems and avoid running the system out of
> disk, it would be convenient to use truncate(2) to simply create a
> sparse file of sufficient size.
>
> However, the GNU truncate(1) utility isn't portable, so let's write a
> tiny test helper that does the work for us.
>
> Signed-off-by: brian m. carlson <bk2204@github.com>
> ---
> diff --git a/t/helper/test-truncate.c b/t/helper/test-truncate.c
> @@ -0,0 +1,27 @@
> +int cmd__truncate(int argc, const char **argv)
> +{
> +       char *p = NULL;
> +       uintmax_t sz = 0;
> +       int fd = -1;
> +
> +       if (argc != 3)
> +               die("expected filename and size");
> +
> +       sz = strtoumax(argv[2], &p, 0);
> +       if (*p)
> +               die("invalid size");

Do you want to check 'errno' here, as well (probably before the '*p' check)?

Or is that being too defensive for a 'test-tool' command?

> +       fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
> +       if (fd < 0)
> +               die_errno("failed to open file %s", argv[1]);
> +
> +       if (ftruncate(fd, (off_t) sz) < 0)
> +               die_errno("failed to truncate file");
> +       return 0;
> +}
Junio C Hamano Oct. 12, 2023, 10:52 p.m. UTC | #2
"brian m. carlson" <sandals@crustytoothpaste.net> writes:

> +	fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
> +	if (fd < 0)
> +		die_errno("failed to open file %s", argv[1]);

contrib/coccinelle/xopen.cocci tells us to write this simply as

	fd = xopen(argv[1], O_WRONLY | O_CREAT, 0600);

https://github.com/git/git/actions/runs/6500777388/job/17656837393#step:4:657
brian m. carlson Oct. 13, 2023, 8:18 p.m. UTC | #3
On 2023-10-12 at 22:52:59, Junio C Hamano wrote:
> "brian m. carlson" <sandals@crustytoothpaste.net> writes:
> 
> > +	fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
> > +	if (fd < 0)
> > +		die_errno("failed to open file %s", argv[1]);
> 
> contrib/coccinelle/xopen.cocci tells us to write this simply as
> 
> 	fd = xopen(argv[1], O_WRONLY | O_CREAT, 0600);

Sure, I can do that.
brian m. carlson Oct. 13, 2023, 8:23 p.m. UTC | #4
On 2023-10-12 at 17:49:13, Eric Sunshine wrote:
> > +       sz = strtoumax(argv[2], &p, 0);
> > +       if (*p)
> > +               die("invalid size");
> 
> Do you want to check 'errno' here, as well (probably before the '*p' check)?
> 
> Or is that being too defensive for a 'test-tool' command?

I don't believe that's necessary.  The Linux manual page leads me to
believe that if *argv[2] is not 0 but *p is 0, then the entire string is
valid, which would imply that errno is not set.  I'm happy to ignore for
the moment the case where the user specifies "" as the argument, because
it is a test helper and "don't do weird, unexpected things with the test
helper without looking at the source code first" is legitimate advice
for Git developers.

If this were user-facing, improved robustness would be warranted, I
agree.
Junio C Hamano Oct. 13, 2023, 8:32 p.m. UTC | #5
"brian m. carlson" <sandals@crustytoothpaste.net> writes:

> On 2023-10-12 at 22:52:59, Junio C Hamano wrote:
>> "brian m. carlson" <sandals@crustytoothpaste.net> writes:
>> 
>> > +	fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
>> > +	if (fd < 0)
>> > +		die_errno("failed to open file %s", argv[1]);
>> 
>> contrib/coccinelle/xopen.cocci tells us to write this simply as
>> 
>> 	fd = xopen(argv[1], O_WRONLY | O_CREAT, 0600);
>
> Sure, I can do that.

Unless there are other changes needed, I'll handle it on this end.

Thanks.
Jeff King Oct. 16, 2023, 11:53 p.m. UTC | #6
On Thu, Oct 12, 2023 at 04:09:29PM +0000, brian m. carlson wrote:

> +int cmd__truncate(int argc, const char **argv)
> +{
> +	char *p = NULL;
> +	uintmax_t sz = 0;
> +	int fd = -1;
> +
> +	if (argc != 3)
> +		die("expected filename and size");
> +
> +	sz = strtoumax(argv[2], &p, 0);
> +	if (*p)
> +		die("invalid size");
> +
> +	fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
> +	if (fd < 0)
> +		die_errno("failed to open file %s", argv[1]);
> +
> +	if (ftruncate(fd, (off_t) sz) < 0)
> +		die_errno("failed to truncate file");
> +	return 0;
> +}

Coverity flagged this as leaking the descriptor "fd" (which is obviously
true). I guess it doesn't matter much in practice, since we're exiting
the process directly afterwards. If it were a memory leak we'd free() it
anyway to shut up the leak detector, but I don't know if we want to be
as careful here (in theory it creates noise that distracts from real
problems, but Coverity is noisy enough that I don't even bother looking
at existing problems).

-Peff
diff mbox series

Patch

diff --git a/Makefile b/Makefile
index 9c6a2f125f..03adcb5a48 100644
--- a/Makefile
+++ b/Makefile
@@ -852,6 +852,7 @@  TEST_BUILTINS_OBJS += test-submodule-nested-repo-config.o
 TEST_BUILTINS_OBJS += test-submodule.o
 TEST_BUILTINS_OBJS += test-subprocess.o
 TEST_BUILTINS_OBJS += test-trace2.o
+TEST_BUILTINS_OBJS += test-truncate.o
 TEST_BUILTINS_OBJS += test-urlmatch-normalization.o
 TEST_BUILTINS_OBJS += test-userdiff.o
 TEST_BUILTINS_OBJS += test-wildmatch.o
diff --git a/t/helper/test-tool.c b/t/helper/test-tool.c
index 9010ac6de7..876cd2dc31 100644
--- a/t/helper/test-tool.c
+++ b/t/helper/test-tool.c
@@ -86,6 +86,7 @@  static struct test_cmd cmds[] = {
 	{ "submodule-nested-repo-config", cmd__submodule_nested_repo_config },
 	{ "subprocess", cmd__subprocess },
 	{ "trace2", cmd__trace2 },
+	{ "truncate", cmd__truncate },
 	{ "userdiff", cmd__userdiff },
 	{ "urlmatch-normalization", cmd__urlmatch_normalization },
 	{ "xml-encode", cmd__xml_encode },
diff --git a/t/helper/test-tool.h b/t/helper/test-tool.h
index f134f96b97..70dd4eba11 100644
--- a/t/helper/test-tool.h
+++ b/t/helper/test-tool.h
@@ -79,6 +79,7 @@  int cmd__submodule_config(int argc, const char **argv);
 int cmd__submodule_nested_repo_config(int argc, const char **argv);
 int cmd__subprocess(int argc, const char **argv);
 int cmd__trace2(int argc, const char **argv);
+int cmd__truncate(int argc, const char **argv);
 int cmd__userdiff(int argc, const char **argv);
 int cmd__urlmatch_normalization(int argc, const char **argv);
 int cmd__xml_encode(int argc, const char **argv);
diff --git a/t/helper/test-truncate.c b/t/helper/test-truncate.c
new file mode 100644
index 0000000000..bd3fde364b
--- /dev/null
+++ b/t/helper/test-truncate.c
@@ -0,0 +1,27 @@ 
+#include "test-tool.h"
+#include "git-compat-util.h"
+
+/*
+ * Truncate a file to the given size.
+ */
+int cmd__truncate(int argc, const char **argv)
+{
+	char *p = NULL;
+	uintmax_t sz = 0;
+	int fd = -1;
+
+	if (argc != 3)
+		die("expected filename and size");
+
+	sz = strtoumax(argv[2], &p, 0);
+	if (*p)
+		die("invalid size");
+
+	fd = open(argv[1], O_WRONLY | O_CREAT, 0600);
+	if (fd < 0)
+		die_errno("failed to open file %s", argv[1]);
+
+	if (ftruncate(fd, (off_t) sz) < 0)
+		die_errno("failed to truncate file");
+	return 0;
+}