diff mbox series

reftable/writer: ensure valid range for log's update_index

Message ID 20241205-424-reftable-writer-add-check-for-limits-v1-1-b287b055204e@gmail.com (mailing list archive)
State Superseded
Headers show
Series reftable/writer: ensure valid range for log's update_index | expand

Commit Message

karthik nayak Dec. 5, 2024, 3:49 p.m. UTC
Each reftable addition has an associated update_index. While writing
refs, the update_index is verified to be within the range of the
reftable writer, i.e. `writer.min_update_index < ref.update_index` and
`writer.max_update_index > ref.update_index`.

The corresponding check for reflogs in `reftable_writer_add_log` is
however missing. Add a similar check, but only check for the upper
limit. This is because reflogs are treated a bit differently than refs.
Each reflog entry in reftable has an associated update_index and we also
allow expiring entries in the middle, which is done by simply writing a
new reflog entry with the same update_index. This means, writing reflog
entries with update_index lesser than the writer's update_index is an
expected scenario.

Add a new unit test to check for the limits and fix some of the existing
tests, which were setting arbitrary values for the update_index by
ensuring they stay within the now checked limits.

Signed-off-by: Karthik Nayak <karthik.188@gmail.com>
---
 reftable/writer.c                   | 12 ++++++++++
 t/unit-tests/t-reftable-readwrite.c | 44 +++++++++++++++++++++++++++++++++++--
 t/unit-tests/t-reftable-stack.c     |  8 +++++--
 3 files changed, 60 insertions(+), 4 deletions(-)


---



--- 

base-commit: cc01bad4a9f566cf4453c7edd6b433851b0835e2
change-id: 20241120-424-reftable-writer-add-check-for-limits-01b1fb703528

Thanks
- Karthik

Comments

Patrick Steinhardt Dec. 6, 2024, 9:06 a.m. UTC | #1
On Thu, Dec 05, 2024 at 04:49:57PM +0100, Karthik Nayak wrote:
> Each reftable addition has an associated update_index. While writing
> refs, the update_index is verified to be within the range of the
> reftable writer, i.e. `writer.min_update_index < ref.update_index` and
> `writer.max_update_index > ref.update_index`.

These should probably be `<=` and `>=`, respectively.

> diff --git a/reftable/writer.c b/reftable/writer.c
> index fd136794d5a27b33b5017f36fbd6b095ab8dac5b..f87086777cd20a9890a63f10c5d6932310dd5610 100644
> --- a/reftable/writer.c
> +++ b/reftable/writer.c
> @@ -412,6 +412,18 @@ int reftable_writer_add_log(struct reftable_writer *w,
>  	if (log->value_type == REFTABLE_LOG_DELETION)
>  		return reftable_writer_add_log_verbatim(w, log);
>  
> +	/*
> +	 * Verify only the upper limit of the update_index. Each reflog entry
> +	 * is tied to a specific update_index. Entries in the reflog can be
> +	 * replaced by adding a new entry with the same update_index,
> +	 * effectively canceling the old one.
> +	 *
> +	 * Consequently, reflog updates may include update_index values lower
> +	 * than the writer's min_update_index.
> +	 */
> +	if (log->update_index > w->max_update_index)
> +		return REFTABLE_API_ERROR;

Yup, looks sensible.

>  	if (!log->refname)
>  		return REFTABLE_API_ERROR;
>  
> diff --git a/t/unit-tests/t-reftable-readwrite.c b/t/unit-tests/t-reftable-readwrite.c
> index d279b86df0aeda11b3fb4d2c15803760ae394941..5ad1c72f6901abcfe7fdc6c3e69e26b58d0013a6 100644
> --- a/t/unit-tests/t-reftable-readwrite.c
> +++ b/t/unit-tests/t-reftable-readwrite.c
> @@ -151,6 +151,45 @@ static void t_log_overflow(void)
>  	reftable_buf_release(&buf);
>  }
>  
> +static void t_log_write_limits(void)
> +{
> +	struct reftable_write_options opts = { 0 };
> +	struct reftable_buf buf = REFTABLE_BUF_INIT;
> +	struct reftable_writer *w = t_reftable_strbuf_writer(&buf, &opts);
> +	struct reftable_log_record log = {
> +		.refname = (char *)"refs/head/master",
> +		.update_index = 1,
> +		.value_type = REFTABLE_LOG_UPDATE,
> +		.value = {
> +			.update = {
> +				.old_hash = { 1 },
> +				.new_hash = { 2 },
> +				.name = (char *)"Han-Wen Nienhuys",
> +				.email = (char *)"hanwen@google.com",
> +				.tz_offset = 100,
> +				.time = 0x5e430672,
> +			},
> +		},
> +	};
> +	int err;
> +
> +	reftable_writer_set_limits(w, 1, 2);
> +
> +	err = reftable_writer_add_log(w, &log);
> +	check_int(err, ==, 0);
> +
> +	log.update_index = 2;
> +	err = reftable_writer_add_log(w, &log);
> +	check_int(err, ==, 0);
> +
> +	log.update_index = 3;
> +	err = reftable_writer_add_log(w, &log);
> +	check_int(err, ==, REFTABLE_API_ERROR);
> +
> +	reftable_writer_free(w);
> +	reftable_buf_release(&buf);
> +}

Makes sense, as well. We could trivially extend this test to also assert
that we can successfully write a log record with update index 0, which
would be smaller than the lower bound.

Patrick
karthik nayak Dec. 6, 2024, 11:24 a.m. UTC | #2
Patrick Steinhardt <ps@pks.im> writes:

> On Thu, Dec 05, 2024 at 04:49:57PM +0100, Karthik Nayak wrote:
>> Each reftable addition has an associated update_index. While writing
>> refs, the update_index is verified to be within the range of the
>> reftable writer, i.e. `writer.min_update_index < ref.update_index` and
>> `writer.max_update_index > ref.update_index`.
>
> These should probably be `<=` and `>=`, respectively.
>

Indeed, good catch, will fix.

>> diff --git a/reftable/writer.c b/reftable/writer.c
>> index fd136794d5a27b33b5017f36fbd6b095ab8dac5b..f87086777cd20a9890a63f10c5d6932310dd5610 100644
>> --- a/reftable/writer.c
>> +++ b/reftable/writer.c
>> @@ -412,6 +412,18 @@ int reftable_writer_add_log(struct reftable_writer *w,
>>  	if (log->value_type == REFTABLE_LOG_DELETION)
>>  		return reftable_writer_add_log_verbatim(w, log);
>>
>> +	/*
>> +	 * Verify only the upper limit of the update_index. Each reflog entry
>> +	 * is tied to a specific update_index. Entries in the reflog can be
>> +	 * replaced by adding a new entry with the same update_index,
>> +	 * effectively canceling the old one.
>> +	 *
>> +	 * Consequently, reflog updates may include update_index values lower
>> +	 * than the writer's min_update_index.
>> +	 */
>> +	if (log->update_index > w->max_update_index)
>> +		return REFTABLE_API_ERROR;
>
> Yup, looks sensible.
>
>>  	if (!log->refname)
>>  		return REFTABLE_API_ERROR;
>>
>> diff --git a/t/unit-tests/t-reftable-readwrite.c b/t/unit-tests/t-reftable-readwrite.c
>> index d279b86df0aeda11b3fb4d2c15803760ae394941..5ad1c72f6901abcfe7fdc6c3e69e26b58d0013a6 100644
>> --- a/t/unit-tests/t-reftable-readwrite.c
>> +++ b/t/unit-tests/t-reftable-readwrite.c
>> @@ -151,6 +151,45 @@ static void t_log_overflow(void)
>>  	reftable_buf_release(&buf);
>>  }
>>
>> +static void t_log_write_limits(void)
>> +{
>> +	struct reftable_write_options opts = { 0 };
>> +	struct reftable_buf buf = REFTABLE_BUF_INIT;
>> +	struct reftable_writer *w = t_reftable_strbuf_writer(&buf, &opts);
>> +	struct reftable_log_record log = {
>> +		.refname = (char *)"refs/head/master",
>> +		.update_index = 1,
>> +		.value_type = REFTABLE_LOG_UPDATE,
>> +		.value = {
>> +			.update = {
>> +				.old_hash = { 1 },
>> +				.new_hash = { 2 },
>> +				.name = (char *)"Han-Wen Nienhuys",
>> +				.email = (char *)"hanwen@google.com",
>> +				.tz_offset = 100,
>> +				.time = 0x5e430672,
>> +			},
>> +		},
>> +	};
>> +	int err;
>> +
>> +	reftable_writer_set_limits(w, 1, 2);
>> +
>> +	err = reftable_writer_add_log(w, &log);
>> +	check_int(err, ==, 0);
>> +
>> +	log.update_index = 2;
>> +	err = reftable_writer_add_log(w, &log);
>> +	check_int(err, ==, 0);
>> +
>> +	log.update_index = 3;
>> +	err = reftable_writer_add_log(w, &log);
>> +	check_int(err, ==, REFTABLE_API_ERROR);
>> +
>> +	reftable_writer_free(w);
>> +	reftable_buf_release(&buf);
>> +}
>
> Makes sense, as well. We could trivially extend this test to also assert
> that we can successfully write a log record with update index 0, which
> would be smaller than the lower bound.
>
> Patrick

Yeah, that does make sense, will add and send in a new version. Thanks
for the quick review!

Karthik
diff mbox series

Patch

diff --git a/reftable/writer.c b/reftable/writer.c
index fd136794d5a27b33b5017f36fbd6b095ab8dac5b..f87086777cd20a9890a63f10c5d6932310dd5610 100644
--- a/reftable/writer.c
+++ b/reftable/writer.c
@@ -412,6 +412,18 @@  int reftable_writer_add_log(struct reftable_writer *w,
 	if (log->value_type == REFTABLE_LOG_DELETION)
 		return reftable_writer_add_log_verbatim(w, log);
 
+	/*
+	 * Verify only the upper limit of the update_index. Each reflog entry
+	 * is tied to a specific update_index. Entries in the reflog can be
+	 * replaced by adding a new entry with the same update_index,
+	 * effectively canceling the old one.
+	 *
+	 * Consequently, reflog updates may include update_index values lower
+	 * than the writer's min_update_index.
+	 */
+	if (log->update_index > w->max_update_index)
+		return REFTABLE_API_ERROR;
+
 	if (!log->refname)
 		return REFTABLE_API_ERROR;
 
diff --git a/t/unit-tests/t-reftable-readwrite.c b/t/unit-tests/t-reftable-readwrite.c
index d279b86df0aeda11b3fb4d2c15803760ae394941..5ad1c72f6901abcfe7fdc6c3e69e26b58d0013a6 100644
--- a/t/unit-tests/t-reftable-readwrite.c
+++ b/t/unit-tests/t-reftable-readwrite.c
@@ -90,7 +90,7 @@  static void t_log_buffer_size(void)
 	int i;
 	struct reftable_log_record
 		log = { .refname = (char *) "refs/heads/master",
-			.update_index = 0xa,
+			.update_index = update_index,
 			.value_type = REFTABLE_LOG_UPDATE,
 			.value = { .update = {
 					   .name = (char *) "Han-Wen Nienhuys",
@@ -127,7 +127,7 @@  static void t_log_overflow(void)
 	int err;
 	struct reftable_log_record log = {
 		.refname = (char *) "refs/heads/master",
-		.update_index = 0xa,
+		.update_index = update_index,
 		.value_type = REFTABLE_LOG_UPDATE,
 		.value = {
 			.update = {
@@ -151,6 +151,45 @@  static void t_log_overflow(void)
 	reftable_buf_release(&buf);
 }
 
+static void t_log_write_limits(void)
+{
+	struct reftable_write_options opts = { 0 };
+	struct reftable_buf buf = REFTABLE_BUF_INIT;
+	struct reftable_writer *w = t_reftable_strbuf_writer(&buf, &opts);
+	struct reftable_log_record log = {
+		.refname = (char *)"refs/head/master",
+		.update_index = 1,
+		.value_type = REFTABLE_LOG_UPDATE,
+		.value = {
+			.update = {
+				.old_hash = { 1 },
+				.new_hash = { 2 },
+				.name = (char *)"Han-Wen Nienhuys",
+				.email = (char *)"hanwen@google.com",
+				.tz_offset = 100,
+				.time = 0x5e430672,
+			},
+		},
+	};
+	int err;
+
+	reftable_writer_set_limits(w, 1, 2);
+
+	err = reftable_writer_add_log(w, &log);
+	check_int(err, ==, 0);
+
+	log.update_index = 2;
+	err = reftable_writer_add_log(w, &log);
+	check_int(err, ==, 0);
+
+	log.update_index = 3;
+	err = reftable_writer_add_log(w, &log);
+	check_int(err, ==, REFTABLE_API_ERROR);
+
+	reftable_writer_free(w);
+	reftable_buf_release(&buf);
+}
+
 static void t_log_write_read(void)
 {
 	struct reftable_write_options opts = {
@@ -917,6 +956,7 @@  int cmd_main(int argc UNUSED, const char *argv[] UNUSED)
 	TEST(t_corrupt_table_empty(), "read-write on an empty table");
 	TEST(t_log_buffer_size(), "buffer extension for log compression");
 	TEST(t_log_overflow(), "log overflow returns expected error");
+	TEST(t_log_write_limits(), "writer limits for writing log records");
 	TEST(t_log_write_read(), "read-write on log records");
 	TEST(t_log_zlib_corruption(), "reading corrupted log record returns expected error");
 	TEST(t_table_read_api(), "read on a table");
diff --git a/t/unit-tests/t-reftable-stack.c b/t/unit-tests/t-reftable-stack.c
index 72f6747064f62106e87c9a90e5fe315139d46e60..52b81475c36aa94ff09f3bf77a7d23992957deb4 100644
--- a/t/unit-tests/t-reftable-stack.c
+++ b/t/unit-tests/t-reftable-stack.c
@@ -770,8 +770,12 @@  static void t_reftable_stack_tombstone(void)
 		}
 
 		logs[i].refname = xstrdup(buf);
-		/* update_index is part of the key. */
-		logs[i].update_index = 42;
+		/*
+		 * update_index is part of the key so should be constant.
+		 * The value itself should be less than the writer's upper
+		 * limit.
+		 */
+		logs[i].update_index = 1;
 		if (i % 2 == 0) {
 			logs[i].value_type = REFTABLE_LOG_UPDATE;
 			t_reftable_set_hash(logs[i].value.update.new_hash, i,