diff mbox series

[v2] seq_file: copy as much as possible to user buffer in seq_read()

Message ID 20241220140819.9887-1-00107082@163.com (mailing list archive)
State New
Headers show
Series [v2] seq_file: copy as much as possible to user buffer in seq_read() | expand

Commit Message

David Wang Dec. 20, 2024, 2:08 p.m. UTC
seq_read() yields at most seq_file->size bytes to userspace, even when
user buffer is prepared to hold more data. This causes lots of extra
*read* syscalls to fetch data from /proc/*.
For example, on an 8-core system, cat /proc/interrupts needs three
*read*:
	$ strace -T -e read cat /proc/interrupts  > /dev/null
	...
	 43 read(3, "            CPU0       CPU1     "..., 131072) = 4082 <0.000068>
	 44 read(3, "  75:   13490876          0     "..., 131072) = 2936 <0.000148>
	 45 read(3, "", 131072)                     = 0 <0.000010>
On a system with hundreds of cpus, it would need tens of more read calls.
A more convincing example is /proc/allocinfo, which is available when
CONFIG_MEM_ALLOC_PROFILING=y. When cat /proc/allocinfo, 4k+ lines need ~100
read calls.

Fill up user buffer as much as possible in seq_read(), extra read
calls can be avoided with a larger user buffer, and 2%~10% performance
improvement would be observed:
	$ strace -T -e read cat /proc/interrupts  > /dev/null
	...
	 56 read(3, "            CPU0       CPU1     "..., 131072) = 7018 <0.000208>
	 57 read(3, "", 131072)                     = 0 <0.000010>

Signed-off-by: David Wang <00107082@163.com>
---
 fs/seq_file.c | 6 ++++++
 1 file changed, 6 insertions(+)

Comments

Markus Elfring Dec. 20, 2024, 2:34 p.m. UTC | #1
> seq_read() yields at most seq_file->size bytes to userspace, …

                                                    user space?


…
> 	$ strace -T -e read cat /proc/interrupts  > /dev/null> 	 45 read(3, "", 131072)                     = 0 <0.000010>
> On a system with hundreds of cpus, it would need …

                               CPUs?


Is it a bit nicer to separate test output and subsequent comments by blank lines?


…
> Fill up user buffer as much as possible in seq_read(), extra read
> calls can be avoided with a larger user buffer, and 2%~10% performance
> improvement would be observed:
Will it help to split such a paragraph into three sentences
(on separate lines)?

Regards,
Markus
David Wang Dec. 20, 2024, 4:48 p.m. UTC | #2
At 2024-12-20 22:34:12, "Markus Elfring" <Markus.Elfring@web.de> wrote:
>> seq_read() yields at most seq_file->size bytes to userspace, …
>
>                                                    user space?
>
>
>…
>> 	$ strace -T -e read cat /proc/interrupts  > /dev/null
>…
>> 	 45 read(3, "", 131072)                     = 0 <0.000010>
>> On a system with hundreds of cpus, it would need …
>
>                               CPUs?
>
>
>Is it a bit nicer to separate test output and subsequent comments by blank lines?
>
>
>…
>> Fill up user buffer as much as possible in seq_read(), extra read
>> calls can be avoided with a larger user buffer, and 2%~10% performance
>> improvement would be observed:
>Will it help to split such a paragraph into three sentences
>(on separate lines)?
>
>Regards,
>Markus

Thanks for the comments, I will address it later.
Any concern about the code?

David
diff mbox series

Patch

diff --git a/fs/seq_file.c b/fs/seq_file.c
index 8bbb1ad46335..2cda43aec4a2 100644
--- a/fs/seq_file.c
+++ b/fs/seq_file.c
@@ -220,6 +220,7 @@  ssize_t seq_read_iter(struct kiocb *iocb, struct iov_iter *iter)
 		if (m->count)	// hadn't managed to copy everything
 			goto Done;
 	}
+Restart:
 	// get a non-empty record in the buffer
 	m->from = 0;
 	p = m->op->start(m, &m->index);
@@ -282,6 +283,11 @@  ssize_t seq_read_iter(struct kiocb *iocb, struct iov_iter *iter)
 	copied += n;
 	m->count -= n;
 	m->from = n;
+	/*
+	 * Keep reading in case more data could be copied into user buffer.
+	 */
+	if (m->count == 0)
+		goto Restart;
 Done:
 	if (unlikely(!copied)) {
 		copied = m->count ? -EFAULT : err;