Message ID | 20200219014433.88424-1-minchan@kernel.org (mailing list archive) |
---|---|
Headers | show |
Series | introduce memory hinting API for external process | expand |
On Tue, 18 Feb 2020 17:44:26 -0800 Minchan Kim <minchan@kernel.org> wrote: > Now, we have MADV_PAGEOUT and MADV_COLD as madvise hinting API. With that, > application could give hints to kernel what memory range are preferred to be > reclaimed. However, in some platform(e.g., Android), the information > required to make the hinting decision is not known to the app. > Instead, it is known to a centralized userspace daemon(e.g., ActivityManagerService), > and that daemon must be able to initiate reclaim on its own without any app > involvement. > This patchset doesn't seem to be getting a lot of interest from other potential users? It seems very specialized. Are there or will there ever be any users of this apart from one Android daemon? Also, it doesn't terribly hard for ActivityManagerService to tell another process "now run madvise with these arguments". Please explain why this is not practical in ActivityManagerService and also within other potential users of this syscall.
On Wed, Feb 19, 2020 at 12:01 PM Andrew Morton <akpm@linux-foundation.org> wrote: > > On Tue, 18 Feb 2020 17:44:26 -0800 Minchan Kim <minchan@kernel.org> wrote: > > > Now, we have MADV_PAGEOUT and MADV_COLD as madvise hinting API. With that, > > application could give hints to kernel what memory range are preferred to be > > reclaimed. However, in some platform(e.g., Android), the information > > required to make the hinting decision is not known to the app. > > Instead, it is known to a centralized userspace daemon(e.g., ActivityManagerService), > > and that daemon must be able to initiate reclaim on its own without any app > > involvement. > > > > This patchset doesn't seem to be getting a lot of interest from other > potential users? It seems very specialized. Are there or will there > ever be any users of this apart from one Android daemon? Don't know if this can be considered another user since it's still in the Android realm of things. I'm interested in extending process_madvise() to support MADV_DONTNEED to expedite memory ripping of a process killed by Android Low Memory Killer. But for that I need process_madvise() to be accepted first. IIRC Crome team was interested in these madv hints as well at some point... > > Also, it doesn't terribly hard for ActivityManagerService to tell > another process "now run madvise with these arguments". Please explain > why this is not practical in ActivityManagerService and also within > other potential users of this syscall. > >
Hi Andrew, On Wed, Feb 19, 2020 at 12:01:23PM -0800, Andrew Morton wrote: > On Tue, 18 Feb 2020 17:44:26 -0800 Minchan Kim <minchan@kernel.org> wrote: > > > Now, we have MADV_PAGEOUT and MADV_COLD as madvise hinting API. With that, > > application could give hints to kernel what memory range are preferred to be > > reclaimed. However, in some platform(e.g., Android), the information > > required to make the hinting decision is not known to the app. > > Instead, it is known to a centralized userspace daemon(e.g., ActivityManagerService), > > and that daemon must be able to initiate reclaim on its own without any app > > involvement. > > > > This patchset doesn't seem to be getting a lot of interest from other > potential users? It seems very specialized. Are there or will there > ever be any users of this apart from one Android daemon? Quote from http://lkml.kernel.org/r/20190531064313.193437-1-minchan@kernel.org " Brian Geffon in ChromeOS team had an experiment with process_madvise(2) Quote form him: "What I found is that by using process_madvise after a tab has been back grounded for more than 45 seconds reduced the average tab switch times by 25%! This is a huge result and very obvious validation that process_madvise hints works well for the ChromeOS use case." " > > Also, it doesn't terribly hard for ActivityManagerService to tell > another process "now run madvise with these arguments". Please explain > why this is not practical in ActivityManagerService and also within > other potential users of this syscall. I think that's the almost a same question why ptrace doesn't work so I summarizes the part in [2/7]: * makes target task runnable creates memory layout change window so hiniting a wrong vma * target task(e.g., background task) could live in little core with cpuset/group limited environment so we couldn't react quick enough, which causes more killing. Thanks.
To expand on how ChromeOS benefits from this, we've advanced far beyond the experimentation phase we've deployed an older version of this idea that was procfs based on several ChromeOS kernels. These are now rolled out to ChromeOS stable channel where we've been testing and the results have been amazing. To elaborate on the setup, Chrome is a multi process architecture where each tab is a separate process and sometimes a single tab can even represent multiple processes. The primary Chrome process has a lot of visibility into the amount of time a user has been spending interacting with a tab (process) and using this knowledge these hints provided to the kernel allow it to make much better swap decisions and amortize the cost of swap over different memory pressure levels meaning that we were better able to reclaim memory which allow us to avoid having to discard tabs or even worse oom. I'd be happy to expand even more if anyone is interested. Brian On Wed, Feb 19, 2020 at 2:32 PM Minchan Kim <minchan@kernel.org> wrote: > > Hi Andrew, > > On Wed, Feb 19, 2020 at 12:01:23PM -0800, Andrew Morton wrote: > > On Tue, 18 Feb 2020 17:44:26 -0800 Minchan Kim <minchan@kernel.org> wrote: > > > > > Now, we have MADV_PAGEOUT and MADV_COLD as madvise hinting API. With that, > > > application could give hints to kernel what memory range are preferred to be > > > reclaimed. However, in some platform(e.g., Android), the information > > > required to make the hinting decision is not known to the app. > > > Instead, it is known to a centralized userspace daemon(e.g., ActivityManagerService), > > > and that daemon must be able to initiate reclaim on its own without any app > > > involvement. > > > > > > > This patchset doesn't seem to be getting a lot of interest from other > > potential users? It seems very specialized. Are there or will there > > ever be any users of this apart from one Android daemon? > > > Quote from http://lkml.kernel.org/r/20190531064313.193437-1-minchan@kernel.org > > " > Brian Geffon in ChromeOS team had an experiment with process_madvise(2) > Quote form him: > "What I found is that by using process_madvise after a tab has been back > grounded for more than 45 seconds reduced the average tab switch times by > 25%! This is a huge result and very obvious validation that process_madvise > hints works well for the ChromeOS use case." > " > > > > > Also, it doesn't terribly hard for ActivityManagerService to tell > > another process "now run madvise with these arguments". Please explain > > why this is not practical in ActivityManagerService and also within > > other potential users of this syscall. > > I think that's the almost a same question why ptrace doesn't work so > I summarizes the part in [2/7]: > > * makes target task runnable creates memory layout change window so > hiniting a wrong vma > > * target task(e.g., background task) could live in little core with > cpuset/group limited environment so we couldn't react quick enough, > which causes more killing. > > > Thanks.