[RFC,41/41] random: lower per-IRQ entropy estimate upon health test failure

Currently, if fips_enabled is set, a per-IRQ min-entropy estimate of
either 1 bit or 1/8 bit is assumed, depending on whether a high resolution
get_cycles() is available or not. The statistical NIST SP800-90B startup
health tests are run on a certain amount of noise samples and are intended
to reject in case this hypothesis turns out to be wrong, i.e. if the
actual min-entropy is smaller. As long as the startup tests haven't
finished, entropy dispatch and thus, the initial crng seeding, is
inhibited. On test failure, the startup tests would restart themselves
from the beginning.

It follows that in case a system's actual per-IRQ min-entropy is smaller
than the more or less arbitrarily assessed 1 bit or 1/8 bit resp., there
will be a good chance that the initial crng seed will never complete.
AFAICT, such a situation could potentially prevent certain userspace
daemons like OpenSSH from loading.

In order to still be able to make any progress, make
add_interrupt_randomness() lower the per-IRQ min-entropy by one half upon
each health test failure, but only until the minimum supported value of
1/64 bits has been reached. Note that health test failures will cause a
restart of the startup health tests already and thus, a certain number of
additional noise samples resp. IRQ events will have to get examined by the
health tests before the initial crng seeding can take place. This number
of fresh events required is reciprocal to the estimated per-IRQ
min-entropy H: for the Adaptive Proportion Test (APT) it equals ~128 / H.
It follows that this patch won't be of much help for embedded systems or
VMs with poor IRQ rates at boot time, at least not without manual
intervention. But there aren't many options left when fips_enabled is set.

With respect to NIST SP800-90B conformance, this patch enters kind of a
gray area: NIST SP800-90B has no notion of such a dynamically adjusted
min-entropy estimate. Instead, it is assumed that some fixed value has been
estimated based on general principles and subsequently validated in the
course of the certification process. However, I would argue that if a
system had successfully passed certification for 1 bit or 1/8 bit resp. of
estimated min-entropy per sample, it would automatically be approved for
all smaller values as well. Had we started out with such a lower value
passing the health tests from the beginning, the latter would never have
complained in the first place and the system would have come up just fine.

Finally, note that all statistical tests have a non-zero probability of
false positives and so do the NIST SP800-90B health tests. In order to not
keep the estimated per-IRQ entropy at a smaller level than necessary for
forever after spurious health test failures, make
add_interrupt_randomness() attempt to double it again after a certain
number of successful health test passes at the degraded entropy level have
been completed. This threshold should not be too small in order to avoid
excessive entropy accounting loss due to continuously alternating between
a too large per-IRQ entropy estimate and the next smaller value. For now,
choose a value of five as a compromise between quick recovery and limiting
said accounting loss.

So, introduce a new member ->good_tests to struct fast_pool for keeping
track of the number of successfult health test passes. Make
add_interrupt_randomness() increment it upon successful healh test
completion and reset it to zero on failures. Make
add_interrupt_randomness() double the current min-entropy estimate and
restart the startup health in case ->good_tests is > 4 and the entropy
had previously been lowered.

Signed-off-by: Nicolai Stange <nstange@suse.de>
---
 drivers/char/random.c | 25 +++++++++++++++++++++++--
 1 file changed, 23 insertions(+), 2 deletions(-)

Message ID	20200921075857.4424-42-nstange@suse.de (mailing list archive)
State	Not Applicable
Delegated to:	Herbert Xu
Headers	show Return-Path: <SRS0=YoIG=C6=vger.kernel.org=linux-crypto-owner@kernel.org> Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 73C751668 for <patchwork-linux-crypto@patchwork.kernel.org>; Mon, 21 Sep 2020 08:01:09 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 64CE4214F1 for <patchwork-linux-crypto@patchwork.kernel.org>; Mon, 21 Sep 2020 08:01:09 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726796AbgIUIBI (ORCPT <rfc822;patchwork-linux-crypto@patchwork.kernel.org>); Mon, 21 Sep 2020 04:01:08 -0400 Received: from mx2.suse.de ([195.135.220.15]:57940 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726592AbgIUH7j (ORCPT <rfc822;linux-crypto@vger.kernel.org>); Mon, 21 Sep 2020 03:59:39 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id E0F6EB533; Mon, 21 Sep 2020 08:00:12 +0000 (UTC) From: Nicolai Stange <nstange@suse.de> To: "Theodore Y. Ts'o" <tytso@mit.edu> Cc: linux-crypto@vger.kernel.org, LKML <linux-kernel@vger.kernel.org>, Arnd Bergmann <arnd@arndb.de>, Greg Kroah-Hartman <gregkh@linuxfoundation.org>, "Eric W. Biederman" <ebiederm@xmission.com>, "Alexander E. Patrakov" <patrakov@gmail.com>, "Ahmed S. Darwish" <darwish.07@gmail.com>, Willy Tarreau <w@1wt.eu>, Matthew Garrett <mjg59@srcf.ucam.org>, Vito Caputo <vcaputo@pengaru.com>, Andreas Dilger <adilger.kernel@dilger.ca>, Jan Kara <jack@suse.cz>, Ray Strode <rstrode@redhat.com>, William Jon McCann <mccann@jhu.edu>, zhangjs <zachary@baishancloud.com>, Andy Lutomirski <luto@kernel.org>, Florian Weimer <fweimer@redhat.com>, Lennart Poettering <mzxreary@0pointer.de>, Peter Matthias <matthias.peter@bsi.bund.de>, Marcelo Henrique Cerri <marcelo.cerri@canonical.com>, Roman Drahtmueller <draht@schaltsekun.de>, Neil Horman <nhorman@redhat.com>, Randy Dunlap <rdunlap@infradead.org>, Julia Lawall <julia.lawall@inria.fr>, Dan Carpenter <dan.carpenter@oracle.com>, Andy Lavr <andy.lavr@gmail.com>, Eric Biggers <ebiggers@kernel.org>, "Jason A. Donenfeld" <Jason@zx2c4.com>, =?utf-8?q?Stephan_M=C3=BCller?= <smueller@chronox.de>, Torsten Duwe <duwe@suse.de>, Petr Tesarik <ptesarik@suse.cz>, Nicolai Stange <nstange@suse.de> Subject: [RFC PATCH 41/41] random: lower per-IRQ entropy estimate upon health test failure Date: Mon, 21 Sep 2020 09:58:57 +0200 Message-Id: <20200921075857.4424-42-nstange@suse.de> X-Mailer: git-send-email 2.26.2 In-Reply-To: <20200921075857.4424-1-nstange@suse.de> References: <20200921075857.4424-1-nstange@suse.de> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-crypto.vger.kernel.org> X-Mailing-List: linux-crypto@vger.kernel.org
Series	random: possible ways towards NIST SP800-90B compliance \| expand [DISCUSSION,00/41] random: possible ways towards NIST SP800-90B compliance [RFC,01/41] random: remove dead code in credit_entropy_bits() [RFC,02/41] random: remove dead code for nbits < 0 in credit_entropy_bits() [RFC,03/41] random: prune dead assignment to entropy_bits in credit_entropy_bits() [RFC,04/41] random: drop 'reserved' parameter from extract_entropy() [RFC,05/41] random: don't reset entropy to zero on overflow [RFC,06/41] random: factor the exponential approximation in credit_entropy_bits() out [RFC,07/41] random: let pool_entropy_delta() take nbits in units of 2^-ENTROPY_SHIFT [RFC,08/41] random: introduce __credit_entropy_bits_fast() for hot paths [RFC,09/41] random: protect ->entropy_count with the pool spinlock [RFC,10/41] random: implement support for delayed entropy dispatching [RFC,11/41] random: convert add_timer_randomness() to queued_entropy API [RFC,12/41] random: convert add_interrupt_randomness() to queued_entropy API [RFC,13/41] random: convert try_to_generate_entropy() to queued_entropy API [RFC,14/41] random: drop __credit_entropy_bits_fast() [RFC,15/41] random: convert add_hwgenerator_randomness() to queued_entropy API [RFC,16/41] random: convert random_ioctl() to queued_entropy API [RFC,17/41] random: drop credit_entropy_bits() and credit_entropy_bits_safe() [RFC,18/41] random: move arch_get_random_seed() calls in crng_reseed() into own loop [RFC,19/41] random: reintroduce arch_has_random() + arch_has_random_seed() [RFC,20/41] random: provide min_crng_reseed_pool_entropy() [RFC,21/41] random: don't invoke arch_get_random_long() from add_interrupt_randomness() [RFC,22/41] random: introduce arch_has_sp800_90b_random_seed() [RFC,23/41] random: don't award entropy to non-SP800-90B arch RNGs in FIPS mode [RFC,24/41] init: call time_init() before rand_initialize() [RFC,25/41] random: probe cycle counter resolution at initialization [RFC,26/41] random: implement support for evaluating larger fast_pool entropies [RFC,27/41] random: increase per-IRQ event entropy estimate if in FIPS mode [RFC,28/41] random: don't award entropy to disk + input events if in FIPS mode [RFC,29/41] random: move definition of struct queued_entropy and related API upwards [RFC,30/41] random: add a queued_entropy instance to struct fast_pool [RFC,31/41] random: introduce struct health_test + health_test_reset() placeholders [RFC,32/41] random: introduce health test stub and wire it up [RFC,33/41] random: make health_test_process() maintain the get_cycles() delta [RFC,34/41] random: implement the "Adaptive Proportion" NIST SP800-90B health test [RFC,35/41] random: improve the APT's statistical power [RFC,36/41] random: optimize the APT's presearch [RFC,37/41] random: implement the "Repetition Count" NIST SP800-90B health test [RFC,38/41] random: enable NIST SP800-90B startup tests [RFC,39/41] random: make the startup tests include muliple APT invocations [RFC,40/41] random: trigger startup health test on any failure of the health tests [RFC,41/41] random: lower per-IRQ entropy estimate upon health test failure

[RFC,41/41] random: lower per-IRQ entropy estimate upon health test failure

Commit Message

Patch