KES

In Cardano, nodes use Key Evolving Signatures (KES). This is another asymmetric key cryptographic scheme, also relying on the use of public and private key pairs. These signature schemes provide forward cryptographic security, meaning that a compromised key does not make it easier for an adversary to forge a signature that allegedly had been signed in the past.

In KES, the public verification key stays constant, but the corresponding private key evolves incrementally. For this reason, KES signing keys are indexed by integers representing the step in the key's evolution. Since the private key evolves incrementally in a KES scheme, the ledger rules require the pool operators to evolve their keys every time a certain number of slots have passed. The details of when these keys are evolved are out of the scope of this document, and the reader is directed to the ledger spec.

Generalised specification

We use the iterated sum construction from Section 4.3 of MMM¹. A KES signature algorithm is parametrized by four algorithms, $KeyGen, Sign, Update$ and $Verify$ . The sum construction depends on two signing algorithms, $Σ_{0} = (KeyGen_{0}, Sign_{0}, Update_{0}, Verify_{0})$ and $Σ_{1} = (KeyGen_{1}, Sign_{1}, Update_{1}, Verify_{1})$ , which are forward-secure with $T_{0}$ and $T_{1}$ time periods respectively. As specified in the paper, any digital signature algorithm can be considered as a forward-secure signature algorithm with 1 time period. We use two specialized versions of the key generation, one to generate the public part, and another to generate the secret part, $P KeyGen, S KeyGen$ respectively. The KES algorithm uses a length doubling pseudorandom generator, $F : {0, 1}^{λ} \to {0, 1}^{λ} \times {0, 1}^{λ}$ , where given a random seed, $s$ , returns two random seeds of the same size $s_{1}, s_{2}$ . The sum construction begins by generating a signing algorithm with two periods by merging two instances of the base signature, and proceeds recursively until the desired level of periods is reached.

The Chang hard fork will bring optimizations to the KES signature size, and we therefore have different verification criteria pre and post-Chang. Both instances use a tree of depth 6. For ease of exposure, we compare both constructions using a simple binary tree of depth 3 (see figure below). Consider each node as a KES instance with $2^{n}$ periods, where $n$ is the height of the node with respect to the leafs. For instance, A is a KES instance with $8$ periods, while node D is a KES instance with $2$ periods. Note that each KES instance is created by two child instances with half the periods. The only relevant details for compatibility on how KES algorithm works is signature generation and signature verification, however we provide a description of the other functions. The details on how the keys are managed are details of implementation which will not be covered here.

digraph keys {
  { 
     A, F, G, J, K, L, M, N, O, t [ shape = box, color = "black" ];
     B, D [ shape = box, color = "black", style = "filled", fillcolor = "green" ];
     C, E, H, I [ shape = box, color = "red", style = "filled", fillcolor = "green" ];
   };
   
   A -> B;
   A -> C;
   B -> D;
   B -> E;
   C -> F;
   C -> G;
   D -> H;
   D -> I;
   E -> J;
   E -> K;
   F -> L;
   F -> M;
   G -> N;
   G -> O;
   H -> t
}

In green, we have the nodes for which we need to store the public key in a signature during pre-babbage eras. With red borders we have the nodes for which we need to store the public keys in post-babbage eras.

Pre-Chang

The signing instances of nodes H-O are single period KES instances, and nodes A-G are defined by recursively calling on the algorithms presented above. In pre-Chang eras the signatures are computed in a naive manner, meaning that a signature is represented (recursively) as the underlying signature and the two public keys, $σ = (σ^{'}, v k_{0}, v k_{1})$ . So for instance, the signature in node A of period 0 contains the signature of node B, $σ_{b}$ , the public key of node B $v k_{b}$ (which is a hash of $v k_{d}$ and $v k_{e}$ ) and the public key of node C $v k_{c}$ (which is a hash of $v k_{f}$ and $v k_{g}$ ). Signature $σ_{b}$ in turn contains signature of node D $σ_{d}$ , the public key of node D $v k_{d}$ (which is a hash of $v k_{h}$ and $v k_{i}$ ) and public key of node E $v k_{e}$ (which is a hash of $v k_{j}$ and $v k_{k}$ ). The signature of node D, $σ_{d}$ contains in turn the signature of node H, $σ_{h}$ , the public key of node H, $v k_{h}$ and the public key of node I $v k_{i}$ . Verification of the signature works in the naive recursive manner. We check that $H (v k_{b}, v k_{c}) = v k_{a}$ , and $H (v k_{d}, v k_{e}) = v k_{b}$ , and $H (v k_{h}, v k_{i}) = v k_{d}$ , and finally that $Verify (m, v k_{h}, σ_{h}) = true$ . More specifically:

$KeyGen (r)$ takes as input a random seed. It then extends the seed into two parts, $(r_{0}, r_{1}) \leftarrow F (r)$ , and uses each seed to generate the key material of the next layer. In particular $(s k_{0}, v k_{0}) \leftarrow KeyGen (r_{0})$ and $v k_{1} \leftarrow P KeyGen (r_{1})$ . Finally, it computes the pair's public key $v k \leftarrow H (v k_{0}, v k_{1})$ and returns $(⟨ s k_{0}, r_{1}, v k_{0}, v k_{1} ⟩, v k)$ . \item $Sign (t, ⟨ s k^{'}, r_{1}, v k_{0}, v k_{1} s k ⟩, m)$ takes as input a time period $t$ , a signing key $s k$ and a message. If $t < T_{0}$ , then it computes the signature using the first signature algorithm, $σ^{'} \leftarrow Sign_{0} (t, s k^{'}, m)$ , otherwise it uses the other $σ^{'} \leftarrow Sign_{1} (t - T_{0}, s k^{'}, m)$ . Finally, returns $(⟨ σ^{'}, v k_{0}, v k_{1} ⟩, t)$ .
$Update (t, ⟨, s k^{'}, r_{1}, v k_{0}, v k_{1} ⟩ s k)$ takes as input a time period $t$ , and a signing key $s k$ . If $t + 1 < T_{0}$ , then $s k^{'} \leftarrow Update_{0} (t, s k^{'})$ . Otherwise, it checks if its changing the key generation algorithm. Specifically, if $t + 1 = T_{0}$ , then $s k^{'} \leftarrow S KeyGen_{1} (r_{1})$ and sets the seed to zero $r_{1} \leftarrow 0$ . Otherwise, $s k^{'} \leftarrow Update_{1} (t - T_{0}, s k^{'})$ .
$Verify (v k, m, ⟨ σ^{'}, v k_{0}, v k_{1} ⟩ σ, t)$ takes as input a verification key, $v k$ , a message, $m$ , a signature, $σ$ , and a time period, $t$ . First, it checks that $H (v k_{0}, v k_{1}) = v k$ . If that is not the case, it returns $false$ . Otherwise, if $t < T_{0}$ then $Verify_{0} (v k_{0}, m, σ, t)$ , else $Verify_{1} (v k_{1}, m, σ, t - T_{0})$ . If verification fails, it returns $false$ , otherwise it returns $true$ .

Post-Chang

The naive definition of the signature used in Pre-Chang results in poor performance with respect to the signature size. We don't need to verify the hash equality at each level, and we simply need to do so at the root. In the Chang hardfork we introduced such an optimization. In particular, instead of storing both public keys in each signature, we only store the one of the branch that we are not in. For the case of the KES instances with 1 period, the KES signature contains not only the underlying signature, but also the public key, which allows us to re-walk the merkle path. Again, assume we are in period 0. Then, the signature in node A of period 0 contains the signature of node B, $σ_{b}$ , and the public key of node C $v k_{c}$ (which is a hash of $v k_{f}$ and $v k_{g}$ ). Signature $σ_{b}$ in turn contains signature of node D $σ_{d}$ and public key of node E $v k_{e}$ (which is a hash of $v k_{j}$ and $v k_{k}$ ). The signature of node D, $σ_{d}$ contains in turn the signature of node H, $σ_{h}$ , and the public key of node I $v k_{i}$ . In this case, $σ_{h} = (σ_{0, h}, v k_{h})$ , where $σ_{0, h}$ is the underlying signature. Verification of the signature works going up the tree, rather than down. We check $Verify (m, v k_{h}, σ_{0, h}) = true$ . Then we compute the expected key of node D, $v k_{d}^{'} \leftarrow H (v k_{h}, v k_{i})$ . Then we use that to compute the expected key of node B, $v k_{b}^{'} \leftarrow H (v k_{d}^{'}, v k_{e})$ . Finally, we check that the leaf indeed is part of the merkle tree by checking that $H (v k_{b}^{'}, v k_{c}) = v k_{a}$ . To derive the missing public key, we introduce a new function, $DeriveVerKey$ .

Specifically, post-Chang KES signature algorithm modifies the $Sign$ and $Verify$ algorithms, and introduces $DeriveVerKey$ as follows:

$KeyGen (r)$ takes as input a random seed. It then extends the seed into two parts, $(r_{0}, r_{1}) \leftarrow F (r)$ , and uses each seed to generate the key material of the next layer. In particular $(s k_{0}, v k_{0}) \leftarrow KeyGen (r_{0})$ and $v k_{1} \leftarrow P KeyGen (r_{1})$ . Finally, it computes the pair's public key $v k \leftarrow H (v k_{0}, v k_{1})$ and returns $(⟨ s k_{0}, r_{1}, v k_{0}, v k_{1} ⟩, v k)$ .
$Sign (t, ⟨ s k^{'}, r_{1}, v k_{0}, v k_{1} s k ⟩, m)$ takes as input a time period $t$ , a signing key $s k$ and a message. If $t < T_{0}$ , then it computes the signature using the first signature algorithm, $σ^{'} \leftarrow Sign_{0} (t, s k^{'}, m)$ and lets $v k_{c} = v k_{1}$ , otherwise it uses the other $σ^{'} \leftarrow Sign_{1} (t - T_{0}, s k^{'}, m)$ , and lets $v k_{c} = v k_{0}$ . Finally, returns $(⟨ σ^{'}, v k ⟩, t)$ .
$Update (t, ⟨, s k^{'}, r_{1}, v k_{0}, v k_{1} ⟩ s k)$ takes as input a time period $t$ , and a signing key $s k$ . If $t + 1 < T_{0}$ , then $s k^{'} \leftarrow Update_{0} (t, s k^{'})$ . Otherwise, it checks if its changing the key generation algorithm. Specifically, if $t + 1 = T_{0}$ , then $s k^{'} \leftarrow S KeyGen_{1} (r_{1})$ and sets the seed to zero $r_{1} \leftarrow 0$ . Otherwise, $s k^{'} \leftarrow Update_{1} (t - T_{0}, s k^{'})$ .
$DeriveVerKey (⟨ σ^{'}, v k_{c} ⟩ σ, m, t)$ takes as input a signature $σ$ and a period $t$ . If $t < T_{0}$ , then $(v k_{n}, res) = DeriveVerKey (σ^{'}, m, t)$ and return $(H (v k_{n}, v k_{c}), res)$ , otherwise $v k_{n} = DeriveVerKey (σ^{'}, m, t - T_{0})$ and return $(H (v k_{c}, v k_{n}), res)$ .
$Verify (v k, m, ⟨ σ^{'}, v k_{c} ⟩ σ, t)$ takes as input a verification key, $v k$ , a message, $m$ , a signature, $σ$ , and a time period, $t$ . First, it computes $v k_{n} \leftarrow DeriveVerKey (⟨ σ^{'}, v k_{c} ⟩ σ, m, t)$ , and then, if $t < T_{0}$ , check that $H (v k_{n}, v k_{c}) = v k$ , otherwise, check that $H (v k_{c}, v k_{n}) = v k$ . If verification fails, it returns $false$ , otherwise it returns $true$ .

For this recursive explanation to be complete, we need to define what happens when we call $DeriveVerKey$ on a leaf signature. Recall that the signature of a leaf contains not only the underlying signature, but also the public key with which it is signed. The derive function at the leaf takes as input $(⟨ σ^{'}, v k_{c} ⟩ σ, m, t)$ . It proceeds by verifying the leaf signature. Parse $σ^{'} = (σ_{0}, v k)$ , and compute the result $res \leftarrow Verify (m, v k, σ_{0})$ . If $t = 0$ , then return $(H (v k, v k_{c}), res)$ , else return $(H (v k_{c}, v k), res)$ .

Parameters of instantiation

The instantiation of both eras is the same. The underlying signature scheme is Ed25519. Regarding $P KeyGen$ and $S KeyGen$ , in Cardano, these functions simply call a seeded version of Ed25519's $KeyGen$ and extract the public or private part. As a hashing function we use Blake2b². Defining the pseudo random function $F$ is not required for compatibility purposes, as it is only used for private key material. However, for sake of completeness, we specify that the cardano node uses Blake2b as a length doubling pseudorandom generator.

Malkin, Micciancio and Miner, Composition and Efficiency Tradeoffs for Forward-Secure Digital Signatures

Aumasson, The BLAKE2 Cryptographic Hash and Message Authentication Code (MAC)

IOG Cryptography Handbook

KES

Generalised specification

Pre-Chang

Post-Chang

Parameters of instantiation