LatticeFold

This article aims to intuitively explain the goals and processes of the LatticeFold protocol.

Introduction

LatticeFold is the first lattice-based folding scheme. Folding schemes require a Homomorphic Additive Commitment Scheme, which is why Pedersen commitments are commonly used. These elliptic curve-based commitments are vulnerable to quantum computing attacks, require large-sized fields, and rely on non-native field arithmetic for folding verification. In contrast, LatticeFold uses Ajtai commitments based on the Module SIS problem, which is known to be quantum-resistant, supports small-sized fields, and is cost-efficient to verify.

Background

Folding Schemes: Motivation and Challenges

A circuit is typically expressed as $(x; w)$ , where the public input x and witness w represent a certain relation $\mathcal{R}$ . For example, a circuit representing $w^2 = x$ could have relations such as $(9; 3)$ and $(16; 4)$ satisfying it. While public inputs act as constraints on the relation, since witnesses must be kept hidden, a ZK SNARK proof ensures the relation is held accordingly.

As shown in Fig. 1, some cases require expressing sequences of computations. For example, $(2^{16}; 2^8),(2^8; 2^4), (2^4; 2^2)$ and $(2^2; 2)$ represent the four steps in computing the square root of $2^{16}$ . Such computational sequences are common, with zkEVMs and zkVMs being other notable examples.

The computational expense of generating a SNARK proof for $(x; w)$ is a major challenge. This inefficiency is compounded when there are multiple instances satisfying the relation, as generating separate proofs for each instance is costly. For example, creating two separate SNARK proofs for $(x_1; w_1)$ and $(x_2; w_2)$ , both satisfying the same relation $\mathcal{R}$ , can be prohibitively expensive.

In comparison, a folding scheme creates a single new pair $(x_3; w_3)$ from $(x_1; w_1)$ and $(x_2; w_2)$ that also satisfies the relation $\mathcal{R}$ . Then, the SNARK proof for the folded $(x_3; w_3)$ not only validates the new relation but also proves the relations for the original pairs $(x_1; w_1)$ and $(x_2; w_2)$ . Generating and verifying folding proofs is faster and cheaper than doing the same with SNARK proofs. Therefore, instead of recursively generating and verifying SNARK proofs, the process is optimized by using folding proofs for intermediate steps before creating a final SNARK proof. The example shown here demonstrates 2-to-1 folding, but some folding schemes support n-to-1 folding.\

Folding generally employs a technique called Random Linear Combination (RLC) where values are combined together with a random value. To ensure succinctness and to hide the witness, the prover uses commitments of the witness to apply RLC. This means that the commitment must support additive homomorphism.

Since FRI-based STARKs use Merkle Trees as commitments, which do not satisfy the additive homomorphism property, folding schemes cannot be used in FRI-based STARK systems.

Module SIS problem

A lattice refers to a combination of points following a repetitive pattern, defined as:

L(\bm{b_1}, \bm{b_2}, \dots, \bm{b_n}) = \left\{ \sum_{i=1}^n x_i \bm{b_i} : x_i \in \mathbb{Z} \right\}

Here, a set of $\{\bm{b_1}, \bm{b_2}, \dots, \bm{b_n}\} \in \mathbb{R}^d$ is called the basis, $n$ is the rank, and $d$ is the dimension, satisfying $n \le d$ . The basis vectors must be linearly independent, defined as:

x_1\bm{b_1} + x_2\bm{b_2} + \cdots + x_n\bm{b_n} = 0 \implies x_1 = x_2 = \cdots = x_n = 0

A given lattice can be created with different bases. One notable problem in lattices is the Shortest Vector Problem (SVP), which asks for the shortest vector that can be formed given a basis. For instance:

\bm{b}_1 = (64, 218, 133), \quad \bm{b}_2 = (71, 205, 111) \quad \bm{b}_3 = (28, -48, -84). \\ \text{Lattice } = \{ x_1\bm{b}_1 + x_2\bm{b}_2 + x_3\bm{b}_3 \mid x_1, x_2, x_3 \in \mathbb{Z} \}.

Using the $L^2$ -norm of all the basis vectors, the solution to the SVP for this basis is found to be $(1,3,−1)$ formed by $x_1 =−322, x_2=323, x_3=−83$ (Reference). This problem becomes significantly harder with poorly chosen bases and higher dimensions, posing challenges even for quantum computers.

The Short Integer Solution (SIS) problem was introduced in Ajtai’s 1999 paper, defined as:

Given a uniformly random integer matrix $\bm{A} \in \mathbb{Z}_q^{\kappa \times m}$ and $B \in \mathbb{Z}$ , find a non-zero integer vector ( $\bm{x} \in \mathbb{Z}^m$ ) such that $\bm{A} \cdot \bm{x} = 0 \in \mathbb{Z}^{\kappa}_q$ and $0 < \|\bm{x}\| \le B$ .

Ajtai demonstrated a worst-case-to-average-case reduction, proving that solving SIS implies solving SVP. The key insight here is that the random matrix $\bm{A}$ allows uniform sampling of bases, retaining the complexity of hard lattice problems and making them practical for cryptographic applications. This was a pivotal step in advancing lattice-based cryptography.

The Module SIS (M-SIS) with $B$ problem, a generalization of SIS, serves as the foundation for LatticeFold. It is defined as:

Given $R = \mathbb{Z}[X] / (X^d + 1)$ and $R_q = R / qR$ , with a uniformly random matrix $\bm{A} \in R^{\kappa \times m}_q$ and $B \in R_q$ , find a non-zero vector $\bm{x} \in R^m_q$ such that $\bm{A} \cdot \bm{x} = \bm{0} \in R^\kappa _q$ and $0 < \|\bm{x} \| \le B$ . Here, $X^d + 1$ is a cyclotomic polynomial (Reference), specifically one where $d$ is a power of 2.

Ajtai Commitment

In LatticeFold, the matrix $\bm{A}$ and witness vector $\bm{w}$ create a commitment , which is called an Ajtai commitment. For a commitment scheme, the following properties are required:

Hiding: The original message cannot be inferred from the commitment. This is satisfied if $\bm{A}$ is randomly sampled.
Binding: Each commitment maps to one original message with high probability. If the same commitment is created from two different $x_1$ and $x_2$ , it is equivalent to solving the M-SIS with $2B$ , which is probabilistically very difficult.

\bm{A} \cdot \bm{x_1} = \bm{A} \cdot \bm{x_2} \text{ for } \bm{x_1} \ne \bm{x_2} \leftrightarrow \bm{A} \cdot (\bm{x_1} - \bm{x_2}) = 0 \text{ where } 0 < \| \bm{x_1} - \bm{x_2} \| \le 2B

Compression: The commitment size is smaller than the original message. This is achieved if $\kappa < m$ .

Additionally, unlike SIS, M-SIS leverages rings instead of fields, introducing challenges in proving knowledge soundness. Specifically, not all ring elements are invertible, and even if they are, their norms can become excessively large, necessitating solutions to address this.

The Ajtai commitment supports additive homomorphism, making it suitable for folding schemes, which use random linear combinations. To preserve the binding property, the norm of the random linear combination of two witness vectors must stay within a certain boundary $B$ . If performed naively like shown below, the norm may become excessively large, exceeding .

\bm{A} \cdot \bm{w_1} + \bm{A} \cdot \alpha \cdot \bm{w_2} = \bm{A} \cdot (\bm{w_1} + \alpha \cdot \bm{w_2})\text{, } \text{where } 0 < \|\bm{w_1} + \alpha \cdot \bm{w_2}\| < (1 + \alpha) \cdot B

In LatticeFold, the $L ^{\infin}$ (infinity norm) is used for norm calculations. For a vector $\bm{v} = [v_1,v_2,\dots,v_n]$ , the infinity norm is defined as:

\|\bm{v}\|_\infty = \max_{i} |v_i|

From this point onward, we use the infinity norm even without an infinity symbol.

Protocol Explanation

LatticeFold is divided into three steps: Expansion, Decomposition, and Fold.

This image above is taken from the LatticeFold paper. As shown above, we first define the multilinear extension (MLE), and then define the following two relations.

\mathcal{R}_{comp}^{B} = \{(\bm{A} \cdot \vec{f}; \vec{f}): \bm{A} \in R_q^{\kappa \times m} \land \vec{f} \in R_q^m \land 0 \le \|\vec{f}\| < B \ \} \\ \mathcal{R}_{acc}^{B} = \Big\{ \begin{aligned} &(\bm{A}\cdot \vec{f}, \bm{r}, v; \vec{f}) \end{aligned} : \begin{aligned} &(\bm{A}\cdot \vec{f}; \vec{f}) \in \mathcal{R}_{comp}^{B} \land \\ & \bm{r} \in R_q^{\log m} \land v \in R_q \wedge \mathsf{mle}[\hat{f}](\bm{r}) = v \end{aligned} \Big\}

The difference between the two relations lies with $(\bm{r}, v)$ . For simplicity, the public value $x$ will be omitted.

Expansion

In the Expansion step, $\mathcal{R}_{comp}^{B}$ is transformed into $\mathcal{R}_{acc}^{B}$ using the zero vector. This transformation is necessary because, in the Fold step, the structure must conform to the format of $\mathcal{R}_{acc}$ .

(\bm{A} \cdot \vec{f}; \vec{f}) \rightarrow (\bm{A} \cdot \vec{f}, \bm{0}, \mathsf{mle}[\hat{f}](\bm{0}); \vec{f})

Decomposition

Remember that this takes $2 \times \mathcal{R}_{acc}^{B}$ as input and produces $2k \times \mathcal{R}_{acc}^b$ . See Fig 6. above

2 \times (\bm{A} \cdot \vec{f}_i, \bm{r_i}, \mathsf{mle}[\hat{f}_i](\bm{r_i}); \vec{f}_i) \rightarrow 2k \times (\bm{A} \cdot \vec{f}_j, \bm{r}_j, \mathsf{mle}[\hat{f}_j](\bm{r}_j); \vec{f}_j)

2 \times (A \cdot w_i, \vec{r}_i, \mathsf{mle}[w](\vec{r}_i); w_i) \rightarrow 2k \times (A \cdot w_j, \vec{r}_j, \mathsf{mle}[w_j](\vec{r}_j); w_j)

The Decomposition proceeds through the steps below for each relation:

The prover provides $\bm{cm_0}, \dots, \bm{cm_{k-1}}, v_0, \dots, v_{k-1}$ to the verifier, satisfying the following conditions:

\vec{f} = \sum_{i = 0}^{k-1}b^i\cdot \vec{f}_i\text{, where } 0 \le \|\vec{f}_i\| < b \\ \bm{cm_i} = \bm{A} \cdot \vec{f}_i \\ v_i = \mathsf{mle}[\hat{f}_i](\bm{r})

The verifier checks the following conditions

\bm{A} \cdot \vec{f} \stackrel{?}= \sum_{i=0}^{k-1}b^i \cdot \bm{cm_i} \space \land v \stackrel{?}= \sum_{i = 0}^{k-1}b^i\cdot v_i

If the conditions above are satisfied, each $\mathcal{R}_{acc}^{B}$ is transformed into $k$ instances of $\mathcal{R}_{acc}^b$ .

(\bm{A} \cdot \vec{f}, \bm{r}, \mathsf{mle}[\hat{f}](\bm{r}); \vec{f}) \rightarrow k \times (\bm{A} \cdot \vec{f}_i, \bm{r}, \mathsf{mle}[\hat{f}_i](\bm{r}); \vec{f}_i)

By applying this to the $2 \times \mathcal{R}_{acc}^B$ , the number increases from $2 \times \mathcal{R}_{acc}^B$ to $2k \times \mathcal{R}_{acc}^b$ . This is done to restrict the norm size to $b$ , which should be smaller than $B$ , preventing the norm from growing excessively during the Fold step. However, note that the range check for each $w_i$ has not yet been performed.

Fold

To verify that the $\vec{f}_i$ values from the previous step lie within the range $(−b, b)$ , the following polynomial will be used.

P(X) = \prod_{i = -(b-1)}^{b- 1}(X - i)

If a value is sampled within $(−b, b)$ and evaluated with $P(X)$ , the result must be 0 for the range check of $\vec{f}_i$ to succeed (This is used in Step 2 below.).

As mentioned earlier, it is also crucial to sample random values effectively. For this purpose, we sample from $C_{\mathsf{small}}$ , known as a strong sampling set. Even when multiplied by $\rho$ sampled from this set, the norm increases by at most a factor of $C$ , known as the expansion factor. If $C \cdot b \cdot 2k < B$ , the norm of the folded witnesses will remain less than $B$ . It can be formally described as:

\|\sum_{i=0}^{2k-1}\rho_i \cdot \vec{f}_i\| = \sum_{i=0}^{2k-1}\|\rho_i \cdot \vec{f}_i\| \le \sum_{i=0}^{2k-1} C \cdot \| \vec{f}_i\| \le \sum_{i=0}^{2k-1} C \cdot b< B

This takes $2k \times \mathcal{R}_{acc}^b$ as input and produces $\mathcal{R}_{acc}^B$ .

2k \times (\bm{A} \cdot \vec{f}_i, \bm{r_i}, \mathsf{mle}[\vec{f}_i](\bm{r}); \vec{f}_i) \rightarrow (\bm{A} \cdot \vec{f'}, \bm{r_{out}}, \mathsf{mle}[\hat{f'}](\bm{r_{out}}); \vec{f'}), \\ \text{where } \vec{f'} = \sum_{i=0}^{2k-1}\rho_i \cdot \vec{f}_i

The Fold proceeds through the steps below:

The verifier samples $\alpha_0, \dots, \alpha_{2k-1}, \mu_0, \dots, \mu_{2k - 1}, \bm{\beta}$ and sends them to the prover.
The prover performs the sumcheck protocol to prove the following.

\sum_{\bm{x} \in \{0, 1\}^{\log m}} g(\bm{x}) = \sum_{i=0}^{2k-1} \alpha_i \cdot v_i \\ \text{where } g(\bm{x}) = g_{\mathsf{eval}}(\bm{x}) + g_{\mathsf{norm}}(\bm{x}) \\ g_{\mathsf{eval}}(\bm{x}) = \sum_{i=0}^{2k-1}\alpha_i \cdot \mathsf{eq}(\bm{r_i}, \bm{x})\cdot \mathsf{mle}[\hat{f}_i](\bm{x}) \\ g_{\mathsf{norm}}(\bm{x}) = \sum_{i=0}^{2k-1}\mu_i \cdot \mathsf{eq}(\bm{\beta}, \bm{x})\cdot \prod_{j = -(b - 1)}^{b -1}(\mathsf{mle}[\hat{f}_i](\bm{x}) - j)

At the end of the sumcheck protocol, the following evaluation claim is obtained:

g(\bm{r_{out}}) \stackrel{?}= s

The prover sends $\theta_0, \dots, \theta_{2k - 1}$ to the verifier, where:

\theta_i = \mathsf{mle}[\hat{f}_i](\bm{r_{out}})

The verifier checks the following condition:

s \stackrel{?}= \sum_{i = 0}^{2k - 1}\alpha_i \cdot \mathsf{eq}(\bm{r_i}, \bm{r_{out}})\cdot \theta_i + \mu_i \cdot \mathsf{eq}(\bm{\beta}, \bm{r_{out}})\cdot \prod_{j = -(b - 1)}^{b-1}(\theta_i - j)

The verifier samples $\rho_0, \dots, \rho_{2k-1}$ from $C_{\mathsf{small}}$ and sends them to the prover. Using these, a new relation is created.

(\bm{A} \cdot \vec{f'}, \bm{r_{out}}, \mathsf{mle}[\hat{f'}](\bm{r_{out}}); \vec{f'})\text{, where } \vec{f'} = \sum_{i=0}^{2k-1}\rho_i \cdot \vec{f}_i

Conclusion

Although not covered in detail here, the paper also discusses the use of extension fields to enable smaller field sizes and methods to apply this approach to HyperNova’s CCS. However, some aspects of LatticeFold limit its application to Protostar's, leaving it an open problem. The significance of LatticeFold lies in it being the first to apply lattice-based cryptography to folding schemes, paving the way for incorporating lattice-based folding schemes into STARKs.

Written by ryan Kim from A41

PreviousFolding NextNova

Last updated 5 months ago