Blockchain Research Newsletter #7: Coded Merkle Tree: Solving Data Availability Attacks in Blockchains

By Mikerah and John Adler

Nov 25, 2019

In this edition of the Blockchain Research Newsletter, we are covering a protocol for solving data availability attacks in blockchains called SPAR (Sparse Fraud Protection) which leverages a new accumulator called a Coded Merkle Tree. This paper was recently published by Yu et al., and provides an efficient data availability scheme for light clients. The authors have implemented a version of SPAR for a Bitcoin client, highlighting the practical relevance of their work.

Motivation

Light clients need to download block headers from a trusted set of full nodes and verify these headers in order to ensure that they represent the blockchain. However, in practice, this is hard to guarantee. Recent work has shown that with fraud proofs, light clients can reduce the trust requirements on the set of full nodes they need to connect to. In fact, with fraud proofs, the trust assumption reduces to only needing a single, honest full node instead of a trustworthy set of full nodes. However, it is possible for a malicious full node to withhold parts of a block in order to fool a light client in to accepting a block header associated with that block. This is known as the data availability problem. Restated, how can a light node ensure that a block whose integrity it is trying to ascertain is available to others in the network?

Background

Before diving into the particulars of how SPAR attempts to solve the data availability problem, one needs a high-level understanding of erasure coding, a technique used to encode data into pieces for efficient retrieval and reconstruction at a later time.

Erasure Coding

Erasure coding is an information theoretic way to extend n-byte sized data into m-byte sized data such that m > n and one need only n of the m pieces in order to reconstruct the data. The code rate is n/m and n’/n is the reception efficiency, where n’ denotes the number of symbols needed to retrieve the data. The code rate tells us the proportion of the data that is non-redundant i.e. the minimum amount of information needed to reconstruct the original data. Erasure coding has applications in distributed systems, most notably in RAID systems. One of the first proposals for using erasure coding to combat data availability was by Buterin. That work was formalized and expanded upon by Al-Bassam in 2018.

Description of SPAR

Before going into the SPAR scheme, we will cover some assumptions it makes about light nodes and full nodes and present the coded merkle tree construction.

Security and Network Model

Light nodes are connected to a set of full nodes in which at least one of the full nodes is honest. The light nodes are not necessarily connected to one another. There is at least one full node connected to all light nodes. All full and light nodes get sent messages about block data. The network is synchronous and network communication is lossless and secure. Users can send messages anonymously. Finally, the scheme allows for a dishonest majority of full nodes (or, more precisely, a dishonest majority of block producers).

Coded Merkle Trees

A coded merkle tree (CMT) is a modified merkle tree in which every layer of the merkle tree is erasure coded. A CMT is constructed as follows: Given k data symbols which are formed from the block, a rate r such that r is at most 1, an undecodable ratio of A,

The leaves of the CMT is formed by first ordering the k data symbols and then placing the the next n-k encoded data symbols. Here, n = k/r. Take the hash of all these symbols.
Then, batch q of these coded symbols to form 1 data symbol for the next layer. Now, we have n/q data symbols.
Iterate this process until you get only t hashes such that t is at least 1.

In the example picture above, k = 16, r = 0.5, q = 4 and t = 4. A vanilla merkle tree is a CMT where r = 1 and q = 2.

The benefits of CMTs are two-fold. First, light clients only need to randomly sample relevant layers in order to prove the availability of every layer. Second, for full nodes, they can prove that layers have been incorrectly encoded and reject the block associated with the CMT and send a proof showing this.

SPAR Scheme

We are now ready to give an overview of the SPAR scheme.

First, a block producer will create a block, along with a CMT root. They will then propagate this to all other nodes in the network. The other full nodes are expected to verify the availability of the block and the correctness of the CMT root. The light clients only need the CMT root in order to ascertain the availability of a given block.

In order for a light client to be reasonably convinced of the availability of a given block, they will request random samples from several full nodes that they are connected to. In particular, they will sample for layers of a given block’s CMT. If the light client receives all the samples it requested within a fixed time, then they can accept that block as available. If the client doesn’t receive all the samples it requested within the timeframe, then it marks that block as pending and will redo the procedure at a later time. On the other hand, if they receive a bad proof i.e. an incorrectly coded proof, they reject that block and update the corresponding layer of the CMT for that block.

For other full nodes in the network, they are in charge of responding to light client queries and ascertaining the availability of blocks. When they receive samples from light clients, they download the original block from other full nodes and attempt to decode the associated CMT. If the CMT was incorrectly coded then they will propagate a proof that the given block is unavailable to all the other nodes, both light and full, in the network. If they have successfully decoded a block, then they will declare this to all the other nodes.

Conclusion

We covered a novel scheme called SPAR that aims to solve the data availability problem for light clients. It makes use of a novel hash-based accumulator called coded merkle tree in order to guarantee its efficiencies. If you would like to read the paper in depth, you can do so here.

Blockchain Research Newsletter #6: Proof-of-Burn

By Mikerah and John Adler

Mikerah

Oct 18, 2019

In this edition of the Blockchain Research Newsletter, we are covering a well-understood but informally defined concept, Proof-of-Burn. The concept of proofs of burn has been in used in Bitcoin and other blockchains extensively. However, until the work of Karantias et al., was poorly defined from a formalization point of view. Being able to formally define concepts enables researchers to discuss work without ambiguity, and more importantly provide proofs of various properties of a system. Moreover, in this paper, the authors provide a general Proof-of-Burn protocol that can be used for any cryptocurrency. They prove that their protocol is correct and secure under the random oracle model. In addition to providing this protocol, they go into depth about a particular use case for a Proof-of-Burn protocol: bootstrapping a new cryptocurrency.

Motivation

With fiat money, there is no way to show that you have destroyed (burned) a certain amount of money in a provable way. In fact, in the context of fiat money, what does burning money even mean? Burning actual legal tender will get you arrested in most countries and debit/credit cards are just numbers in a database that you can't provably show that you can burn. Cryptocurrencies, however, give us a way to prove that a user cannot spend a certain amount of coin. Not only can you provably make money unspendable, this process can be binding---you can trace who the previous owner was---and can be censorship resistant, i.e., no one can prevent you from burning your own coins. There are a few use cases in which one might want to burn a part of their cryptocurrency holdings, e.g., as a Sybil resistance to join a particular application.

Proof-of-Burn

Roughly, a Proof-of-Burn protocol is a way to provably destroy your coins. More formally, a Proof-of-Burn protocol is defined as follows:For a given security parameter k, a Proof-of-Burn protocol consists of the following functions:

A function called GenerateBurnAddr with a nonce and a tag as inputs, returns a burn address
A function called VerifyBurn with a nonce, a tag and a burn address as inputs, returns whether or not the tag is correctly encoded with respect to the burn address.

Moreover, Proof-of-Burn protocols satisfy the following properties:

Unspendability: Once burnt, coins are no longer spendable.
Binding: A burn transaction commits to a single user-generated string called a tag.
Uncensorability: Miners (or more generally, validators) cannot censor burn transactions

Further, the authors provide game-based security definitions for Proof-of-Burn protocols. These definitions are given in the context of a mathematical formalization of unspendability and binding. Although the formalization is outside the scope of this summary, we will provide some intuition for these formulations. Consider a challenger that might want to spend burned coins and commit to multiple tags. In the first case, the challenger wants to generate a tag, a transaction, a signature, and public key such that VerifyBurn and the transaction verification both return true. Thus, a Proof-of-Burn protocol is unspendable if there is negligible function for which the probability of this occurring is small. In the second case, the challenger wants to generate a 2 tags, t and t’, and a burn address such that t and t’ are different, and that VerifyBurn returns true on both t and t’ for the same burn address and nonce. Thus, a Proof-of-Burn protocol is binding if there exists some negligible function for which the probability of this occurring is small. With both of these definitions, we can define what it means for a proof of burn protocol to be secure. A Proof-of-Burn protocol is secure if it is both unspendable and binding.

Protocol

The authors provide a Proof-of-Burn protocol that satisfy all the properties previously discussed. This protocol was designed to work with Bitcoin’s Pay to Public Key Hash (P2PKH), although it can be modified to work with other blockchains.

The Proof-of-Burn protocol is constructed as follows:
GenerateBurnAddr(1^k, t):

   th = H(t)

   th’ = th XOR 1

   return th’

VerifyBurn(1^k, t, th’):

   return whether GenerateBurnAddr(1^k,t) is equal to th’

where H is a cryptographic hash function and XOR is the exclusive or function.

Conclusion

We have summarized what the authors have defined as a Proof-of-Burn protocol and what it means for it to be secure. Moreover, we went over a construction of a Proof-of-Burn protocol that the authors proposed that satisfy the various definitions given for secure Proof-of-Burn protocols. You can read the paper here if you want to deep dive into the mathematical details.

Blockchain Research Newsletter #5: Incentives in Ethereum's Hybrid Casper Protocol

By Mikerah and John Adler

Mikerah

Aug 11, 2019

In this edition of the Blockchain Research Newsletter, we will be covering Incentives in Ethereum’s Hybrid Casper Protocol by Vitalik Buterin et al. This paper provides a mathematical framework for analyzing the incentives of Casper FFG as originally planned in EIP1011. Moreover, it analyzes the liveness and safety properties of Casper FFG and shows that the hybrid Casper protocol provides better guarantees than pure PoW.

Motivation

Pure Proof of Work (PoW) has several properties that aren’t necessarily desirable for blockchain systems, namely probabilistic finality and possibility of block reversions. Probabilistic finality is the property that it is harder and harder to revert a block the deeper it is in the blockchain. It doesn’t guarantee that once your transaction is sent, then it can never be reverted. This is problematic especially for large transactions. As everything is public, a well-resourced adversary can see this transaction and try to revert it. This is why we want to be able to have deterministic finality in blockchain systems. Deterministic finality gives us a guarantee that once a block has been included in the blockchain, that it will not be reverted. This makes it such that large transactions, even though public, can be done without having to worry about a potential block reversion.

Overview

Casper FFG is a stake-based overlay network for PoW that provides deterministic finality to the base PoW chain.

Casper FFG Description

At its core, Casper FFG is a smart contract deployed onto a PoW chain with enough expressiveness like Ethereum. In order to become a validator, one needs to make a deposit to the smart contract. One can cease being a validator by requesting their deposit back from the smart contract but not before a particular exit period has elapsed. In practice, this exit period is around 120 days.

The validators’ primary task is to vote for checkpoints every epoch. An epoch is the number of blocks between 2 checkpoints. A checkpoint is essentially a snapshot of a block in which once it’s finalized, any blocks before that checkpoint cannot be reverted. A checkpoint is said to be justified if at least ⅔ of the validators, in terms of stake, vote for a checkpoint. A checkpoint is said to be finalized if it is justified and it comes before a checkpoint that has been justified. Two checkpoints are conflicting if they are at the same height and neither comes before the other.

In order to gain the property of economic finalization on the PoW chain, amendments to the PoW chain’s fork choice rule need to be made in order to take into account finalized checkpoints. In addition to taking into account a block’s difficulty, clients need to take into account whether a block is a finalized checkpoint. Clients need to periodically query the smart contract to check for these checkpoints and if there is a tie, consider the amount of accumulated work in a block. Clients only need to consider epochs in which the total stake deposited meets a specified threshold. In the case where there are no justified checkpoints after the genesis block, the fork choice rule simply reverts to the PoW’s chain original fork choice rule.

Incentive Analysis

In order to ensure that the validators in Casper FFG behave properly, i.e., finalize and justify checkpoints in according to the protocol description, validators are rewarded and penalized according to how they vote. As we will explain, Casper’s incentives are designed to be incentive compatible.

Each validator has a deposit `D_v`. If a validator votes properly in an epoch, i.e., they vote for non-conflicting blocks and those blocks get finalized, then their deposit `D_v` increases by a positive interest rate. This positive interest rate is dependent on the total deposited from all the validators and the total number of validators voting. If a validator doesn’t vote during an epoch, then they are penalized and their deposit decreases. The amount by which their deposit decreases is dependent on the total number of non-validators. These penalties for non-voting get worse if blocks don’t get finalized for long periods of time. We consider validators who submit incorrect votes as non-voters. As such, they get penalized as would a non-voter. A validator that submits conflicting votes gets their deposits entirely removed or partially penalized depending on how serious the violation was and in proportion to how the protocol is performing, i.e., how many blocks have been finalized so far in proportion to the number of validators.

How much a validator makes is dependent on 3 network parameters, base interest rate, total deposit dependence and base penalty in addition to the number of epochs since the last block has been finalized and whether the validator is voting or not. The math is presented in the paper, which we will not cover in the newsletter. At a high level, each validator deposit for each epoch, `D_{v,i}`, is calculated as a function of a validator’s individual reward factor, which is then used to calculate the collective reward factor. Then, based on the validator’s individual reward factor and the collective reward factor for an epoch, the validator’s deposit increases or decreases for the epoch.

For misbehaving validators, i.e., validators that trigger the slashing conditions:

Voting for 2 blocks at the same height
Voting for blocks that are within each others span, i.e., surround votes

They get there deposits entirely or partially slashed and the validator that finds these violations get a finder’s fee that is 4% of the offending validator’s deposit.

Liveness and Safety Analysis

For the purposes of analyzing the liveness and safety guarantees of hybrid Casper FFG, the usual definitions of liveness and safety have been adapted to fit hybrid Casper FFG, a threat model needs to be defined and fault types within this threat model need to be specified.

Recall, liveness is the guarantee that something good will eventually happen and safety is the guarantee that something bad will never happen. In the context of hybrid Casper FFG, liveness is the guarantee that a proportion of the nodes will finalize checkpoints in finite time and safety is the guarantee that a proportion of the nodes that consider a checkpoint finalized at time `t` will consider that same checkpoint finalized at some time `t’` such that `t’ > t`.

The threat model that is considered is the following:

Assume that an adversary controls 49% of the total stake for an infinite amount of time. Moreover, assume that the honest majority assumption holds. Collusion between miners and validators is not considered. Finally, the network is partially synchronous.

As finalization is dependent on the total amount of validators in terms of stake, liveness and safety are dependent on validator deposits and as such, total stake.

First, let’s consider the liveness of Casper. As defined above, if at least ⅔ of the total stake is controlled by correctly voting validators, then liveness is guaranteed as blocks are getting finalized as per the protocol specification. If less than ⅔ of the total stake is controlled by correctly voting validators, we need to be a bit careful. In this case, finalization doesn’t occur since the total stake is mostly controlled by non-voting validators. However, notice that with each epoch, the non-voting validators deposits decreases and the voting validators deposits increases. Thus, every epoch the total stake controlled by voting validators increases. Eventually, the total stake controlled by voting validators will reach and exceed ⅔ of the total stake. Thus, finalization has resumed and liveness is guaranteed.

Now, let’s consider the safety of Casper. We consider 2 cases: one in which the network is not partitioned and the other in which the network is partitioned. In the case in which the network is not partitioned, we need to worry about the nothing at stake problem. In a non-partitioned network, validators can view all the possible chains that are being finalized and decide to vote on all on them in order to increase their chances of increasing their deposits. Remember that in order for a block to be finalized, it has to be a child of a previously finalized block. So, different checkpoints on different chains can’t be finalized unless ⅓ of the total stake is controlled by incorrectly voting validators, i.e., slashing conditions have been violated. Now, let’s consider the case in which the network is partitioned. Now, we need to worry about long range attacks. Long range attacks are attacks in which an adversary tries to provide an alternative history of the chain with the same genesis block. Due to weak subjectivity, nodes can’t immediately distinguish between the canonical chain and alternate chain. Using reasoning similar to how we showed liveness, notice that the honest validators will always be able to finalize checkpoints on the canonical chain due to how much of the total stake they control. The fork choice rule defined previously ensures that validators will vote on a chain that i) will remain the same in the future, i.e., the canonical chain and ii) such that checkpoints won’t be overwritten.

Conclusion

We summarized Casper FFG, a stake-based finality gadget that adds deterministic finality to a PoW chain. You can read the full paper here. Although EIP1011 has been deprecated, Casper FFG has been modified to fit a pure PoS blockchain and there are plans to use its finality to potentially finalize the legacy PoW chain.

Blockchain Research Newsletter #4: Towards a Functional Fee Market for Cryptocurrencies

By Mikerah and John Adler

Mikerah

Jun 16, 2019

In this edition of the Blockchain Research Newsletter, we will go through Towards a Functional Fee Market for Cryptocurrencies, a paper that proposes an alternative fee system to the current way blockchain networks price transaction fees. Some of the ideas in this paper have been proposed as modifications to Ethereum’s current fee mechanism in EIP 1559.

Motivation

Whenever you send cryptocurrencies to someone else, you need to pay a fee in order to prevent spam in permissionless blockchains. The higher the fee, the more likely your transaction is to get into an earlier block; the lower the fee, the less likely your transaction will be included in an earlier block (with fees low enough, or no fees, your transaction may never get included). This makes for terrible UX in that users can easily overpay to get their transactions included in a block and miners get unstable revenue from fees. In general, you can view this market as a first price auction for transaction space in a block. Miners auction off space in their block to users who want their transactions included.

Background

Before diving into the specific proposal, first we will go over first and second price auctions upon which the paper’s proposal modifies for a cryptocurrency context.

First Price Auctions

Generalized First Price auctions (GFPs) are what people generally think of when they hear the word “auction.” GFPs are auctions in which positions are sold to the highest bidder according to how “high” that position is. Each bidder pays in accordance to their bid. For example, in sponsored search, the highest bidder gets the highest slot on the webpage and pays their bidding price, the second highest bidder gets the second highest slot on the webpage and pays their bidding price, etc.

The main problem with GFPs is that they allow for strategic behavior and as such, don’t possess a pure Nash equilibrium. This is especially problematic in a blockchain context as all miners are pseudonymous and these miners operate under adversarial conditions. This results in a saw-tooth-like pattern for transaction fees.

From "Strategic bidder behavior in sponsored search auctions"

Second Price Auctions

Generalized Second Price auctions (GSPs) are slightly different than GFPs. Just as in GFPs, the highest bidder gets the highest slot, the second highest bidder gets the second highest slot, etc. The main difference here is that the bidders don't pay their original bidding price. They instead pay the price of the bidder below them. In other words, the highest bidder doesn’t pay the highest bid price that they bid but instead the second bidder’s price and so on for each subsequent bidder. Even though GSPs don’t have an equilibrium, in practice, they have produced more predictable and stable prices for bidders. Search engines such as Google use a modified GSP for selling ad slots for their webpages.

Model

In the model used to present the new fee market, each block has K slots, which represent transactions, N users where we assume that N > K and that the probability that a user has their transaction in the mempool is d. Each user is an identically and independently distributed random variable. Each user has a non-zero real-valued bid for their transaction, that is also non-zero and real-valued, to get included in a block.

A miner looking to make a profit will select the K highest bids or all available bids if there are less than K bids available. A miner only makes a profit if the block they mined actually gets included into the blockchain as per the usual blockchain protocol.

Alternative Fee Mechanism Proposal

The fee mechanism is actually quite simple, and can be broken down into a few steps:

Users attach a fee to their transaction (as they do now). Note that this fee is the maximum fee the user is willing to pay to have their transaction included.
All transactions in a block pay the minimum fee paid for any transaction in that block. This step is the one that bears similarities to a GSP.

The model used assumes that each transaction is equally-sized, with K total transactions per blocks. The above steps can be trivially modified to use something like “fee rate per byte” rather than “fee,” to account for differently-sized transactions.

The two steps above are potentially gameable by miners (or more generally block producers), as they can include transactions to themselves and ignore other transactions with lower fees to potentially raise the total fees collected per transactions (by raising the minimum fee in a block). In practice this isn’t actually an issue, as a miner that includes such a transaction is leaving actual fee-paying transactions on the table.

From a theoretical perspective, however, this issue needs to be remedied:

Miners of a block are paid the average fees of the last B blocks. This has the effect of smoothing out how fees are distributed to miners.
Miners always need to fill blocks.
1. A block is full if it has K transactions or it has fewer than K transactions and pays a fill penalty, which is (K - # of tx in block) * fee paid by each tx in block. Alternatively, the miner can declare that there are insufficient transactions to fill the block in the mempool, at which point all transactions in the block pay the minimum fee.
2. The minimum fee is set at the protocol level. It can be implemented as the minimum fee required for mempool inclusion and propagation.

The incentive for the fill penalty is that larger blocks are more likely to be orphaned by other miners (as they take longer to validate). Unfortunately, a metric for this negative incentive is not presented, so it’s unclear where exactly the balance lies. In addition, this negative incentive only exists for PoW cryptocurrencies. Most PoS-based consensus protocols do not have a race to broadcast blocks lest they be orphaned, so it’s unclear how this fee mechanism proposal can be extended for PoS.

Conclusion

In this edition, we presented an alternative proposal to improve fee markets for users and miners in cryptocurrencies. You can read the original paper here. While possibly not perfect, it does surely offer a vast improvement over the current fee mechanism used for most cryptocurrencies today.

Blockchain Research Newsletter #3: NiPoPoW and FlyClient

by Mikerah and John Adler

Mikerah

May 09, 2019

In this edition of the Blockchain Research Newsletter, we summarize two proposals for developing trust-minimized light-clients for proof-of-work blockchains: Non-Interactive Proofs of Proof of Work (NIPoPoWs) and FlyClient. NIPoPoWs were introduced in 2017 by IOHK researchers, Kiayias et al. They haven’t been widely deployed in practice yet but are planned to be leveraged with the Cardano project. FlyClient, first introduced by Benedikt Bünz at Scaling Bitcoin 2018, has been proposed in a ZIP for the Zcash blockchain and is currently also being considered for the Ethereum blockchain.

Motivation

Blockchains require that all nodes in the network validate any state changes that occur. In other words, all nodes have to check the validity of every transaction and thus every block that gets sent across the network. Additionally, nodes have to download the entire blockchain in order to ensure its validity. For fully validating nodes (including miners), this isn’t a problem. However, smaller devices like mobile phones cannot handle downloading and processing the entire history of the blockchain. This is where light-clients come in. Light clients enables less powerful devices to access the state of the blockchain that is relevant to them. They don’t need to validate the entire chain but instead validate only block headers. Applications such as wallets can use light clients in order to easily access blockchain data and interact with the blockchain.

Historical light client designs relied on the honesty of miners and having to store every block header. This can be a bottleneck for light clients, as storage is now linear in the number of block headers---especially problematic is low blocktimes are used. Practically, this means that a light client might end up storing gigabytes of data which is still a lot of for low-power devices such as cell phones.

Non-Interactive Proofs of Proof of Work

Non-Interactive Proofs of Proof of Work (NIPoPoWs) is a light client proposal, originally published in 2017 by Kiayias et al. It builds upon Kiayias’s previous work on Proofs of Proof of Work (PoPoWs) by making this construction non-interactive.

Model

In the model use to present NIPoPoWs, we consider three actors: light clients, full nodes, and miners. In cryptography speak, the light clients are verifiers and the full nodes are provers. Full nodes need to prove the validity of blocks and block headers to light clients that verify these proofs. We assume a synchronous network model. Moreover, we model the proof of work process as a random oracle. In plain English, miners query a fair oracle with “may I produce the next block,” with the oracle randomly assigning a block producer. The model assumes that the difficulty is constant, i.e., the difficulty stays the same throughout the lifetime of the network.

Overview

The underlying data structure that support NIPoPoWs is the interlink. It is a skip-list that allows for a sparse sampling of a sequence of block headers. Specifically, links to superblocks (i.e., blocks that satisfy a higher difficulty target that necessary) are contained within the interlink, which is stored in block headers. This can be implemented via a fork, including a velvet fork; details of this fork procedure can be found in the paper.

Given a difficulty target of d leading zeroes, a superblock hashes to more than d leading zeroes. In expectations, half of all mined blocks will have d+1 leading zeros, a quarter will have d+2 leading zeros, and so on. This information is stored in the interlink as levels, as shown in the figure below. Level 0 of the interlink contains links to blocks that at least meet difficulty d, level 1 contains links to blocks that at least meet difficulty d+1, and so on.

A NIPoPoW can be constructed as follows, shown in the figure below:

Start at the highest level with at least m blocks. In this example, level 2. We don’t want levels with fewer than m blocks to prevent freak outliers from influencing the proof.
From this level, choose the last m blocks, appending them to the proof.
Go down one level, then choose all blocks ahead of the earliest block chosen previously, appending them to the proof.
Repeat step 3 until there are no more levels.

To non-interactively verify proofs: given multiple proofs, select the one with the highest score (we assume that at least one proof is honest). Scores are calculated as follows:

For each level, calculate the number of blocks meeting at least the difficulty target of the level.
Return the maximum of the above.

Note that the parameter m must be carefully chosen. Longer proofs are more secure and resilient to a larger adversarial hashrate, but are longer.

FlyClient

FlyClient is a light client proposal for PoW blockchains such as Bitcoin and Ethereum proposed by Bünz et al. in 2019. It makes use of a variant of Merkle trees called Merkle Mountain Ranges (MMRs). It only needs to store a logarithmic number of block headers and provides strong mechanisms for ensuring the validity of these block headers.

Background: Merkle Mountain Ranges

Before we dive into the FlyClient protocol, let’s go over the MMR data structure. MMRs are an extension of Merkle trees that allow for each append operations. They were first introduced in 2016 by Bitcoin Core developer Peter Todd.

Merkle trees are ubiquitous in blockchain systems. They are a form of authenticated data structure with logarithmic proofs of inclusion. However, they are not without shortcomings, such as when we want to insert an extra leaf into the tree. A previously-balanced Merkle tree would become unbalanced, resulting in larger proof sizes. Rebalancing the tree to keep proof sizes with logarithmic worst-case performance results in significant computational overhead, as many hashes must be re-computed.

In order to get over the shortcomings of plain Merkle trees, Merkle Mountain Ranges (MMRs) were devised. They enable us to append new data to a Merkle tree without having to regenerate it constantly, and keep the resulting tree somewhat balanced.

Visualization of Merkle Mountain Range append operations.

The update process for MMRs work as follows:

Line up all the leaf nodes.
Starting from the left, create a Merkle tree using as many nodes as possible (this will be a power of 2).
Using the remaining nodes, create as many maximal Merkle trees as possible with the remaining leaves (step 2).
Now, we have several Merkle trees. Notice that in the picture above, we see that these Merkle trees have a mountain range-like structure with multiple peaks.
Starting from the rightmost tree, hash the roots of 2 consecutive trees together.
Repeat step 5 as in the usual Merkle tree construction.

Overview

The FlyClient protocol uses three main building blocks: Merkle Mountain Ranges, probabilistic sampling, and the Fiat-Shamir heuristic. In the protocol, a light client has to decide between two chains provided by two full nodes, one of which provides an honest chain. In cryptography language, the light client is known as the verifier and the two full nodes are known as provers. The goal is to have the verifier correctly guess which prover is giving the client the honest chain.

The FlyClient protocol requires changes that can be added through a soft fork or a hard fork. Block headers need to include a Merkle root to a MMR that contains commitments to all the previous block headers of the chain.

The protocol works as follows:

Both full nodes send the latest block header to the light client.
Iterate j times, where j is bounded logarithmically by the number of block headers:
1. The light client queries k random block headers from each full node, where k is determined by a fraction of the honest chain’s computing power. In order to sample blocks, we observe the following: A malicious full node can only mine a subset of the blocks due to limited computing power. Thus, there will be a fork point at which an honest node’s chain and a malicious node’s chain will differ.
2. For every block header, a full node provides a MMR proof that this block is included in its chain at a particular position
3. The light client checks the MMR proof and proof of work done for each block header. If any of these checks are incorrect, the light client rejects the full node that provided the invalid block header.
If the full node has not been rejected, then the light client accepts that full node’s chain as correct.

In practice, step 2 is in fact a hash of the block header at each iteration. This is an instantiation of the Fiat-Shamir heuristic. So, we now have a non-interactive protocol instead of an interactive one.

Comparison

Both proposals aim to tackle the light client problem using different security assumptions and cryptographic primitives. NIPoPoW assumes a constant difficulty, synchronous network. On the other hand, FlyClient uses more realistic assumptions on network conditions, i.e., a variable difficulty, partially synchronous network. Both are designed to have a light client only store a logarithmic number of block headers instead of a linear number of block headers. In terms of efficiency, both protocols offer reasonably efficient proof sizes, with FlyClient providing shorter proof sizes than NIPoPoW. The authors of FlyClient show that FlyClient can provide up to a 40% improvement over NIPoPoW in this aspect.

One major difference is NIPoPoW’s reliance on superblocks. This means that NIPoPoWs cannot be applied to PoS blockchains or more generally, Sybil resistance mechanisms that don’t have an equivalent notion of work. Moreover, the reliance on the rarity of superblocks makes NIPoPoW-based light clients susceptible to bribing attacks. An attacker can pay miners to withhold their blocks in favor of broadcasting superblocks. FlyClient on the other hand, only uses randomness after a block has been mined, as light client randomly samples already-mined blocks. Thus, NIPoPoW is only efficient when there are no adversaries, but remains secure in the presence on an adversarial minority.

Conclusion

In this edition, we have presented two different light client proposals, FlyClient and Non-Interactive Proofs of Proof of Work. Both provide an efficient means to enable low-power devices to send and verify transactions to blockchains, and have applications in cross-chain communications protocols. If you want to read both papers in-depth, you can the read the FlyClient paper here and NIPoPoW paper here.

Loading more posts…