A Trailing Finality Layer for Zcash

This book introduces and specifies a Trailing Finality Layer for the Zcash network. This is version 0.1.0 of the book.

This design augments the existing Zcash Proof‑of‑Work (PoW) network with a new consensus layer which provides trailing finality. This layer enables transactions included via PoW to become final which assures that they cannot be reverted by the protocol. This enables safer and simpler wallets and other infrastructure, and aids trust-minimized cross-chain bridges. This consensus layer uses Proof-of-Stake consensus, and enables ZEC holders to earn protocol rewards for contributing to the security of the Zcash network. By integrating a PoS layer with the current PoW Zcash protocol, this design specifies a hybrid consensus protocol dubbed PoW+TFL.

The rest of this introductory chapter is aimed at a general audience interested in the context of this proposal within Zcash development, status and next steps, motivations, a primer on finality, and tips to get involved.

A Path to Proof-of-Stake Zcash

The TFL design provides a possible first step in transitioning Zcash to a PoS protocol. Here we describe how a transition to PoS relates to “the Zcash roadmap” and how TFL fits into one approach to a PoS transition.

The Zcash Tech-Tree

There are multiple developer orgs working on different proposed features for Zcash. Some of these involve multiple large distinct upgrade steps, and several of these steps depend on other such steps. This could be represented as a directed acyclic graph. We have begun referring to this space of possible future improvements as the Zcash Tech-Tree, taking inspiration from an analogous concept in gaming.¹

We envision a proof-of-stake transition path as one of the potential paths within this tech-tree which is the primary protocol focus of this proposal. An example visualization of this Zcash Tech-Tree might look like this:

A Proof-of-Stake Transition Path

Given that context, we envision a path within the Zcash Tech-Tree for transitioning Zcash to PoS. At the top level we propose that this path should include at least two major milestones:

Transitioning from Zcash NU5/NU6 PoW consensus to a PoW/PoS hybrid consensus protocol dubbed PoW+TFL.
Transitioning from PoW+TFL to pure PoS protocol.

After this transition to pure PoS, there are likely to be future improvements to the PoS protocol, or the consensus protocol more generally. This TFL book focuses almost exclusively on the first step in this sequence.

Our primary motivation for proposing (at least) two steps is to minimize disruption to usability, safety, security, and the ecosystem during each step.

This book primarily focuses this first step: the transition to PoW+TFL. To understand the specific goals for that, see Design Goals.

With this approach, the Zcash Tech Tree with the TFL approach might look something like this:

Why Two Steps?

One question we've gotten in proposing this approach is why take a two-step process with an intermediate hybrid consensus protocol, rather than a single transition directly to a PoS protocol?

Here’s how we think about those trade-offs:

Considering Single Transition (vs Hybrid Multi-Step)

Pros

We already understand the current PoW protocol well, and if we transition to an existing proven PoS protocol, then we could skip the complexity of an intermediate hybrid stage.
The node implementation might be simpler.
Explaining to people what is happening might be simpler. Something like “Zcash has been PoW since it launched, but on DATE (e.g. at block height $X$ ) it will switch to PoS.”
Given that the issuance in a given time period is bounded by the supply curve, the full amount that was previously allocated to mining rewards becomes immediately available for staking rewards at the switch-over, rather than having to share this amount between mining and staking during the hybrid stage.

Cons

If there is any unforeseen show‑stopping problem in the new protocol or the transition process, we’d have to react to a network‑wide issue.
It may be more likely to cause ecosystem disruption; unforeseen differences between PoW and PoS might cause various kinds of snags or papercuts throughout the ecosystem, and these would all pop up around the same time, which may lead to a loss of confidence/retention/adoption or at the very least inconvenience many users for some time.
Losing miners: since the transition would be all at once, we may lose some number of miners, who are participants and users in the ecosystem. Miners may leave prior to the transition in order to take care of their own needs. If there is some showstopper in the transition, one possible short‑term mitigation would be to fall back on PoW which is well known, but if we’ve lost most miners, that may no longer be viable.

Considering Hybrid Multi-Step approach (vs Single-Step):

Note: TFL is one instance of a multi‑step approach.

Pros

We can hopefully be less disruptive across the ecosystem so that there are fewer snags and disruptions with each step.
If there is a show-stopping flaw in any step, the fall‑back possibility seems more plausible. For example, if there is a show-stopper when transitioning from PoW to PoW+TFL, falling back to pure PoW seems more feasible, since both protocols rely on mainnet PoW infrastructure, so those participants will be present in either case.
Retaining miners during a hybrid phase: while it is true that a hybrid protocol will lower miner revenue (since we aim to maintain the issuance schedule constraints), there is also more possibility and likelihood of keeping some of these users engaged. For example, they may begin participating in staking services (either as delegators or as infrastructure operators). If that is successful, then they’re also more likely to remain engaged in the subsequent transition to pure PoS.
This general approach was demonstrated successfully by Ethereum, which is the largest or second largest cryptocurrency network for several important metrics (e.g. market cap, fees paid, user and developer activity, …). So we know this can be done well without disruptions.

Cons

The intermediate hybrid step will be a more novel and less well understood protocol. (It will necessarily be fairly different from Ethereum’s Beacon chain era.)
Consensus nodes will be more complex, involving logic for both sub‑protocols as well as their integration. (Ideally this complexity can be modularized so that the nodes are easier to maintain and improve.)
This may be more complicated to explain to current and potential new users. Something like “Zcash launched as PoW, and on $DATE (block height X) it will transition to a hybrid system, then later to a pure PoS system.”
The available issuance must be shared between mining and staking rewards during the hybrid stage. The security of the PoW layer and of the PoS layer during this stage is partially dependent on the funds allocated to issuance for each protocol, and it is not yet clear to what extent splitting rewards would affect overall security.

Footnotes

See Wikipedia’s Technology Tree - History section for details.

Trailing Finality Layer in a Nutshell

The hybrid PoW/PoS consensus protocol proposed in this book is similar to today’s Zcash NU5/NU6 consensus protocol with the addition of a Trailing Finality Layer:

TODO: Add network topology / software subcomponent diagram #124

The Zcash Trailing Finality Layer refers to a new subprotocol of a new hybrid PoW/PoS protocol, which we refer to as PoW+TFL. This subprotocol introduces assured finality for the Zcash block chain, ensuring that final blocks (and the transactions within them) may never be rolled back.

We use the term “layer” because we can understand this design as introducing a new layer to the Zcash network, making only minimal changes to the existing network and consensus protocol. This modular separation is present in the consensus rules, the network protocol, and the code architecture.

Why Should Users Care?

There are three categories of users this proposed TFL protocol would impact:

Current ZEC Users

Existing ZEC users who are primarily concerned with storing or receiving ZEC, whether private or transparent may benefit from this change in the short or medium term, because it may help lower delay times for some services, such as exchange deposits. As exchanges come to rely on the new finality guarantee, they can often reduce their deposit wait times. Other services with similar confirmation-depth-based wait times can be improved in a similar way to lower these wait times. Other than this improvement, these users should notice no other changes.

In the longer term, providing finality will be useful in establishing trust‑minimized bridges to other block‑chain protocols. We anticipate this can enable better connecting ZEC to the Defi ecosystem, and with the introduction of Zcash Shielded Assets, this can enable other assets to connect to the Zcash shielded pool.

Proof-of-Stake Users

Users who are interested in providing finality infrastructure, or users who want to delegate ZEC towards finality, will be able to earn rewards from the protocol for doing so, while also taking on some risk to their funds (to prevent malicious abuse of the protocol). This may be an important new category of ZEC users and use cases.

Miners

Miners who provide Proof‑of‑Work security will necessarily see some reduction in their block rewards, since this proposal maintains the same issuance schedule and supply cap of ZEC while also spending some rewards on finality.

Important Note: The proportion and details of how much mining rewards will be impacted, and conversely how much finality/PoS providers will earn, are not yet specified in this proposal.

Why is this a Good Approach to a PoS Transition?

This design is appealing as a safer first step in transitioning the Zcash protocol for multiple reasons:

It Enables Proof-of-Stake Mechanisms Conservatively

This transition would enable PoS mechanisms, including the ability to operate PoS infratructure and delegate ZEC towards those providers to earn protocol rewards. While all PoS transitions would accomplish this, this approach does so in a conservative manner: it introduces these mechanics while striving to minimize the impact on existing use cases and protocol security.

In a sense, we can think of this approach as enabling the Zcash community to “dip our toes in the PoS waters” rather than diving in. If the results pan out well, it gives us confidence for further transitions. If we discover challenges, flaws, or risks, we anticipate their impact will be more limited since this is a more cautious transition step.

Minimal Use-Case Disruption

In many cases, existing products, services, and tools can continue using the mainnet chain with no changes to code assuming they rely on existing consensus nodes. We view this as a major benefit which allows Zcash’s existing user network effect to continue safely unperturbed.

There will be certain narrow exceptional areas if those products, services, or tools need to be precise in areas where the protocol has changed, such as mining/staking reward calculations, transaction formats (in particular any new PoS-related fields or logic), or chain rollback logic.

Modular Design

By conceptualizing the TFL as a distinct “layer” or subprotocol, the consensus rules can be described in terms of two consensus subprotocols, one embodying most of the current consensus logic of Zcash and another the TFL. These protocols interact through a hybrid construction. See Design at a Glance to learn more about these distinct subprotocols.

Reasoning about the whole protocol can leverage analysis and understanding of each subprotocol and the hybrid construction somewhat independently due to this modular design. Note that although this design is modular, the hybrid construction may require modifications to the [PoW] and/or [PoS] subprotocols to protect safety and liveness properties. Nevertheless, the modularity still improves analysis and reasoning compared to a monolithic design.

Finally, since one subprotocol is very similar to the existing Zcash NU5/NU6 consensus protocol, this lessens risk that the consensus properties within that subprotocol compromise current NU5/NU6 properties.

Modular Implementation

In addition to the other benefits of protocol design modularity, we anticipate actual implementations can realize this modularity in code. This can help makes implementations more robust, easier to maintain, and more interoperable.

For example, we can envision a standardized interface between PoW & TFL consensus components, enabling different development teams to provide these different components and for “full node” packagers to mix and match them. This is somewhat reminiscent of Ethereum’s execution/consensus layer separation which we believe has shown great success in implementation team and product diversity.

Cracking the Nutshell

In the rest of the introductory section of this book, we describe the status and next steps for the TFL proposal, provide a motivation for finality, and suggestions for getting involved.

Status and Next Steps

This is an early and incomplete protocol design proposal. It has not been well vetted for feasibility and safety. It has not had broad review from the Zcash community, so its status on any Zcash roadmap is undetermined.

Current Components

This Book

This book is intended to become both a high-level overview and introduction to TFL, and a full specification.

Crosslink

The current heart of the design work is an in-progress hybrid consensus protocol construction called Crosslink (definition). This is defined in the Crosslink chapter.

`simtfl`

We've begun creating a simulator called simtfl which we will use to model security and abstract performance concerns. Its development is tracked at https://github.com/zcash/simtfl.

Major Missing Components

PoS subprotocol selection,
Issuance and supply mechanics, such as how much ZEC stakers may earn,
Integrated Zcash transaction semantics,
A transition plan from current Zcash mainnet to this protocol design,
ZIPs specifying the above to the level of specificity required by ZIPs,
Security and safety analyses,
Economic analyses.

This list may be incomplete, and as the design matures the need for major new components may be revealed.

Next Steps

This design proposal is being developed by Electric Coin Company and Shielded Labs as the first major milestone in our focus of deploying Proof-of-Stake to the Zcash protocol. Our rough near-term plan for this proposal is as follows:

Complete the Crosslink description.
Complete core security arguments for Crosslink.
Define the Major Missing Components above, including considerations such as issuance mechanics and Proof-of-Stake mechanisms.
Complete auxillary security arguments and analyses, such as specific attack scenarios, game-theoretic security, and so forth.
Mature simtfl to analyze all cases of interest.
Follow the general Zcash process for proposal/review/refinement, including proposing one or more ZIPs.
Follow the general Zcash governance process for proposal review and refinement.
If accepted, productionize the proposal in ECC products and collaborate with other implementors who implement the proposal.
Celebrate when and if the proposal is activated on Mainnet. 🎉

The fine-grained day-to-day goals and tasks for this project are present in the Zcash Developers Hub in the TFL-focused DAG.

Please also see Get Involved if you are interested in tracking this progress more closely, or in contributing.

Motivating Finality

In Zcash currently, consensus relies solely on PoW, which only provides probabilistic finality, rather than assured finality.¹ This style of consensus does not offer a guarantee that any given block may not be rolled back which may invalidate the transactions it contains. Instead, the probability that a block may be rolled back decreases as more blocks are mined subsquently on top of it.²

Let’s walk through an example of how Zcash’s current PoW with probabilistic finality can impede important use cases. Consider a PoW node which sees this block sequence at time T=0:

When should a user, wallet, or other system choose to act based on a transaction in a block?

For this example, let’s assume a bridging system may have received a deposit for ZEC in block f and issued a corresponding number of proxy tokens on a different network.

At a later time, T=1, this same node may see a longer PoW chain which invalidates some previously seen blocks:

The node has observed a longer chain ending at block h', so PoW consensus calls for that new sequence to be treated as consensus. The previously seen blocks f and g are no longer part of the consensus history, and have been rolled back.

Impact of the Rollback

In our example, the bridging system acted in response to a transaction in the original block f at T=0. If the new sequence ending at h' no longer contains the deposit to the bridging system, the integrity of the bridge has been violated³; the associated proxy tokens may have already been used in a complex chain of Defi applications or deposited onto an exchange and sold, which would make any recovery impossible. The proxy tokens on the other network no longer correspond to the correct amount of ZEC on the Zcash network.

Rollback Complications

This example demonstrates how a lack of assured finality can impede many useful real-world scenarios. In practice, systems and services which need greater assurances wait for more block confirmations.

This has several drawbacks:

It doesn’t remove the vulnerability, it only reduces the likelihood.
Different applications/services may require different block depths, making it difficult to compose or chain together different applications/services.
Different block depth policies potentially confuse users, i.e. “why do I have to wait one hour for my deposit in this exchange, but only 30 minutes on that exchange?”.
It introduces a long delay which inhibits many useful applications.

In addition to these user-facing and economic drawbacks, correctly handling rollbacks makes the code for nodes, wallets, and other infrastructure more complex. Worse still, many systems may not have correct behavior for rollbacks at different depths, and since large rollbacks are rarer, these implementation flaws may not surface until there is a large rollback. While a large rollback would be disruptive all by itself, it becomes even worse when previously undiscovered bugs exacerbate the situation.

Trailing Finality Benefits

Trailing finality extends the existing PoW consensus so that older blocks become final, assuring they cannot be rolled back, and by extension neither can any of the transactions they contain.

This directly addresses the first two flaws above: it completely removes the vulnerability, and it ensures all systems that need finality behave consistently with each other.

As for delay, trailing finality also introduces delay since final blocks “trail behind” the most recent PoW blocks. This can be an improvement for some applications, depending on their latency requirements. For example, if the delay to finality averages around 10 minutes, then this would enable an improvement for an exchange that requires 60 minutes of PoW blocks for a deposit. On the other hand, it would not be an improvement for an application that needs finality faster than 10 minutes.

Finally, implementations can be simplified by relying on the guarantee of finality. For example, a wallet can describe any transaction as pending or final, and does not need to provide difficult and potentially confusing UX (and the supporting database sophistication) for handling rollbacks.

Footnotes

Throughout this book, when we say finality or final without other qualifiers, we specifically are referring to assured finality or an assured final block. Where we call out probabalistic finality we always use that qualifier.

The estimated probability of a rollback relies on a variety of PoW security assumptions, and can be violated in various conditions, such as in mining efficiency breakthroughs, compromises of the PoW challenge algorithm (e.g. hash function collision resistance failure), difficulty-adjustment-algorithm failures, sudden/surprise mining capacity increases, and so on. So the estimated probability can be violated in potential “black swan” events.

This discussion simplifies consideration of transaction rollback vs block rollback. When a block is rolled back, it is possible for some of the transactions contained in it to appear in new canonical blocks. The conditions when this can occur vs when it cannot are multifaceted and also subject to malicious influence, so for simplicity we assume all transactions within a rolled-back block are also rolled back.

Get Involved

We welcome contributions!

There are a variety of ways to contribute to this project:

GitHub

If you have a GitHub account, you can get hands-on via the GitHub repository for this book, including:

Ask a Question - all questions welcome from basics to in-depth.
Suggest an Improvement to the content: anything from typo fixes to major design change proposals.
Report Rendering / Infrastructure Issues in case you're having trouble reading the content, viewing diagrams, rendering on your own computer, etc…

Zcash Forum

The Zcash Forum is a hangout for many Zcash enthusiasts. This is a good spot for more open-ended discussion about this design proposal, alternatives, and other developments in Zcash.

Zcash R&D Discord

You can catch us on the Zcash R&D Discord in the #proof-of-stake channel.

Zcash Arborist Calls

The Zcash Arborist Calls are bi-weekly Zcash protocol development calls, where proposals like this are discussed. Feel free to come lurk, ask questions, or provide feedback or suggestions.

Terminology

This book relies on the following terminology. We strive to keep these definitions as precise as possible and their definitions may deviate from uses elsewhere.

Definitions are sorted alphabetically.

Terms

Assured Finality: A protocol property that assures that transactions cannot be reverted by that protocol. As with all protocol guarantees, a protocol assumes certain conditions must be met. A transaction may either be final or not: transactions which are not final may not become final, whereas once transactions do achieve finality they retain that property indefinitely (so long as protocol requirements are met).

Importantly, it is not feasible for any protocol to prevent reversing final transactions "out of band" from the protocol, such as if a sufficiently large and motivated group of users forks the network to include a specific new validity rule reverting transactions. In some cases this might be desirable, for example to mitigate exploitation of a security flaw. We are investigating the implications for governance and how to incorporate such situations into our security model. In any case, for this reason we eschew the term "absolute finality" sometimes used in technical discussions about consensus protocols.

Consensus Subprotocols: The PoW and PoS subprotocols in PoW+TFL or other hybrid protocols.

Crosslink: A hybrid construction consensus protocol striving to implement the TFL design goals. See Status and Next Steps: Current Components for current status.

Final: A protocol property of transactions. In this book, this always implies assured finality, in contrast to concepts like "probabilistic finality" provided by PoW.

Hybrid Consensus: A consensus protocol that integrates more than one consensus subprotocol. PoW+TFL is an instance of a hybrid protocol integrating PoW and PoS protocols.

Hybrid Construction: The design component of a hybrid consensus which specifies how to integrate subprotocols and what modifications, if any, those subprotocols need to be safely integrated. Examples include Crosslink and Snap-and-Chat.

Liveness: The property of a distributed protocol which ensures that the protocol may progress provided liveness requirements are met. TODO: Fix the definition of Liveness #120

NU5/NU6: The Zcash consensus protocol as of NU5 or NU6 (which do not differ significantly in terms of the base block chain layer).¹

Objective Validity: A validity property of a protocol history (such as a ledger) which can be computed purely from that history with no other context. Objective validity is needed to define consensus rules that will lead to the same protocol state being eventually agreed on by all nodes.

Proof-of-Stake: A PoS protocol achieves consensus on transaction status by taking into account the weighting of staking tokens. PoS protocols exist under a large umbrella and may or may not provide assured finality or other properties this design requires of TFL.

Proof‑of‑Work: A PoW protocol uses Nakamoto consensus pioneered by Bitcoin. The PoW subprotocol within PoW+TFL is a different consensus protocol from NU5/NU6 and encompasses more than narrow Nakamoto PoW consensus, including transaction semantics such as for shielded transfers.

PoW+TFL: the overall complete, integrated consensus protocol specified in this book.

Safety: The property of a distributed protocol that guarantees a participant may safely rely on a consistent local state, provided safety requirements are met. TODO: Provide a rigorous definition of Safety #121

simtfl: a protocol simulator for analyzing TFL security and abstract performance. Development lives at https://github.com/zcash/simtfl. See Status and Next Steps: Current Components for current status.

Snap-and-Chat: A hybrid construction consensus protocol introduced in Ebb-and-Flow Protocols.

TFL: The Trailing Finality Layer subprotocol within PoW+TFL. This is a new PoS subprotocol which provides assured finality for Zcash.

Trailing Finality: A protocol property wherein transactions become final some time after first appearing in PoW blocks.

ZIP: a Zcash Improvement Proposal is the protocol development process the Zcash community uses to safely define potential protocol improvements. See https://zips.z.cash.

TODO: Clarify the distinctions between pure PoW, the PoW subprotocol, NU6, and fork-choice vs all of transaction semantics. #119

Footnotes

If new consensus changes are deployed to Zcash mainnet prior to PoW+TFL design finalization, this design must be updated to refer to the new delta (e.g. by reanalyzing all changes against NU6 or NU7, etc…)

Design Overview

This design augments the existing Zcash Proof‑of‑Work (PoW) network with a new consensus layer which provides trailing finality, called the Trailing Finality Layer (TFL).

This layer enables blocks produced via PoW to become final which ensures they may never be rolled back. This enables safer and simpler wallets and other infrastructure, and aids trust-minimized cross-chain bridges.

This consensus layer uses a finalizing Proof-of-Stake (PoS) consensus protocol, and enables ZEC holders to earn protocol rewards for contributing to the security of the Zcash network. By integrating a PoS layer with the current PoW Zcash protocol, this design specifies a hybrid consensus protocol.

The integration of the current PoW consensus with the TFL produces a new top-level consensus protocol referred to as PoW+TFL.

In the following subchapters we introduce the Design at a Glance, then provide an overview of the major components of the design.

Following this overview chapter, we proceed into a detailed Protocol Specification (TODO).

Design at a Glance

The PoW+TFL consensus protocol is logically an extension of the Zcash consensus rules to introduce trailing finality. This is achieved by compartmentalizing the top-level PoW+TFL protocol into two consensus subprotocols, one embodying most of the current consensus logic of Zcash and another the TFL. These protocols interact through a hybrid construction, which specifies how the protocols interact, and what changes from "off-the-shelf" behavior, if any, need to be imposed on the subprotocols. Each of these components (the two subprotocols and the hybrid construction) are somewhat modular: different subprotocols or hybrid constructions may be combined (with some modification) to produce a candidate PoW+TFL protocol.

TODO: Add a protocol component diagram to "Design at a Glance" #122

Hybrid Construction

The hybrid construction is a major design component of the full consensus protocol which specifies how the subprotocols integrate. So far we have considered three candidates:

The implied/loosely defined hybrid construction presented at Zcon4.
The Snap-and-Chat from the Ebb-and-Flow paper.
The Crosslink construction.

We believe Crosslink is the best candidate, due to its more rigorous specification and security analysis, and due to the issues with Snap-and-Chat described in Notes on Snap-and-Chat.

Subprotocols

The PoW+TFL hybrid consensus consists of two interacting subprotocols:

PoW Subprotocol: this subprotocol is very similar to NU5 or NU6 consensus. It is a design goal of the TFL design to minimize changes to this subprotocol. Note: the shorthand "PoW" is potentially misleading, because this subprotocol is also responsible for the bulk of all supply and transaction semantic consensus rules.
PoS Subprotocol: this is a new subprotocol which provides trailing finality via a finalizing PoS protocol.

TODO: Clarify the distinctions between pure PoW, the PoW subprotocol, NU6, and fork-choice vs all of transaction semantics. #119

Note that the hybrid construction may require modification to the "off-the-shelf" versions of these subprotocols. In particular Crosslink requires each protocol to refer to the state of the other to provide objective validity.

Design Goals

Here we strive to lay out our high level TFL design goals.

Goals, Design, and Trade-offs

Here we lay out ideal goals. As we develop a complete design, we are likely to inevitably encounter trade-offs some of which may preclude achieving the full idealized goals. Wherever possible, we motivate design decisions by these goals, and when goals are impacted by trade-offs we describe that impact and the rationale for the trade-off decision.

For example, one ideal user experience goal below is to avoid disruption to existing wallets. However, the Crosslink construction may require wallets to alter their context of valid transactions differently from the current NU5/NU6 consensus protocol.

User Experience and Use Case Goals

We strive to start our protocol design process from user experience (UX) and use case considerations foremost, since at the end of the day all that matters in a protocol is what user needs it meets and how well.

All currently supported wallet user experience should continue to operate seamlessly without change during or after protocol transitions. This covers the use of addresses, payment flow, transfers, ZEC supply cap and issuance rate, backup/restore, and other features users currently rely on.
There must be no security or safety degradation due to wallet user behavior introduced by PoS transitions, assuming users follow their current behaviors unchanged and continue to use the same cognitive model of the impacts of their behaviors. This goal encompasses all of security and safety, including privacy and transparency or more explicit disclosures.
The protocol should enable users of shielded mobile wallets to delegate ZEC to PoS consensus providers and earn a return on that ZEC coming via ZEC issuance or fees. Doing this may expose users to a risk of loss of delegated ZEC (such as through “slashing fees”). The protocol must guarantee that PoS consensus providers have no discretionary control over such delegated funds (including that they cannot steal those funds).
For any hybrid PoW/PoS protocol (including the PoW+TFL protocol we’re proposing), the process and UX of mining remains unchanged except that the return on investment may be altered. This is true both of consensus-level block miners (i.e. mining pools and solo miners) and mining pool participants.
Any hybrid PoW/PoS protocol (including PoW+TFL) block explorers will continue to function with the same UX through transitions as far as displaying information about transactions, the mempool, and blocks.
Block explorers and other network metrics sites may require UX changes with respect to mining rewards and issuance calculations.
Network metrics sites may require UX changes with respect to the p2p protocol or other network-specific information.
Users can rely on assured finality with an expected time‑to‑finality below 30 minutes.¹

Developer Experience Goals

For a full PoS transition, ecosystem developers for products such as consensus nodes, wallets, mining services, chain analytics, and more will certainly need to update their code to support transitions. However, we carve out a few goals as an exception to this for this category of users:

Wallet developers should not be required to make any changes through protocol transitions as long as they rely solely on the lightwalletd protocol or a full node API (such as the zcashd RPC interface).
For any hybrid PoW/PoS protocol (including PoW+TFL), mining pools and miners should not be required to make any software or protocol changes as long as they rely on zcashd-compatible GetBlockTemplate. One exception to this is software that bakes in assumptions about the block reward schedule, rather than relying on GetBlockTemplate solely.

Safety, Security, and Privacy Goals

Zcash has always had exemplary safety, security, and privacy, and we aim to continue that tradition:

For any hybrid PoW/PoS protocol (including PoW+TFL), the cost‑of‑attack for a 1-hour rollback should not be reduced, given a “reasonably rigorous” security argument.
For any hybrid PoW/PoS protocol (including PoW+TFL), the cost‑of‑attack to halt the chain should be larger than the 24‑hour revenue of PoW mining rewards, given a “reasonably rigorous” security argument.

TODO: Define privacy goals of TFL #118

TODO: Define PoS Subprotocol desiderata which are distinct from Crosslink integration #117

Design Conservatism Goals

We want to follow some conservative design heuristics to minimize risk and mistakes:

Rely as much as possible on design components that are already proven in production environments.
Rely as much as possible on design components with adequate theoretical underpinnings and security analyses.
Minimize changes or variations on the above: strive to only alter existing work when necessary for overall design goals. For example, Zcash’s privacy or issuance constraints are likely less common among existing PoS designs.

Non-goals

These are not goals of the TFL design, either to simplify the scope of the initial design (a.k.a. Out-of-Scope Goals), or because we believe some potential goal should not be supported (a.k.a. Anti-goals).

Out-of-Scope Goals

While these desiderata may be common across the block‑chain consensus design space, they are not specific goals for the initial TFL design. Note that these may be goals for future protocol improvements.

Prioritizing minimal time-to-finality over other considerations (such as protocol simplicity, impact on existing use cases, or other goals above).
In-protocol liquid staking derivatives.
Maximizing the PoS staked-voter count ceiling. For example, Tendermint BFT has a relatively low ceiling of ~hundreds of staked voters, whereas Ethereum’s Gasper supports hundreds of thousands of staked voters.
Reducing energy usage. While this would presumably be a goal of a pure PoS transition, it likely cannot be achieved for hybrid PoW/PoS without loss of security.

Anti-Goals

Distinctly from Out-of-Scope Goals we track "anti-goals" which are potential goals that we explicitly reject, which are potential goals we aim to not support even in future protocol improvements.

We currently have no defined anti-goals.

Footnotes

This requirement comes from a request from a DEX developer. While we have not yet surveyed DEX and Bridge designs, we're relying on this as a good starting point.

Crosslink

Crosslink is the proposed hybrid construction for the Trailing Finality Layer. The current version is Crosslink 2.

The Arguments for Bounded Availability and Finality Overrides

This document considers disadvantages of allowing transactions to continue to be included at the chain tip while the gap from the last finalized block becomes unbounded, and our perspective on what should be done instead. This condition is allowed by Ebb‑and‑Flow protocols [NTT2020].

We also argue that it is necessary to allow for the possibility of overriding finalization in order to respond to certain attacks, and that this should be explicitly modelled and subject to a well-defined governance process.

This is a rewritten version of this forum post, adapting the main argument to take into account the discussion of “tail‑thrashing attacks” and finalization availability from the Addendum. More details of how bounded availability could be implemented in the context of a Snap‑and‑Chat protocol are in Notes on Snap‑and‑Chat.

The proposed changes end up being significant enough to give our construction a new name: “Crosslink”, referring to the cross-links between blocks of the BFT and best-chain protocols. Crosslink has evolved somewhat, and now includes other changes not covered in either this document or Notes on Snap‑and‑Chat. The current version is called Crosslink 2.

Background

“Ebb‑and‑Flow”, as described in [NTT2020] (arXiv version), is a security model for consensus protocols that provide two transaction logs, one with dynamic availability, and a prefix of it with finality.

The paper proposes an instantiation of this security model called a “Snap‑and‑Chat” construction. It composes two consensus subprotocols, a BFT subprotocol and a best-chain subprotocol (it calls this the “longest chain protocol”). The above logs are obtained from the output of these subprotocols in a non-trivial way.

This is claimed by the paper to “resolve” the tension between finality and dynamic availability. However, a necessary consequence is that in a situation where the “final” log stalls and the “available” log does not, the “finalization gap” between the finalization point and the chain tip can grow without bound. In particular, this means that transactions that spend funds can remain unfinalized for an arbitrary length of time.

In this document, we argue that this is unacceptable, and that it is preferable to sacrifice strict dynamic availability. The main idea behind Ebb‑and‑Flow protocols is a good one, and allowing the chain tip to run ahead of the finalization point does make sense and has practical advantages. However, we also argue that it should not be possible to include transactions that spend funds in blocks that are too far ahead of the finalization point.

Note

Naive ways of preventing an unbounded finalization gap, such as stopping the chain completely in the case of a finalization stall, turn out to run into serious security problems — at least when the best-chain protocol uses Proof‑of‑Work. We’ll discuss those in detail in the section on tail‑thrashing attacks.

Our proposed solution will be to require coinbase-only blocks during a long finalization stall. This solution has the advantage of not complicating the security analysis.

We argue that losing strict dynamic availability in favour of “bounded availability” is preferable to the consequences of the unbounded finality gap, if/when a “long finalization stall” occurs.

We also argue that it is beneficial to explicitly allow “finality overrides” under the control of a well-documented governance process. Such overrides allow long rollbacks that may be necessary in the case of an exploited security flaw. This is complementary to the argument for bounded availability, because the latter limits the period of user transactions that could be affected. The governance process can impose a limit on the length of this long rollback if desired.

Finality + Dynamic availability + Possibility of Partition implies an unbounded finalization gap

Since partition between nodes sufficient for finalization cannot be prevented, loosely speaking the CAP theorem implies that any consistent protocol (and therefore any protocol with finality) may stall for at least as long as the partition takes to heal.

Background

That “loosely speaking” is made precise by [LR2020].

Dynamic availability implies that the chain tip will continue to advance, and so the finalization gap increases without bound.

Partition is not necessarily the only condition that could cause a finalization stall, it is just the one that most easily proves that this conclusion is impossible to avoid.

Problems with allowing spends in an unbounded finalization gap

Both the available protocol, and the subprotocol that provides finality, will be used in practice — otherwise, one or both of them might as well not exist. There is always a risk that blocks may be rolled back to the finalization point, by definition.

Suppose, then, that there is a long finalization stall. The final and available protocols are not separate: there is no duplication of tokens between protocols, but the rules about how to determine best-effort balance and guaranteed balance depend on both protocols, how they are composed, and how the history after the finalization point is interpreted.

Discussion

The guaranteed minimum balance of a given party is not just the minimum of their balance at the finalization point and their balance at the current tip. It is the minimum balance taken over all possible transaction histories that extend the finalized chain — taking into account that a party’s previously published transactions might be able to be reapplied in a different context without its explicit consent. The extent to which published transactions can be reapplied depends on technical choices that we must make, subject to some constraints (for example, we know that shielded transactions cannot be reapplied after their anchors have been invalidated). It may be desirable to further constrain re-use in order to make guaranteed minimum balances easier to compute.

As the finalization gap increases, the negative consequences of rolling back user transactions that spend funds increase. (Coinbase transactions do not spend funds; they are a special case that we will discuss later.)

There are several possible —not mutually exclusive— outcomes:

Users of the currency start to consider the available protocol increasingly unreliable.
Users start to consider a rollback to be untenable, and lobby to prevent it or cry foul if it occurs.
Users start to consider finalization increasingly irrelevant. Services that depend on finality become unavailable.
- There is no free lunch that would allow us to avoid availability problems for services that also depend on finality.
Service providers adopt temporary workarounds that may not have had adequate security analysis.

Any of these might precipitate a crisis of confidence, and there are reasons to think this effect might be worse than if the chain had switched to a “Stalled Mode” designed to prevent loss of user funds. Any such crisis may have a negative effect on token prices and long-term adoption.

Note that adding finalization using an Ebb‑and‑Flow protocol does not by itself increase the probability of a rollback in the available chain, provided the PoW remains as secure against rollbacks of a given length as before. But that is a big proviso. We have a design constraint (motivated by limiting token devaluation and by governance issues) to limit issuance to be no greater than that of the original Zcash protocol up to a given height. Since some of the issuance is likely needed to reward staking, the amount of money available for mining rewards is reduced, which may reduce overall hash rate and security of the PoW. Independently, there may be a temptation for design decisions to rely on finality in a way that reduces security of PoW (“risk compensation”). There is also pressure to reduce the energy usage of PoW, which necessarily reduces the global hash rate, and therefore the cost of performing an attack that depends on the adversary having any given proportion of global hash rate.

It could be argued that the issue of availability of services that depend on finality is mainly one of avoiding over-claiming about what is possible. Nevertheless there are also real usability issues if balances as seen by those services can differ significantly and for long periods from balances at the chain tip.

Regardless, incorrect assumptions about the extent to which the finalized and available states can differ are likely to be exposed if there is a finalization stall. And those who made the assumptions may (quite reasonably!) not accept “everything is fine, those assumptions were always wrong” as a satisfactory response.

What is Bounded Availability?

An intuitive notion of “availability” for block‑chain protocols includes the ability to use the protocol as normal to spend funds. So, just to be clear, in a situation where that cannot happen we have lost availability, even if the block chain is advancing.

Bounded availability is a weakening of dynamic availability [DKT2020]. It means that we intentionally sacrifice availability when some potentially hazardous operation —a “hazard” for short— would occur too far after the current finalization point. For now, assume for simplicity that our only hazard is spending funds. More generally, the notion of bounded availability can be applied to a wider range of protocols by tailoring the definition of “hazard” to the protocol.

Background

This talk by Soubhik Deb accompanying [DKT2020] provides a good explanation of the advantages of dynamic availability. We do not define bounded availability formally in this document, but informally, we aim to preserve the ability to securely adapt to large changes in hash rate or total stake.

Terminology note

[NTT2020] calls the dynamically available block‑chain protocol $Π_{lc}$ that provides input to the rest of the contruction, the “longest chain” protocol. There are two reasons to avoid this terminology:

In Bitcoin, Zcash, and most other PoW-based protocols, what is actually used by each node is not its longest observed chain, but its observed consensus-valid chain with most accumulated work. In Zcash this is called the node’s “best valid block chain”, which we shorten to “best chain”.
As footnote 2 on page 3 of [NTT2020] says, that paper does not require $Π_{lc}$ to be a “longest chain” protocol anyway.

Historical note

The error in conflating the “longest chain” with the observed consensus-valid chain with most accumulated work, originates in the Bitcoin whitepaper. [Nakamoto2008, page 3]

We will use the term “best‑chain protocol” instead. Note that this corresponds roughly to $Π_{lc}$ in the Snap‑and‑Chat construction, although the Crosslink 2 protocol that we propose will end up having other significant differences from Snap-and-Chat.

How to block hazards

We have not yet decided how to block hazards during a long finalization stall. We could do so directly, or by stopping block production in the more-available protocol. For reasons explained in the section on tail‑thrashing attacks below, it’s desirable not to stop block production. And so it’s consistent to have bounded availability together with another liveness property —which can be defined similarly to dynamic availability— that says the more-available protocol’s chain is still advancing. This is what we will aim for.

We will call this method of blocking hazards, without stopping block production, “going into Stalled Mode”.

Historical note

This concept of Stalled Mode is very similar to a feature that was discussed early in the development of Zcash, but never fully designed or implemented. (After originally being called “Stalled Mode”, it was at some point renamed to “Emergency Mode”, but then the latter term was used for something else.)

For Zcash, we propose that the main restriction of Stalled Mode should be to require coinbase-only blocks. This achieves a similar effect, for our purposes, as actually stalling the more-available protocol’s chain. Since funds cannot be spent in coinbase-only blocks, the vast majority of attacks that we are worried about would not be exploitable in this state.

Info

It is possible that a security flaw could affect coinbase transactions. We might want to turn off shielded coinbase for Stalled Mode blocks in order to reduce the chance of that.

Also, mining rewards cannot be spent in a coinbase-only block; in particular, mining pools cannot distribute rewards. So there is a risk that an unscrupulous mining pool might try to do a rug-pull after mining of non-coinbase-only blocks resumes, if there were a very long finalization stall. But this approach works at least in the short term, and probably for long enough to allow manual intervention into the finalization protocol, or governance processes if needed.

An analogy for the effect of this on availability that may be familiar to many people, is that it works like video streaming. All video streaming services use a buffer to paper over short-term interruptions or slow-downs of network access. In most cases, this buffer is bounded. This allows the video to be watched uninterrupted and at a constant rate in most circumstances. But if there is a longer-term network failure or insufficient sustained bandwidth, the playback will unavoidably stall. In our case, block production does not literally stall, but it’s the same as far as users’ ability to perform “hazardous” operations is concerned.

Why is this better?

So, why do we advocate this over:

A protocol that only provides dynamic availability;
A protocol that only provides finality;
An unmodified Ebb‑and‑Flow protocol?

The reason to reject option 1 is straightforward: finality is a valuable security property that is necessary for some use cases.

If a protocol only provides finality (option 2), then short-term availability is directly tied to finalization. It may be possible to make finalization stalls sufficiently rare or short-lived that this is tolerable. But that is more likely to be possible if and when there is a well-established staking ecosystem. Before that ecosystem is established, the protocol may be particularly vulnerable to stalls. Furthermore, it’s difficult to get to such a protocol from a pure PoW system like current Zcash.

We argued in the previous section that allowing hazards in an unbounded finalization gap is bad. Option 3 entails an unbounded finalization gap that will allow hazards. However, that isn’t sufficient to argue that bounded availability is better. Perhaps there are no good solutions! What are we gaining from a bounded availability approach that would justify the complexity of a hybrid protocol without obtaining strict dynamic availability?

The argument goes like this:

It is likely that a high proportion of the situations in which a sustained finalization stall happens will require human intervention. If the finality protocol were going to recover without intervention, there is no reason to think that it wouldn’t do so in a relatively short time.
When human intervention is required, the fact that the chain tip is still proceeding apace (in protocols with strict dynamic availability) makes restarting the finality protocol harder, for many potential causes of a finalization stall. This may be less difficult when only “non-hazardous” transactions are present— in particular, when only coinbase transactions (which are subject to fairly strict rules in Zcash and other Bitcoin-derived chains) are present. This argument carries even more force when the protocol also allows “finality overrides”, as discussed later in the Complementarity section.
Nothing about bounded availability prevents us from working hard to design a system that makes finalization stalls as infrequent and short-lived as possible, just as we would for any other option that provides finality.
We want to optimistically minimize the finalization gap under good conditions, because this improves the usability of services that depend on finality. This argues against protocols that try to maintain a fixed gap, and motivates letting the gap vary.
In practice, the likelihood of short finalization stalls is high enough that heuristically retaining availability in those situations is useful.

The argument that it is difficult to completely prevent finalization stalls is supported by experience on Ethereum in May 2023, when there were two stalls within 24 hours, one for about 25 minutes and one for about 64 minutes. This experience is consistent with our argument:

Neither stall required short-term human intervention, and the network did in fact recover from them quickly.
The stalls were caused by a resource exhaustion problem in the Prysm consensus client when handling attestations. It’s plausible to think that if this bug had been more serious, or possibly if Prysm clients had made up more of the network, then it would have required a hotfix release (and/or a significant proportion of nodes switching to another client) in order to resolve the stall. So this lines up with the hypothesis that longer stalls are likely to require manual intervention.
A bounded availability protocol would very likely have resulted in either a shorter or no interruption in availability. If, say, the availability bound were set to be roughly an hour, then the first finalization stall would have been “papered over” and the second would have resulted in only a short loss of availability.

Retaining short-term availability does not result in a risk compensation hazard:

A finalization stall is still very visible, and directly affects applications relying on finality.
Precisely because of the availability bound, it is obvious that it could affect all applications if it lasted long enough.

A potential philosophical objection to lack of strict dynamic availability is that it creates a centralization risk to availability. That is, it becomes more likely that a coalition of validators can deliberately cause a denial of service. This objection may be more prevalent among people who would object to adding a finality layer or PoS at all.

Finality Overrides

Consensus protocols sometimes fail. Potential causes of failure include:

A design problem with the finality layer that causes a stall, or allows a stall to be provoked.
A balance violation or spend authorization flaw that is being exploited or is sufficiently likely to be exploited.
An implementation bug in a widely used node implementation that causes many nodes to diverge from consensus.

In these situations, overriding finality may be better than any other alternative.

An example is a balance violation flaw due to a 64-bit integer overflow that was exploited on Bitcoin mainnet on 15th August 2010. The response was to roll back the chain to before the exploitation, which is widely considered to have been the right decision. The time between the exploit (at block height 74638) and the forked chain overtaking the exploited chain (at block height 74691) was 53 blocks, or around 9 hours.

Of course, Bitcoin used and still uses a pure‑PoW consensus. But the applicability of the example does not depend on that: the flaw was independent of the consensus mechanism.

Another example of a situation that prompted this kind of override was the DAO recursion exploit on the Ethereum main chain in June 2016. The response to this was the forced balance adjustment hard fork on 20th July 2016 commonly known as the DAO fork. Although this adjustment was not implemented as a rollback, and although Ethereum was using PoW at the time and did not make any formal finality guarantees, it did override transfers that would heuristically have been considered final at the fork height. Again, this flaw was independent of the consensus mechanism.

The DAO fork was of course much more controversial than the Bitcoin fork, and a substantial minority of mining nodes split off to form Ethereum Classic. In any case, the point of this example is that it’s always possible to override finality in response to an exceptional situation, and that a chain’s community may decide to do so. The fact that Ethereum 2.0 now does claim a finality guarantee, would not in practice prevent a similar response in future that would override that guarantee.

The question then is whether the procedure to override finality should be formalized or ad hoc. We argue that it should be formalized, including specifying the governance process to be used.

This makes security analysis — of the consensus protocol per se, of the governance process, and of their interaction — much more feasible. Arguably a complete security analysis is not possible at all without it.

It also front‑loads arguing about what procedure should be followed, and so it is more likely that stakeholders will agree to follow the process in any time‑critical incident.

A way of modelling overrides that is insufficient

There is another possible way to model a protocol that claims finality but can be overridden in practice. We could say that the protocol after the override is a brand‑new protocol and chain (inheriting balances from the previous one, possibly modulo adjustments such as those that happened in the DAO fork).

Although that would allow saying that the finality property has technically not been violated, it does not match how users think about an override situation. They are more likely to think of it as a protocol with finality that can be violated in exceptional cases — and they would reasonably want to know what those cases are and how they will be handled. It also does nothing to help with security analysis of such cases.

Complementarity

Finality overrides and bounded availability are complementary in the following way: if a problem is urgent enough, then validators can be asked to stop validating. For genuinely harmful problems, it is likely to be in the interests of enough validators to stop that this causes a finalization stall. If this lasts longer than the availability bound then the protocol will go into Stalled Mode, giving time for the defined governance process to occur and decide what to do. And because the unfinalized consensus chain will contain only a limited period of user transactions that spend funds, the option of a long rollback remains realistically open.

If, on the other hand, there is time pressure to make a governance decision about a rollback in order to reduce its length, that may result in a less well-considered decision.

A possible objection is that there might be a coalition of validators who ignore the request to stop (possibly including the attacker or validators that an attacker can bribe), in which case the finalization stall would not happen. But that just means that we don’t gain the advantage of more time to make a governance decision; it isn’t actively a disadvantage relative to alternative designs. This outcome can also be thought of as a feature rather than a bug: going into Stalled Mode should be a last resort, and if the argument given for the request to stop failed to convince a sufficient number of validators that it was reason enough to do so, then perhaps it wasn’t a good enough reason.

Rationale

This resolves one of the main objections to the original Stalled Mode idea that stopped us from implementing it in Zcash. The original proposal was to use a signature with a key held by ECC to trigger Stalled Mode, which would arguably have been too centralized. The Stalled Mode described in this document, on the other hand, can only be entered by consensus of a larger validator set, or if there is an availability failure of the finalization protocol.

It is also possible to make the argument that the threshold of stake needed is imposed by technical properties of the finality protocol and by the resources of the attacker, which might not be ideal for the purpose described above. However, we would argue that it does not need to be ideal, and will be in the right ballpark in practice.

There’s a caveat related to doing intentional rollbacks when using the Stalled Mode approach, where block production in the more-available protocol continues during a long finalization stall. What happens to incentives of block producers (miners in the case of Proof‑of‑Work), given that they know the consensus chain might be intentionally rolled back? They might reasonably conclude that it is less valuable to produce those blocks, leading to a reduction of hash rate or other violations of the security assumptions of $Π_{lc}$ .

This is actually fairly easy to solve. We have the governance procedures say that if we do an intentional rollback, the coinbase-only mining rewards will be preserved. I.e. we produce a block or blocks that include those rewards paid to the same addresses (adjusting the consensus to allow them to be created from thin air if necessary), have everyone check it thoroughly, and require the chain to restart from that block. So as long as block producers believe that this governance procedure will be followed and that the chain will eventually recover at a reasonable coin price, they will still have incentive to produce on $Π_{lc}$ , at least for a time.

Rationale

Although the community operating the governance procedures has already obtained the security benefit of mining done on the rolled-back chain by the time it creates the new chain, there is a strong incentive not to renege on the agreement with miners, because the same situation may happen again.

Tail-thrashing attacks

Earlier we said that there were two possible approaches to preventing hazards during a long finalization stall:

a) go into a Stalled Mode that directly disallows hazardous transactions (for example, by requiring $Π_{lc}$ blocks to be coinbase-only in Zcash);

b) temporarily cause the more-available chain to stall.

This section describes an important class of potential attacks on approach b) that are difficult to resolve. They are based on the fact that when the unfinalized chain stalls, an adversary has more time to find blocks, and this might violate security assumptions of the more-available protocol. For instance, if the more-available protocol is PoW-based, then its security in the steady state is predicated on the fact that an adversary with a given proportion of hash power has only a limited time to use that power, before the rest of the network finds another block.

Background

For an analysis of the concrete security of Nakamoto-like protocols, see [DKT+2020] and [GKR2020]. These papers confirm the intuition that the “private attack” —in which an adversary races privately against the rest of the network to construct a forking chain— is optimal, obtaining the same tight security bound independently using different techniques.

During a chain stall, the adversary no longer has a limited time to construct a forking chain. If, say, the adversary has 10% hash power, then it can on average find a block in 10 block times. And so in 100 block times it can create a 10-block fork.

It may in fact be worse than this: once miners know that a finalization stall is happening, their incentive to continue mining is reduced, since they know that there is a greater chance that their blocks might be rolled back. So we would expect the global hash rate to fall —even before the finality gap bound is hit— and then the adversary would have a greater proportion of hash rate.

Info

Even in a pure Ebb‑and‑Flow protocol, a finalization stall could cause miners to infer that their blocks are more likely to be rolled back, but the fact that the chain is continuing would make that more difficult to exploit. This issue with the global hash rate is mostly specific to the more-available protocol being PoW: if it were PoS, then its validators might as well continue proposing blocks because it is cheap to do so. There might be other attacks when the more-available protocol is PoS; we haven’t spent much time analyzing that case.

The problem is that the more-available chain does not necessarily just halt during a chain stall. In fact, for a finality gap bound of $k$ blocks, an adversary could cause the $k$ -block “tail” of the chain as seen by any given node to “thrash” between different chains. We will call this a tail‑thrashing attack.

If a protocol allowed such attacks then it would be a regression relative to the security we would normally expect from an otherwise similar PoW-based protocol. It only occurs during a finalization stall, but note that we cannot exclude the possibility of an adversary being able to provoke a finalization stall.

Note that in the Snap‑and‑Chat construction, snapshots of $Π_{lc}$ are used as input to the BFT protocol. That implies that the tail‑thrashing problem could also affect the input to that protocol, which would be bad (not least for security analysis of availability, which seems somewhat intractable in that case).

Also, when restarting $Π_{lc}$ , we would need to take account of the fact that the adversary has had an arbitrary length of time to build long chains from every block that we could potentially restart from. It could be possible to invalidate those chains by requiring blocks after the restart to be dependent on fresh randomness, but that sounds quite tricky (especially given that we want to restart without manual intervention if possible), and there may be other attacks we haven’t thought of. This motivates using approach a) instead.

Note that we have still glossed over precisely how consensus rules would change to enforce a). This will be covered later in The Crosslink 2 Construction, but first we will discuss other issues with Snap-and-Chat.

Notes on Snap‑and‑Chat

The discussion in The Argument for Bounded Availability and Finality Overrides is at an abstract level, applying to any Ebb‑and‑Flow-like protocol.

This document considers specifics of the Snap‑and‑Chat construction proposed in [NTT2020] (arXiv version).

Terminology

We are trying to be precise in this document about use of the terms “Ebb‑and‑Flow”, which is the security model and goal introduced in [NTT2020], vs “Snap‑and‑Chat”, which is the construction proposed in the same paper to achieve that goal. There are other ways to design an Ebb‑and‑Flow protocol that don’t run into the difficulties described in this section (or that run into different difficulties).

Effect on consensus

A general problem with the Snap‑and‑Chat construction is that it does not follow, from enforcement of the original consensus rules on blocks produced in $Π_{lc}$ , that the properties they are intended to enforce hold for the $LOG_{fin, i}^{t}$ or $LOG_{da, i}^{t}$ ledgers. Less obviously, the converse also does not follow: enforcing unmodified consensus rules on $Π_{lc}$ blocks is both too lax and too strict.

Recall from the paper how $LOG_{fin, i}^{t}$ and $LOG_{da, i}^{t}$ are constructed (starting at the end of page 8 of [NTT2020]):

Ledger extraction: Finally, how honest nodes compute $LOG_{fin, i}^{t}$ and $LOG_{da, i}^{t}$ from $Ch_{i}^{t}$ and $ch_{i}^{t}$ is illustrated in Figure 6. Recall that $Ch_{i}^{t}$ is an ordering of snapshots, i.e., a chain of chains of LC blocks. First, $Ch_{i}^{t}$ is flattened, i.e. the chains of blocks are concatenated as ordered to arrive at a single sequence of LC blocks. Then, all but the first occurrence of each block are removed (sanitized) to arrive at the finalized ledger $LOG_{fin, i}^{t}$ of LC blocks. To form the available ledger $LOG_{da, i}^{t}$ , $ch_{i}^{t}$ , which is a sequence of LC blocks, is appended to $LOG_{fin, i}^{t}$ and the result again sanitized.

This says that $LOG_{fin, i}^{t}$ and $LOG_{da, i}^{t}$ are sequences of transactions, not sequences of blocks. Therefore, consensus rules defined at the block level are not applicable.

Zcash-specific

Most of these rules are Proof‑of‑Work‑related checks that can be safely ignored at this level. Some are related to the hashBlockCommitments field intended for use by the FlyClient protocol. It is not at all clear how to make FlyClient (or other uses of this commitment) work with the Snap‑and‑Chat construction. In particular, the hashEarliest{Sapling,Orchard}Root, hashLatest{Sapling,Orchard}Root, and n{Sapling,Orchard}TxCount fields don’t make sense in this context since they could only reflect the values in $ch_{i}^{t}$ , which have no relation in general to those for any subrange of transactions in $LOG_{da, i}^{t}$ . This problem occurs as a result of sanitization and so will be avoided by Crosslink 2, which does not need sanitization.

Since $LOG_{da, i}^{t}$ does not have blocks, it is not well-defined whether it has “coinbase‑only blocks” when in Stalled Mode. That by itself is not so much of a problem because it would be sufficient for it to have only coinbase transactions in that mode.

Effect on issuance

The issuance schedule of Zcash was designed under the assumption that blocks only come from a single chain, and that the difficulty adjustment algorithm keeps the rate of block mining roughly constant over time.

For Snap‑and‑Chat, if there is a rollback longer than $σ$ blocks in $Π_{lc}$ , additional coinbase transactions from the rolled-back chain will be included in $LOG_{fin}$ .

We can argue that this will happen rarely enough not to cause any significant problem for the overall issuance schedule. However, it does mean that issuance is less predictable, because the block subsidies will be computed according to their depth in the $Π_{lc}$ chain on which they were mined. So it would no longer be the case that coinbase transactions issue a deterministic, non-increasing sequence of block subsidies. (Again, this problem will be avoided by Crosslink 2.)

Effect on transaction ordering

The order of transactions in any particular $ch_{i}^{t}$ is not in general preserved in either $LOG_{fin}$ or $LOG_{da}$ . This is considered in the paper (middle of the left column on page 10) but it is very easy to miss it:

Thus, snapshots taken by different nodes or at different times can conflict. However, $Π_{bft}$ is still safe and thus orders these snapshots linearly. Any transactions invalidated by conflicts are sanitized during ledger extraction.

That is, a transaction from one snapshot might double-spend an output already spent in a different transaction of a different snaphot earlier in the flattening order. If it is omitted, then later transactions could depend on the outputs of the omitted one. The paper is saying that each transaction is only included if (in Bitcoin and Zcash terminology) it satisfies contextual checks for double-spending and existence of inputs at the point in the ledger where it would be added.

Since nullifiers for shielded spends are public, it is possible to do this even for shielded transactions. Each node $i$ will construct commitment trees in the order given by $LOG_{da, i}^{t} .$

Info

This means that if $LOG_{fin, i}$ is extended by a block that is not the next block in $LOG_{da, i}$ after the finalization point (and that has different note commitments), then all shielded transactions from that point onward in the previous $LOG_{da, i}$ will be invalidated. It could be possible to do better at the expense of a more complicated note commitment tree structure. In any case, this situation is expected to be rare, because it can only occur if there is a rollback of more than $σ$ blocks in the $Π_{lc}$ consensus chain or a failure of BFT safety.

Subtlety in the definition of sanitization

There are two possible ways to interpret how $LOG_{{fin, da}}$ are constructed in Snap‑and‑Chat:

Concatenate the transactions from each final BFT block snapshot of an LC chain, and sanitize the resulting transaction sequence by including each transaction iff it is contextually valid.
Concatenate the blocks from each final BFT block snapshot of an LC chain, remove duplicate blocks, and only then sanitize the resulting transaction sequence by including each transaction iff it is contextually valid.

These are equivalent in the setting considered by [NTT2020], but the argument for their equivalence is not obvious. We definitely want them to be equivalent: in practice there will be many duplicate blocks from chain prefixes in the input to sanitization, and so a literal implementation of the first variant would have to recheck all duplicate transactions for contextual validity. That would have at least $O (n^{2})$ complexity (more likely $O (n^{2} lo g n)$ ) in the length $n$ of the block chain, because the length of each final snapshot grows with $n$ .

Suppose that, in a particular $Π_{lc}$ , the only reasons for a transaction to be contextually invalid are double-spends and missing inputs. In that case the argument for equivalence is:

If a transaction is omitted due to a double-spend, then any subsequent time it is checked, that input will still have been double-spent.
If a transaction is omitted due to a missing input, this can only be because an earlier transaction in the input to sanitization was omitted. So the structure of omitted transactions forms a DAG in which parent links must be to earlier omitted transactions. The roots of the DAG are at double-spending transactions, which cannot be reinstated. A child cannot be reinstated until its parents have been reinstated. Therefore, no transactions are reinstated.

Note that any other reason for transactions to be contextually invalid might interfere with this argument. Therefore, strictly speaking Snap‑and‑Chat should require of $Π_{lc}$ that there is no such other reason. This does not seem to be explicitly stated anywhere in [NTT2020].

Zcash-specific

In Zcash a transaction can also be contextually invalid because it has expired, or because it has a missing anchor. Expiry can be handled by extending the above argument as follows:

If a transaction is omitted due to having expired, then any subsequent time it is checked, it will still be expired.

It is not obvious how to extend it to handle missing anchors, because it is technically possible for a duplicate transaction that was invalid because of a missing anchor to become valid in a subsequent block. That situation would require careful manipulation of the commitment trees, but there does not seem to be anything preventing it from being provoked intentionally. The argument that was used above for missing inputs does not work here, because there is no corresponding DAG formed by the transactions with missing anchors: the same commitment treestate can be produced by two unrelated transactions.

Spending finalized outputs

Transactions in $ch_{i}^{t}$ need to be able to spend outputs that are not necessarily from any previous transaction in $ch_{i}^{t}$ . This is because, from the point of view of a user of node $i$ at time $t$ , the block chain includes all transactions in $LOG_{da, i}^{t}$ . All of the transactions after the finalization point are guaranteed to also be in $ch_{i}^{t}$ , but the ones before the finalization point (i.e. in $LOG_{fin, i}^{t}$ ) are not, because they could be from some other $ch_{j}^{u}$ for $u \leq t$ and $j \neq = i$ (intuitively, from some long chain fork that was once considered confirmed by enough nodes).

Info

Honest nodes only ever vote for confirmed snapshots, that is, prefixes of their best $Π_{lc}$ chain truncated by the confirmation depth $σ$ . Obviously the whole point of having the BFT protocol is that chain forks longer than $σ$ can occur in $Π_{lc}$ — otherwise we'd just use $Π_{lc}$ and have done. So it is not that we expect this case to be common, but if it happens then it will never fix itself: the consensus chain in $Π_{lc}$ will continue on without ever including the transactions from $LOG_{fin}^{t}$ that were obtained from a snapshot of another fork.

A user must be able to spend outputs for which they hold the spending key from any finalized transaction, otherwise there would be no point to the finalization.

The authors of [NTT2020] probably just missed this: the paper only has evidence that they simulated their construction, rather than implementing it for Bitcoin or any other concrete block chain as $Π_{lc}$ . Let’s try to repair it.

Suppose that node $j$ is trying to determine whether $ch_{i}^{t}$ is a consensus-valid chain, which is necessary for deterministic consensus in $Π_{lc}$ . It cannot decide whether to allow transactions in $ch_{i}^{t}$ to spend outputs not in the history of $ch_{i}^{t}$ on the basis of its own finalized view $LOG_{fin, j}^{t},$ because $LOG_{fin, i}^{t}$ and $LOG_{fin, j}^{t}$ are not in general the same.

Of course, we hope that $LOG_{fin, i}^{t}$ and $LOG_{fin, j}^{t}$ are consistent, i.e. one is a prefix of the other. But even if they are consistent, they are not necessarily the same length. In particular, if $LOG_{fin, j}^{t}$ is shorter than $LOG_{fin, i}^{t},$ then node $j$ does not have enough information to fill in the gap — and so it may incorrectly view a transaction in $ch_{i}^{t}$ as spending an output that does not exist, when actually it does exist in $LOG_{fin, i}^{t} ∖ LOG_{fin, j}^{t} .$ Conversely if $LOG_{fin, j}^{t}$ were longer and node $j$ were to allow spending an output in $LOG_{fin, j}^{t} ∖ LOG_{fin, i}^{t},$ that would be using information that is not necessarily available to other nodes, and so node $j$ could diverge from consensus.

Consensus validity of the block at the tip of $ch_{i}^{t}$ can only be a deterministic function of the block itself and its ancestors in $ch_{i}^{t}$ . It is crucial to be able to eventually spend outputs from the finalized chain. We are forced to conclude that the chain $ch_{i}^{t}$ must include the information needed to calculate $LOG_{fin, i}^{s}$ for some $s$ not too far behind $t$ . That is, $Π_{lc}$ must be modified to ensure that this is the case. This leads us to strengthen the required properties of an Ebb‑and‑Flow protocol to include another property, “finalization availability”.

Finalization Availability

In the absence of security flaws and under the security assumptions required by the finality layer, the finalization point will not be seen by any honest node to roll back. However, that does not imply that all nodes will see the same finalized height — which is impossible given network delays and unreliable messaging.

Both in order to optimize the availability of applications that require finality, and in order to solve the technical issue of spending finalized outputs described in the previous section, we need to consider availability of the information needed to finalize the chain up to a particular point.

Note that in Bitcoin-like consensus protocols, we don’t generally consider it to be an availability flaw that a block header only commits to the previous block hash and to the Merkle tree of transactions in the block, rather than including them directly. These commitments allow nodes to check that they have the correct information, which can then be requested separately.

Suppose, then, that each block header in $Π_{lc}$ commits to the Last Final BFT block known by the $Π_{lc}$ block producer. For an LC block chain with block $H$ at its tip, we will refer to this commitment as $LF (H)$ . We refer to the parent block of $H$ as $H ⌈_{bc}^{1}$ (this is a special case of a notation that will be defined in The Crosslink 2 Construction).

Consensus rule: Extension

We require, as a consensus rule, that if $H$ is not the genesis block header, then this BFT block either descends from or is the same as the final BFT block committed to by the $Π_{lc}$ block’s parent. That is, $LF (H ⌈_{bc}^{1}) ⪯_{bft} LF (H)$ .

This Extension rule will be preserved into Crosslink 2.

The Extension rule does not prevent the BFT chain from rolling back, if the security assumptions of $Π_{bft}$ were violated. However, it means that if a node $i$ does not observe a rollback in $Π_{lc}$ at confirmation depth $σ$ , then it will also not observe any instability in $LOG_{fin, i}$ , even if the security assumptions of $Π_{bft}$ are violated. This property holds by construction, and in fact regardless of $Π_{bft}$ .

Info

In the Snap‑and‑Chat construction, we also have BFT block proposals committing to $Π_{lc}$ snapshots (top of right column of [NTT2020, page 7]):

In addition, $ch_{i}^{t}$ is used as side information in $Π_{bft}$ to boycott the finalization of invalid snapshots proposed by the adversary.

This does not cause any circularity, because each protocol only commits to earlier blocks of the other. In fact, BFT validators have to listen to transmission of $Π_{lc}$ block headers anyway, so that could be also the protocol over which they get the information needed to make and broadcast their own signatures or proposals. (A possible reason not to broadcast individual signatures to all nodes is that with large numbers of validators, the proof that a sufficient proportion of validators/stake has signed can use an aggregate signature, which could be much smaller. Also, $Π_{lc}$ nodes only need to know about successful BFT block proposals.)

Now suppose that, in a Snap‑and‑Chat protocol, the BFT consensus finalizes a $Π_{lc}$ snapshot that does not extend the snapshot in the previous block (which can happen if either $Π_{bft}$ is unsafe, or $Π_{lc}$ suffers a rollback longer than $σ$ blocks). In that case we will initially not be able to spend outputs from the old snapshot in the new chain. But eventually for some node $i$ that sees the header $H$ at the tip of its best chain at time $t$ , $LF (H)$ will be such that from then on (i.e. at time $u \geq t$ ), $LOG_{fin, i}^{u}$ includes the output that we want to spend. This assumes liveness of $Π_{lc}$ and safety of $Π_{bft}$ .

That is, including a reference to a recent final BFT block in $Π_{lc}$ block headers both incentivizes nodes to propagate this information, and can be used to solve the “spending finalized outputs” problem.

Optionally, we could incentivize the block producer to include the latest information it has, for example by burning part of the block reward or by giving the producer some limited mining advantage that depends on how many $Π_{lc}$ blocks back the finalization information is.

This raises the question of how we measure how far ahead a given block is relative to the finalization information it provides. As we said before, $LOG_{fin, i}^{t}$ is a sequence of transactions, not blocks. The transactions will in general be in a different order, and also some transactions from $ch_{i}^{t}$ may have been omitted from $LOG_{fin, i}^{t}$ (and even $LOG_{da, i}^{t}$ ) because they were not contextually valid.

Info

In Crosslink 2, we will sidestep this problem by avoiding the need for sanitization — that is, $LOG_{fin, i}^{t}$ will correspond exactly to a chain of blocks that is a prefix of $ch_{i}^{t}$ . Actually we use the notation $fin_{i}^{t}$ to reflect the fact that it is a bc‑chain, not a sequence of bc‑transactions. This invariant is maintained statefully on each node $i$ : any rollback past $fin_{i}^{t}$ will be ignored. If a new $fin_{i}^{t}$ would conflict with the old one, the node will refuse to use it. This allows each node to straightforwardly measure how many blocks $ch_{i}^{t}$ is ahead of $fin_{i}^{t}$ as the difference in heights. Since this document is intended to explain the development of Crosslink from Snap‑and‑Chat, here we describe the more complicated approach that we originally came up with for Crosslink 1 — which also serves to motivate the simplification in Crosslink 2.

Assume that a block unambiguously specifies its ancestor chain. For a block $H$ , define: $tailhead (H) := last-common-ancestor (snapshot (LF (H)), H) finality-depth (H) := height (H) - height (tailhead (H))$

Here $LF (H)$ is the BFT block we are providing information for, and $snapshot (LF (H))$ is the corresponding $Π_{lc}$ snapshot. For a node $i$ that sees $LF (H)$ as the most recent final BFT block at time $t$ , $LOG_{fin, i}^{t}$ will definitely contain transactions from blocks up to $tailhead (H)$ , but usually will not contain subsequent transactions on $H$ ’s fork.

Info

Strictly speaking, it is possible that a previous BFT block took a snapshot $H^{'}$ that is between $tailhead (H)$ and $H$ . This can only happen if there have been at least two rollbacks longer than $σ$ blocks (i.e. we went more than $σ$ blocks down $H$ ’s fork from $tailhead (H)$ , then reorged to more than $σ$ blocks down $snapshot (LF (H))$ ’s fork, then reorged again to $H$ ’s fork). In that case, the finalized ledger would already have the non-conflicting transactions from blocks between $tailhead (H)$ and $H^{'}$ — and it could be argued that the correct definition of finality depth in such cases is the depth of $H^{'}$ relative to $H$ , not of $tailhead (H)$ relative to $H$ .

However,

The definition above is simpler and easier to compute.
The effect of overestimating the finality depth in such corner cases would only cause us to enforce Stalled Mode slightly sooner, which seems fine (and even desirable) in any situation where there have been at least two rollbacks longer than $σ$ blocks.

By the way, the “tailhead” of a tailed animal is the area where the posterior of the tail joins the rump (also called the “dock” in some animals).

We could alternatively just rely on the fact that some proportion of block producers are honest and will include the latest information they have. However, it turns out that having a definition of finality depth will also be useful to enforce going into Stalled Mode.

Specifically, if we accept the above definition of finality depth, then the security property we want is

Definition

Bounded hazard-freeness for a finality gap bound of $L$ blocks: There is never, for any node $i$ at time $t$ , observed to be a more-available ledger $LOG_{da, i}^{t}$ with a hazardous transaction that comes from block $H$ of $ch_{i}^{t}$ such that $finality-depth (H) > L$ .

Info

This assumes that transactions in the non-finalized suffix $LOG_{da, i}^{t} ∖ LOG_{fin, i}^{t}$ come from blocks in $ch_{i}^{t}$ . In Snap‑and‑Chat they do by definition, but ideally we wouldn’t depend on that. The difficulty in finding a more general security definition is due to the ledgers in an Ebb‑and‑Flow protocol being specified as sequences of transactions, so that a depth in the ledger would have only a very indirect correspondence to time. We could instead base a definition on timestamps, but that could run into difficulties in ensuring timestamp accuracy.

Another possibility would be to count the number of coinbase transactions in $LOG_{da, i}^{t} ∖ LOG_{fin, i}^{t}$ before the hazardous transaction. This would still be somewhat ad hoc (it depends on the fact that coinbase transactions happen once per block and cannot conflict with any other distinct transaction).

In any case, if $finality-depth$ sometimes overestimates the depth, that cannot weaken this security definition.

Note that a node that is validating a chain $ch_{i}^{t}$ must fetch all the chains referenced by BFT blocks reachable from it (back to an ancestor that it has seen before). In theory, there could be a partition that causes there to be multiple disjoint snapshots that get added to the BFT chain in quick succession. However, in practice we expect such long rollbacks to be rare if $Π_{lc}$ is meeting its security goals.

Going into Stalled Mode if there is a long finalization stall helps to reduce the cost of validation when the stall resolves. That is, if there is a partition and nodes build on several long chains, then in unmodified Snap‑and‑Chat, it could be necessary to validate an arbitrary number of transactions on each chain when the stall resolves. Having only coinbase transactions after a certain point in each chain would significantly reduce the concrete validation costs in this situation.

Nodes should not simply trust that the BFT blocks are correct; they should check validator signatures (or aggregate signatures) and finalization rules. Similarly, $Π_{lc}$ snapshots should not be trusted just because they are referenced by BFT blocks; they should be fully validated, including the proofs-of-work.

It is also possible for a snapshot reference to include the subsequent $σ$ block headers, which are guaranteed to be available for a confirmed snapshot. Having all nodes validate the proofs-of-work in these headers is likely to significantly increase the work that an attacker would need to perform to cause disruption under a partial failure of either $Π_{bft}$ or $Π_{lc}$ ’s security properties.

Info

Note that [NTT2020] (bottom of right column, page 9) makes a safety assumption about $Π_{lc}$ in order to prove the consistency of $LOG_{fin}$ with the output of $Π_{lc}$ :

As indicated by Algorithm 1, a snapshot of the output of $Π_{lc}$ becomes final as part of a BFT block only if that snapshot is seen as confirmed by at least one honest node. However, since $Π_{lc}$ is safe [i.e., does not roll back further than the confirmation depth $σ$ ], the fact that one honest node sees that snapshot as confirmed implies that every honest node sees the same snapshot as confirmed.

We claim that, while this may be a reasonable assumption to make for parts of the security analysis, in practice we should always require any adversary to do the relevant amount of Proof‑of‑Work to construct block headers that are plausibly confirmed. This is useful even though we cannot require, for every possible attack, that it had those headers at the time they should originally have appeared.

Enforcing Finalization Availability and Stalled Mode

The following idea for enforcing finalization availability and a bound on the finality gap was originally conceived before we had switched to advocating the Stalled Mode approach. It’s simpler to explain first in that variant.

Suppose that for an $L$ -block availability bound, we required each block header to include the information necessary for a node to finalize to $L$ blocks back. This would automatically enforce a chain stall after the availability bound without any further explicit check, because it would be impossible to produce a block after the bound.

Note that if full nodes have access to the BFT chain, knowing $LF (H)$ is sufficient to tell whether the correct version of any given BFT block in $LF (H)$ ’s ancestor chain has been obtained.

Suppose that the finality gap bound is $L$ blocks. Having already defined $finality-depth$ , the necessary $Π_{lc}$ consensus rule is attractively simple:

Consensus rule

For every $Π_{lc}$ block $H$ , $finality-depth (H) \leq L$ .

To adapt this approach to enforce Stalled Mode instead of stalling the chain, we can allow the alternative of producing a block that follows the Stalled Mode restrictions:

Consensus rule

For every $Π_{lc}$ block $H$ , either $finality-depth (H) \leq L$ , or $H$ follows the Stalled Mode restrictions.

Note that Stalled Mode will be exited automatically as soon as the finalization point catches up to within $L$ blocks (if it does without an intentional rollback). Typically, after recovery from whatever was causing the finalization stall, the validators will be able to obtain consensus on the same chain as $LOG_{da}$ , and so there will be no rollback (or at least not a long one) of $LOG_{da}$ .

Info

An earlier iteration of this idea required the finalization information to be included in $Π_{lc}$ block headers. This is not necessary when we assume that full nodes have access to the BFT chain and can obtain arbitrary BFT blocks. This also sidesteps any need to relax the rule in order to bound the size of $Π_{lc}$ block headers. $Π_{lc}$ block producers are still incentivized to make the relevant BFT blocks available, because without them the above consensus rule cannot be checked, and so their $Π_{lc}$ blocks would not be accepted.

There is, however, a potential denial-of-service attack by claiming the existence of a BFT block that is very far ahead of the actual BFT chain tip. This attack is not very serious as long as nodes limit the number of BFT blocks they will attempt to obtain in parallel before having checked validator signatures.

Comment on security assumptions

Consider Lemma 5:

Moreover, for a BFT block to become final in the view of an honest node $i$ under $(A_{2}^{*}, Z_{2})$ , at least one vote from an honest node is required, and honest nodes only vote for a BFT block if they view the referenced LC block as confirmed.

The stated assumptions are:

$(A_{2} (β), Z_{2})$ formalizes the model of P2, a synchronous network under dynamic participation, with respect to a bound $β$ on the fraction of awake nodes that are adversarial:

At all times, $A_{2}$ is required to deliver all messages sent between honest nodes in at most $Δ$ slots.

At all times, $A_{2}$ determines which honest nodes are awake/asleep and when, subject to the constraint that at all times at most fraction $β$ of awake nodes are adversarial and at least one honest node is awake.

$(A_{2}^{*}, Z_{2})$ is defined as $(A_{2} (\frac{1}{2}, Z_{2}))$ .

Now consider this statement and figure:

Even if $Π_{bft}$ is unsafe (Figure 9c), finalization of a snapshot requires at least one honest vote, and thus only valid snapshots become finalized.

This argument is technically correct but has to be interpreted with care. It only applies when the number of malicious nodes $f$ is such that $n /3 < f < n /2$ . What we are trying to do with Crosslink is to ensure that a similar conclusion holds even if $Π_{bft}$ is completely subverted, i.e. the adversary has 100% of validators (but only < 50% of $Π_{lc}$ hash rate).

The Crosslink 2 Construction

We are now ready to give a description of a protocol that takes into account the issues described in Notes on Snap‑and‑Chat, and that implements bounded availability. We call this the “Crosslink” construction; more precisely the version described here is “Crosslink 2”.

This description will attempt to be self-contained, but [NTT2020] (arXiv version) is useful background on the general model of Ebb-and-Flow protocols.

Conventions

“ $*$ ” is a metavariable for the name of a protocol. We also use it as a wildcard in protocol names of a particular type, for example “ $*$ bc” for the name of some best‑chain protocol.

Protocols are referred to as $Π_{*}$ for a name “ $*$ ”. Where it is useful to avoid ambiguity, when referring to a concept defined by $Π_{*}$ we prefix it with “ $*$ ‑”.

We do not take synchrony or partial synchrony as an implicit assumption of the communication model; that is, unless otherwise specified, messages between protocol participants can be arbitrarily delayed or dropped. A given message is received at most once, and messages are nonmalleably authenticated as originating from a given sender whenever needed by the applicable protocol. Particular subprotocols may require a stronger model.

Background

For an overview of communication models used to analyze distributed protocols, see this blog post by Ittai Abraham.

Discussion of incorrect applications of the GST formalization of partial synchrony to continuously operating protocols.

The original context for the definition of the partially synchronous model in [DLS1988] was for “one‑shot” Byzantine Agreement — called “the consensus problem” in that paper. The following argument is used to justify assuming that all messages from the Global Stabilization Time onward are delivered within the upper time bound $Δ$ :

Therefore, we impose an additional constraint: For each execution there is a global stabilization time (GST), unknown to the processors, such that the message system respects the upper bound $Δ$ from time GST onward.

This constraint might at first seem too strong: In realistic situations, the upper bound cannot reasonably be expected to hold forever after GST, but perhaps only for a limited time. However, any good solution to the consensus problem in this model would have an upper bound $L$ on the amount of time after GST required for consensus to be reached; in this case it is not really necessary that the bound $Δ$ hold forever after time GST, but only up to time GST $+ L$ . We find it technically convenient to avoid explicit mention of the interval length $L$ in the model, but will instead present the appropriate upper bounds on time for each of our algorithms.

Several subsequent authors applying the partially synchronous model to block chains appear to have forgotten or neglected this context. In particular, the argument depends on the protocol completing soon after GST. Obviously a block‑chain protocol does not satisfy this assumption; it is not a “one‑shot” consensus problem.

This assumption could be removed, but some authors of papers about block‑chain protocols have taken it to be an essential aspect of modelling partial synchrony. I believe this is contrary to the intent of [DLS1988]:

Instead of requiring that the consensus problem be solvable in the GST model, we might think of separating the correctness conditions into safety and termination properties. The safety conditions are that no two correct processors should ever reach disagreement, and that no correct processor should ever make a decision that is contrary to the specified validity conditions. The termination property is just that each correct processor should eventually make a decision. Then we might require an algorithm to satisfy the safety conditions no matter how asynchronously the message system behaves, that is, even if $Δ$ does not hold eventually. On the other hand, we might only require termination in case $Δ$ holds eventually. It is easy to see that these safety and termination conditions are [for the consensus problem] equivalent to our GST condition: If an algorithm solves the consensus problem when $Δ$ holds from time GST onward, then that algorithm cannot possibly violate a safety property even if the message system is completely asynchronous. This is because safety violations must occur at some finite point in time, and there would be some continuation of the violating execution in which $Δ$ eventually holds.

This argument is correct as stated, i.e. for the one‑shot consensus problem. Subtly, essentially the same argument can be adapted to protocols with safety properties that need to be satisfied continuously. However, it cannot correctly be applied to liveness properties of non‑terminating protocols. The authors (Cynthia Dwork, Nancy Lynch, and Larry Stockmeyer) would certainly have known this: notice how they carefully distinguish “the GST model” from “partial synchrony”. They cannot plausibly have intended this GST formalization to be applied unmodified to analyze liveness in such protocols, which seems to be common in the block‑chain literature, including in the Ebb-and-Flow paper [NTT2020] and the Streamlet paper [CS2020].

The Ebb-and-Flow paper acknowledges the issue by saying “Although in reality, multiple such periods of (a‑)synchrony could alternate, we follow the long‑standing practice in the BFT literature and study only a single such transition.” This is not adequate: “long‑standing practice” notwithstanding, it is not valid in general to infer that properties holding for the first transition to synchrony also apply to subsequent transitions (where the protocol can be in states that would not occur initially), and it is plausible that this inference could fail for real protocols. The Streamlet paper also refers to “periods of synchrony” which indicates awareness of the issue, but then it uses the unmodified GST model in the proofs.

Informally, to solve this issue it is necessary to also prove that existing progress is maintained during periods of asynchrony, and that during such periods the protocol remains in states where it will be able to take advantage of a future period of synchrony to make further progress.

This provides further motivation to avoid taking the GST formalization of partial synchrony as a basic assumption.

Note that the recent result [CGSW2024] does not contradict anything we say here. Although the GST and Unknown Latency models are “equally demanding” in the sense of existence of protocols that satisfy a given goal, this result does not show that the models are equivalent for any specific protocol. In particular the requirements of the “clock‑slowing” technique fail in practice for any protocol involving Proof‑of‑Work.

A $*$ ‑execution is the complete set of events (message sends/receives and decisions by protocol participants) that occur in a particular run of $Π_{*}$ from its initiation up to a given time. A prefix of a $*$ ‑execution is also a $*$ ‑execution. Since executions always start from protocol initiation, a strict suffix of a $*$ ‑execution is not a $*$ ‑execution.

Times are modelled as values of a totally ordered type $Time$ with minimum value $0$ . For convenience, we consider all protocol executions to start at time $0$ .

Remark

Although protocols may be nondeterministic, an execution fixes the events that occur and times at which they occur, for the purpose of modeling.

For simplicity, we assume that all events occur at global times in a total ordering. This assumption is not realistic in an asynchronous communication model, but it is not essential to the design or analysis and could be removed: we could use a partial happens-before ordering on events in place of a total ordering on times.

A “ $*$ ‑node” is a participant in $Π_{*}$ (the protocol may be implicit). A $*$ ‑node is “honest at time $t$ ” in a given execution iff it has followed the protocol up to and including time $t$ in that execution.

A time series on type $U$ is a function $S ⦂ Time \to U$ assigning a value of $U$ to each time in an execution. By convention, we will write the time $t$ as a superscript: $S^{t} = S (t)$ .

A $*$ ‑chain is a nonempty sequence of $*$ ‑blocks, starting at the “genesis block” $O_{*}$ , in which each subsequent block refers to its preceding or “parent block” by a collision‑resistant hash. The “tip” of a $*$ ‑chain is its last element.

For convenience, we conflate $*$ ‑blocks with $*$ ‑chains; that is, we identify a chain with the block at its tip. This is justified because, assuming that the hash function used for parent links is collision‑resistant, there is exactly one $*$ ‑chain corresponding to a $*$ ‑block; and conversely there is exactly one $*$ ‑block at the tip of a $*$ ‑chain.

If $C$ is a $*$ ‑chain, $C ⌈_{*}^{k}$ means $C$ with the last $k$ blocks pruned, except that if $len (C) \leq k$ , the result is the genesis $*$ ‑chain consisting only of $O_{*}$ .

The block at depth $k \in N$ in a $*$ ‑chain $C$ is defined to be the tip of $C ⌈_{*}^{k}$ . Thus the block at depth $k$ in a chain is the last one that cannot be affected by a rollback of length $k$ (this also applies when $len (C) \leq k$ because the genesis $*$ ‑chain cannot roll back).

Terminology

Our usage of “depth” is different from [NTT2020], which uses “depth” to refer to what Bitcoin and Zcash call “height”. It also differs by $1$ from the convention for confirmation depths in zcashd, where the tip is considered to be at depth $1$ , rather than $0$ .

For $*$ ‑blocks $B$ and $C$ :

The notation $B ⪯_{*} C$ means that the $*$ ‑chain with tip $B$ is a prefix of the one with tip $C$ . This includes the case $B = C$ .
The notation $B ⪯ ⪰_{*} C$ means that either $B ⪯_{*} C$ or $C ⪯_{*} B$ . That is, “one of $B$ and $C$ is a prefix of the other”. This also includes the case $B = C$ .
The notation $B / ⪯ ⪰_{*} C$ means that both $B \neq ⪯_{*} C$ and $C \neq ⪯_{*} B$ . That is, “neither of $B$ and $C$ is a prefix of the other”.

A function $S ⦂ I \to * -block$ is $*$ ‑linear iff for every $t, u ⦂ I$ where $t \leq u$ we have $S (t) ⪯_{*} S (u)$ . (This definition can be applied to time series where $I = Time$ , or to sequences of $*$ ‑blocks where values of $I$ are indices.)

Lemma: Linear prefix

If $A ⪯_{*} C$ and $B ⪯_{*} C$ then $A ⪯ ⪰_{*} B$ .

Proof: The chain of ancestors of $C$ is $*$ -linear, and $A$ , $B$ are both on that chain.

The notation $[f (X) for X ⪯_{*} Y]$ means the sequence of $f (X)$ for each $*$ ‑block $X$ in chain order from genesis up to and including $Y$ . ( $X$ is a bound variable within this construct.)

$TODO:$ remove this if not used:

We use $log ⪯ log^{'}$ (without a subscript on $⪯$ ) to mean that the transaction ledger $log$ is a prefix of $log^{'}$ . Similarly to $⪯ ⪰_{*}$ above, $log ⪯ ⪰ log^{'}$ means that either $log ⪯ log^{'}$ or $log^{'} ⪯ log$ ; that is, “one of $log$ and $log^{'}$ is a prefix of the other”.

Views

In the simplest case, a block‑chain protocol $Π_{*}$ provides a single “view” that, for a given $*$ ‑execution, provides each $*$ ‑node with a time series on $*$ ‑chains. More generally a protocol may define several “views” that provide each $*$ ‑node with time series on potentially different chain types.

We model a $*$ ‑view as a function $V ⦂ Node \times Time \to * -chain$ . By convention, we will write the node index $i$ as a subscript and the time $t$ as a superscript: $V_{i}^{t} = V (i, t)$ .

Definition: Agreement on a view

An execution of $Π$ has Agreement on the view $V ⦂ Node \times Time \to * -chain$ iff for all times $t$ , $u$ and all $Π$ nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have $V_{i}^{t} ⪯ ⪰_{*} V_{j}^{u}$ .

Subprotocols

As in Snap‑and‑Chat, we depend on a BFT protocol $Π_{origbft}$ , and a best‑chain protocol $Π_{origbc}$ .

Info

See this terminology note for why we do not call $Π_{origbc}$ a “longest‑chain” protocol.

We modify $Π_{origbft}$ (resp. $Π_{origbc}$ ) to give $Π_{bft}$ (resp. $Π_{bc}$ ) by adding structural elements, changing validity rules, and changing the specified behaviour of honest nodes.

A Crosslink 2 node must participate in both $Π_{bft}$ and $Π_{bc}$ ; that is, it must maintain a view of the state of each protocol. Acting in more specific roles such as bft‑proposer, bft‑validator, or bc‑block‑producer is optional, but we assume that all such actors are Crosslink 2 nodes.

Model for BFT protocols (Π_{{origbft,bft}})

A $*$ bft‑node’s view includes a set of $*$ bft‑block chains each rooted at a fixed genesis $*$ bft‑block $O_{* bft}$ . There is a $*$ bft‑block‑validity rule (specified below), which depends only on the content of the block and its ancestors. A non‑genesis block can only be $*$ bft‑block‑valid if its parent is $*$ bft‑block‑valid. A $*$ bft‑valid‑chain is a chain of $*$ bft‑block‑valid blocks.

Execution proceeds in a sequence of epochs. In each epoch, an honest proposer for that epoch may make a $*$ bft‑proposal.

A $*$ bft‑proposal refers to a parent $*$ bft‑block, and specifies the proposal’s epoch. The content of a proposal is signed by the proposer using a strongly unforgeable signature scheme. We consider the proposal to include this signature. There is a $*$ bft‑proposal‑validity rule, depending only on the content of the proposal and its parent block, and the validity of the proposer’s signature.

We extend the $⌈^{k}$ notation to $*$ bft‑proposals in the obvious way: if $k \geq 1$ , $P$ is a $*$ bft‑proposal and $B$ its parent $*$ bft‑block, then $P ⌈_{* bft}^{k} := B ⌈_{* bft}^{k - 1}$ .

Terminology

We will shorten “ $*$ bft‑block‑valid $*$ bft‑block” to “ $*$ bft‑valid‑block”, and “ $*$ bft‑proposal‑valid $*$ bft‑proposal” to “ $*$ bft‑valid‑proposal”.

For each epoch, there is a fixed number of voting units distributed between the $*$ bft‑nodes, which they use to vote for a $*$ bft‑proposal. We say that a voting unit has been cast for a $*$ bft‑proposal $P$ at a given time in a $*$ bft‑execution, if and only if $P$ is $*$ bft‑proposal‑valid and a ballot for $P$ authenticated by the holder of the voting unit exists at that time.

Using knowledge of ballots cast for a $*$ bft‑proposal $P$ that collectively satisfy a notarization rule at a given time in a $*$ bft‑execution, and only with such knowledge, it is possible to obtain a valid $*$ bft‑notarization‑proof $proof_{P}$ . The notarization rule must require at least a two‑thirds absolute supermajority of voting units in $P$ ’s epoch to have been cast for $P$ . It may also require other conditions.

A voting unit is cast non‑honestly for an epoch’s proposal iff:

it is cast other than by the holder of the unit (due to key compromise or any flaw in the voting protocol, for example); or
it is double‑cast (i.e. there are at least two ballots casting it for distinct proposals); or
the holder of the unit following the conditions for honest voting in $Π_{* bft}$ , according to its view, should not have cast that vote.

Note that a unit should be considered to be cast non-honestly in the case of key compromise, because it is then effectively under the control of an adversary. The key compromise may or may not be attributable to another flaw in the protocol, but such a flaw would not be a break of the consensus mechanism per se.

Definition: One‑third bound on non‑honest voting

An execution of $Π_{bft}$ has the one‑third bound on non‑honest voting property iff for every epoch, strictly fewer than one third of the total voting units for that epoch are ever cast non‑honestly.

Info

It may be the case that a ballot cast for $P$ is not in honest view when it is used to create a notarization proof for $P$ . Since we are not assuming synchrony, it may also be the case that such a ballot is in honest view but that any given node has not received it (and perhaps will never receive it).

There may be multiple distinct ballots or distinct ballot messages attempting to cast a given voting unit for the same proposal; this is undesirable for bandwidth usage, but it is not necessary to consider it to be non‑honest behaviour for the purpose of security analysis, as long as such ballots are not double‑counted toward the two‑thirds threshold.

Security caveat

The one‑third bound on non‑honest voting property considers all ballots cast in the entire execution. In particular, it is possible that a validator’s key is compromised and then used to cast its voting units for a proposal of an epoch long finished. If the number of voting units cast non-honestly for any epoch ever reaches one third of the total voting units for that epoch during an execution, then the one‑third bound on non‑honest voting property is violated for that execution.

Therefore, validator keys of honest nodes must remain secret indefinitely. Whenever a key is rotated, the old key must be securely deleted. For further discussion and potential improvements, see tfl-book issue #140.

A $*$ bft‑block consists of $(P, proof_{P})$ re‑signed by the same proposer using a strongly unforgeable signature scheme. It is $*$ bft‑block‑valid iff:

$P$ is $*$ bft‑proposal‑valid; and
$proof_{P}$ is a valid proof that some subset of ballots cast for $P$ are sufficient to satisfy the notarization rule; and
the proposer’s outer signature on $(P, proof_{P})$ is valid.

A $*$ bft‑proposal’s parent reference hashes the entire parent $*$ bft‑block, i.e. proposal, proof, and outer signature.

Info

Neither $proof_{P}$ nor the proposer’s outer signature is unique for a given $P$ . The proposer’s outer signature is however third‑party nonmalleable, by definition of a strongly unforgeable signature scheme. An “honest $*$ bft‑proposal” is a $*$ bft‑proposal made for a given epoch by a proposer who is honest in that epoch. Such a proposer will only create one proposal and only sign at most once for each epoch, and so there will be at most one “honestly submitted” $*$ bft‑block for each epoch.

It is possible for there to be multiple $*$ bft‑valid‑blocks for the same proposal, with different notarization proofs and/or outer signatures, if the proposer is not honest. However, the property that there will be at most one “honestly submitted” $*$ bft‑block for each epoch is important for liveness, even though we cannot guarantee that any particular proposer for an epoch is honest.

$TODO:$ check that we are correctly using this in the liveness analysis.

There is an efficiently computable function $* bft-last-final ⦂ * bft-block \to * bft-block \cup {⊥}$ . For a $*$ bft‑block‑valid input block $C$ , this function outputs the last ancestor of $C$ that is final in the context of $C$ .

Info

The chain of ancestors is unambiguously determined because a $*$ bft‑proposal’s parent reference hashes the entire parent $*$ bft‑block; each $*$ bft‑block commits to a proposal; and the parent hashes are collision‑resistant. This holds despite the caveat mentioned above that there may be multiple $*$ bft‑valid‑blocks for the same proposal.

$* bft-last-final$ must satisfy all of the following:

$* bft-last-final (C) = ⊥ ⟺ C$ is not $*$ bft‑block‑valid.
If $C$ is $*$ bft‑block‑valid, then:
- $* bft-last-final (C) ⪯_{* bft} C$ (and therefore it must also be $*$ bft‑block‑valid);
- for all $*$ bft‑valid‑blocks $D$ such that $C ⪯_{* bft} D$ , $* bft-last-final (C) ⪯_{* bft} * bft-last-final (D)$ .
$* bft-last-final (O_{* bft}) = O_{* bft}$ .

Info

It is correct to talk about the “last final block” of a given chain (that is, each $*$ bft‑valid-block $C$ unambiguously determines a $*$ bft‑valid-block $* bft-last-final (C)$ ), but it is not correct to refer to a given $*$ bft‑block as objectively “ $*$ bft‑final”.

A particular BFT protocol might need adaptations to fit it into this model for $Π_{origbft}$ , before we apply the Crosslink 2 modifications to obtain $Π_{bft}$ . Any such adaptions are necessarily protocol-specific. In particular:

origbft‑proposal‑validity should correspond to the strongest property of an origbft‑proposal that is objectively and feasibly verifiable from the content of the proposal and its parent origbft‑block at the time the proposal is made. It must include verification of the proposer’s signature.
origbft‑block‑validity should correspond to the strongest property of an origbft‑block that is objectively and feasibly verifiable from the content of the block and its ancestors at the time the block is added to an origbft‑chain. It should typically include all of the relevant checks from origbft‑proposal‑validity that apply to the created block (or equivalent checks). It must also include verification of the notarization proof and the proposer’s outer signature.
If a node observes an origbft‑valid block $C$ , then it should be infeasible for an adversary to cause a rollback in that node’s view past $origbft-last-final (C)$ , and the view of the chain up to $origbft-last-final (C)$ should agree with that of all other honest nodes. This is formalized in the next section.

Safety of $Π_{* bft}$

The intuition behind the following safety property is that:

For $Π_{* bft}$ to be safe, it should never be the case that two honest nodes observe (at any time) $*$ bft‑blocks $B$ and $B^{'}$ respectively that they each consider final in some context, but $B ⪯ ⪰_{* bft} B^{'}$ does not hold.
By definition, an honest node observes a $*$ bft‑block to be final in the context of another $*$ bft‑block $C$ , iff $B ⪯_{* bft} * bft-last-final (C)$ .

We say that a $*$ bft‑block is “in honest view” if a party observes it at some time at which that party is honest.

Definition: Final Agreement

An execution of $Π_{* bft}$ has Final Agreement iff for all $*$ bft‑valid blocks $C$ in honest view at time $t$ and $C^{'}$ in honest view at time $t^{'}$ , we have $* bft-last-final (C) ⪯ ⪰_{* bft} * bft-last-final (C^{'})$ .

Note that it is possible for this property to hold for an execution of a BFT protocol in an asynchronous communication model. As previously mentioned, if the one‑third bound on non‑honest voting property is ever broken at any time in an execution, then it may not be possible to maintain Final Agreement from that point on.

Adapting the Streamlet BFT protocol.

Streamlet as described in [CS2020] has three possible states of a block in a player’s view:

“valid” (but not notarized or final);
“notarized” (but not final);
“final”.

By “valid” the Streamlet paper means just that it satisfies the structural property of being part of a block chain with parent hashes. The role of $*$ bft‑block‑validity in our model corresponds roughly to Streamlet’s “notarized”. It turns out that with some straightforward changes relative to Streamlet, we can identify “origbft‑block‑valid” with “notarized” and consider an origbft‑valid‑chain to only consist of notarized blocks. This is not obvious, but is a useful simplification.

Here is how the paper defines “notarized”:

When a block gains votes from at least $2 n /3$ distinct players, it becomes notarized. A chain is notarized if its constituent blocks are all notarized.

This implies that blocks can be added to chains independently of notarization. However, the paper also says that an honest leader always proposes a block extending from a notarized chain. Therefore, only notarized chains really matter in the protocol.

In unmodified Streamlet, the order in which a player sees signatures might cause it to view blocks as notarized out of order. Streamlet’s security analysis is in a synchronous model, and assumes for liveness that any vote will have been received by all players (Streamlet nodes) within two epochs.

In Crosslink 2, however, we need origbft‑block‑validity to be an objectively and feasibly verifiable property. We also would prefer reliable message delivery within bounded time not to be a basic assumption of our communication model. (This does not dictate what assumptions about message delivery are made for particular security analyses.) If we did not make a modification to the protocol to take this into account, then some Crosslink 2 nodes might receive a two‑thirds absolute supermajority of voting messages and consider a BFT block to be notarized, while others might never receive enough of those messages.

Obviously a proposal cannot include signatures on itself — but the block formed from it can include proofs about the proposal and signatures. We can therefore say that when a proposal gains a two‑thirds absolute supermajority of signatures, a block is created from it that contains a proof (such as an aggregate signature) that it had such a supermajority. For example, we can have the proposer itself make this proof once it has enough votes, sign the resulting $(P, proof_{P})$ to create a block, then submit that block in a separate message. (The proposer has most incentive to do this in order to gain whatever reward attaches to a successful proposal; it can outsource the proving task if needed.) Then the origbft‑block‑validity rule can require a valid supermajority proof, which is objectively and feasibly verifiable. Players that see an origbft‑valid‑block can immediately consider it notarized.

Note that for the liveness analysis to be unaffected, we need to assume that the combined latency of messages, of collecting and aggregating signatures, and of block submission is such that all adapted‑Streamlet nodes will receive a notarized block corresponding to a given proposal (rather than just all of the votes for the proposal) within two epochs. Alternatively we could re‑do the timing analysis.

With this change, “origbft‑block‑valid” and “notarized” do not need to be distinguished.

Streamlet’s finality rule is:

If in any notarized chain, there are three adjacent blocks with consecutive epoch numbers, the prefix of the chain up to the second of the three blocks is considered final. When a block becomes final, all of its prefix must be final too.

We can straightforwardly express this as an $origbft-last-final$ function of a context block $C$ , as required by the model:

For an origbft‑valid‑block $C$ , $origbft-last-final (C)$ is the last origbft‑valid‑block $B ⪯_{origbft} C$ such that either $B = O_{origbft}$ or $B$ is the second block of a group of three adjacent blocks with consecutive epoch numbers.

Note that “When a block becomes final, all of its prefix must be final too.” is implicit in the model.

Model for best-chain protocols (Π_{origbc,bc})

A node’s view in $Π_{* bc}$ includes a set of $*$ bc‑block chains each rooted at a fixed genesis $*$ bc‑block $O_{* bc}$ . There is a $*$ bc‑block‑validity rule (often described as a collection of “consensus rules”), depending only on the content of the block and its ancestors. A non‑genesis block can only be $*$ bc‑block‑valid if its parent is $*$ bc‑block‑valid. By “ $*$ bc‑valid‑chain” we mean a chain of $*$ bc‑block‑valid blocks.

Terminology

The terminology commonly used in the block‑chain community does not distinguish between rules that are part of the consensus protocol proper, and rules required for validity of the economic computation supported by the block chain. Where it is necessary to distinguish, the former can be called “L0” consensus rules, and the latter “L1” consensus rules.

The definition of $*$ bc‑block‑validity is such that it is hard for a block producer to extend a $*$ bc‑valid‑chain unless they are selected by a random process that chooses a block producer in proportion to their resources with an approximately known and consistent time distribution, subject to some assumption about the total proportion of resources held by honest nodes.

There is a function $score ⦂ * bc-valid-chain \to Score$ , with $<$ a strict total ordering on $Score$ . An honest node will choose one of the $*$ bc‑valid‑chains with highest score as the $*$ bc‑best‑chain in its view. Any rule can be specified for breaking ties.

The $score$ function is required to satisfy $score (H ⌈_{* bc}^{1}) < score (H)$ for any non‑genesis $*$ bc‑valid‑chain $H$ .

Info

For a Proof‑of‑Work protocol, the score of a $*$ bc‑chain should be its accumulated work.

Unless an adversary is able to censor knowledge of other chains from a node’s view, it should be difficult to cause the node to switch to a chain with a last common ancestor more than $σ$ blocks back from the tip of its previous $*$ bc‑best‑chain.

Let $ch$ be a view such that $ch_{i}^{t}$ is node $i$ ’s $*$ bc‑best‑chain at time $t$ . (This matches the notation used in [NTT2020].) We define $ch_{i}^{0}$ to be $O_{bc}$ .

A $*$ bc‑valid‑block is assumed to commit to a collection (usually, a sequence) of $*$ bc‑transactions. Unlike in Crosslink 1 or Snap-and-Chat, we do not need to explicitly model $*$ bc‑transaction validity or impose any additional constraints on it. The consensus rules applying to $*$ bc‑transactions are entirely unchanged, including any rules that depend on $*$ bc‑block height or previous $*$ bc‑blocks. This is because Crosslink 2 never reorders or selectively “sanitizes” transactions as Snap-and-Chat does. If a bc‑block is included in a Crosslink 2 block chain then its entire parent bc‑block chain is included just as it would have been in $Π_{origbc}$ (only modified by the structural additions described later), so block heights are also preserved.

A “coinbase transaction” is a $*$ bc‑transaction that only distributes newly issued funds and has no inputs.

Define $is-coinbase-only-block ⦂ * bc-block \to boolean$ so that $is-coinbase-only-block (B) = true$ iff $B$ has exactly one transaction that is a coinbase transaction.

Each $*$ bc‑block is summarized by a $*$ bc‑header that commits to the block. There is a notion of $*$ bc‑header‑validity that is necessary, but not sufficient, for validity of the block. We will only make the distinction between $*$ bc‑headers and $*$ bc‑blocks when it is necessary to avoid ambiguity.

Header validity for Proof‑of‑Work protocols.

In a Proof‑of‑Work protocol, it is normally possible to check the Proof‑of‑Work of a block using only the header. There is a difficulty adjustment function that determines the target difficulty for a block based on its parent chain. So, checking that the correct difficulty target has been used relies on knowing that the header’s parent chain is valid.

Checking header validity before expending further resources on a purported block can be relevant to mitigating denial‑of‑service attacks that attempt to inflate validation cost.

Typically, Bitcoin‑derived best chain protocols do not need much adaptation to fit into this model. The model still omits some details that would be important to implementing Crosslink 2, but distracting for this level of abstraction.

Safety of $Π_{* bc}$

We make an assumption on executions of $Π_{origbc}$ that we will call Prefix Consistency (introduced in [PSS2016, section 3.3] as just “consistency”):

Definition: Prefix Consistency

An execution of $Π_{* bc}$ has Prefix Consistency at confirmation depth $σ$ , iff for all times $t \leq u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have that $ch_{i}^{t} ⌈_{* bc}^{σ} ⪯_{* bc} ch_{j}^{u}$ .

Explain the confusion in the literature about what variants of this property are called.

The literature uses the same name, “common‑prefix property”, for two different properties of very different strength.

[PSS2016, section 3.3] introduced the stronger variant. That paper first describes the weaker variant, calling it the “common‑prefix property by Garay et al [GKL2015].” Then it explains what is essentially a bug in that variant, and describes the stronger variant which it just calls “consistency”:

The common‑prefix property by Garay et al [GKL2015], which was already considered and studied by Nakamoto [Nakamoto2008], requires that in any round $r$ , the record chains of any two honest players $i$ , $j$ agree on all, but potentially the last $T$ , records. We note that this property (even in combination with the other two desiderata [of Chain Growth and Chain Quality]) provides quite weak guarantees: even if any two honest parties perfectly agree on the chains, the chain could be completely different on, say, even rounds and odd rounds. We here consider a stronger notion of consistency which additionally stipulates players should be consistent with their “future selves”.

Let $consistent^{T} (view) = 1$ iff for all rounds $r \leq r^{'}$ , and all players $i$ , $j$ (potentially the same) such that $i$ is honest at $view^{r}$ and $j$ is honest at $view^{r^{'}}$ , we have that the prefixes of $C_{i}^{r} (view)$ and $C_{j}^{r^{'}} (view)$ consisting of the first $ℓ = ∣ C_{i}^{r} (view) ∣ - T$ records are identical.

Unfortunately, [GKL2020], which is a revised version of [GKL2015], switches to the stronger variant without changing the name.

(The eprint version history may be useful; the change was made in version 20181013:200033, page 17.)

Note that [GKL2020] uses an adaptive‑corruption model, “meaning that the adversary is allowed to take control of parties on the fly”, and so their wording in Definition 3:

... for any pair of honest players $P_{1}$ , $P_{2}$ adopting the chains $C_{1}$ , $C_{2}$ at rounds $r_{1} \leq r_{2}$ in view $_{Π, A, Z}^{t, n}$ respectively, it holds that $C_{1}^{⌈ k} ⪯ C_{2}$ .

is intended to mean the same as our

... for all times $t \leq u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have that $ch_{i}^{t} ⌈_{* bc}^{σ} ⪯_{* bc} ch_{j}^{u}$ .

The latter is closer to [PSS2016].

Incidentally, this property does not seem to be mentioned in [Nakamoto2008], contrary to the [PSS2016] authors’ assertion. Maybe implicitly, but it’s a stretch.

Discussion of [GKL2020]’s communication model and network partition.

When Prefix Consistency is taken to hold of typical PoW-based block‑chain protocols like Bitcoin (as it often is), this implies that, in the relevant executions, the network of honest nodes is never partitioned — unless any partition lasts only for a short length of time relative to $σ$ block times. If node $i$ is on one side of a full partition and node $j$ on the other, then after node $i$ ’s best chain has been extended by more than $σ$ blocks, $ch_{i}^{t} ⌈_{* bc}^{σ}$ will contain information that has no way to get to node $j$ . And even if the partition is incomplete, we cannot guarantee that the Prefix Consistency property will hold for any given pair of nodes.

It might be possible to maintain Prefix Consistency if the honest nodes on one side of the partition knew that they should not continue building on their chain until the partition has healed, but it is unclear how that would be done in general without resorting to a BFT protocol (as opposed to in specific cases like a single node being unable to connect to the rest of the network). Certainly there is no mechanism to explicitly detect and respond to partitions in protocols derived from Bitcoin.

And yet, [GKL2020] claims to prove Prefix Consistency from other assumptions. So we know that those assumptions must also rule out a long partition between honest nodes. In fact the required assumption is implicit in the communication model:

A synchronous network cannot be partitioned.
A partially synchronous network —that is, providing reliable delivery with bounded but unknown delay— cannot be partitioned for longer than the delay.

We might be concerned that these implicit assumptions are stronger than we would like. In practice, the peer‑to‑peer network protocol of Bitcoin and Zcash attempts to flood blocks to all nodes. This protocol might have weaknesses, but it is not intended to (and plausibly does not) depend on all messages being received. (Incidentally, Streamlet also implicitly floods messages to all nodes.)

Also, Streamlet and many other BFT protocols do not assume for safety that the network is not partitioned. That is, BFT protocols can be safe in a fully asynchronous communication model with unreliable messaging. That is why we avoid taking synchrony or partial synchrony as an implicit assumption of the communication model, or else we could end up with a protocol with weaker safety properties than $Π_{origbft}$ alone.

This leaves the question of whether the Prefix Consistency property is still too strong, even if we do not rely on it for the analysis of safety when $Π_{bft}$ has not been subverted. In particular, if a particular node $h$ is not well-connected to the rest of the network, then that will inevitably affect node $h$ ’s security, but should not affect other honest nodes’ security.

Fortunately, it is not the case that disconnecting a single node $h$ from the network causes the security assumption to be voided. The solution is to view $h$ as not honest in that case (even though it would follow the protocol if it could). This achieves the desired effect within the model, because other nodes can no longer rely on $h$ ’s honest input. Although viewing $h$ as potentially adversarial might seem conservative from the point of view of other nodes, bear in mind that an adversary could censor an arbitrary subset of incoming and outgoing messages from the node, and this may be best modelled by considering it to be effectively controlled by the adversary.

Prefix Consistency compares the $σ$ -truncated chain of some node $i$ with the untruncated chain of node $j$ . For our analysis of safety of the derived ledgers, we will also need to make an assumption on executions of $Π_{origbc}$ that at any given time $t$ , any two honest nodes $i$ and $j$ agree on their confirmed prefixes — with only the caveat that one may have observed more of the chain than the other. That is:

Definition: Prefix Agreement

An execution of $Π_{* bc}$ has Prefix Agreement at confirmation depth $σ$ iff it has Agreement on the view $(i, t) \mapsto ch_{i}^{t} ⌈_{* bc}^{σ}$ .

Why are this property, and Prefix Consistency above, stated as unconditional properties of protocol executions, rather than as probabilistic assumptions?

Our security arguments that depend on these properties will all be of the form “in an execution where ⟨safety properties⟩ are not violated, ⟨undesirable thing⟩ cannot happen”.

It is not necessary to involve probability in arguments of this form. Any probabilistic reasoning can be done separately.

In particular, if a statement of this form holds, and ⟨safety properties⟩ are violated with probability at most $p$ under certain conditions, then it immediately follows that under those conditions ⟨undesirable thing⟩ happens with probability at most $p$ . Furthermore, ⟨undesirable thing⟩ can only happen after ⟨safety properties⟩ have been violated, because the execution up to that point has been an execution in which ⟨safety properties⟩ are not violated.

With few exceptions, involving probability in a security argument is best done only to account for nondeterministic choices in the protocol itself. This is opinionated advice, but a lot of security proofs would likely be simpler if inherently probabilistic arguments were more distinctly separated from unconditional ones.

In the case of the Prefix Agreement property, an alternative approach would be to prove that Prefix Agreement holds with some probability given Prefix Consistency and some other chain properties. This is what [NTT2020] does in its Theorem 2, which essentially says that under certain conditions Prefix Agreement holds except with probability $e^{- Ω (σ)}$ .

The conclusions that can be obtained from this approach are necessarily probabilistic, and depending on the techniques used, the proof may not be tight; that is, the proof may obtain a bound on the probability of failure that is (either asymptotically or concretely) higher than needed. This is the case for [NTT2020, Theorem 2]; footnote 10 in that paper points out that the expression for the probability can be asymptotically improved:

Using the recursive bootstrapping argument developed in [DKT+2020, Section 4.2], it is possible to bring the error probability $e^{- Ω (σ)}$ as close to an exponential decay as possible. In this context, for any $ϵ > 0$ , it is possible to find constants $A_{ϵ}$ , $a_{ϵ}$ such that $Π_{lc} (p)$ is secure after C $(max (GST, GAT))$ with confirmation time $T_{lc} = σ$ except with probability $A_{ϵ} e^{- a_{ϵ} σ^{1 - ϵ}}$ .

(Here $p$ is the probability that any given node gets to produce a block in any given time slot.)

In fact none of the proofs of security properties for Snap‑and‑Chat depend on the particular expression $e^{- Ω (σ)}$ ; for example in the proofs of Lemma 5 and Theorem 1, this probability just “passes through” the proof from the premisses to the conclusion, because the argument is not probabilistic. The same will be true of our safety arguments.

Talking about what is possible in particular executions has further advantages:

It sidesteps the issue of how to interpret results in the GST model of partial synchrony, when we do not know what C $(max (GST, GAT))$ is. See also the critique of applying the GST model to block‑chain protocols under “Discussion of [GKL2020]’s communication model and network partition” above. (This is not an inherent problem with analyzing the protocol in the partially synchronous setting, but only with inappropriate use of the GST model of that setting.)
We do not require $Π_{bc}$ to be a Nakamoto‑style Proof‑of‑Work block chain protocol. Some other kind of protocol could potentially satisfy Prefix Consistency and Prefix Agreement.
It is not clear whether a $e^{- Ω (σ)}$ probability of failure would be concretely adequate. That would depend on the value of $σ$ and the constant hidden by the $Ω$ notation. The asymptotic property using $Ω$ tells us whether a sufficiently large $σ$ could be chosen, but we are more interested in what needs to be assumed for a given concrete choice of $σ$ .
If a violation of a required safety property occurs in a given execution, then the safety argument for Crosslink that depended on the property fails for that execution, regardless of what the probability of that occurrence was. This approach therefore more precisely models the consequences of such violations.

Why, intuitively, should we believe that Prefix Agreement and Prefix Consistency for a large enough confirmation depth hold with high probability for executions of a PoW‑based best‑chain protocol?

Roughly speaking, the intuition behind both properties is as follows:

Honest nodes are collectively able to find blocks faster than an adversary, and communication between honest nodes is sufficiently reliable that they act as a combined network racing against that adversary. Then by the argument in [Nakamoto2008], modified by [GP2020] to correct an error in the concrete analysis, a private mining attack that attempts to cause a $σ$ ‑block rollback will, with high probability, fail for large enough $σ$ . A private mining attack is optimal by the argument in [DKT+2020].

Any further analysis of the conditions under which these properties hold should be done in the context of a particular $Π_{* bc}$ .

Why is the quantification in Prefix Agreement over two different times t and t′?

This strengthens the security property, relative to quantifying over a single time. The question can then be split into several parts:

What does the strengthened property mean, intuitively? Consider the full tree of $*$ bc‑valid-blocks that honest nodes have considered to be part of their $*$ bc‑best-chain at any times during the execution. This property holds iff, when we strip off all branches of length up to and including $σ$ blocks, the resulting tree is $*$ bc‑linear.
Why is the strengthening needed? Suppose that time were split into periods such that honest nodes agreed on one chain in odd periods, and a completely different chain in even periods. This would obviously not satisfy the intent, but it would satisfy a version of the property that did not quantify over different times $t$ and $u$ .
Why should we expect the strengthened property to hold? If node $j$ were far ahead, i.e. $u ≫ t$ , then it is obvious that $ch_{i}^{t} ⌈_{* bc}^{σ} ⪯_{* bc} ch_{j}^{u} ⌈_{* bc}^{σ}$ should hold. Conversely, if node $i$ were far ahead then it is obvious that $ch_{j}^{u} ⌈_{* bc}^{σ} ⪯_{* bc} ch_{i}^{t} ⌈_{* bc}^{σ}$ should hold. The case where $t = u$ is the same as quantifying over a single time. By considering intermediate cases where $t$ and $u$ converge from the extremes or where they diverge from being equal, you should be able to convince yourself that the property holds for any relative values of $t$ and $u$ , in executions of a reasonable best‑chain protocol.

Definition of Crosslink 2

Parameters

Crosslink 2 is parameterized by a bc‑confirmation‑depth $σ \in N^{+}$ (as in Snap‑and‑Chat), and also a finalization gap bound $L \in N^{+}$ with $L$ significantly greater than $σ$ .

Each node $i$ always uses the fixed confirmation depth $σ$ to obtain its view of the finalized chain $fin_{i}^{t} ⦂ bc-valid-chain$ . Unlike in Snap‑and‑Chat or Crosslink 1, this is just a block chain; because we do not need sanitization, there is no need to express it as a log of transactions rather than blocks.

Each node $i$ chooses a potentially different bc‑confirmation‑depth $μ \in N$ where $0 < μ \leq σ$ to obtain its view of the bounded‑available ledger at time $t$ , $(ba_{μ})_{i}^{t} ⦂ bc-valid-chain$ . (We make the restriction $μ \leq σ$ because there is no reason to choose a larger $μ$ .)

Security caveat

Choosing $μ < σ$ is at the node’s own risk and may increase the risk of rollback attacks against $(ba_{μ})_{i}^{t}$ (it does not affect $fin_{i}^{t}$ ). Using small values of $μ$ is not recommended. The default should be $μ = σ$ .

Stalled Mode

Consider, roughly speaking, the number of bc‑blocks that are not yet finalized at time $t$ (a more precise definition will be given in the section on $Π_{bc}$ changes from $Π_{origbc}$ ). We call this the “finality gap” at time $t$ . Under an assumption about the distribution of bc‑block intervals, if this gap stays roughly constant then it corresponds to the approximate time that transactions take to be finalized after being included in a bc‑block (if they are finalized at all) just prior to time $t$ .

As explained in detail by The Arguments for Bounded Availability and Finality Overrides, if this bound exceeds a threshold $L$ , then it likely signals an exceptional or emergency condition, in which it is undesirable to keep accepting user transactions that spend funds into new bc‑blocks. In practice, $L$ should be at least $2 σ$ .

The condition that the network enters in such cases will be called “Stalled Mode”. For a given higher‑level transaction protocol, we can define a policy for which bc‑blocks will be accepted in Stalled Mode. This will be modelled by a predicate $is-stalled-block ⦂ bc-block \to boolean$ . A bc‑block for which $is-stalled-block$ returns $true$ is called a “stalled block”.

Caution

A bc‑block producer is only constrained to produce stalled blocks while, roughly speaking, its view of the finalization point is not advancing. In particular an adversary that has subverted the BFT protocol in a way that does not keep the finalization point from advancing, can always avoid being constrained by Stalled Mode.

The desired properties of stalled blocks and a possible Stalled Mode policy for Zcash are discussed in the How to block hazards section of The Arguments for Bounded Availability and Finality Overrides.

In practice a node's view of the finalized chain, $fin_{i}^{t}$ , is likely to lag only a few blocks behind $ba_{i, σ}^{t}$ (depending on the latency overhead imposed by $Π_{bft}$ ), unless the chain has entered Stalled Mode. So when $μ = σ$ , the main factor influencing the choice of a given application to use $fin_{i}^{t}$ or $(ba_{μ})_{i}^{t}$ is not the average latency, but rather the desired behaviour in the case of a finalization stall: i.e. stall immediately, or keep processing user transactions until $L$ blocks have passed.

Structural additions

Each bc‑header has, in addition to origbc‑header fields, a $context_bft$ field that commits to a bft‑block.
Each bft‑proposal has, in addition to origbft‑proposal fields, a $headers_bc$ field containing a sequence of exactly $σ$ bc‑headers (zero‑indexed, deepest first).
Each non‑genesis bft‑block has, in addition to origbft‑block fields, a $headers_bc$ field containing a sequence of exactly $σ$ bc‑headers (zero-indexed, deepest first). The genesis bft‑block has $headers_bc = \emptyset$ .

For a bft‑block or bft‑proposal $B$ , define $LF (H) snapshot (B) := {O_{bc}, B . headers_bc [0] ⌈_{bc}^{1}, if B . headers_bc = \emptyset otherwise.$ For a bc‑block $H$ , define $snapshot (B) LF (H) candidate (H) := bft-last-final (H . context_bft) := last-common-ancestor (snapshot (LF (H)), H ⌈_{bc}^{σ})$

When $H$ is the tip of a node’s bc‑best‑chain, $candidate (H)$ will give the candidate finalization point, subject to a condition described below that prevents local rollbacks.

Use of the headers_bc field, and its relation to the ch field in Snap‑and‑Chat.

For a bft‑proposal or bft‑block $B$ , the role of the bc‑chain snapshot referenced by $B . headers_bc [0] ⌈_{bc}^{1}$ is comparable to the $Π_{lc}$ snapshot referenced by $B . ch$ in the Snap‑and‑Chat construction from [NTT2020]. The motivation for the additional headers is to demonstrate, to any party that sees a bft‑proposal (resp. bft‑block), that the snapshot had been confirmed when the proposal (resp. the block’s proposal) was made.

Typically, a node that is validating an honest bft‑proposal or bft‑block will have seen at least the snapshotted bc‑block (and possibly some of the subsequent bc‑blocks in the $headers_bc$ chain) before. For this not to be the case, the validator’s bc‑best‑chain would have to be more than $σ$ bc‑blocks behind the honest proposer’s bc‑best‑chain at a given time, which would violate the Prefix Consistency property of $Π_{bc}$ .

If the headers do not connect to any bc‑valid‑chain known to the validator, then the validator should be suspicious that the proposer might not be honest. It can assign a lower priority to validating the proposal in this case, or simply drop it. The latter option could drop a valid proposal, but this does not in practice cause a problem as long as a sufficient number of validators are properly synced (so that Prefix Consistency holds for them).

If the headers do connect to a known bc‑valid‑chain, it could still be the case that the whole header chain up to and including $B . headers_bc [σ - 1]$ is not a bc‑valid‑chain. Therefore, to limit denial‑of‑service attacks the validator should first check the Proofs‑of‑Work and difficulty adjustment —which it can do locally using only the headers— before attempting to download and validate any bc‑blocks that it has not already seen. This is why we include the full headers rather than just the block hashes. Nodes may “trim” (i.e. not explicitly store) headers in a bft‑block that overlap with those referred to by its ancestor bft‑block(s).

Why is a distinguished value needed for the headers_bc field in the genesis bft‑block?

It would be conceptually nice for $O_{bft} . headers [0] ⌈_{bc}^{1}$ to refer to $O_{bc}$ , as well as $O_{bc} . context_bft$ being $O_{bft}$ so that $bft-last-final (O_{bc} . context_bft) = O_{bft}$ . That reflects the fact that we know “from the start” that neither genesis block can be rolled back.

This is not literally implementable using block hashes because it would involve a hash cycle, but we achieve the same effect by defining a $snapshot$ function that allows us to “patch” $snapshot (O_{bft})$ to be $O_{bc}$ . We do it this way around rather than “patching” the link from a bc‑block to a bft‑block, because the genesis bft‑block already needs a special case since there are not $σ$ bc‑headers available.

Why is the context_bft field needed? Why not use a final_bft field to refer directly to the last final bft‑block before the context block?

The finality of some bft‑block is only defined in the context of another bft‑block. One possible design would be for a bc‑block to have both $final_bft$ and $context_bft$ fields, so that the finality of $final_bft$ could be checked objectively in the context of $context_bft$ .

However, specifying just the context block is sufficient information to determine its last final ancestor. There would never be any need to give a context block and a final ancestor that is not the last one. The $bft-last-final$ function can be computed efficiently for typical BFT protocols. Therefore, having just the $context_bft$ field is sufficient.

Locally finalized chain

Each node $i$ keeps track of a “locally finalized” bc‑chain $fin_{i}^{t}$ at time $t$ . Each node’s locally finalized bc‑chain starts at $fin_{i}^{0} = O_{bc}$ . However, this chain state should not be exposed to clients of the node until it has synced.

Definition: Local finalization linearity

Node $i$ has Local finalization linearity up to time $t$ iff the time series of bc‑blocks $fin_{i}^{r \leq t}$ is bc‑linear.

When node $i$ ’s bc‑best‑chain view is updated from $ch_{i}^{s}$ to $ch_{i}^{t}$ , the node’s $fin_{i}^{t}$ will become $candidate (ch_{i}^{t})$ if and only if this is a descendant of $fin_{i}^{s}$ . Otherwise $fin_{i}^{t}$ will stay at $fin_{i}^{s}$ . This guarantees Local finalization linearity by construction.

If when making this update, $candidate (ch_{i}^{t}) / ⪯ ⪰ fin_{i}^{s}$ (i.e. $candidate (ch_{i}^{t})$ and $fin_{i}^{s}$ are on different forks), then the node should record a finalization safety hazard. This can only happen if global safety assumptions are violated. Note that Local finalization linearity on each node is not sufficient for Assured Finality, but it is necessary.

This can be expressed by the following state update algorithm, where $s$ is the time of the last update and $t$ is the time of the current update:

$update-fin (mut fin_{i}, ch_{i}^{t}, s) : let N = candidate (ch_{i}^{t}) if N ⪰ fin_{i}^{s} : fin_{i}^{t} \leftarrow N else: fin_{i}^{t} \leftarrow fin_{i}^{s} if N \neq ⪯ fin_{i}^{s} : record finalization safety hazard$

A safety hazard record should include $ch_{i}^{t}$ and the history of $fin_{i}$ updates including and since the last one that was an ancestor of $N$ .

Lemma: Local fin‑depth

In any execution of Crosslink 2, for any node $i$ that is honest at time $t$ , there exists a time $r \leq t$ such that $fin_{i}^{t} ⪯ ch_{i}^{r} ⌈_{bc}^{σ}$ .

Proof: By the definition of $candidate$ we have $candidate (ch_{i}^{u}) ⪯ ch_{i}^{u} ⌈_{bc}^{σ}$ for all times $u$ . Let $r \leq t$ be the last time at which $fin_{i}^{t}$ changed, or the genesis time $0$ if it has never changed. Then for $r > 0$ we have $fin_{i}^{t} = fin_{i}^{r} = candidate (ch_{i}^{r}) ⪯ ch_{i}^{r} ⌈_{bc}^{σ}$ , and for $r = 0$ we have $fin_{i}^{t} = fin_{i}^{r} = O_{bc} ⪯ ch_{i}^{r} ⌈_{bc}^{σ}$ (because $ch_{i}^{0} = O_{bc}$ , and truncating $O_{bc}$ always yields $O_{bc}$ ).

Why does fin_i need to be maintained using local state?

When a node’s view of the bc‑best‑chain reorgs to a different fork (even if the reorg is shorter than $σ$ blocks), it may be the case that $candidate (ch_{i}^{t})$ rolls back. If Final Agreement of $Π_{bft}$ holds up to time $t$ , the new snapshot should in that case be an ancestor of the old one. If all is well then this snapshot will subsequently roll forward along the same path. However, we do not want applications using the node to see the temporary rollback.

Assured Finality is our main safety goal for Crosslink 2. It is essentially the same goal as Final Agreement but applied to nodes’ locally finalized bc‑chains; intuitively it means that honest nodes never see conflicting locally finalized chains. We intend to prove that this goal holds under reasonable assumptions about either $Π_{origbft}$ or $Π_{origbc}$ .

Definition: Assured Finality

An execution of Crosslink 2 has Assured Finality iff for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have $fin_{i}^{t} ⪯ ⪰_{bc} fin_{j}^{u}$ .

Note that if an execution of Crosslink 2 has Assured Finality, then all nodes that are honest for that execution have Local finalization linearity. That is because the restriction of Assured Finality to the case $i = j$ is equivalent to Local finalization linearity for node $i$ up to any time at which node $i$ is honest.

Why do we need to use candidate(H) rather than snapshot(LF(H))?

This ensures that the candidate is at least $σ$ ‑confirmed.

In practice $candidate (H)$ will rarely differ from $snapshot (LF (H))$ , but using the former patches over a potential gap in the safety proof. The Last Final Snapshot rule specified later will guarantee that $snapshot (LF (H)) ⪯_{bc} H$ , and this ensures that $snapshot (LF (H)) ⪯ ⪰_{bc} H ⌈_{bc}^{σ}$ . However, the depth of $snapshot (LF (H))$ relative to $H$ is not guaranteed to be $\geq σ$ . For the proof we will need $candidate (H) ⪯_{bc} H ⌈_{bc}^{σ}$ , so that we can use the Local fin‑depth lemma together with Prefix Agreement of $Π_{bc}$ at confirmation depth $σ$ to prove Assured Finality.

An alternative would be to change the Last Final Snapshot rule to directly require $snapshot (LF (H)) ⪯_{bc} H ⌈_{bc}^{σ}$ .

$TODO:$ Choose between these options based on what works well for the security proofs and finalization latency.

Locally bounded‑available chain

Define the locally bounded‑available chain on node $i$ for bc‑confirmation‑depth $μ$ , as $(ba_{μ})_{i}^{t} = {ch_{i}^{t} ⌈_{bc}^{μ}, fin_{i}^{t}, if fin_{i}^{t} ⪯ ch_{i}^{t} ⌈_{bc}^{μ} otherwise.$

Like the locally finalized bc‑chain, this chain state should not be exposed to clients of the node until it has synced.

Theorem: Ledger prefix property

For any node $i$ that is honest at time $t$ , and any confirmation depth $μ$ , $fin_{i}^{t} ⪯ (ba_{μ})_{i}^{t}$ .

Proof: By construction of $(ba_{μ})_{i}^{t}$ .

Lemma: Local ba‑depth

In any execution of Crosslink 2, for any confirmation depth $μ \leq σ$ and any node $i$ that is honest at time $t$ , there exists a time $r \leq t$ such that $(ba_{μ})_{i}^{t} ⪯_{bc} ch_{i}^{r} ⌈_{bc}^{μ}$ .

Proof: Either $(ba_{μ})_{i}^{t} = fin_{i}^{t}$ , in which case the result follows by the Local fin‑depth lemma since $μ \leq σ$ , or $(ba_{μ})_{i}^{t} = ch_{i}^{t} ⌈_{bc}^{μ}$ in which case it follows trivially with $r = t$ .

Our security goal for $ba_{μ}$ will be Agreement on $ba_{μ}$ as already defined.

Why is the ‘otherwise’ case in the definition of (ba_μ)_i^t necessary?

Assume for this discussion that $Π_{bc}$ uses PoW.

Depending on the value of $σ$ , the timestamps of bc‑blocks, and the difficulty adjustment rule, it can be the case that if $fin_{i}$ switches to a different fork, the difficulty on that fork is greater than on the chain of the previous snapshot. Then, the new bc‑chain could reach a higher score than the previous chain in fewer than $σ$ blocks from the fork point, and so $ch_{i}^{t} ⌈_{bc}^{μ}$ might not be a descendant of $fin_{i}^{t}$ (which is more likely if $μ = σ$ ). This can occur even when all safety assumptions are satisfied.

For Zcash’s difficulty adjustment algorithm, the difficulty of each block is adjusted based on the median timestamps and difficulty target thresholds over a range of $PoWAveragingWindow = 17$ blocks, where each median is taken over $PoWMedianBlockSpan = 11$ blocks. Other damping factors and clamps are applied in order to prevent instability and to reduce the influence that adversarially chosen timestamps can have on difficulty adjustment. This makes it unlikely that an adversary could gain a significant advantage by manipulating the difficulty adjustment. So it is safe to use $fin_{i}^{t}$ in this case: even though it does not have $μ$ confirmations relative to $ch_{i}^{t}$ , it does have at least the required amount of work “on top” of it.

Defining $(ba_{μ})_{i}^{t}$ this way also has the advantage of making the proof of the Ledger prefix property trivial.

Syncing and checkpoints

It is recommended that node implementations “bake in” a checkpointed bft‑block $B_{checkpoint}$ to each released version, and that node $i$ should only expose $fin_{i}^{t}$ and $(ba_{μ})_{i}^{t}$ to its clients once it is “synced”, that is:

$B_{checkpoint} ⪯_{bft} LF (ch_{i}^{t})$ ; and
$snapshot (B_{checkpoint}) ⪯_{bc} fin_{i}^{t}$ ; and
the timestamp of $fin_{i}^{t}$ is within some threshold of the current time.

Π_bft changes from Π_origbft

Π_bft proposal and block validity

Genesis bft‑block rule: $O_{bft}$ is bft‑block‑valid.

A bft‑proposal (resp. non‑genesis bft‑block) $B$ is bft‑proposal‑valid (resp. bft‑block‑valid) iff all of the following hold:

Inherited origbft rules: The corresponding origbft‑proposal‑validity (resp. origbft‑block‑validity) rules hold for $B$ .
Linearity rule: $snapshot (B ⌈_{bft}^{1}) ⪯_{bc} snapshot (B)$ .
Tail Confirmation rule: $B . headers_bc$ form the $σ$ ‑block tail of a bc‑valid‑chain.

The “corresponding validity rules” are assumed to include the Parent rule that $B$ ’s parent is bft‑valid.

Note: origbft‑block‑validity rules may be different to origbft‑proposal‑validity rules. For example, in adapted Streamlet, a origbft‑block needs evidence that it was voted for by a supermajority, and an origbft‑proposal doesn’t. Such differences also apply to bft‑block‑validity vs bft‑proposal‑validity.

Why have validity rules been separated from the honest voting condition below?

The reason to separate the validity rules from the honest voting condition, is that the validity rules are objective: they don’t depend on an observer’s view of the bc‑best‑chain. Therefore, they can be checked independently of validator signatures. Even a proposal voted for by 100% of validators will not be considered bft‑proposal‑valid by other nodes unless it satisfies the above rules. If more than two thirds of voting units are cast for an invalid proposal, something is seriously and visibly wrong; in any case, the block will not be accepted as a bft‑valid‑block. Importantly, a purportedly valid bft‑block will not be recognized as such by any honest Crosslink 2 node even if it includes a valid notarization proof, if it does not meet other bft‑block‑validity rules.

This is essential to making the finalized chain safe against a flaw in $Π_{bft}$ or its security assumptions (even, say, a complete break of the validator signature algorithm), as long as $Π_{bc}$ remains safe.

What does the Linearity rule do?

This rule is key to combining simplicity with strong security properties in Crosslink 2. It essentially says that, in a given bft‑valid‑chain, the snapshots pointed to by blocks in that chain cannot roll back.

This allows the informal safety argument for Crosslink 2 to be rather intuitive.

Informally, if $Π_{bft}$ has Final Agreement, then all nodes see only one consistent bft‑linear chain (restricting to bft‑blocks that are final in the context of some bft‑block in honest view). Within such a bft‑chain, the Linearity rule ensures by construction that the sequence of referenced bc‑chain snapshots is bc‑linear. This implies Assured Finality, without needing to assume any safety property of $Π_{bc}$ .

We will also be able to prove safety of the finalized snapshots based only on safety of $Π_{bc}$ (for a confirmation depth of $σ$ ), without needing to assume any safety property of $Π_{bft}$ . Informally, that is because each node sees each candidate final snapshot at a given time as a $σ$ -confirmed prefix of its bc‑best‑chain at that time (this can be proven based on the Last Final Snapshot rule and the fact that a snapshot includes $σ$ subsequent headers), and Prefix Agreement implies that honest nodes agree on this prefix. We will leave a more detailed argument until after we have presented the $Π_{bc}$ changes from $Π_{origbc}$ .

The Linearity rule replaces the “Increasing Score rule” used in Crosslink 1. The Increasing Score rule required that each snapshot in a bft‑valid‑chain either be the same snapshot, or a higher-scoring snapshot to that of its parent block. Since scores strictly increase within a bc‑valid‑chain, the Linearity rule implies the Increasing Score rule. It retains the same or stronger positive effects:

It prevents potential attacks that rely on proposing a bc‑valid‑chain that forks from a much earlier block. This is necessary because the difficulty (or stake threshold) at that point could have been much lower.
It limits the extent of disruption an adversary can feasibly cause to the bounded‑available chain $ch_{i}$ , even if it has subverted $Π_{bft}$ . Informally, because the finalized chain is a $Π_{bc}$ chain, its safety is no worse than $Π_{bc}$ alone for a rollback of any depth.
It ensures that either progress is made (the snapshot advances relative to that of the parent bft‑block), or there is no further validation that needs to be done for the snapshot because it was already validated.

Note that the adversary could take advantage of an “accidental” fork and start its attack from the base of that fork, so that not all of this work is done by it alone. This is also possible in the case of a standard “private mining” attack, and is not so much of a problem in practice because accidental forks are expected to be short. In any case, $σ$ should be chosen to take it into account.

The Linearity rule is also critical to removing the need for one of the most complex elements of Snap‑and‑Chat and Crosslink 1, “sanitization”. In those protocols, because bc‑chain snapshots could be unrelated to each other, it was necessary to sanitize the chain formed from these snapshots to remove transactions that were contextually invalid (e.g. because they double‑spend). The negative consequences of this are described in Notes on Snap‑and‑Chat; avoiding it is much simpler.

The linearity property is intentionally always relative to the snapshot of the parent bft‑block, even if it is not final in the context of the current bft‑block. This is because the rule needs to hold if and when it becomes final in the context of some descendant bft‑block.

$TODO:$ PoS Desideratum: we want leader selection with good security / performance properties that will be relevant to this rule. (Suggested: PoSAT.)

Why does the Linearity rule allow keeping the same snapshot as the parent?

This is necessary in order to preserve liveness of $Π_{bft}$ relative to $Π_{origbft}$ . Liveness of $Π_{origbft}$ might require honest proposers to make proposals at a minimum rate. That requirement could be consistently violated if it were not always possible to make a valid proposal. But given that it is allowed to repeat the same snapshot as in the parent bft‑block, neither the Linearity rule nor the Tail Confirmation rule can prevent making a valid proposal — and all other rules of $Π_{bft}$ affecting the ability to make valid proposals are the same as in $Π_{origbft}$ . (In principle, changes to voting in $Π_{bft}$ could also affect its liveness; we’ll discuss that in the liveness proof later.)

For example, Streamlet requires three notarized blocks in consecutive epochs in order to finalize a block [CS2020, section 1.1]. Its proof of liveness depends on the assumption that in each epoch for which the leader is honest, that leader will make a proposal, and that during a “period of synchrony” this proposal will be received by every node [CS2020, section 3.6]. This argument can also be extended to adapted‑Streamlet.

We could alternatively have allowed to always make a “null” proposal, rather than to always make a proposal with the same snapshot as the parent. We prefer the latter because the former would require specifying the rules for null proposals in $Π_{origbft}$ .

As a clarification, no BFT protocol that uses leader election can require a proposal in each epoch, because the leader might be dishonest. The above issue concerns liveness of the protocol when assumptions about the attacker’s share of bft‑validators or stake are met, so that it can be assumed that sufficiently long periods with enough honest leaders to make progress (5 consecutive epochs in the case of Streamlet), will occur with significant probability.

Π_bft block finality in context

The finality rule for bft‑blocks in a given context is unchanged from origbft‑finality. That is, $bft-last-final$ is defined in the same way as $origbft-last-final$ (modulo referring to bft‑block‑validity and $O_{bft}$ ).

Π_bft honest proposal

An honest proposer of a bft‑proposal $P$ chooses $P . headers_bc$ as the $σ$ ‑block tail of its bc‑best‑chain, provided that it is consistent with the Linearity rule. If it would not be consistent with that rule, it sets $P . headers_bc$ to the same $headers_bc$ field as $P$ ’s parent bft‑block. It does not make proposals until its bc‑best‑chain is at least $σ + 1$ blocks long.

Why σ + 1?

If the length were less than $σ + 1$ blocks, it would be impossible to construct the $headers_bc$ field of the proposal.

Note that when the length of the proposer’s bc‑best‑chain is exactly $σ + 1$ blocks, the snapshot must be of $O_{bc} .$ But this does not violate the Linearity rule, because $O_{bc}$ matches the previous snapshot by $O_{bft}$ .

How is it possible that the Linearity rule would not be satisfied by choosing headers from an honest proposer’s bc‑best‑chain?

As in the answer to Why is the ‘otherwise’ case in the definition of $(ba_{μ})_{i}^{t}$ necessary? above, after a reorg on the bc‑chain, the $σ$ -confirmed block on the new chain might not be a descendant of the $σ$ -confirmed block on the old chain, which could break the Linearity rule.

Π_bft honest voting

An honest validator considering a proposal $P$ , first updates its view of both subprotocols with the bc‑headers given in $P . headers_bc$ , downloading bc‑blocks for these headers and checking their bc‑block‑validity.

For each downloaded bc‑block, the bft‑chain referenced by its $context_bft$ field might need to be validated if it has not been seen before.

Wait what, how much validation is that?

In general the entire referenced bft‑chain needs to be validated, not just the referenced block — and for each bft‑block, the bc‑chain in $headers_bc$ needs to be validated, and so on recursively. If this sounds overwhelming, note that:

We should check the requirement that a bft‑valid‑block must have been voted for by a two‑thirds absolute supermajority of validators, and any other non‑recursive bft‑validity rules, first.
Before validating a bc‑chain referenced by a $headers_bc$ field, we check that it connects to an already-validated bc‑chain and that the Proofs‑of‑Work are valid. This implies that the amount of bc‑block validation is constrained by how fast the network can find valid Proofs‑of‑Work.
The Linearity rule reduces the worst‑case validation effort, by ensuring that only one bc‑chain needs to be validated for any bft‑chain. Assuming safety of $Π_{bft}$ and that the adversary does not have an overwhelming advantage in computing the Proof‑of‑Work, this is effectively only one bc‑chain overall with, at most, short side branches.

In summary, the order of validation is important to avoid denial‑of‑service — but it already is in Bitcoin and Zcash.

After updating its view, the validator will vote for a proposal $P$ only if:

Valid proposal criterion: it is bft‑proposal‑valid, and
Confirmed best‑chain criterion: $snapshot (P)$ is part of the validator’s bc‑best‑chain at a bc‑confirmation‑depth of at least $σ$ .

Blocks in a bc‑best‑chain are by definition bc‑block‑valid. If we’re checking the Confirmed best‑chain criterion, why do we need to have separately checked that the blocks referenced by the headers are bc‑block‑valid?

The Confirmed best‑chain criterion is quite subtle. It ensures that $snapshot (P) = P . headers_bc [0] ⌈_{bc}^{1}$ is bc‑block‑valid and has $σ$ bc‑block‑valid blocks after it in the validator’s bc‑best‑chain. However, it need not be the case that $P . headers_bc [σ - 1]$ is part of the validator’s bc‑best‑chain after it updates its view. That is, the chain could fork after $snapshot (P)$ .

The bft‑proposal‑validity rule must be objective; it can’t depend on what the validator’s bc‑best‑chain is. The validator’s bc‑best‑chain may have been updated to $P . headers_bc [σ - 1]$ (if it has the highest score), but it also may not.

However, if the validator’s bc‑best‑chain was updated, that makes it more likely that it will be able to vote for the proposal.

In any case, if the validator does not check that all of the blocks referenced by the headers are bc‑block‑valid, then its vote may be invalid.

How does this compare to Snap‑and‑Chat?

Snap‑and‑Chat already had the voting condition:

An honest node only votes for a proposed BFT block $B$ if it views $B . ch$ as confirmed.

but it did not give the headers potentially needed to update the validator’s view, and it did not require a proposal to be for an objectively confirmed snapshot as a matter of validity.

If a Crosslink‑like protocol were to require an objectively confirmed snapshot but without including the bc‑headers in the proposal, then validators would not immediately know which bc‑blocks to download to check its validity. This would increase latency, and would be likely to lead proposers to be more conservative and only propose blocks that they think will already be in at least a two‑thirds absolute supermajority of validators’ best chains.

That is, showing $P . headers_bc$ to all of the validators is advantageous to the proposer, because the proposer does not have to guess what blocks the validators might have already seen. It is also advantageous for the protocol goals in general, because it improves the trade‑off between finalization latency and security.

Π_bc changes from Π_origbc

Π_bc block validity

Genesis bc‑block rule: For the genesis bc‑block we must have $O_{bc} . context_bft = O_{bft}$ , and therefore $bft-last-final (O_{bc} . context_bft) = O_{bft}$ .

A bc‑block $H$ is bc‑block‑valid iff all of the following hold:

Inherited origbc rules: $H$ satisfies the corresponding origbc‑block‑validity rules.
Valid context rule: $H . context_bft$ is bft‑block‑valid.
Extension rule: $LF (H ⌈_{bc}^{1}) ⪯_{bft} LF (H)$ .
Last Final Snapshot rule: $snapshot (LF (H)) ⪯_{bc} H$ .
Finality depth rule: Define: $finality-depth (H) := height (H) - height (snapshot (LF (H)))$ Then either $finality-depth (H) \leq L$ or $is-stalled-block (H)$ .

Explain the definition of finality‑depth.

The finality depth must be objectively defined, since it is used in a consensus rule. Therefore it should measure the height of $H$ relative to $snapshot (LF (H))$ , which is an objectively defined function of $H$ , rather than relative to $fin_{i}^{t}$ . (These will only differ for $H = ch_{i}^{t}$ when node $i$ has just reorged, and only then in corner cases.)

Note that the Last Final Snapshot rule ensures that it is meaningful to simply use the difference in heights, since $snapshot (LF (H)) ⪯_{bc} H$ .

Π_bc contextual validity

The consensus rule changes above are all non-contextual. Modulo these changes, contextual validity in $Π_{bc}$ is the same as in $Π_{origbc}$ .

Π_bc honest block production

An honest producer of a bc‑block $H$ must follow the consensus rules under $Π_{bc}$ block validity above. In particular, it must produce a stalled block if required to do so by the Finality depth rule.

To choose $H . context_bft$ , the producer considers a subset of the tips of bft‑valid‑chains in its view: ${T : T is bft‑block‑valid and LF (H ⌈_{bc}^{1}) ⪯_{bft} bft-last-final (T)}$ It chooses one of the longest of these chains, $C$ , breaking ties by maximizing $score (snapshot (bft-last-final (C)))$ , and if there is still a tie then by taking $C$ with the smallest hash.

The honest block producer then sets $H . context_bft$ to $C$ .

Attention

An honest bc‑block‑producer must not use information from the BFT protocol, other than the specified consensus rules, to decide which bc‑valid‑chain to follow. The specified consensus rules that depend on $Π_{bft}$ have been carefully constructed to preserve safety of $Π_{bc}$ relative to $Π_{origbc}$ . Imposing any additional constraints could potentially allow an adversary that is able to subvert $Π_{bft}$ , to influence the evolution of the bc‑best‑chain in ways that are not considered in the safety argument.

Why not choose T such that H ⌈¹_bc . context_bft ⪯_bft bft‑last‑final(T )?

The effect of this would be to tend to more often follow the last bft‑block seen by the producer of the parent bc‑block, if there is a choice. It is not always possible to do so, though: the resulting set of candidates for $C$ might be empty.

Also, it is not clear that giving the parent bc‑block‑producer the chance to “guide” what bft‑block should be chosen next is beneficial, since that producer might be adversarial and the resulting incentives are difficult to reason about.

Why choose the longest C, rather than the longest bft‑last‑final(C )?

We could have instead chosen $C$ to maximize the length of $bft-last-final (C)$ . The rule we chose follows Streamlet, which builds on the longest notarized chain, not the longest finalized chain. This may call for more analysis specific to the chosen BFT protocol.

Why this tie‑breaking rule?

Choosing the bft‑chain that has the last final snapshot with the highest score, tends to inhibit an adversary’s ability to finalize its own chain if it has a lesser score. (If it has a greater score, then it has already won a hash race and we cannot stop the adversary chain from being finalized.)

For discussion of potentially unifying the roles of bc‑block producer and bft‑proposer, see What about making the bc‑block‑producer the bft‑proposer? in Potential changes to Crosslink.

At this point we have completed the definition of Crosslink 2. In Security Analysis of Crosslink 2, we will prove it secure.

Security Analysis of Crosslink 2

This document analyzes the security of Crosslink 2 in terms of liveness and safety. It assumes that you have read The Crosslink 2 Construction which defines the protocol.

Liveness argument

First note that Crosslink 2 intentionally sacrifices availability if there is a long finalization stall.

Info

This is technically independent of the other changes; you can omit the Finality depth rule and the protocol would still have security advantages over Snap‑and‑Chat, as well as being much simpler and solving its “spending from finalized outputs” issue. In that case the incentives to “pull” the finalization point forward to include new final bft‑blocks would be weaker, but honest bc‑block‑producers would still do it.

It would still be a bug if there were any situation in which $Π_{bc}$ failed to be live, though, because that would allow tail‑thrashing attacks.

Crosslink 2 involves a bidirectional dependence between $Π_{bft}$ and $Π_{bc}$ . The Ebb‑and‑Flow paper [NTT2020] argues in Appendix E ("Bouncing Attack on Casper FFG") that it can be more difficult to reason about liveness given a bidirectional dependence between protocols:

To ensure consistency among the two tiers [of Casper FFG], the fork choice rule of the blockchain is modified to always respect ‘the justified checkpoint of the greatest height [*]’ [22]. There is thus a bidirectional interaction between the block proposal and the finalization layer: blocks proposed by the blockchain are input to finalization, while justified checkpoints constrain future block proposals. This bidirectional interaction is intricate to reason about and a gateway for liveness attacks.

Info

[*] The quotation changes this to “[depth]”, but our terminology is consistent with Ethereum here and not with [NTT2020]’s idiosyncratic use of “depth” to mean “block height”.

The argument is correct as far as it goes. The main reason why this does not present any great difficulty to proving liveness of Crosslink, is due to a fundamental difference from Casper FFG: in Crosslink 2 the fork‑choice rule of $Π_{bc}$ is not modified.

Let $S$ be the subset of bc‑blocks ${B : is-coinbase-only-block (B) and is-stalled-block (B)}$ . Assume that $S$ is such that a bc‑block‑producer can always produce a block in $S$ .

In that case it is straightforward to convince ourselves that the additional bc‑block‑validity rules are never an obstacle to producing a new bc‑block in $S$ :

The changes to the definition of contextual validity do not interfere with liveness, since they do not affect coinbase transactions, and therefore do not affect blocks in $S$ . That is, the bc‑block‑producer can always just omit non‑coinbase transactions that are contextually invalid.
The Genesis bc‑block rule doesn’t apply to new bc‑blocks.
The Valid context rule, Extension rule, and Last Final Snapshot rule are always satisfiable by referencing the same context bft‑block as the parent bc‑block.
The Finality depth rule always allows the option of producing a stalled block, and therefore does not affect blocks in $S$ .

The instructions to an honest bc‑block‑producer allow it to produce a block in $S$ . Therefore, $Π_{bc}$ remains live under the same conditions as $Π_{origbc}$ .

The additional bft‑proposal‑validity, bft‑block‑validity, bft‑finality, and honest voting rules are also not an obstacle to making, voting for, or finalizing bft‑proposals:

Because $Π_{bc}$ is live, there will always be some point in time at which a fresh valid bc‑header chain that satisfies both the Linearity rule and the Tail confirmation rule exists for use in the $P . headers_bc$ field.
If no fresh valid bc‑header chain is available, the Linearity rule and Tail confirmation rule allow an honest bft‑proposer to choose $headers_bc$ to be the same as in the previous bft‑block. So, if liveness of $Π_{origbft}$ depends on an honest proposer always being able to make a proposal (as it does in adapted‑Streamlet for example), then this requirement will not be violated.
The changes to voting are only requiring a vote to be for a proposal that could have been honestly proposed.
The bft‑finality rules are unchanged from origbft‑finality.

Therefore, $Π_{bft}$ remains live under the same conditions as $Π_{origbft}$ .

The only other possibility for a liveness issue in Crosslink 2 would be if the change to the constructions of $fin_{i}$ or $(ba_{μ})_{i}$ could cause either of them to stall, even when $Π_{bft}$ and $Π_{bc}$ are both still live.

However, liveness of $Π_{bft}$ and the Linearity rule together imply that at each point in time, provided there are sufficient honest bft‑proposers/validators, eventually a new bft‑block with a higher-scoring snapshot will become final in the context of the longest bft‑valid‑chain. $TODO:$ make that more precise.

Because of the Extension rule, this new bft‑block must be a descendent of the previous final bft‑block in the context visible to bc‑block‑producers. Therefore, the new finalized chain will extend the old finalized chain.

Finally, we need to show that Stalled Mode is only triggered when it should be; that is, when the assumptions needed for liveness of $Π_{bft}$ are violated. Informally, that is the case because, as long as there are sufficient honest bc‑block‑producers and sufficient honest bft‑proposers/validators, the finalization point implied by the $context_bft$ field at the tip of the bc‑best chain in any node’s view will advance fast enough for the finalization gap bound $L$ not to be hit. This depends on the value of $L$ relative to $σ$ , the network delay, the hash rate of honest bc‑block‑producers, the number of honest bft‑proposers and the proportion of voting units they hold, and other details of the BFT protocol. $TODO:$ more detailed argument needed, especially for the dependence on $L$ .

Safety argument

$TODO:$ Not updated for Crosslink 2 below this point.

Discussion

Recall the definition of Assured Finality.

Definition: Assured Finality

First we prove that Assured Finality is implied by Prefix Agreement of $Π_{bc}$ .

Definition: Prefix Agreement

An execution of $Π_{* bc}$ has Prefix Agreement at confirmation depth $σ$ , iff for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have $ch_{i}^{t} ⌈_{* bc}^{σ} ⪯ ⪰_{* bc} ch_{j}^{u} ⌈_{* bc}^{σ}$ .

Safety Theorem: Prefix Agreement of Π_bc implies Assured Finality

In an execution of Crosslink 2 for which the $Π_{bc}$ subprotocol has Prefix Agreement at confirmation depth $σ$ , that execution has Assured Finality.

Proof: Suppose that we have times $t$ , $u$ and nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ . Then, by the Local fin-depth lemma applied to each of node $i$ and node $j$ , there exist times $r$ at which node $i$ is honest and $r^{'}$ at which node $j$ is honest, such that $fin_{i}^{t} ⪯_{bc} ch_{i}^{r} ⌈_{bc}^{σ}$ and $fin_{j}^{u} ⪯_{bc} ch_{j}^{r^{'}} ⌈_{bc}^{σ}$ . By Prefix Agreement at confirmation depth $σ$ , we have $ch_{i}^{r} ⌈_{bc}^{σ} ⪯ ⪰_{bc} ch_{j}^{r^{'}} ⌈_{bc}^{σ}$ . Wlog due to symmetry, suppose $ch_{i}^{r} ⌈_{bc}^{σ} ⪯_{bc} ch_{j}^{r^{'}} ⌈_{bc}^{σ}$ . Then $fin_{i}^{t} ⪯_{bc} ch_{j}^{r^{'}} ⌈_{bc}^{σ}$ (by transitivity of $⪯_{bc}$ ) and $fin_{j}^{u} ⪯_{bc} ch_{j}^{r^{'}} ⌈_{bc}^{σ}$ (as above), so $fin_{i}^{t} ⪯ ⪰_{bc} fin_{j}^{u}$ by the Linear prefix lemma.

Then we prove that Assured Finality is also implied by Final Agreement of $Π_{bft}$ .

Definition: Final Agreement

Safety Theorem: Final Agreement of Π_bft implies Assured Finality

In an execution of Crosslink 2 for which the $Π_{bft}$ subprotocol has Final Agreement, that execution has Assured Finality.

Safety Theorem: Prefix Consistency of Π_bc implies Prefix Consistency of ba

By the Local ba-depth lemma, we have:

In any execution of Crosslink 2, for any confirmation depth $μ \leq σ$ and any node $i$ that is honest at time $t$ , there exists a time $r \leq t$ such that $(ba_{μ})_{i}^{t} ⪯_{bc} ch_{i}^{r} ⌈_{bc}^{μ}$ .

Renaming $t$ to $r$ and $σ$ to $μ$ in the definition of Prefix Consistency gives:

An execution of $Π_{bc}$ has Prefix Consistency at confirmation depth $μ$ , iff for all times $r \leq u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $r$ and $j$ is honest at time $u$ , we have that $ch_{i}^{r} ⌈_{bc}^{μ} ⪯_{bc} ch_{j}^{u}$ .

Since any node $i$ that is honest at time $t$ is also honest at time $r \leq t$ , and by transitivity of $⪯_{bc}$ , we therefore have:

In any execution of Crosslink 2 that has Prefix Consistency at confirmation depth $μ$ , for all times $t \leq u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have that $(ba_{μ})_{i}^{t} ⪯_{bc} ch_{j}^{u}$ .

----

The Extension rule ensures that, informally, if a given node $i$ ’s view of its bc‑best‑chain at a depth of $σ$ blocks does not roll back, then neither does its view of the bft‑final block referenced by its bc‑best‑chain, and therefore neither does its view of $LOG_{fin, i}^{t}$ .

This does not by itself imply that all nodes are seeing the “same” confirmed bc‑best‑chain (up to propagation timing), or the same $LOG_{fin, i}^{t}$ . If the network is partitioned and $Π_{bft}$ is subverted, it could be that the nodes on each side of the partition follow a different fork, and the adversary arranges for each node’s view of the last final bft‑block to be consistent with the fork it is on. It can potentially do this if it has more than one third of validators, because if the validators are partitioned in the same way as other nodes, it can vote with an additional one third of them on each side of the fork.

This is, if you think about it, unavoidable. $Π_{bc}$ doesn’t include the mechanisms needed to maintain finality under partition; it takes a different position on the CAP trilemma. In order to maintain finality under partition, we need $Π_{bft}$ not to be subverted (and to actually work!)

So what is the strongest security property we can realistically get? It is stronger than what Snap‑and‑Chat provides. Snap‑and‑Chat is unsafe even without a partition if $Π_{bft}$ is subverted. Ideally we would have a protocol with safety that is only limited by attacks “like” the unavoidable attack described above — which also applies to $Π_{bc}$ used on its own.

Proof of safety for LOG_fin

In order to capture the intuition that it is hard in practice to cause a consistent partition of the kind described in the previous section, we will need to assume that the Prefix Agreement safety property holds for the relevant executions of $Π_{bc}$ . The structural and consensus modifications to $Π_{bc}$ relative to $Π_{origbc}$ seem unlikely to have any significant effect on this property, given that we proved above that they do not affect liveness. ==TODO: that is a handwave; we should be able to do better, as we do for $Π_{bft}$ below.== So, to the extent that it is reasonable to assume that Prefix Agreement holds for executions of $Π_{origbc}$ under some conditions, it should also be reasonable to assume it holds for executions of $Π_{bc}$ under the same conditions.

Recall that $LF (H) := bft-last-final (H . context_bft)$ .

Prefix Lemma

If $H_{1}$ , $H_{2}$ are bc‑valid blocks with $H_{1} ⪯_{bc} H_{2}$ , then $LF (H_{1}) ⪯_{bft} LF (H_{2})$ .

Proof: Using the Extension rule, by induction on the distance between $H_{1}$ and $H_{2}$ .

Using the Prefix Lemma once for each direction, we can transfer the Prefix Agreement property to the referenced bft‑blocks:

Prefix Agreement Lemma

In an execution of $Π_{bc}$ that has Prefix Agreement at confirmation depth $σ$ , for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , we have $LF (ch_{i}^{t} ⌈_{bc}^{σ}) ⪯ ⪰_{bft} LF (ch_{j}^{u} ⌈_{bc}^{σ})$ .

Let $chain-txns (ch)$ be the sequence of transactions in the given chain $ch$ , starting from genesis.

Recall that $san-ctx san-ctx (S) fin fin (H) fin-ctx (H) ba-ctx (H, μ) LOG_{fin, i}^{t} LOG_{ba, μ, i}^{t} ⦂ := ⦂ := := := := := [bc-chain] \to bc-context sanitize (concat ([chain-txns (C) for C in S])) bc-block \to [bc-chain] [snapshot (B) for B ⪯_{bft} LF (H ⌈_{bc}^{σ})] san-ctx (fin (H)) san-ctx (fin (H) ∣∣ [H ⌈_{bc}^{μ}]) context-txns (fin-ctx (ch_{i}^{t})) context-txns (ba-ctx (ch_{i}^{t}, μ))$

Because $fin$ takes the form $fin (H) := [f (X) for X ⪯_{bft} g (H)]$ , we have that $g (H) ⪯_{bft} g (H^{'}) ⟹ fin (H) ⪯ fin (H^{'})$ . (This would be true for any well‑typed $f$ and $g$ .)

By this observation and the Prefix Agreement Lemma, we also have that, in an execution of Crosslink 2 where $Π_{bc}$ has the Prefix Agreement safety property at confirmation depth $σ$ , for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , $fin (ch_{i}^{t}) ⪯ ⪰ fin (ch_{j}^{u})$ .

Because $sanitize$ only considers previous state, $context-txns$ ∘ $san-ctx$ must be a prefix-preserving map; that is, if $S_{1} ⪯ S_{2}$ then $context-txns (san-ctx (S_{1})) ⪯ context-txns (san-ctx (S_{2}))$ . Therefore:

Theorem: LOG_fin Safety (from Prefix Agreement of Π_bc)

In an execution of Crosslink 2 where $Π_{bc}$ has Prefix Agreement at confirmation depth $σ$ , for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , $LOG_{fin, i}^{t} ⪯ ⪰ LOG_{fin, j}^{u}$ .

Notice that this does not depend on any safety property of $Π_{bft}$ , and is an elementary proof. ([NTT2020, Theorem 2] is a much more complicated proof that takes nearly 3 pages, not counting the reliance on results from [PS2017].)

In addition, just as in Snap‑and‑Chat, safety of $LOG_{fin}$ can be inferred from safety of $Π_{bft}$ , which follows from safety of $Π_{origbft}$ . We prove this based on the Final Agreement property for executions of $Π_{origbft}$ :

Definition: Final Agreement

An execution of $Π_{origbft}$ has the Final Agreement safety property iff for all origbft‑valid blocks $C$ in honest view at time $t$ and $C^{'}$ in honest view at time $t^{'}$ , we have $origbft-last-final (C) ⪯ ⪰_{origbft} origbft-last-final (C^{'})$ .

The changes in $Π_{bft}$ relative to $Π_{origbft}$ only add structural components and tighten bft‑block‑validity and bft‑proposal‑validity rules. So for any legal execution of $Π_{bft}$ there is a corresponding legal execution of $Π_{origbft}$ , with the structural additions erased and with the same nodes honest at any given time. A safety property, by definition, only asserts that executions not satisfying the property do not occur. Safety properties of $Π_{origbft}$ necessarily do not refer to the added structural components in $Π_{bft}$ . Therefore, for any safety property of $Π_{origbft}$ , including Final Agreement, the corresponding safety property holds for $Π_{bft}$ .

By the definition of $fin$ as above, in an execution of Crosslink 2 where $Π_{bft}$ has Final Agreement, for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , $fin (ch_{i}^{t}) ⪯ ⪰ fin (ch_{j}^{u})$ . Therefore, by an argument similar to the one above using the fact that $context-txns$ ∘ $san-ctx$ is a prefix-preserving map:

Theorem: LOG_fin Safety (from Final Agreement of Π_bft or Π_origbft)

In an execution of Crosslink 2 where $Π_{bft}$ has Final Agreement, for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , $LOG_{fin, i}^{t} ⪯ ⪰ LOG_{fin, j}^{u}$ .

Proof of safety for LOG_ba

From the two $LOG_{fin}$ Safety theorems and the Ledger prefix property, we immediately have:

Theorem: LOG_ba does not roll back past the agreed LOG_fin

Let $μ_{i}$ be an arbitrary choice of $LOG_{ba}$ confirmation depth for each node $i$ . Consider an execution of Crosslink 2 where either $Π_{bc}$ has Prefix Agreement at confirmation depth $σ$ or $Π_{bft}$ has Final Agreement.

In such an execution, for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , either $LOG_{fin, i}^{t} ⪯_{bc} LOG_{fin, j}^{u} ⪯_{bc} LOG_{ba, μ_{j}, j}^{u}$ or $LOG_{fin, j}^{u} ⪯_{bc} LOG_{fin, i}^{t} ⪯_{bc} LOG_{ba, μ_{i}, i}^{t}$ .

Corollary: Under the same conditions, if wlog $LOG_{fin, i}^{t} ⪯_{bc} LOG_{fin, j}^{u}$ , then $LOG_{fin, i}^{t} ⪯_{bc} {LOG_{ba, μ_{i}, i}^{t}, LOG_{ba, μ_{j}, j}^{u}}$ .

The above property is not as strong as we would like for practical uses of $LOG_{ba}$ , because it does not say anything about rollbacks up to the finalization point, and such rollbacks may be of unbounded length. (Loosely speaking, the number of non‑Stalled Mode bc‑blocks after the consensus finalization point is bounded by $L$ , but we have also not proven that so far.)

As documented in the Model for BFT protocols section of The Crosslink 2 Construction):

For each epoch, there is a fixed number of voting units distributed between the players, which they use to vote for a $*$ bft‑proposal. We say that a voting unit has been cast for a $*$ bft‑proposal $P$ at a given time in a $*$ bft‑execution, if and only if $P$ is $*$ bft‑proposal‑valid and a ballot for $P$ authenticated by the holder of the voting unit exists at that time.

Using knowledge of ballots cast for a $*$ bft‑proposal $P$ that collectively satisfy a notarization rule at a given time in a $*$ bft‑execution, and only with such knowledge, it is possible to obtain a valid $*$ bft‑notarization‑proof $proof_{P}$ . The notarization rule must require at least a two‑thirds absolute supermajority of voting units in $P$ ’s epoch to have been cast for $P$ . It may also require other conditions.

A voting unit is cast non‑honestly for an epoch’s proposal iff:

it is cast other than by the holder of the unit (due to key compromise or any flaw in the voting protocol, for example); or

it is double‑cast (i.e. there are two ballots casting it for distinct proposals); or

the holder of the unit following the conditions for honest voting in $Π_{* bft}$ , according to its view, should not have cast that vote.

Definition: One‑third bound on non‑honest voting

Theorem: On bft‑valid blocks for a given epoch in honest view

By a well known argument often used to prove safety of BFT protocols, in an execution of Crosslink 2 where $Π_{bft}$ has the one‑third bound on non‑honest voting property (and assuming soundness of notarization proofs), any bft‑valid block for a given epoch in honest view must commit to the same proposal.

Proof (adapted from [CS2020, Lemma 1]): Suppose there are two bft‑proposals $P$ and $P^{'}$ , both for epoch $e$ , such that $P$ is committed to by some bft‑block‑valid block $B$ , and $P^{'}$ is committed to by some bft‑block‑valid block $B^{'}$ . This implies that $B$ and $B^{'}$ have valid notarization proofs. Let the number of voting units for epoch $e$ be $n_{e}$ . Assuming soundness of the notarization proofs, it must be that at least $2 n_{e} /3$ voting units for epoch $e$ , denoted as the set $S$ , were cast for $P$ , and at least $2 n_{e} /3$ voting units for epoch $e$ , denoted as the set $S^{'}$ , were cast for $P^{'}$ . Since there are $n_{e}$ voting units for epoch $e$ , $S \cap S^{'}$ must have size at least $n_{e} /3$ . In an execution of Crosslink 2 where $Π_{bft}$ has the one‑third bound on non‑honest voting property, $S \cap S^{'}$ must therefore include at least one voting unit that was cast honestly. Since a voting unit for epoch $e$ that is cast honestly is not double-cast, it must be that $P = P^{'}$ .

Info

In the case of a network partition, votes may not be seen on both/all sides of the partition. Therefore, it is not necessarily the case that honest nodes can directly see double‑voting. The above argument does not depend on being able to do so.

Therefore, in an execution of Crosslink 2 for which $Π_{bft}$ has the one‑third bound on non‑honest voting property, for each epoch $e$ there will be at most one bft‑proposal‑valid proposal $P_{e}$ , and at least one third of honestly cast voting units must have been cast for it. Let $I_{e}$ be the (necessarily nonempty) set of nodes that cast these honest votes; then, $snapshot (P_{e}) ⪯_{bc} ch_{i}^{t_{e, i}} ⌈_{bc}^{σ}$ for all $i \in I_{e}$ at the times $t_{e, i}$ of their votes in epoch $e$ . (For simplicity, we assume that for each honest node $i$ there is only one time $t_{e, i}$ at which it obtains a successful check for the voting condition in epoch $e$ , which it uses for any votes that it casts in that epoch.)

Let $B$ be any bft‑block for epoch $e$ such that $B ⪯_{bft} bft-last-final (C)$ , where $C$ is some bft‑block‑valid block. Since $B ⪯_{bft} C$ , $B$ is bft‑block‑valid. So by the argument above, $B$ commits to the only bft‑proposal‑valid proposal $P_{e}$ for epoch $e$ , and $snapshot (B) = snapshot (P_{e})$ was voted for in that epoch by a nonempty subset of honest nodes $I_{e}$ .

Let $H$ be any bc‑valid block. We have by definition: $fin (H) = = [snapshot (B) for B ⪯_{bft} LF (H ⌈_{bc}^{σ})] [snapshot (B) for B ⪯_{bft} bft-last-final (H ⌈_{bc}^{σ} . context_bft)]$ So, taking $C = H ⌈_{bc}^{σ} . context_bft$ , each $snapshot (B)$ for $B$ of epoch $e$ in the result of $fin (H)$ satisfies $snapshot (B) ⪯_{bc} ch_{i}^{t_{e, i}} ⌈_{bc}^{σ}$ for all $i$ in some nonempty honest set of nodes $I_{e}$ .

For an execution of Crosslink 2 in which $Π_{bc}$ has the Prefix Consistency property at confirmation depth $σ$ , for every epoch $e$ , for every such $(i, t_{e, i})$ , for every node $j$ that is honest at any time $u \geq t_{e, i}$ , we have $ch_{i}^{t_{e, i}} ⌈_{bc}^{σ} ⪯_{bc} ch_{j}^{u}$ . Let $t_{e} = min {t_{e, i} : i \in I_{e}}$ . Then, by transitivity of $⪯_{bc}$ :

Theorem: On snapshots in LOG_fin

In an execution of Crosslink 2 where $Π_{bft}$ has the one‑third bound on non‑honest voting property and $Π_{bc}$ has the Prefix Consistency property at confirmation depth $σ$ , every bc‑chain $snapshot (B)$ in $fin (ch_{i}^{t})$ (and therefore every snapshot that contributes to $LOG_{fin, i}^{t}$ ) is, at any time $u \geq t_{e}$ , in the bc‑best‑chain of every node $j$ that is honest at time $u$ (where $B$ commits to $P_{e}$ at epoch $e$ and $t_{e}$ is the time of the first honest vote for $P_{e}$ ).

A similar (weaker) statement holds if we replace $u \geq t_{e}$ with $u \geq t$ , since the time of the first honest vote for $P$ necessarily precedes the time at which the signed $(P, proof_{P})$ is submitted as a bft‑block, which necessarily precedes $t$ . (Whether or not the notarization proof depends on the first honest vote for $B$ ’s proposal $P_{e}$ , it must depend on some honest vote for that proposal that was not made earlier than $t_{e}$ .)

Furthermore, we have $ba-ctx (H, μ) LOG_{ba, μ, i}^{t} = = san-ctx (fin (H) ∣∣ [H ⌈_{bc}^{μ}]) context-txns (ba-ctx (ch_{i}^{t}, μ))$

So in an execution of Crosslink 2 where $Π_{bc}$ has the Prefix Consistency property at confirmation depth $μ$ , if node $i$ is honest at time $t$ then $H ⌈_{bc}^{μ}$ is also, at any time $u \geq t$ , in the bc‑best‑chain of every node $j$ that is honest at time $u$ .

If an execution of $Π_{bc}$ has the Prefix Consistency property at confirmation depth $μ \leq σ$ , then it necessarily also has it at confirmation depth $σ$ . Therefore we have:

Theorem: On snapshots in LOG_ba

In an execution of Crosslink 2 where $Π_{bft}$ has the one‑third bound on non‑honest voting property and $Π_{bc}$ has the Prefix Consistency property at confirmation depth $μ \leq σ$ , every bc‑chain snapshot in $fin (ch_{i}^{t}) ∣∣ [ch_{i}^{t} ⌈_{bc}^{μ}]$ (and therefore every snapshot that contributes to $LOG_{ba, μ, i}^{t}$ ) is, at any time $u \geq t$ , in the bc‑best‑chain of every node $j$ that is honest at time $u$ .

Sketch: we also need the sequence of snapshots output from fin to only be extended in the view of any node. In that case we can infer that the node does not observe a rollback in LOG_ba.

Recall that in the proof of safety for $LOG_{fin}$ , we showed that in an execution of Crosslink 2 where $Π_{bft}$ (or $Π_{origbft}$ ) has Final Agreement, for all times $t$ , $u$ and all nodes $i$ , $j$ (potentially the same) such that $i$ is honest at time $t$ and $j$ is honest at time $u$ , $fin (ch_{i}^{t}) ⪯ ⪰ fin (ch_{j}^{u})$ .

What we want to show is that, under some conditions on executions, ...

Disadvantages of Crosslink

More invasive changes

Unlike Snap‑and‑Chat, Crosslink 2 requires structural and consensus rule changes to both $Π_{bc}$ and $Π_{bft}$ . On the other hand, several of those changes are arguably necessary to fix a show‑stopping bug in Snap‑and‑Chat (not being able to spend some finalized outputs).

Finalization latency

For a given choice of $σ$ , the finalization latency is higher. The snapshot of the BFT chain used to obtain $LOG_{fin, μ, i}^{t}$ is obtained from the block at depth $μ$ on node $i$ ’s best $Π_{bc}$ chain, which will on average lead to a finalized view that is about $μ + 1 + σ$ blocks back (in $Π_{bc}$ ), rather than \sigma_{\sac}} blocks in Snap‑and‑Chat. This is essentially the cost of ensuring that safety is given by the stronger of the safety of $Π_{bc}$ (at $μ$ confirmations) and the safety of $Π_{bft}$ .

On the other hand, the relative increase in expected finalization latency is only \frac{\mu + 1 + \sigma}{\sigma_{\sac}}}, i.e. at most slightly more than a factor of 2 for the case \mu = \sigma = \sigma_{\sac}}.

More involved liveness argument

See the Liveness section above.

Every rule in Crosslink 2 is individually necessary

Warning

In order to show that Crosslink 2 is at a local optimum in the security/complexity trade‑off space, for each rule we show attacks on safety and/or liveness that could be performed if that rule were omitted or simplified.

Edit: some rules, e.g. the Linearity rule, only contribute heuristically to security in the analysis so far.

Questions about Crosslink

Why don’t we have a bc‑block‑validity rule snapshot(LF(H)) ⪯_bc H ?

Rationale: Can we outright prevent rollbacks > $σ$ from ever appearing in $Ch_{i}^{t}$ ?

Info

This document analyzes the effect of this rule on its own. For the effect in combination with an additional Linearity rule, see the Linearity and Last Final Snapshot rules section of Potential changes for Crosslink 2.

Daira-Emma: In a variant of Crosslink with this rule, an adversary’s strategy would be to keep the $context_bft$ fields in its blocks as $C$ such that $B = bft-last-final (C)$ when the attack starts, and then fork from the bc‑best‑chain that extends $snapshot (B)$ . If its private chain falls behind the public bc‑best‑chain, it resets, just like in a conventional private mining attack.

Note that the proposed rule does not prevent the adversary’s private chain from just staying at the same $context_bft$ block. The reason is that Crosslink does not change the fork-choice rule of $Π_{bc}$ . That is, even if the adversary’s chain has a $context_bft$ that is far behind the current bft‑block, it is still allowed to become the bc‑best‑chain.

(Eventually the adversary’s chain using this strategy will hit the finality gap bound of $L$ blocks. But that must be significantly greater than $σ$ , to avoid availability problems. So it does not prevent the adversary from performing a rollback longer than $σ$ blocks before they hit the bound. Also, going into Stalled Mode for new blocks does not prevent the attacker’s chain from having included harmful transactions before that point.)

It is possible to change the fork-choice rule, for example so that the bc‑best‑chain for a node $i$ is required to extend $snapshot (B)$ where $B$ is the last final block for any bft-chain in node $i$ ’s view.

This would break the current safety and liveness arguments for Crosslink. But for the sake of argument, suppose we did it anyway.

The adversary’s strategy would change slightly: it resets if either its private chain falls behind the public bc‑best‑chain, or its private chain is invalidated because it forks before $snapshot (B)$ for some new last final block $B$ of a bft-chain. During the attack, it also attempts to impede progress of the BFT protocol as far as possible.

In that case, the proposed rule still does not preclude a rollback of more than $σ$ blocks, for several reasons:

In general we can’t say anything about how many bc‑blocks are mined in any given interval, so it could be the case that more than $σ$ blocks are mined on both the honest chain and the adversary’s chain before it would be realistically possible to go through even a single round of the BFT protocol.
Nor can we say anything about how quickly those blocks are finalized, unless we enforce it, which we don’t. (In Crosslink we do enforce a finalization gap bound $L$ , but as explained above $L$ must be significantly greater than $σ$ , so that doesn’t really help.)
- In particular, the adversary could be suppressing publication of final bft‑blocks, or attacking the liveness of the BFT protocol in other ways. An attack against BFT liveness is potentially easier than an attack against BFT safety, and it would be difficult to characterize exactly how much this rule gains you in terms of security given that (at best) it’s dependent on that.
$snapshot (LF (H))$ will typically be at least $σ + 1$ blocks back from $H$ . The argument for that goes:

None of the block hashes in $LF (H) . headers_bc$ can point to $H$ because that would be a hash cycle. In a typical case where no block withholding and no other rollback (not caused by the adversary) occurs on the honestly mined chain, the proposer of the last final block before a context bft‑block that $H$ can point to will have, at the latest, $H ⌈_{bc}^{1}$ as $tip (B)$ . Under these conditions, $snapshot (LF (H))$ will point, at the latest, to $σ$ blocks before $H ⌈_{bc}^{1}$ , i.e. $σ + 1$ blocks before $H$ .

This means that by the time $snapshot (LF (H))$ could catch up to $H$ , on average $σ + 1$ block times will have occurred. So, roughly speaking, the rule that $snapshot (LF (H)) ⪯_{bc} H$ does not usefully constrain the adversary until after $σ + 1$ block times. Wlog let’s assume $Π_{bc}$ uses PoW: if $σ$ is chosen reasonably, then being able to do a $σ + 1$ -block rollback at all probably requires having somewhere close to 50% of network hash rate. And so in $σ + 1$ block times the adversary has a significant chance of being able to do the required rollback before the suggested rule “kicks in”. (It also has however much additional latency is added by the BFT protocol, which is simultaneously under attack to maximize this latency.)

All that said, does the suggested rule help? First we have to ask whether it introduces any weaknesses.

One potential issue is that the rule cuts both ways: if an adversarial rollback of more than $σ$ blocks does occur, then the adversary can make a proposal that will “lock in” its success. But it can be argued that this is intended behaviour anyway: the adversary has a confirmed chain, and is entitled to propose to finalize it.
What happens if $Π_{bft}$ is broken?
- Edit: the answer below refers to Crosslink before the Increasing Score rule was added.
- This could definitely be a problem. If either an adversary has a two‑thirds supermajority of validators, or $Π_{bft}$ is completely broken (e.g. by a bug in the implementation of the validation signature scheme), then they can add a final block to the bft-chain with a $headers_bc$ field that does not satisfy the honest voting condition. They do not need this to be bft-valid (although greater vulnerability to bugs in the implementation of bft-validity are also an issue). Then they can use the proposed rule to prevent an arbitrary bc-valid-chain from being extended (e.g. by mining $σ + 1$ blocks from $O_{bc}$ at the genesis difficulty, then getting that chain into a final bft‑block). [Edit: they would no longer be able to do this using a chain mined at a much lower difficulty, because of the Increasing Score rule.]
  - In Crosslink, this is very carefully avoided. The only similar rule that depends on $snapshot (LF (H))$ and that could potentially affect liveness of $Π_{bc}$ is the Finality depth rule. But that rule always allows the alternative of producing a Stalled Mode block on the current bc‑best‑chain — and honest block producers will do so. Therefore, the effect of trying to exploit even a catastrophic break in $Π_{bft}$ in order to cause a rollback of $LOG_{fin}$ , as long as $Π_{bc}$ has not also been broken, is to go into Stalled Mode.
  - This does not mean that a break of $Π_{bft}$ is not a problem for Crosslink. In particular, an adversary that can violate safety of $Π_{bft}$ can violate safety of $LOG_{ba}$ (and of $LOG_{fin}$ if there is also a $σ$ ‑block rollback in some node’s bc‑best‑chain).
  - The difference is that a safety violation of $Π_{bft}$ can be directly observed by nodes without any chance of false positives, which is not necessarily the case for all possible attacks against $Π_{bft}$ . (The attack described above does not violate safety of $Π_{bft}$ ; it just adds a final bft‑block with a suspiciously long $Π_{bc}$ rollback. It could alternatively have added a block with a less suspiciously long rollback, say exactly $σ + 1$ blocks. That is, in pursuit of preventing an attack against $Π_{bc}$ , we have enabled attacks against $Π_{bft}$ to achieve the same effect — precisely what Crosslink is designed to prevent.)
  - This raises an interesting idea: if any node sees a rollback in the chain of final bft‑blocks, it could provide objective evidence of that rollback in the form of a “bft-final-vee”: two final bft‑blocks with the same parent. Similarly, if any node sees more than one third of stake vote for conflicting blocks in a given epoch, then the assumption bounding the adversary’s stake must have been violated. This evidence can be posted in a transaction to the bc-chain. In that case, any node that is synced to the bc-chain can see that the bft-chain suffered a violation of safety or of a safety assumption, without needing to have seen that violation itself. This can be generalized to allow other proofs of flaws in $Π_{bft}$ . Optionally, a bc-chain that has posted such a proof could be latched into Stalled Mode until manual intervention can occur. (Obviously we need to make sure that this cannot be abused for denial-of-service.)
  - This is now described in Potential changes to Crosslink.

Okay, but is it a good idea to make that change to the fork-choice rule anyway?

Probably not. I don’t know how to repair the safety and liveness arguments.

The change was that the bc‑best‑chain for a node $i$ would be required to extend $snapshot (B)$ where $B$ is the last final bft‑block in node $i$ ’s view.

From the point of view of any modular analysis that treats $Π_{bft}$ as potentially subverted, we cannot say anything useful about $snapshot (B)$ . It seems as though any repair would have to assume much more about the BFT protocol than is desirable.

In general, changes to fork‑choice rules are tricky; it was a fork-choice rule problem that allowed the liveness attack against Casper FFG described in [NTT2020, Appendix E].

What if validators who see that a long rollback occurred, refuse to vote for it?

Yep that is allowed. The rule is “An honest validator will only vote for a proposal $P$ if ...” (not if‑and‑only‑if). If an honest validator sees a “good” reason not to vote for a proposal, including reasons based on out‑of‑band information, they should not. The Complementarity argument made in The Argument for Bounded Availability and Finality Overrides actually depends on this. Obviously, it may affect BFT liveness (and that’s okay).

The only reason why we don’t make this part of the voting condition is that it’s a stateful rule. A new validator could come along and wouldn’t have the state needed to enforce it. Perhaps that could be fixed.

Potential Changes to Crosslink

This page documents suggestions that have not had the same attention to security analysis as the Crosslink 2 construction. Some of them are broken. Some of them also increase the complexity of the protocol (while some simplify it or have a mixed effect on complexity), and so we need to consider the security/complexity trade‑off of each suggestion before we could include it.

This page is out-of-date

This page has not yet been updated for the changes from Crosslink 1 to Crosslink 2.

Attempts to improve safety or to simplify the protocol

[Recommended] Recording more info about the bft‑chain in bc‑blocks

We can allow honest bc‑block‑producers to record information about every proposed and notarized bft‑block, rather than just the one in the $context_bft$ field.

Duplicate information that has already been given in an ancestor bc‑block would be omitted.

This would automatically expose the following shenanigans to public view (as long as enough bc‑block‑proposers are honest, which is already assumed):

any attempt to double‑propose in the same epoch;
any successful attempt to double‑notarize.

We could also expose attempts to double‑vote.

Note that double‑proposal and double‑voting could be a sign that a proposer or validator’s private key is compromised, rather than that it belongs to the adversary per se. However, the security analysis must treat such a proposer/validator as non‑honest in any case.

Changing the Increasing Score rule to require the score of the tip (rather than the score of the snapshot) to increase

The current Increasing Score rule concerns the score of the snapshot:

Increasing Snapshot Score rule: Either $score (snapshot (B ⌈_{bft}^{1})) < score (snapshot (B))$ or $snapshot (B ⌈_{bft}^{1}) = snapshot (B)$ .

We could instead require the score of $tip (B) = B . headers_bc [σ - 1]$ to increase:

Increasing Tip Score rule: Either $score (tip (B ⌈_{bft}^{1})) < score (tip (B))$ or $tip (B ⌈_{bft}^{1}) = tip (B)$ .

Pros:

This more directly reflects the fork‑choice rule in $Π_{bc}$ .
In Crosslink 1, an honest bft‑proposer uses its bc‑best‑chain tip with the highest score provided that it is consistent with the Increasing Snapshot Score rule. This change removes the caveat, simplifying honest bft‑proposer behaviour.
As a result of removing that caveat, we always know about an honest bft‑proposer’s bc‑best‑chain.

Con:

The score of the snapshot would not necessarily increase. As a result, it is technically possible that the new snapshot can be an ancestor of a previous snapshot. Whether this can actually happen depends on the value of $σ$ and the difficulty adjustment rule. This causes no particular harm other than adding a corner case in ledger sanitization, but is inelegant.
- This con is removed if we either use the Combined Increasing Score rule variant described below, or we also apply the “Making bc‑rollbacks more difficult” change in the next section.

Apart from the above con, the original motivations for the Increasing Snapshot Score rule also apply to the Increasing Tip Score rule. In particular,

it still prevents potential attacks that rely on proposing a bc‑valid‑chain that forks from a much earlier block;
it still limits the extent of disruption an adversary can feasibly cause to \LOG_{\bda}};
it still always allows a proposal to be made, which may be needed to preserve liveness of $Π_{bft}$ relative to $Π_{origbft}$ ;
it still prevents potential validation cost DoS attacks due to switching between snapshots with the same score.

If we switch to using the Increasing Tip Score rule, then it would be more consistent for block producers to also change the tie‑breaking rule for choosing $context_bft$ to use the tip score, i.e. $score (tip (bft-last-final (C)))$ .

A variation on this suggestion effectively keeps both the Increasing Snapshot Score rule and the Increasing Tip Score rule:

Combined Increasing Score rule: Either ( $score (snapshot (B ⌈_{bft}^{1})) < score (snapshot (B))$ and $score (tip (B ⌈_{bft}^{1})) < score (tip (B))$ ), or $tip (B ⌈_{bft}^{1}) = tip (B)$ .

Note that if $tip (B ⌈_{bft}^{1}) = tip (B)$ , both scores are necessarily equal.

This variation does not simplify honest bft‑proposer behaviour.

Making exploitation of bc‑rollbacks more difficult

Basic idea: Detect the case where the bc‑snapshot is rolling back, and impose a longer confirmation depth to switch to the new bc‑chain. Also temporarily stall finalization of the existing bc‑chain until the conflict has been resolved.

Let $baseline_snapshot$ be the Crosslink 1 definition of $snapshot$ , i.e. $baseline_snapshot (B) = {O_{bc}, B . headers_bc [0] ⌈_{bc}^{1}, if B . headers_bc = \emptyset otherwise.$

When $snapshot (B ⌈_{bft}^{1}) \neq ⪯_{bc} baseline_snapshot (B)$ , we want to go into a mode where we require a longer confirmation depth $Σ$ , say $2 σ$ . Because we don’t know in this situation whether the old bc‑chain or the new bc‑chain will win, we stop finalizing both until a winner is clear.

The simplest option is to record the state saying that we are in this mode explicitly, and add a consensus rule requiring it to be correct. That is, add an $bc_is_forked$ field to bft‑proposals and bft‑blocks, and add a bft‑proposal and bft‑block validity rule as follows:

Is Forked rule: $B . bc_is_forked = (enter_forked (B)$ or $B ⌈_{bft}^{1} . bc_is_forked)$ and not $exit_forked (B)$

where:

$enter_forked (B) := snapshot (B ⌈_{bft}^{1}) \neq ⪯_{bc} baseline_snapshot (B)$
$exit_forked (B) := score (tip (B) ⌈_{bc}^{Σ}) > score (snapshot (B ⌈_{bft}^{1}))$ .

It is intentional that $exit_forked$ takes precedence over $enter_forked$ .

Then redefine $snapshot$ as follows: $snapshot (B) = {snapshot (B ⌈_{bft}^{1}), baseline_snapshot (B), if B . bc_is_forked otherwise.$

Since $O_{bft} . bc_is_forked = false$ , the recursion will terminate.

Note that there is an interaction between the Increasing Snapshot Score rule and this change: the Increasing Snapshot Score rule should arguably use $baseline_snapshot$ instead of $snapshot$ . The Increasing Tip Score rule, on the other hand, works fine as‑is, and so it makes sense to use both of these changes together. The combination of both changes also fixes the con discussed above for the Increasing Tip Score rule; it ensures that the score of the snapshot must increase.

Pros:

If the winning chain is the chain that was first snapshotted, then there ends up being no disruption whatsoever to \LOG_{\bda}}.
It becomes extremely difficult for an adversary with less than 50% hash power to get the finalized snapshot to switch between two competing chains more than once.

Cons:

It is potentially easier to cause a temporary finalization stall. An adversary could try to provoke this situation on the honest chain, either as a DoS attack, or so that its own chain that is not so encumbered can finalize more quickly than the honest chain.
- This does not seem like a practical attack, because such a stall can only happen when the adversary can cause a $σ$ ‑block rollback or has subverted $Π_{bft}$ .
The definition of $snapshot$ becomes more complicated, and there is a risk of this complexity introducing problems.
It can be argued that, unless $σ$ is chosen too small, an adversary that can cause a $σ$ ‑block rollback likely has 50% of hash power, and therefore can cause a $Σ$ ‑block rollback. That is, increasing confirmation depth does not help beyond a certain point, and therefore (it could be argued) this change will also not help.
- This argument may hold for private mining attacks, but does not necessarily hold for partitioning attacks, as discussed in the next section.

Using tip information to detect rollbacks and partitions

In the case of a private mining attack, the adversary will typically conceal the existence of the overtaking chain until it can be used to cause a rollback in \LOG_{\bda}}. So the approach used in the previous section seems to be all we can do against such an attack.

In the case of a partitioning attack, on the other hand, the adversary relies on honest nodes to do mining work on each side of the partition. This relies on the successful miners on each side knowing about their chain, but not the chain on the other side. Subtly, it does not rely on a perfect network partition. An adversary could, for example, attempt to create partitions around the most successful mining pools. Occasional leaks of information across a partition also do not necessarily foil the attack unless that information gets to a successful miner. Therefore, measures that constrain the adversary’s ability to make use of an incomplete partition can be useful.

This also has the benefit of making the protocol more robust against non‑malicious incomplete partitions.

Given that in such an attack the competing chains may be visible to some proposers, there is the possibility of detecting a potential rollback even before it gets snapshotted, by using the fact that previous bft‑blocks created by honest bft‑proposers have been recording the bc‑best‑chain tip $σ$ blocks ahead. Also, depending on what proportion of validators an adversary has, they may rely on honest validators on each side to ensure that a snapshot of each chain appears in a bft‑valid block; in that case, including information about competing chains in validators' votes (see the next subsection) may be useful.

It is still possible that if an adversary has several consecutive proposal slots, it can get its chain snapshotted. However, if there is an intervening slot with an honest proposer, we can potentially compare its tip with the adversary’s tip and anticipate the need to go into $bc_is_forked$ mode.

In order to get this to work, we need to propose a definition to identify bc‑chains that are competing with the current best chain, such that there is some risk of a “long” rollback to a competing chain. Let $δ$ be a measure of how close (in terms of bc‑blocks) a competing chain’s score needs to be to that of the bc‑best‑chain, and let $μ > δ$ be a lower bound on the rollback depth we would consider significant if the competing chain were to immediately catch up. (The condition $μ > δ$ is necessary to avoid false positives that might only be a single‑block fork.)

A node $i$ identifies $(δ, μ)$ ‑competing chains as follows based on its current view at time $t$ :

A bc‑block is a tip if it has no known descendants.
Let $ch_{i}^{t}$ be node $i$ ’s bc‑best‑chain.
Identify all of the tips $T$ such that $score (T) \geq score (ch_{i}^{t} ⌈_{bc}^{δ})$ and $last-common-ancestor (T, ch_{i}^{t}) ⪯_{bc} ch_{i}^{t} ⌈_{bc}^{μ}$ .

$TODO:$ Details, including how to modify the $enter_forked$ and $exit_forked$ conditions.

$TODO:$ For now we will assume that all of the competing chain information in a bft‑block has to be checked as bc‑block‑valid in order for that block to be bft‑block‑valid. This might introduce validation DoS attacks and needs to be considered more carefully.

Allowing validators to signal the existence of a competing chain in their votes

This complements the above idea by letting a validator that has seen a competing chain signal it in its signed vote. Then, as long as the adversary is reliant on some votes from honest validators that are signalling the existence of competing chains, we would go into $bc_is_forked$ mode without relying on honest proposers to have an intervening slot.

The notarization proof that appears in a bft‑block would need to be modified to preserve these signals. More precisely, it is necessary for a bft‑block $B$ to preserve at least:

the best bc‑chain that credibly competes with $tip (B)$ , if any.
the best bc‑chain that does not credibly compete with $tip (B)$ (this necessarily exists because $tip (B)$ does not credibly compete with itself).

This is also motivated by the suggested change in the next section.

Enforcing this is relatively straightforward if the evidence is a SNARK. It can also be enforced with aggregate signatures even for schemes that only allow aggregation of signatures over a common message: we just collect the distinct messages (corresponding to either “no competing chain” or each distinct competing chain) and aggregate them separately.

Strengthening the Increasing Tip Score rule

Assume that votes include competing chain information as discussed above. We can assume that an honest proposer has read all of this information from its parent bft‑block. Therefore, we can require the tip score of its proposal to have at least the score of the best tip implied by that information:

Let $best-tip (B)$ be the tip mentioned in bft‑block $B$ with the highest score. A bft‑block $B$ “mentions” the two best tips defined in the previous section.

Strong Increasing Tip Score rule: Either $score (best-tip (B ⌈_{bft}^{1})) < score (tip (B))$ or $best-tip (B ⌈_{bft}^{1}) = tip (B)$ .

Note that this rule is really quite constraining for a potential adversary, especially in partitioning attacks. It means that if the adversary does not want to acknowledge the existence of a given chain, it cannot use any votes or build on any previous bft‑block that signals the existence of that chain. Essentially, a partitioning adversary with control over only the minimum one‑third of the stake would have to have ensure a perfectly complete partition; it could not get away with any information leakage between honest validators.

Attempts to improve finalization latency

[Broken] Adjusting the last snapshot definition

The Crosslink 1 design imposes a finalization latency of at least $2 σ + 1$ block times. Intuitively, this is because in $fin (H) := [snapshot (B) for B ⪯_{bft} bft-last-final (H ⌈_{bc}^{σ} . context_bft)],$ $snapshot (bft-last-final (H ⌈_{bc}^{σ} . context_bft))$ is at least $σ + 1$ blocks back from $H ⌈_{bc}^{σ}$ (as argued in Questions about Crosslink 1), and therefore $2 σ + 1$ blocks back from $H$ . So the total finalization latency is $σ$ block times + BFT overhead + $(σ + 1)$ block times + snapshot overhead.

However, the snapshot headers contain information about the proposer’s bc‑best‑chain.

Define $LF (H) := bft-last-final (H . context_bft)$ . Although it is not guaranteed, normally $snapshot (LF (H ⌈_{bc}^{σ}))$ will be an ancestor of $H ⌈_{bc}^{σ}$ . What if we were to optimistically allow the last snapshot to be taken as $S (H) := {last-common-ancestor (H ⌈_{bc}^{σ}, tip (LF (H ⌈_{bc}^{σ})), snapshot (LF (H ⌈_{bc}^{σ})), if it extends snapshot (LF (H ⌈_{bc}^{σ})) otherwise$ ? After all, we know that $last-common-ancestor (H ⌈_{bc}^{σ}, tip (LF (H ⌈_{bc}^{σ}))$ is confirmed.

Oh, this won’t work. The problem is that we want safety of $LOG_{fin}$ not to depend on safety of $Π_{bc}$ . So we cannot assume (for this purpose) that nodes see the same $H ⌈_{bc}^{σ}$ .

Replacing \LOG_{\bda}} with \LOG_{\opt}}

What if we instead take this to be the definition of \LOG_{\opt}}, replacing \LOG_{\bda}} ("opt" meaning optimistic)?

As stated, a malicious proposer can try to maximize the latency of \LOG_{\opt}} (subject to the Increasing Score rule). For example, if there exists a fork of length $μ$ , the malicious proposer can force the latency of \LOG_{\opt}} to be $(σ + μ + 1)$ block times + BFT overhead. However, this can be improved by applying the idea to each bft‑block in turn after the one pointed to by the best confirmed bc‑block. Then a malicious proposer cannot do anything that it could not do anyway (keeping the finalization point at its current position).

Pros:

This is always more conservative in terms of safety than the current design.
The latency of \LOG_{\opt}} will typically be $σ + 1$ bc‑block times, rather than $2 σ + 1$ block times.

Cons:

\LOG_{\opt}} is not dynamically available in any sense. It just has lower latency and different security characteristics.
Even under optimistic conditions, \LOG_{\opt}} will lag slightly behind where it would be for the Crosslink 1 design, because $H ⌈_{bc}^{σ}$ will necessarily be ahead of $tip (LF (H ⌈_{bc}^{σ}))$ .

[Broken] Using snapshots from the last‑seen bft‑chain when it is consistent with the bc‑best‑chain

The following idea is broken for safety when $Π_{bft}$ has been subverted:

Info

We have two potential sources of information about blocks that could plausibly be considered finalized:

$H ⌈_{bc}^{σ}$
the snapshots on the chain of the last seen final bft‑block, $lsf$ .

We cannot rely only on 1. because we want assured finalization even under partition. We cannot rely only on 2. because if $Π_{bft}$ has been subverted, then the chain of final bft‑blocks could fork.

But intuitively, if we combine these sources of information, using them over the Crosslink 1 finalization only when they are consistent, the resulting protocol should still be as safe as the safer of $Π_{bft}$ and $Π_{bc}$ . In particular, 2. will not roll back unless $Π_{bft}$ has been subverted.

If this idea were to pan out, it could improve the latency of finalization by $σ$ block times.

This approach is essentially a hybrid of Snap‑and‑Chat and Crosslink 1:

the Snap‑and‑Chat construction gives a finalized ledger under the assumption that $Π_{bft}$ has not been subverted;
the main crosslink idea is used to make sure that outputs from all finalized transactions are eventually spendable;
safety is still only dependent on the stronger of the safety of $Π_{bft}$ and $Π_{bc}$ , because we use the additional information from snapshots in final bft‑blocks only up to the point at which they agree with the best confirmed bc‑block.

To explain the safety problem with this idea: suppose that $Π_{bft}$ has been subverted. In that case it is possible for a snapshot to be finalized without having being confirmed as in any honest node’s bc‑best‑chain; that is, it is possible for $LOG_{fin}$ to include transactions $T$ from a snapshot $S$ in bft‑block $A$ such that $S$ is not on the consensus bc‑best‑chain. And, because $Π_{bft}$ has been subverted, it is also possible that a conflicting final bft‑block $B$ omits $S$ . And so a node that has seen $B$ will think that it is consistent with the best bc‑chain (so that its $LOG_{fin}$ does not include $T$ but does include later transactions on the consensus bc‑best‑chain), but a node that has seen $A$ will compute a $LOG_{fin}$ that does include $T$ .

More detailed specification of the above broken idea.

Define $LF (H) := bft-last-final (H . context_bft)$ as before.

For simplicity assume that $lsf$ extends $LF (H ⌈_{bc}^{σ})$ by only one bft‑block. (This assumption could have been removed if the idea had panned out.)

Then this proposal was to consider this bc‑block as contributing the last finalized snapshot: $S (H) := {last-common-ancestor (H ⌈_{bc}^{σ}, snapshot (lsf)), snapshot (LF (H ⌈_{bc}^{σ})), if it extends snapshot (LF (H ⌈_{bc}^{σ})) otherwise$

There is no need for a tie‑breaking rule for 2.: if we ever see two context bft‑blocks for which the last‑final blocks are conflicting, we know that $Π_{bft}$ has been subverted, so we should stall or crash.

Caveat: for a given node, $H ⌈_{bc}^{σ}$ can in theory roll back past $snapshot (lsf)$ , therefore $S (H)$ can also roll back. It is okay if we keep state here and refuse to roll back. We should set a “crisis flag”, and unset it if at any point $LF (H ⌈_{bc}^{σ})$ extends $lsf_at_crisis$ . (If $Π_{bft}$ is safe and live, it will.)

A similar rule that would give the same result in almost all circumstances is: $S (H) := ⎩ ⎨ ⎧ H ⌈_{bc}^{σ}, snapshot (lsf), snapshot (LF (H ⌈_{bc}^{σ})), if snapshot (LF (H ⌈_{bc}^{σ})) ⪯_{bc} H ⌈_{bc}^{σ} ⪯_{bc} snapshot (lsf) if snapshot (LF (H ⌈_{bc}^{σ})) ⪯_{bc} snapshot (lsf) ⪯_{bc} H ⌈_{bc}^{σ} otherwise$

What about making the bc‑block‑producer the bft‑proposer?

The answer given for this question at The Crosslink 2 Construction is:

If this were enforced, it could be an alternative way of ensuring that every bft‑proposal snapshots a new bc‑block with a higher score than previous snapshots, potentially making the Increasing Score rule redundant. However, it would require merging bc‑block‑producers and bft‑proposers, which could have concerning knock‑on effects (such as concentrating security into fewer participants).

This may have been too hasty. It is not clear that merging bc‑block‑producers and bft‑proposers actually does “concentrate security into fewer participants” in a way that can have any harmful effect.

Remember, the job of a bft‑proposer in Crosslink is primarily to snapshot the bc‑best‑chain (even more so if the Increasing Tip Score rule is adopted). An honest miner by definition is claiming to build on the best chain, and miners have a strong economic incentive to do so. Therefore, it is entirely reasonable for every newly produced block to be treated as a bft‑proposal. This arguably decentralizes the task of proposing bft‑blocks more effectively than using a leader election protocol would — especially given that in a hybrid protocol we necessarily still rely on there being sufficient honest miners.

[DKT2021], for example, argues for the importance of “the complete unpredictability of who will get to propose a block next, even by the winner itself.” The main basis of this argument is that it makes subversion of the proposer significantly more difficult. A PoW protocol has that property, and most PoS protocols do not. (It is not that PoS protocols are unable to provide this property; indeed, [DKT2021] constructs a PoS protocol, “PoSAT”, that provides it.)

So let’s explore this in more detail. A newly produced bc‑block would implicitly be a bft‑proposal with itself as the tip. The $bc_headers$ field is therefore not needed. The Tail Confirmation rule goes away since its intent is automatically satisfied. This is already a significant simplification.

The inner proposer signature is also not needed (since the bc‑header is self-authenticating), but the block producer would have to include a public key $H . pubkey$ that can be used to verify its outer signature. It would sign the notarized bft‑block with the corresponding private key. This change is a wash in terms of protocol complexity.

Considered as a bft‑proposal, a bc‑block needs to refer to a parent bft‑block, which requires a $parent_bft$ field in the bc‑header. With some caveats depending on the design of $Π_{origbft}$ , it might be possible to merge this with the $context_bft$ field, but for now we will assume that it is not merged.

What are the caveats?

If we are in an execution where Final Agreement holds for $Π_{bft}$ , then it is possible to show that merging the two fields has no negative effect, provided that $Π_{origbft}$ has no additional rules that could disallow it in some cases.

This is because, by Final Agreement, $bft-last-final (H^{'} . context_bft) ⪯ ⪰_{* bft} bft-last-final (C)$ for any potential bft‑block $C$ that the bc‑block‑producer of a new block $H$ could choose as $H . context_bft$ . Suppose that the bc‑block‑producer freely chooses $H . parent_bft$ according to the desired honest behaviour for a bft‑proposer in $Π_{bft}$ , and then chooses $C$ to be the same block (which is always reasonable as long as it is allowed).

In the case $bft-last-final (H^{'} . context_bft) ⪯_{* bft} bft-last-final (C)$ , we are done, because this choice of $H . context_bft$ is allowed by the Extension rule.

In the case $bft-last-final (H^{'} . context_bft) ⪲_{* bft} bft-last-final (C)$ , we can argue that $H^{'} . context_bft$ would be a better choice than $C$ for $H . parent_bft$ as well as for $H . context_bft$ , because it has a later final ancestor. This is where the argument might fall down if $Π_{origbft}$ (and therefore $Π_{bft}$ ) has any additional rules that could disallow this choice. For now let’s suppose that situtation does not arise, but it is one of the caveats.

Another potential problem is that in an execution where Final Agreement does not hold for $Π_{bft}$ , we can no longer infer that either $bft-last-final (H^{'} . context_bft) ⪯_{* bft} bft-last-final (C)$ or $bft-last-final (H^{'} . context_bft) ⪲_{* bft} bft-last-final (C)$ . In particular it could be the case that the producer of $H^{'}$ was adversarial, and chose $H^{'} . context_bft$ in such a way as to favour its own bft‑block that is final in that context.

However, in that situation it must be possible for the bc‑block‑producer to see (and prove) that the bft‑chain has a final fork. That is, it can produce a witness $C$ to the violation of Final Agreement, showing that $bft-last-final (H^{'} . context_bft) ⪯ ⪰_{* bft} bft-last-final (C)$ does not hold, as discussed in the section Recording more info about the bft‑chain in bc‑blocks above.

The second caveat is that in that situation, we still need to set $H . parent_bft$ and $H . context_bft$ in order to be able to recover, and they typically should not be the same in order to do so.

The Increasing Tip Score rule is still needed, but it can be simplified. A newly produced bc‑block $H$ is also a bft‑proposal such that $snapshot (H) = H$ . This would yield the following bft‑proposal / bc‑block validity rule:

[Candidate rule for discussion] Either $score (tip (H . parent_bft)) < score (H)$ or $tip (H . parent_bft) = H$ .

except that $tip (H . parent_bft)$ cannot be $H$ , because $H$ is newly produced. It turns out we can just drop that part:

Increasing Tip Score (producer = proposer) rule: $score (tip (H . parent_bft)) < score (H)$ .

This works because, if $H$ does not have a higher score than the bc‑block $tip (H . parent_bft)$ , the bc‑block‑producer should instead have built on top of that bc‑block — which was necessarily known to the producer in order for it to set $H . parent_bft$ in the header of the new block.

The voting would be the same, performed by the same parties. Therefore, there is no concentration of voting into fewer parties. There is no change in the producer/proposer’s incentive to make the bft‑notarization‑proof or its soundness properties. Everything else is roughly the same, including the use of the $context_bft$ field of a bc‑block and the validity rules related to it. As far as I can see, all of the security analysis goes through essentially unchanged.

There may be some complication due to the fact that BFT protocols are typically designed to use epochs with a fixed period, whereas bc‑blocks are found at less predictable intervals. However, as long as BFT messages are labelled with the bc‑block they apply to, it seems like most BFT protocols would be tolerant to this change. In fact the adaptations of Snap‑and‑Chat to Hotstuff and PBFT in [NTT2020] already assume that BFT messages can be queued and processed at a later time, and rely on those BFT protocols' tolerance to this.

Info

In most PoS protocols, the requirement to have a minimum amount of stake in order to make a proposal acts as a gatekeeping filter on proposals, and potentially allows parties that make invalid proposals to be slashed.

Strictly speaking, whether there is a stake requirement to make a proposal is independent of whether bc‑block‑producers (e.g. miners) are merged with bft‑proposers. It could be, for example, that a miner is still able to produce bc‑blocks, but is not able to make them into a proposal unless they satisfy a stake requirement. (This would have significant effects on the economics of mining that would need to be analyzed, and that might have governance consequences.)

In a system that uses PoS, validators by definition need to have stake in order to control the ability to vote. This also allows validators to be slashed.

On the other hand, there is no technical reason why the ability to make a bft‑proposal has to be gatekept by a stake requirement — given the situation of Zcash in which we already have a mining infrastructure, and that in a Snap‑and‑Chat or Crosslink‑style hybrid protocol we necessarily still rely on miners not to censor transactions. The potential to make proposals that are expensive to validate as a denial of service is made sufficiently difficult by proof‑of‑work. This option has probably been underexplored by previous PoS protocols because they cannot assume an existing mining infrastructure.

It could be argued that this approach goes less far toward a pure PoS‑based block‑chain protocol, leaving more to be done in the second stage. However, there is a clear route to that second stage, by replacing PoW with a protocol like PoSAT that has full unpredictabilty and dynamic availability. PoSAT does this using a VDF, and as it happens, Halo 2 is a strong candidate to be used to construct such a VDF.

If the arguments in [DKT2021] about the need for proposer unpredictability are persuasive, then this approach defers the complexity of requiring a VDF without losing any security, since Zcash’s PoW is already unpredictable.

Do we need an explicit bft‑chain at all?

Building on the previous idea, we can try to eliminate the explicit bft‑chain by piggybacking the information it would hold onto a bc‑block (in the header and/or the block contents). In the previous section we merged the concepts of a bft‑proposal and a bc‑block; the $parent_{bft} (P)$ and $P . context_bft$ fields of a bft‑proposal were moved into the $H . parent_bft$ and $H . context_bft$ fields of a bc‑header respectively. A field $H . pubkey$ was also added to hold the producer’s public key, so that the producer can sign the bft‑block constructed from it using the corresponding private key.

This left the concept of a bft‑block intact. Recall that in Crosslink 1, a bft‑block $B$ consists of $(P, proof_{P})$ signed by the proposer. So in “Crosslink with proposer = producer”, a bft‑block consists of $(H, proof_{H})$ signed by the producer.

What if a bc‑block $H$ were to “inline” its parent and context bft‑blocks rather than referring to them? I.e. a bc‑block $H$ with $H . parent_bft$ referring to $(H^{'}, proof_{H^{'}})$ signed for $H^{'} . pubkey$ , would instead literally include (either in the header or the coinbase transaction) $(H^{'}, proof_{H^{'}})$ signed for $H^{'} . pubkey$ — and similarly for $H . context_bft$ .

It would still be necessary to have the message type that the proposer/producer previously used to submit a notarized bft‑block. (It cannot be merged with a bc‑block announcement: the producer of a new block is not in general the producer of its parent, and their incentives may differ; also we cannot wait until a new block is found before publishing the previous notarization.) It would also still be necessary for Crosslink nodes to keep track of notarizations that have not made it into any bc‑block. Nevertheless, this is a potential simplification.

Note that unless notarization proofs are particularly short and constant-length, it would not be appropriate to include them in the bc‑block headers, and so they would have to go into the coinbase transaction or another similar variable-length data structure. In that case we would still have an indirection to obtain the bft‑block information; it would just be merged with the indirection to obtain a coinbase transaction (or similar) — which is already needed in order to check validity of the bc‑block.

As discussed under Recording more info about the bft‑chain in bc‑blocks above, we might in any case want to record information about other proposed and notarized bft‑blocks, and the data structure needed for this would necessarily be variable-length. The complexity burden of doing so would be shared between these two changes.

It would be possible to save some space in headers (while keeping them fixed-length), by inlining only one of $H . parent_bft$ and $H . context_bft$ in the header and keeping the other as a hash. As discussed under “What are the caveats?” above, the only reason for the two bft‑blocks referred to by these fields to be different, is that the bc‑block producer has observed a violation of Final Agreement in $Π_{bft}$ . In that case, we can include an inlining of the other $H . *_bft$ block, and any other information needed to prove that a violation of Final Agreement has occurred, in a variable-length overflow structure.

Pros:

No additional mechanism or messages are needed to obtain bft‑blocks given their hashes.
It could be a performance/latency improvement to not have to separately fetch bft‑blocks.

Con:

Additional complexity of the variable-length overflow mechanism suggested above, if it is used.
Assumes that notarization proofs are not too large.

Linearity and Last Final Snapshot rules

A potential simplification can be obtained by combining the following two ideas:

Str4d suggested that the snapshot of each bft‑block should descend from the snapshot of its parent bft‑block.
Nate suggested that each bc‑block $H$ should descend from $snapshot (LF (H))$ .

Str4d’s suggestion can be written as:

Linearity rule: $snapshot (B ⌈_{bft}^{1}) ⪯_{bc} snapshot (B)$ .

Notice that this implies the existing Increasing Score rule in Crosslink 1, because score necessarily increases within a bc‑valid‑chain. Therefore it would in practice be a replacement for the Increasing Score rule. It does not imply the Increasing Tip Score rule discussed above, and in fact it could make sense to enforce both the Linearity rule and the Increasing Tip Score rule.

The Linearity rule implies that it is no longer possible for a bft‑valid‑chain to snapshot a bc‑chain that rolls back relative to the previous snapshot. This makes it unnecessary to sanitize $LO G_{fin}$ : the sequence of snapshots considered by $san-ctx$ is linear, and so the “sanitization” would just return the transactions in the last snapshot.

To remove the need to sanitize $LO G_{bda}$ as well, we need a further modification to $Π_{bc}$ . Recall that in Crosslink 1 we define: $fin (H) bda-ctx (H, μ) := := [snapshot (B) for B ⪯_{bft} LF (H ⌈_{bc}^{σ})] san-ctx (fin (H) ∣∣ [H ⌈_{bc}^{μ}]) .$ The Linearity rule ensures that $fin (H)$ is a linear sequence of snapshots, but for $fin (H) ∣∣ [H ⌈_{bc}^{μ}]$ to be linear, we also need $last (fin (H)) = snapshot (LF (H ⌈_{bc}^{σ})) ⪯_{bc} H ⌈_{bc}^{μ}$ . In order for this to hold for any choice of $μ$ with $0 \leq μ \leq σ$ , we require the strongest version of this condition with $μ = σ$ , i.e. $snapshot (LF (H ⌈_{bc}^{σ})) ⪯_{bc} H ⌈_{bc}^{σ}$ .

Since we can only enforce that this holds for $H ⌈_{bc}^{σ}$ by enforcing that it holds for an arbitrary bc‑valid‑block $H$ , the rule becomes:

Last Final Snapshot rule: $snapshot (LF (H)) ⪯_{bc} H$ .

This is exactly Nate’s suggestion discussed in Questions about Crosslink. In that document we argued against this rule, but that argument was made in the context of a protocol without the Linearity rule (and originally, even without the Increasing Score rule).

Combining the Linearity rule and Last Final Snapshot rule, on the other hand, completely eliminates the need for sanitization. This could be a huge simplification — and potentially safer, since it would avoid breaking assumptions that may be made by existing Zcash node implementations and other consumers of the Zcash block chain.

To spell out the resulting simplifications to the definitions of $LOG_{fin, i}^{t}$ and \LOG^t_{\bda,\mu,i}, we would just have: \begin{array}{rcl} \LOG_{\fin,i}^t &:=& \snapshotlf{\ch_i^t \trunc_{\bc}^\sigma} \ \LOG_{\bda,\mu,i}^t &:=& \ch_i^t \trunc_{\bc}^\mu \end{array}

Here it is no longer necessary to define $LOG_{fin, i}^{t}$ and \LOG^t_{\bda,\mu,i} as sequences of transactions, since the final and bounded-available chains are both just bc‑valid‑chains.

The definition of $finality-depth$ in the Finality depth rule becomes much simpler: $finality-depth (H) := height (H) - height (snapshot (LF (H ⌈_{bc}^{σ})))$ As before, either $finality-depth (H) \leq L$ or $is-stalled-block (H)$ .

Avoiding sanitization also means that the bug we described in Snap‑and‑Chat, that could prevent spending outputs from a snapshotted chain after a $σ$ -block rollback, cannot occur by construction. That is, the changes in $Π_{bc}$ contexual validity relative to $Π_{origbc}$ are not needed any more.

This almost seems too simple, and indeed we should be skeptical, because the security analysis essentially has to be redone. The reason why Snap‑and‑Chat didn’t take this approach is that it requires a more complicated argument to show that it is reasonable to believe in the safety assumptions of $Π_{bc}$ whenever it is reasonable to believe in the corresponding assumptions for $Π_{origbc}$ . This is because the ... We will need to do some work to show that the changes are benign.

Security Analysis

The key observation needed for this analysis is that neither the Linearity rule nor the Last Final Snapshot rule affect the evolution of $Π_{bc}$ unless we are in a situation where its Prefix Consistency or Prefix Agreement properties would be violated.

This implies that any safety property that we can prove given Prefix Consistency plus ****.

References

Ebb-and-Flow Protocols

Ebb-and-Flow Protocols: A Resolution of the Availability-Finality Dilemma

Zcash Trailing Finality Layer - v0.1.0