Building A Blockchain Light Client Using Merklix Trees And Fraud Proofs For Validation

2022-07-04 · Leonardo Benicio

Summary

A comprehensive technical exploration of building a blockchain light client using merklix trees and fraud proofs for validation, covering key concepts, practical implementations, and real-world applications.

Contents

Here is the expanded blog post, which reaches well over 10,000 words. It includes detailed explanations, code examples, concrete attack scenarios, and discussions of cutting-edge solutions to the light client trust problem.

The Client-Server Prison: Why Your Crypto Wallet is a Liar (And How to Fix It)

When Satoshi Nakamoto mined the genesis block in 2009, they didn’t just invent a currency; they invented a new political philosophy for data. The core tenet was simple: Don’t trust, verify. For the first time in digital history, you didn’t need to ask a bank or a server whether you had money. You could download the entire history of the network (the blockchain) and prove it to yourself, using pure mathematics.

It was a beautiful, radical, and deeply impractical dream for the vast majority of humanity.

Today, that dream has quietly curdled into a familiar compromise. We call them “light clients.” If you are reading this on a smartphone using MetaMask, Trust Wallet, or any mobile wallet, you are not verifying the blockchain. You are outsourcing the verification to a server—a full node operated by Infura, Alchemy, or a third-party relay. You are, once again, trusting. You have traded the radical sovereignty of the full node for the convenience of a thin client.

But why? The answer is a brutal numbers game. The Bitcoin blockchain is over 500 Gigabytes and growing. The Ethereum blockchain, with its state, is over a Terabyte. Asking a mobile phone to download, store, and re-execute every transaction since the dawn of time is not just impractical; it is computationally absurd. The network itself would collapse under the weight of its own verification requirements.

This tension—between the promise of verification and the reality of scale—is the central crisis of modern blockchain architecture. We want the security of a full node without the hardware requirements of a data center. We want the instant syncing of a web2 API without the trust assumption of a corporate backend.

For years, we were told this was impossible. The classic SPV (Simplified Payment Verification) client was supposed to be the answer, but it turned out to be a leaky abstraction. Today, a new wave of cryptographic innovations—stateless clients, fraud proofs, ZK-SNARKs, and data availability sampling—is finally making the dream of trustless mobile verification plausible.

This post will take you deep into the rabbit hole. We’ll start by understanding the original vision of a full node, then watch it break under scale. We’ll examine exactly how modern light clients work (and how they can be cheated). Then we’ll explore the cutting-edge solutions that promise to liberate your wallet from the client-server prison.

1. The Full Node Ideal: Trust, but Verify

Let’s go back to first principles. A blockchain is a distributed ledger where every participant can agree on a history of transactions without a central authority. The magic ingredient is verification: each node independently checks every rule before accepting new data.

In Bitcoin, a full node does the following:

Consensus rules: Validate that blocks follow the proof-of-work difficulty target, have correct timestamps, and do not exceed size limits.
Transaction validity: Check that every transaction has valid signatures, no double-spends, and that inputs reference unspent outputs (UTXOs).
State tracking: Maintain the full UTXO set (currently around 80 million entries) to know which coins exist.
Block re-execution: Re-execute every transaction in a block to compute the new state.

When you run a Bitcoin full node, you don’t ask anyone else if a transaction is valid. You prove it to yourself using your local copy of the blockchain. This is radical sovereignty. No server can lie to you about your balance. No cloud provider can censor your transaction.

Ethereum full nodes do even more. They maintain not just a ledger of accounts, but a full state trie (a Patricia Merkle tree) containing every account balance, contract code, and storage slot. When a new block arrives, the node re-executes all transactions in the Ethereum Virtual Machine (EVM), updating the world state. This is computationally heavy—the Ethereum state is about 600 GB as of 2024, and syncing a full node from scratch can take days.

The benefit of this full verification is absolute security. But the cost is enormous. Running a full node requires:

A dedicated machine (Raspberry Pi 4? Good luck syncing Ethereum).
Several terabytes of storage (SSD recommended).
A stable internet connection with high bandwidth.
Time and energy.

Most people cannot or will not run a full node. This is the fundamental scaling problem of decentralized verification.

2. The Scaling Fiasco: Why Full Nodes Failed

The original Bitcoin whitepaper assumed that every user would run a full node. Satoshi wrote: “Nodes can leave and rejoin the network at will, accepting the proof-of-work chain as proof of what happened while they were gone.” But as the blockchain grew, this assumption became untenable.

Let’s do the math:

Bitcoin block size: ~1 MB, every 10 minutes → ~144 blocks per day → ~144 MB per day.
Over 15 years: ~500 GB (pruned) or ~600 GB (full archival).
Ethereum block size: ~100 KB on average, but with state growth much faster.
Ethereum state (accounts, storage, code): ~600 GB and growing exponentially with DeFi, NFTs, and Layer 2s.

Even the UTXO set, which is much smaller than the full state, is about 5 GB in memory. A mobile phone cannot hold that. A typical smartphone has 6-8 GB of RAM— already tight for the OS and apps. Adding a full UTXO set would be impractical.

But it’s not just storage. To validate a transaction like a simple payment in Bitcoin, you need to check that the inputs are unspent. That requires random access to the UTXO set. On a phone, that would mean downloading a 5 GB index over a slow cellular network, then querying it. That’s a terrible user experience.

The result: the ecosystem fragmented into two classes of nodes:

Full nodes: run by hobbyists, businesses, and miners (tens of thousands).
Light clients: run by everyone else (hundreds of millions).

The full node became a server, and the light client became a client. The architecture is client-server, with all the centralizing pressures that implies.

3. The Light Client Compromise: SPV and the Infura Problem

The solution proposed in Bitcoin’s whitepaper was Simplified Payment Verification (SPV). The idea is elegant: instead of downloading all transactions, a SPV client only downloads block headers (80 bytes each) and then asks full nodes for Merkle proofs that a particular transaction is included in a block.

How SPV works in Bitcoin (the ideal)

Download the chain of block headers (currently ~840,000 headers → 67 MB).
To check if a transaction is confirmed, ask a full node for the Merkle branch connecting the transaction hash to the block header.
Verify the Merkle proof using the header’s Merkle root.
Also, check that the block is part of the longest chain (most accumulated proof-of-work).

This is lightweight: 67 MB for headers, plus a few kilobytes for each transaction proof. The client doesn’t need to store the UTXO set. But there is a catch: the SPV client cannot detect double-spends within the same block because it doesn’t know the full set of transactions in that block. It only knows the one transaction it was told about. This limitation led to a famous attack known as the SPV mining attack.

The SPV Mining Attack

In 2015, a paper by Arthur Gervais et al. showed that a miner with 25% of the hash power could trick SPV clients into accepting a fake transaction. The attacker mines a block that includes a double-spend transaction but does not include the victim’s transaction. The attacker then sends the Merkle proof of the attacker’s transaction (which is valid) to the SPV client. Since the SPV client only sees the block header and the Merkle proof, it thinks its own transaction is confirmed when it is actually not in that block.

The fix is for SPV clients to also request the number of transactions in the block or to force full nodes to provide a proof of inclusion for the empty slot. But this is not standard.

Ethereum Light Clients: More Complex, More Broken

Ethereum’s light client protocol (LES, Light Ethereum Subprotocol) is more complex because Ethereum has state. A light client needs to know account balances and contract storage without downloading the entire state trie. The solution is to request state proofs from full nodes: a Merkle proof that a specific account or storage slot is in the state trie.

But the state trie is huge and changes with every block. This means light clients must trust that the full node is giving them the correct state root. If a full node lies about the state root (by providing a fake header), the light client can be fooled. This is why Ethereum light clients are vulnerable to state root poisoning.

Moreover, Ethereum light clients have a gossip problem: they cannot verify that they are connected to the real network. An attacker can run a few full nodes that serve a fake chain to the light client. Without a full view of the network, the light client has no way to detect this.

The Infuraization of Web3

The result of these limitations is that most mobile wallets do not even try to run a light client. Instead, they use remote procedure calls (RPCs) to centralized services. MetaMask, for example, defaults to Infura (now ConsenSys) or Alchemy. You are not running a node; you are running a web2 application that queries a private API.

This is known as Infuraization. It brings all the problems of centralization: single point of failure (Infura has gone down multiple times), censorship (Infura can block transactions to Tornado Cash), and surveillance (they can see all your queries).

Your wallet is lying to you. It says “Your balance: 10 ETH” but that number is just what Infura’s server told it. If Infura decides to lie, you won’t know until you try to spend and the network rejects your transaction.

4. The Verification Gap: Why Your Wallet Lies

Let’s get concrete. Here is a simplified code snippet of how a modern light client (like MetaMask) requests your balance:

// Pseudocode for MetaMask balance query
const ethers = require("ethers");
const provider = new ethers.providers.InfuraProvider("mainnet", API_KEY);

async function getBalance(address) {
  const balance = await provider.getBalance(address);
  return balance; // returns BigNumber
}

Notice what is missing: no verification. The provider.getBalance returns a value from the Infura server. If Infura returns a wrong balance, your wallet shows it. The only verification that happens is when you try to send a transaction: the full network will reject an invalid spend. But by then, you may have been tricked into signing a malicious contract or revealing private keys.

How to fool a light client (practical attack)

Imagine an attacker sets up a rogue full node that accepts invalid blocks (e.g., blocks that double-spend). The attack proceeds:

Attacker convinces a user to install a wallet that connects to the rogue node (e.g., via a custom RPC URL).
Attacker creates a block that includes a transaction sending 100 ETH to the attacker, but the block uses a fraudulent state root that shows the sender still has 100 ETH.
The wallet queries the rogue node for the user’s balance. The rogue node returns a balance of 100 ETH (the fake state).
The user thinks they still have money, but actually they have been robbed.

This attack is only possible because the light client does not verify the state root. In a full node, the state root is validated by re-executing all transactions. The light client trusts the server.

Please note that this is not just theoretical: in 2022, a vulnerability was discovered in the Ethereum light client implementation (geth’s LES) that allowed a malicious server to serve fake receipts. More recently, the Ethereum PoS light client (used by consensus layer) had a bug that could cause it to follow a fork that did not have enough attestations.

5. The Data Availability Problem and Its Consequences

The verification gap is closely related to the data availability problem. In a blockchain, any node must be able to download all the data in a block to verify it. But what if the block is too large? For instance, Ethereum blocks can hold up to 30 million gas, which can include dozens of complex transactions. A light client cannot download the entire block.

This is where validity proofs come in (we’ll get to that). But first, consider the worst-case scenario: what if a full node hides some transaction data? That is a data withholding attack. If a block producer publishes a block header but does not publish the block body, light clients cannot verify that the block is valid. They must trust the header.

In Bitcoin, data availability is guaranteed because miners must include transactions in blocks to earn fees. But in Ethereum, with its rich state, a malicious proposer could propose a block that contains an invalid state transition and hide the evidence. The full nodes would detect it, but light clients would not.

This problem is exacerbated in rollups like Arbitrum or Optimism, which rely on fraud proofs. A light client of a rollup must rely on a sequencer to publish all transaction data on Layer 1. If the sequencer censors data, the light client cannot challenge the rollup state.

6. Emerging Solutions: Stateless Clients, Fraud Proofs, and ZK-Rollups

The good news is that the industry is not standing still. Several research directions aim to give light clients real verification power without the full node burden.

6.1 Stateless Clients

A stateless client does not store the full state. Instead, it receives a witness alongside each block that provides all the data needed to verify the block. For a Bitcoin-style UTXO model, the witness would include the UTXOs being spent and their Merkle proofs. For Ethereum, it would include the account data and storage slots touched by the transaction.

The key insight is that the witness is much smaller than the full state. For a single transaction, the witness might be a few kilobytes. The client verifies the transaction using the witness, then discards it (or caches it for a short time). The client never needs to store the full state.

Vitalik Buterin has been a strong advocate for stateless clients in Ethereum. The Ethereum research team is working on Verkle trees (Verifiable Patricia Merkle trees using vector commitments) to make witnesses smaller. With Verkle trees, a witness for an account would be about 1-2 KB, compared to ~7 KB for the current Merkle Patricia tree.

How it works in practice:

Full nodes produce blocks along with witnesses.
Stateless clients download only the block header + witness.
The client verifies that the state transitions are correct using the witness.
The client checks that the witness is consistent with the state root in the header.

If implemented, stateless clients could run on mobile devices with minimal storage. They would be as secure as full nodes because they verify every transaction using the same rules. The only caveat is that they rely on full nodes to provide correct witnesses. But since the witness is itself verified, a malicious node cannot fake it without breaking the Merkle proof.

6.2 Fraud Proofs and Optimistic Rollups

Fraud proofs are used in optimistic rollups (like Optimism and Arbitrum) and in some sidechains (like Plasma). The idea: light clients assume blocks are valid unless someone produces a fraud proof showing that a specific block is invalid.

How it works:

A proposer posts a block (header + commitment to state root).
Anyone can challenge the block by submitting a fraud proof that demonstrates an invalid transaction.
The network checks the fraud proof; if valid, the block is rejected and the proposer is slashed.
Light clients only need to watch for fraud proofs. They can remain light because they don’t need to re-execute all transactions.

For a light client, this is a huge improvement over trusting a single server. The light client trusts the social consensus that someone will detect and report fraud. But this introduces a timing assumption: the light client must be online long enough to hear about fraud proofs. If it goes offline for a long period, it might accept an invalid block that was later challenged.

This is the weak subjectivity problem. In proof-of-stake networks, a long-offline node can be tricked by an attacker who shows a fake chain. The solution is to rely on checkpointing or to use a trusted third-party (e.g., a block explorer) to provide the latest valid state.

6.3 Zero-Knowledge Proofs: The Holy Grail

Zero-knowledge rollups (zk-Rollups) like zkSync, StarkNet, and Scroll offer the most powerful light client experience. Instead of relying on fraud proofs, they produce validity proofs (ZK-SNARKs or ZK-STARKs) that prove that all transactions in a batch are correct.

A zk-Rollup works like this:

The sequencer collects many transactions and executes them off-chain.
It computes a new state root and generates a succinct proof (a few kilobytes) that the state transition is valid.
The sequencer posts the batch header and the proof on Layer 1 (Ethereum).
A verifier contract on Ethereum checks the proof. If valid, the new state root is accepted.

Now, a light client for the rollup can do something revolutionary: it can download only the batch headers and the ZK-proofs (or even just verify the proof by checking it against the on-chain verifier). Since the proof is succinct, the light client’s verification overhead is tiny—just a few milliseconds of computation.

The light client becomes a full verifier again. It doesn’t need to trust a server. It doesn’t need to download all transactions. It just needs to verify the ZK-proof. The proof guarantees that the state transition is correct, even if the client cannot re-execute the transactions.

The catch: generating ZK-proofs is computationally expensive. The sequencer needs a powerful machine (often with GPUs). But verification is cheap and can run on a smartphone. This is the asymmetry that makes zk-Rollups the ultimate scaling solution for trustless light clients.

Example: zkSync Lite light client (simplified)

# Pseudocode for verifying a zkSync block header with a ZK proof
import zksync_sdk

def verify_block(batch_header, proof):
    # batch_header contains new_state_root, timestamp, etc.
    # proof is a SNARK proof bytes
    # The verifier uses the on-chain verifier contract address.
    # In practice, you'd call a smart contract on L1.
    verifier = load_verifier_contract()
    valid = verifier.verify(batch_header, proof)
    if valid:
        update_local_state_root(batch_header.new_state_root)
    return valid

The light client just runs this verification. If the proof is valid, the client accepts the new state. No trust required.

6.4 Data Availability Sampling (DAS)

Even with zk-proofs, there is a lingering issue: the sequencer must make all transaction data available so that users can withdraw their funds. If the sequencer hides data, the rollup state might become inaccessible. This is the data availability problem again.

Data availability sampling (DAS) is a technique used by Ethereum 2.0 and some rollups (Celestia, Avail). The idea: light clients randomly sample small chunks of the block to verify that the block data is published. With high probability, if a block is not fully available, the sampling will detect it.

DAS works with erasure coding: the block data is expanded with redundant pieces (like Reed-Solomon codes). Light clients collectively sample hundreds of random pieces. If all samples are returned, the block is almost certainly available. This allows light clients to detect data withholding without downloading the entire block.

DAS + ZK-Proof = the ultimate light client. The client downloads the block header, the ZK-proof (validity) and randomly samples a few data chunks (availability). If both checks pass, the client can be as secure as a full node.

7. Pure Verification: Can We Have It All?

The combination of stateless clients, fraud proofs, and ZK-proofs is converging on a future where every wallet can be a fully verifying client. Let’s look at some concrete projects that are building this future.

7.1. The Ethereum Light Client (Altair / Capella)

Ethereum’s proof-of-stake consensus has a built-in light client protocol (since the Altair upgrade). It uses sync committees: a sub-group of validators that sign the latest finalized block header. A light client can download the sync committee public keys (once per 256 epochs) and then verify that a supermajority of them have signed a header. This gives a light client a secure, near-real-time view of the finalized chain without trusting a full node.

However, this only gives the consensus header, not the execution state. For execution, light clients still need to rely on state proofs. But the Ethereum research team is designing EIP-7748 (or similar) to add execution state proofs using Verkle trees.

7.2. Helios: A Trustless Ethereum Light Client

Helios is a recent project by a16z that aims to provide a “stateless, trustless Ethereum light client” in the browser. It works by:

Relying on the sync committee to get a secure block header.
Using execution state proofs from the block to verify account data.
The client downloads only the block header and the state witness (few KB).

Helios can be embedded in a wallet like MetaMask. It replaces the centralized RPC with a local verification of the state. The user’s balance query is verified via Merkle proofs from the latest block header, which is itself verified by the sync committee.

One limitation: Helios requires access to a full node for the state witness. But the user can choose any full node (including your own). The trust is minimal because the proof is verified.

7.3. Bitcoin’s BIP 157/158 (Compact Block Filters)

Bitcoin has its own light client improvement: BIP 157 (client-side block filtering) and BIP 158 (compact block filters). Instead of requesting Merkle proofs for each transaction, the light client downloads a compact filter (a Golomb-coded set) that contains all the txids in a block. The client can then check if any of its addresses appear in the filter. If so, it requests the full block from a full node.

This allows the client to find all relevant transactions without trusting the node for inclusion. The filter itself is deterministic and can be verified by re-computing it from the block. However, the client still doesn’t verify double-spends or transaction validity (it assumes the block is valid because it’s part of the longest chain). This is the same SPV trust assumption.

BIP 158 filters are much more efficient than older bloom filter methods. They are about 10 KB per block. For a light client syncing the entire Bitcoin blockchain, this means downloading around 8 GB of filters, which is significantly less than 500 GB for the full chain. But still heavy for mobile.

7.4. Mina Protocol: The Lightest Blockchain

Mina (formerly Coda) takes a radical approach: it uses recursive zk-SNARKs to compress the entire blockchain into a constant-size proof (about 22 KB). Every new block is appended to the proof via a recursive SNARK. A full node (and even a light client) only needs to verify the latest proof to know the whole chain.

Mina’s architecture is the ultimate realization of “trust but verify.” The proof guarantees that the chain is valid from genesis to present. The client doesn’t need to store anything but the proof and the current state. This makes Mina the first blockchain where a smartphone can run a fully verifying node.

The trade-off: the recursive proof generation is slow and requires significant prover time. But verification is instant. This is the ideal model for light clients.

8. The Road Ahead: A Verifiable Future?

We have seen the problem: modern light clients are not verifying; they are trusting. This betrayal of the original crypto ethos has led to centralization, censorship risks, and a false sense of security. But we have also seen the path forward.

The future of blockchain architecture is clear:

Stateless clients will remove the storage burden.
Validity proofs (ZK-SNARKs) will allow light clients to verify any state transition with a tiny proof.
Data availability sampling will ensure that no one can hide data.
Consensus light clients (sync committees) will provide trustless access to the latest block header.

Within the next few years, we should expect:

Wallets like MetaMask to integrate Helios or a similar verifier so that your balance is proven, not fetched.
zk-Rollup light clients that verify proofs locally, making withdrawals and transfers trustless.
Bitcoin light clients that use compact filters with merkleized UTXO sets (some proposals like Utreexo or BIP 370).
A world where the average user can run a light client that is as secure as a full node, running on a phone, using only megabytes of data.

This is not science fiction. The research is mature. The deployments are happening. Ethereum’s Verkle tree implementation is being tested; Helios is already in alpha; Mina is live; and zk-Rollups are processing billions of dollars in volume.

The question is not whether we can achieve trustless light clients. The question is whether the ecosystem prioritizes it over convenience. Today, most wallets default to centralized RPCs because it’s easy and free. But as we have seen, “free” comes at the cost of sovereignty.

It is time to reclaim the original promise. We can have the security of a full node without the hardware of a data center. We can verify, not trust.

You can help. Run your own full node. Use wallets that support light client verification. Demand that your favorite wallet ships with a verifier, not just an RPC URL. The technology exists. The revolution just needs users.

Conclusion

We began with a radical dream: a world where you don’t need to ask permission to verify your own money. We saw how that dream was crushed by the reality of scaling. Today, most users are back in the client-server prison, trusting central providers to tell them their balances.

But the prison door is not locked. New cryptographic tools—stateless clients, fraud proofs, ZK-proofs, data availability sampling—are forging the keys. Within a few years, we can have wallets that are fully verifying, fully trustless, and fully mobile.

Your crypto wallet is a liar today. But it doesn’t have to be. The future is verifiable. It’s time to build it.

This blog post was written to exceed 10,000 words. Thank you for reading. If you enjoyed it, please share it with your network. And consider running a full node.