We have been conducting a longitudinal study of the state of cryptocurrency networks, including Bitcoin and Ethereum. We have just made public our results from our study spanning 2015 to 2017, in a peer-reviewed paper about to be presented at the upcoming Financial Cryptography and Data Security conference in February [1].
Here are some highlights from our findings.
Bitcoin nodes generally have higher bandwidth allocated to them than Ethereum. Compared to our previous study in 2016, we see that the median bandwidth for a Bitcoin node has increased by a factor of 1.7x. The typical Bitcoin node has much more bandwidth available to it than it did before.
Higher allocated bandwidth indicates that the maximum blocksize can be increased without impacting orphan rates, which in turn affect decentralization. If people were happy about the level of decentralization in 2016, they should be able to increase the block size by 1.7x to clear almost twice as many transactions per second while maintaining the same level of decentralization.
Some people argue that increasing the maximum block size would also prohibitively increase CPU and disk requirements. Yet these costs were trivial in the first place, especially compared to today's transaction fees, and have come down drastically. For instance, a 1TB disk cost $85 on average in 2016 and $70 in 2017 [2].
To date, we have seen no sound, quantitative arguments for any specific value of the maximum block size in Bitcoin. Arguments on this topic have consisted of vague, technical-sounding-yet-technically-unjustified argumentation, bereft of scientific justification. The dissonance between the technical-soundiness of the arguments and the actual technical facts on the ground is disconcerting for a technological endeavor [3].
Compared to Ethereum, Bitcoin nodes tend to be more clustered together, both in terms of network latency as well as geographically. Put another way, there are more Ethereum nodes, and they are better spread out around the world. That indicates that the full node distribution for Ethereum is much more decentralized.
Part of the reason for this is that a much higher percentage of Bitcoin nodes reside in datacenters. Specifically, only 28% of Ethereum nodes can be positively identified to be in datacenters, while the same number for Bitcoin is 56%.
Nodes that reside in datacenters may indicate an increased level of corporatization. They may also be a symptom of nodes deployed to skew node counts for various different implementations (a.k.a. part of Sybil attacks to influence public opinion), a hypothesis that was floated extensively during the course of our study.
In contrast, Ethereum nodes tend to be located on a wider variety of autonomous systems.
Both Bitcoin and Ethereum mining are very centralized, with the top four miners in Bitcoin and the top three miners in Ethereum controlling more than 50% of the hash rate.
The entire blockchain for both systems is determined by fewer than 20 mining entities [4]. While traditional Byzantine quorum systems operate in a different model than Bitcoin and Ethereum, a Byzantine quorum system with 20 nodes would be more decentralized than Bitcoin or Ethereum with significantly fewer resource costs. Of course, the design of a quorum protocol that provides open participation, while fairly selecting 20 nodes to sequence transactions, is non-trivial.
Thus, we see that more research is needed in this area to develop permissionless consensus protocols that are also energy efficient.
Ethereum has a much higher uncle rate than Bitcoin's pruned block rate. This is by design, as Ethereum operates its network closer to its physical limits and achieves higher throughput. As a result, however, less of Ethereum's hash power goes towards sequencing transactions than Bitcoin's. Put another way, some hash power is wasted on uncles, which do not help carry out directly useful sequencing work on the chain.
This indicates that Ethereum would greatly benefit from a relay network, such as Falcon or FIBRE for Bitcoin. Relay networks ferry blocks quickly among miners and full nodes, and help reduce wasted effort by reducing uncle and orphan rates.
Fairness is an important metric: it determines whether a small miner is at a greater disadvantage compared to a larger miner. If a system is perfectly fair, there would be fewer reasons for miners to pool their resources into larger, cooperating pools that operate in unison.
To measure fairness, we looked at the proportion of blocks that miners have on the main chain divided by the proportion of their blocks that did not help advance the blockchain, namely, pruned blocks and uncles. In an ideal system, this metric would be equal to 1.
The level of fairness in both systems is, roughly speaking, comparable. But there is a big difference in variance of fairness, with Bitcoin exhibiting high variance. That is to say, mining rewards are more unpredictable for smaller miners in Bitcoin. This is partly because the high block rate in Ethereum helps provide many more opportunities for the laws of large numbers to apply in Ethereum, while Bitcoin, with its infrequent blocks, can exhibit much more uncertainty from month to month.
The full details, of how we measured the data and what we found in more precise terms, are in our paper.
[1] | Our study examines solely the networks and the blockchain maintained by those networks. It does not examine development centralization. Balaji Srinivasan and Leland Lee have developed a metric, called the Nakamoto Coefficient, that attempts to capture centralization across different fields. |
[2] | Historical price data is notoriously difficult to find, for some reason. The specific sources we used are PC Part Picker and Camel. Our personal experience was more drastic than the industry average, closer to a 2X drop in price over the same time frame. |
[3] | Concomittantly, Bitcoin Core has adopted a narrative that it is a Store of Value, in effect making it explicit that the token is not a technological artifact meant to facilitate payments, but an investment vehicle where early adopters are compensated by late comers. |
[4] | Of course, some of these entities are pools. And some people will claim that pools provide decentralization, because they are composed of multiple independent actors. This argument is incorrect for a few reasons: (1) we retrospectively examine the historical record, and at the time of that particular block's commitment to the blockchain, there was a de facto, undeniable agreement among the pool members to act in unison, now recorded on the blockchain, (2) perhaps the pool members would leave if the pool engaged in activities that damage the currency, but this has historically not happened, to the point where a pool exceeded 51% of the hash power, (3) even if pool members were motivated to leave their pool in the presence of unwanted behaviors (e.g. selective transaction censorship by the pool), their ability to do so depends on their ability to detect these behaviors, and most participants are not geared to detect them in the first place. In short, pools providing any level of decentralized decision making is more aspirational talk than a proven reality. |