twokei's comments

twokei · on Nov 11, 2019

Yep! Just to clarify, this is entirely hypothetical! I want to know the value-add people would put on such a database if it were to exist.

There's little insight from what I've seen in the database market as to what people really want the most out of a database, or what makes a database "appealing" to people apart from its reputation from being time-tested.

twokei · on Nov 11, 2019

Gotcha! Makes sense to me :). What price tag would you put on this software?

There's been a pretty interesting development from what I've seen on annual licensing fees for enterprise features, with the code open-sourced.

twokei · on Nov 11, 2019

It would be possible if say for example, the database was fully replicated in a masterless fashion.

The tradeoff is slower writes (there are ways to make it so that the more nodes you replicate across, write latency won't be affected!).

Reads will scale linearly for the more nodes you replicate across.

So, the novelty in some sense is fast replication of data in a masterless fashion (without any leader such as, for example in Raft or Paxos).

If this protocol was surprisingly simple (which dumbs down the complexity of the software significantly), would you pay for this sort of database?

SamReidHughes · on Nov 11, 2019

Yeah, no, the database being replicated "in a masterless fashion" doesn't help. The problem is that under some network partition conditions, you can't do some writes or any writes, unless you want to sacrifice resilience and/or consensus.

twokei · on Nov 11, 2019

Why wouldn't there be failures only under a complete network partition? So long as one node in one partition may communicate with another node in another partition, then writes may still be performed.

Availability would be what is sacrificed in the advent that a node is partitioned away from the main network.

SamReidHughes · on Nov 11, 2019

Only when that's true of those two nodes and no other disjoint subset of nodes.

twokei · on Aug 10, 2019

A fork could be created just by taking the existing longest chain, and attempting to append a block to the frontiers blocks parent, or even to the frontier block itself. It's pretty simple. The time between blocks still is non-deterministic, and not often is there realistically a clear winner.

There existing n forks, even at a bare minimum of 3 forks of the same height is very common, especially if you consider how many miners there are in the entirety of the Bitcoin network.

gus_massa · on Aug 10, 2019

I tried to find the details about the current mini-forks but I couldn't. Do you have a link? I guess the number is small because the mining pools are well connected and none of them want to waste resources (because they loose money).

Probably 3 forks is a good number, but I'd like to see some data. Anyway, it's a linear problem that is adjusted automatically by the difficulty to find the next block. It's not an exponential problem that makes the protocol unpractical at scale.

twokei · on Aug 11, 2019

The difficulty adjustment algorithm for Bitcoin is unfortunately very naive - there is a reason blockchain timestamps have a 2 hour deviation.

For some data, look into orphan blocks: https://www.blockchain.com/btc/orphaned-blocks

twokei · on Aug 1, 2019

Interesting - would love your thoughts, why is it that DAGs aren't very scalable?

marknadal · on Aug 1, 2019

DAGs aren't very scalable, because like with git, you have to store the whole history.

In some scenarios, you can rebase/snapshot to clean up history, but these usually require a type of centralization or consensus, which defeats the point of using something like git.

As a result, DAGs can only be used to replace a subset of apps/tools out there.

The most important/used apps, though, are indexed lists. Things like:

- Google rankings

- Reddit homepages

- AirBnB listings

- Ubers nearby

If you're updating a geo-index 100s of times a second, like in the case of cars' GPS locations, then you're just wasting resources with a DAG that you'll never use and wind up bottlenecking the system, preventing scaling beyond a certain threshold. I've dealt with this in practice, and it was no fun.

We switched off DAGs, and our biggest production deployment now has been with 15 million monthly users. This is far far far beyond the scale of any of the previous systems.

twokei · on July 9, 2018

Unfortunately not, but thanks for the tidbit on Noise (the crypto protocol).

We're open to integrating with best practices in a manner that makes the library more secure, and everyone happy.

twokei · on July 8, 2018

Halfway through the paper, it describes a new approach to creating trustless, decentralized cloud computing markets via. cryptographic resource attestation.

To simplify it out, the resource attestation model allows one to bind a virtual currency to some amount of computational time and resources.

It allows users to securely rent away their smart devices when they're idle to developers, researchers, startups, and enterprises who really need large amounts of compute power for cheap prices.