As a long time user and developer of databases, I would suggest isolation failur...

aaomidi · on Dec 22, 2023

Serializble is easy to reason about and it also moves the problems with distributed systems to the database where it can more appropriately be handled imo.

It is by no means a silver bullet and depending on your application it may not be the right choice.

nextaccountic · on Dec 22, 2023

You only benefit from it if you re-fetch data from database every time you need it, and never cache.

If you ever fetch data once and use it locally many times, you are back to handling stale data.

cmrdporcupine · on Dec 22, 2023

The whole point of the RDBMS revolution in the 70s and 80s was to try to bring about a world where developers did not have to care about how their data was stored, and could rely on consistency (and data representational independence)

The way this should all have gone down is that the caching story should have been something that DB vendors resolved, rather than something pushed into the application tier. But the push towards three tier architectures, and OOP and ORMs, meant this wasn't feasible.

What would be ideal is a single consistent data retrieval model, which extends from the physical retrieval of relations, all the way up to the presentation layer, all one transaction, and handles caching for you. There is already caching happening within the DBMS, for example...

spockz · on Dec 22, 2023

Databases mostly cache the right things with the right indices and execution plans.

I don’t see how the push for OOP and ORMs has anything to do with databases not caching.

Are you suggesting we move application logic down into the database engine with the last paragraph?

cmrdporcupine · on Dec 22, 2023

I'm saying the line between the two is largely of our own making. The push towards OO and component models meant a strong separation between the two layers -- this was and is accepted as the "right" way to model things. But it comes with the cost of leaky abstractions, potentially broken isolation models, and high non-essential complexity by nature of the constant transition between components.

If it wasn't for this, we could be looking at DB architectures in which application logic co-habits with the DB. This doesn't imply application logic in the DB, but means that the DB's view of the data moves its way up into the application. Where the logic gets execute isn't as much the concern as what that logic operates on and that the data isolation model is consistent.

I am also of the opinion that the relational model, with its predicate-logic view of the world, is a richer way to model information than objects. So that's my bias.

A lot of this is straight out of the "Out of the Tarpit" paper, FWIW.

nextaccountic · on Dec 23, 2023

Something like Hibernate in Java will fetch data from the database once, populate objects (potentially making cycles and complex relationships between Java objects), and then let your business logic deal with those long-running, persistent Java objects (as opposed to objects that you deallocate right after being done with then after you queried the database)

This means that if you ever happen to use this object again in another context without making a new query, you risk dealing with stale data. And this happens all the time, because querying the db is seen as "expensive" and reusing model objects is "cheap"

aaomidi · on Dec 22, 2023

A good database can act as a cache too though.

This is something spanner was kinda magical at, and we need more databases like it.