Data Models for SQL and NoSQL: Horseshoe Pattern

eyelidlessness · on Nov 8, 2021

> If your data has a one-to-many relationship, then Document Database (NoSQL) is more appropriate.

Maybe. Probably not. But what happens when you want to query the inverse many-to-one relation? Unless your document store is effectively indexing these one-to-many relations as another document store with inverse relations (at which point you have an RDBMS regardless of what you call it), it’s probably going to be wildly inefficient.

esclerofilo · on Nov 8, 2021

Article agrees, check out the picture at the end. It has "many to one" listed as a relational strength.

eyelidlessness · on Nov 8, 2021

I know, that’s part of why I found the quote so off and felt like it should be questioned directly.

DenisM · on Nov 8, 2021

TLDR: SQL databases add JSON support becoming no-sql'ey, while NoSQL databases are adding better joins, becoming more sql'ey. Hence, convergence.

I'm not sold on the argument myself, but I did not keep track recently.

kumarvvr · on Nov 8, 2021

Evidence of a database convergence is clear in the case of postgresql.

I guess the logical convergence point is not an sql or nosql db, but a datastore, that allows for mixed use.

smitty1e · on Nov 8, 2021

SQL focuses on the plural set of tuples. I want fields and indices. Joins and types on the fields are part and parcel of the engine.

NoSql focuses on individial key/value pairs. Indices and types and joins have to be stapled onto the opaque value.

SQL is more mature and general. NoSql has its place. Eningeering is grasping the virtues of both and implementing what is appropriate.

Convergence is going to land on the SQL side.

kumarvvr · on Nov 8, 2021

Absolutely.

Take the example of eCommerce. I tend to categorize eCom data into three units. Definitions, Transactions, State.

Definitions : Data that describe products, brands, categories, customers, vendors, etc. This has incredible variation and constantly evolving, with perhaps some core tenets being common across different business use cases.

Transactions : These are your sales, purchases, discounts, returns, etc. Not too much variation, more or less similar across various business verticals.

State : Inventory, stores, location of goods, etc. Least variance, can be referenced to Definitions and Transactions. Operations are standard and mostly fixed.

Choosing a PostgreSQL database, allows me to model Definitions as collections of JSONB items, with a few columns for id, time stamps, etc.

Transactions and state are best modeled as relational tables, with references to definitions.

raihansaputra · on Nov 8, 2021

How does the app code deal with the State/Transaction data? I assume the State/Transaction data will have columns that will comply with a Definition? How does the State/Transaction change after a Definition is changed?

kumarvvr · on Nov 8, 2021

State is referenced to object id, state changes are stored as ledger entries, transactions are also ledger entries.

All are referenced to object ids only.

pphysch · on Nov 8, 2021

Yup. Postgres feels very good with latest JSON type features. Even a really unpredictable JSON document has some metadata that you can stick in typed columns and get the full power of SQL over your little "NoSQL" column. Timestamps, author, etc.

GaelFG · on Nov 8, 2021

I read in the article a common writting :

Schema flexibility is another major differentiating factors between SQL and NoSQL. The relational world imposes a clear schematic structure during the design phase itself and any change in the structure is generally considered expensive. NoSQL offers Schema-on-Read. This makes it a lot easier to change the data format as data evolves.

I only worked briefly with a nosql 'real' production database but we needed to change the data shema, and migration was a pain (like any migration). Was it because some special user case ? Is migrating data really simplier with nosql for a reason I don't see ?

Naively I find them even worse because you can have multiple shema at once on the same table, so each read on an important document must include a sanity check 'what is the current shema of this particular entry, should we migrate it ?'. What do I miss ?

cipher_system · on Nov 8, 2021

I've found Postgraphile to be an excellent solution for the "Impedance mismatch" for Postgres. All the strengths of SQL and in the app have access to a smooth graphql api without any effort.