Looking Ahead to Postgres 19

(snowflake.com)

98 points | by thinkingemote 1 hour ago

12 comments

sega_sai 31 minutes ago
Speaking as long-term (>15 years) user of Postgres in science, I am getting worried about the lack of columnar type of storage in Postgresql. As the datasets become bigger and bigger, the limitations of PG's storage are becoming more and more significant. I know there are various extensions (i.e. cetus) that may offer such functionality, but then you depend on that extension being supported in the future, as well additional complexity.
[-]
- tyre 27 minutes ago
  You might be using the wrong database if that’s what you’re hoping for. Columnar databases are a separate category.
  It’s like saying that you’re getting worried Apple doesn’t sell washing machines.
  [-]
  - atombender 4 minutes ago
    Postgres's whole origin story is basically to think outside the box and don't be constrained by existing thinking. Stonebraker thought existing databases were too limited in terms of their data types and expressiveness. He started Postgres as an evolution of Ingres (Postgres = Post Ingres) that added rich custom data types and a rewriting system based on rules.
    Columnar and all the other fun stuff (JSON, GIS, inverted indexes, embedding vectors) is a natural progression of that thinking. With TimescaleDB, Hydra, Citus, pg_mooncake, etc. becoming very popular the last few years, there is a clear demand for an integrated experience.
    (Stonebraker also thought one database shouldn't do everything, as described in his early 2000s "One Size Does Not Fit All" paper, and Stonebraker branched out into HStore/Vertica for columnar. In hindsight, I think that was appropriate for the time, but no longer a significant concern.)
  - sega_sai 10 minutes ago
    PostgreSQL was a right tool for my task for many years. It is a question for PostgreSQL can adapt to a new reality of much bigger datasets or I have to switch to a new tool. And I am not the only user of Postgresql in this context. So it is easy to say in vacuum 'you are using the wrong database', but it's not something that can be easily changed with 100s of Tb of data, existing user workflows etc.
    [-]
    - chucksmash 0 minutes ago
      Just because it's easy to say doesn't mean it's wrong.
      By the same logic, you could say Microsoft Access should have all the capabilities of Postgres because it's painful for small businesses to move off of it when it's no longer a good fit for their needs.
    - brookst 2 minutes ago
      Age old product question. The Honda Civic is much larger today than the original, because the original targeted young people on a budget. As they aged and had kids, they needed more room in the car. Most peoples’ insinct is to buy a newer version of the car they love, so the civic grew to accommodate the aging market.
      No real point here other than an observation about how the installed base’s needs change, across industries.
  - mickeyp 18 minutes ago
    I mean I don't disagree with you, but they did just add a graph database feature, which is about as orthogonal to relational database design as you can get.
- segmondy 21 minutes ago
  Fine by me, as a 25 years user. More is not necessarily better, see redis.
- znpy 10 minutes ago
  I was about to post something similar.
  There was a big fanfare about orioledb a while ago, and i think it got bought by people that wanted to push that into mainstream postgres?
  Did it die somewhere along the road?
  [-]
  - eatonphil 3 minutes ago
    Supabase customers run it (marked alpha).
    https://supabase.com/blog/orioledb-launch
    And they continue to work on it.
    https://supabase.com/blog/orioledb-patent-free
aljgz 1 hour ago
I've used Postgres, Oracle, MsSql Server, and MySql in serious projects, no extensive experience with Sqlite, which I know is an amazing player.
These days, I do myself a favor and always avoid Oracle and MySql/MariaDB.
Postgres is amazing, and the two big things I wished it had:
1. lightweight connection; connection bouncers improve the situation, but you still have an unreasonably high memory footprint per concurrent connection.
2. Synchronously updated materialized views (Sql Server calls them indexed views). These are incredible tools in complex data situations. I saw a project struggle with complex technical implementations that would be elegant, trivial and always correct with indexed views.
Sql Server can be costly, but in many cases the benefits it provides are totally worth the cost.
Choosing the data store carefully prevents lots of future trouble.
[-]
- bob1029 42 minutes ago
  SQLite and MSSQL are my two solutions for relational storage problems.
  If I am going to use a "free" provider, SQLite is impossible to beat. They cover a majority of use cases today. SQLite starts to fall apart with backup, replication and tooling. If I am on the hook for things like system availability and disaster recovery, I don't have a problem spending money to cover my ass.
  If I am going to pay any amount of money at all, I am going all the way. The developer experience around MSSQL is untouchable. SSMS and VS with sql projects runs circles around contemporary entity framework crap. Sprinkle in 3rd party tools from vendors like RedGate and you can replace multi-million dollar consulting packages.
  I wouldn't ever advocate for standing up a new Oracle or DB2 machine, but if one was already in place I'd probably die on the hill of not trying to refactor it away. These databases typically come with multi-volume ghost stories attached. Reinventing all those weird effects on a new engine will typically kill the business if there are no other options available.
- asah 40 minutes ago
  two techniques I use with pg:
  1. "materialize" the view as a full table, then index that. Any reasonable pipeline/ETL tool can provide incremental updates between tables. Obviously, anything materialized requires considerations around storage, replication, backup/restore, I/O, etc.
  2. use a regular VIEW and index (precisely) the underlying expressions mentioned in the view, i.e. so when the view is used, then the indexes get used.
  Both require rewriting SQL, though I've used VIEWs to make the change transparent.
- d125q 49 minutes ago
  Care to share some examples where SQL Server's indexed views would shine?
  In my eyes they're similar to triggers, which incur a high performance overhead in OLTP systems and are shunned by developers. In OLAP systems custom ETL code will likely outperform them.
  [-]
  - aljgz 8 minutes ago
    Indexed views are much faster than trying to achieve the same result with triggers. Triggers have serious concurrency limitations, and you do recalculations even when the fields you depend on are not touched.
    Indexed views are not much worse than indexes. Of course, when they refer to other tables there are underlying data lookups, but in our experience when we moved from triggers to indexed views, large scale data ingestion went way faster.
    Where we used it: While revamping a large scale sales program, we stored the warehouse in/out in one table, and several things like current stock were calculated using indexed views.
    Bonus: Using Snapshot concurrency control, you can do many things concurrently, and only when they both updates to a certain product in the same store you'll get the second transaction failing (which could be retried on the backend).
    The fact that they are completely in-sync with your data is amazing.
  - mickeyp 16 minutes ago
    These things exist to eliminate the risk of ever serving stale information from a materialised view. I.e., their benefit is political/reputational as much as they are technical in the sense that they save you effort like remembering to invalidate a MV after an ingest operation.
    Stale MV is a thing you only ever burn your fingers on once. Like how "It's not DNS" is a common meme in networking.
- ksec 45 minutes ago
  >I do myself a favor and always avoid Oracle and MySql/MariaDB.
  So what's wrong with MySQL or MariaDB?
  [-]
  - rmunn 5 minutes ago
    Don't know of anything wrong with MariaDB, but there used to be plenty wrong with MySQL. To give the most egregious example (THANKFULLY fixed in MariaDB, but was present in MySQL for the longest time), inserting the value 128 into a TINYINT column (signed 8-bit int) would clamp the value rather than returning an error. Which might be what you want... except if that was a primary key column. Marvel at the following, which used to be how MySQL behaved:
    Note: the below taken nearly verbatim from https://sql-info.de/mysql/referential-integrity.html#3_5
```
  CREATE DATABASE foo;
  USE foo;
  CREATE TABLE one ( id TINYINT NOT NULL PRIMARY KEY ) TYPE=InnoDB ;
  CREATE TABLE two (
    id TINYINT NOT NULL PRIMARY KEY,
    INDEX (id),
    CONSTRAINT id_fkey FOREIGN KEY (id) REFERENCES one(id)
  ) TYPE=InnoDB ;
```
    Now that we've created both tables, let's insert a record into table one:
```
  INSERT INTO one VALUES (127);
```
    And now let's insert a record with a different primary key into table two:
```
  INSERT INTO two VALUES (128);
```
    MariaDB will give you an error at this point (ERROR 1264 (22003): Out of range value for column 'id' at row 1), but MySQL (at least back when I tried this about ten years ago, which was the last time I was forced to work with MySQL — and I am so glad I never have to go back!) would return no error message and just say "Query OK, 1 row affected (0.009 sec)".
    Now let's select the value we inserted into table "two":
```
  SELECT * FROM two;
```
    And what do we see? The value 127, even though we inserted 128. Which has created a foreign-key relationship to table "one" that we never intended to put in there.
    There are other reasons why MySQL was inadequate, but I no longer remember them. Probably MariaDB has fixed them by now. But I no longer have to use MySQL/MariaDB for anything, and I never want to go back. I have a VERY strong averse reaction, caused by past pain, when I think of using MariaDB. (I actually spun up a virtual machine to test what I wrote here, because there's no way I was going to install MariaDB on my primary work machine).
  - phamilton 19 minutes ago
    Just a set of things too minor to move off of it but annoying enough to not want to start with it.
    My list:
    No `explain (analyze,buffers)`. Instant DDL has some warts (e.g. fk, metadata locks). Query planning bugs (actually... query planning in general is disappointing). Exiting the repl doesn't stop queries. Implicit type casting. Replication lag from large DDL (e.g. creating an index). Lack of two phase DDL (creating constraints NOT VALID and then VALIDATE later). Lack of extensions (e.g. pg_vector). No safe access to inspect buffer cache. AWS Aurora seems to only add shiny new things to Postgres. And more.
    Again, none of this is quite enough to migrate off of it for an established system, but certainly enough to avoid it on a new project.
- cwnyth 59 minutes ago
  What's wrong with MariaDB?
  [-]
  - chuckadams 23 minutes ago
    Last I looked, MariaDB still implemented JSON columns as LONGTEXT under the covers, making it a non-starter for any serious use of said type.
- pbronez 57 minutes ago
  I am currently fighting my way off SQL Server towards PostgreSQL.
  Windows Server is a real pain to operate and the SQL Server ecosystem expects you to run a lot of add-ons on the server alongside your database. Those don’t translate to managed database services, so you lose a lot of functionality if you jump to RDS or similar.
  The first party tools are also aging poorly. SSIS and SSRS are not fun. SSMS is ok for what it is but can’t compete with the ecosystem around PostgreSQL.
  Maybe I’m missing something but I can’t wait to ditch it.
  [-]
  - aljgz 6 minutes ago
    Agree about Windows Server. You can run SqlServer on Linux though. I'm not aware of your specific addons, but the Sql Server itself works perfectly well on Linux.
  - hotsauceror 41 minutes ago
    What are some of the add-ons that you run on the server? We run ours in a pretty bare-bones manner so I'm interested to hear what you're doing.
mickeyp 20 minutes ago
The graph database feature looks interesting, but I wonder...
SELECT customer_name FROM GRAPH_TABLE (myshop MATCH (c IS customers)-[IS customer_orders]->(o IS orders WHERE o.ordered_when = current_date) COLUMNS (c.name AS customer_name));
That is _awful_ syntax; it is reminiscent of neo4j, which is surely not a tool anyone serious should copy from outright in 2026.
And of course the final thing I am left wondering is if it's fast. Row-level security is such a useful feature and yet only a fool would contemplate building anything serious with Postgres', as the planner goes haywire and does per-row-matching, nuking performance.
magnio 1 hour ago
i like the COPY and logical replication improvements. Currently I back up my PG database with a sidecar Databasus instance that is heavier than my entire backend + DB + Caddy!
(LLM writing rant below)
---
> That alone tells you something: Users had a real need, and the ecosystem filled the gap.
> This sounds straightforward, but it solves a real operational problem.
> None of these change the world. All of them make day-to-day data workflows better.
> The easy thing to do here is list planner changes and call it done. But the more useful takeaway is this: Postgres keeps getting better at recognizing the shape of common queries and doing less unnecessary work.
> [Proceed to list planner changes]
If Orwell were alive today, he might declare himself illiterate in English and learn Klingon just to avoid having to read these.
mkurz 1 hour ago
No word that PostgreSQL 19 introduces native application-time temporal data support based on the SQL:2011 standard? https://www.depesz.com/2026/04/02/waiting-for-postgresql-19-...
[-]
- mickeyp 14 minutes ago
  They're a cool feature but honestly a bit tricky to use well, IMHO. And be careful with PII lingering in a temporal void somewhere for a long time :-)
- unfocso 58 minutes ago
  Wow. Incredible how this was not mentioned in the OP. I had done it with tcn triggers and adding "_archive" shadown tables manually with tcn (https://www.postgresql.org/docs/current/tcn.html), but doing it natively is gonna be, as per most postgresql implementations, wonderful.
- m_w_ 1 hour ago
  Nor more than a mention of Query Hints, which had some interesting discussion under a similarly-titled submission.
  https://news.ycombinator.com/item?id=48413655
aynyc 14 minutes ago
I'm wondering. Just wondering? Will they ever support multiple storage engines like MariaDB? Having a storage engine that support OLTP or OLAP or append-only would be cool. I totally understand if they don't want to do that.
Jysix 39 minutes ago
I'm dreaming of block compression in Postgresql, instead of only row compression, too limited to be effective. I know you can store your data on a Zfs pool with block compression, but having it native would remove the burdain of setting this up and maybe better perf.
tehlike 39 minutes ago
I am looking forward to the day it supports table access methods that enables variety of use cases out of box.
Something like rocksdb as PG backend would be fantastic. Yugabyte does this but it's not PG.
[-]
- prpl 29 minutes ago
  Salesforce runs Salesforce on postgres with LSM:
  https://vldb.org/cidrdb/2026/a-multi-tenant-relational-oltp-...
  [-]
  - tehlike 11 minutes ago
    Likely their own fork, not plain PG.
- Guillaume86 23 minutes ago
  OrioleDB seems interesting, not sure it's what you're after but worth a look.
  [-]
  - tehlike 12 minutes ago
    It is, but it's not usable right now without some patches to postgres. At least that was the case as of PG17, and very likely 18 too.
    Context: https://www.orioledb.com/docs#:~:text=OrioleDB%20currently%2...
breakingcups 1 hour ago
I can't decide whether this person writes in the type of style that was apparently overrepresented in LLM training, or whether they heavily used AI to spruce up their writing. I'm learning towards the latter.
[-]
- tux3 1 hour ago
  Spruce up is unreasonably charitable. I'm more irritated that the authorship information is misleading. craig-kerstiens is not available on Huggingface, and yet not a single sentence in this article seems to have been typed on a keyboard.
  When Claude writes things like "as someone who has spent a lot of time doing X", I think this is also a kind of failure of alignment. LLMs shouldn't write as if they had personal experience. It's something a person might say in the training data, but I just think LLMs shouldn't claim life experience they don't have, even if that's a statistically likely sequence of tokens.
- iterateoften 1 hour ago
  These low effort constant comments about style or formatting are against Hackernews guidelines for discussions and something needs to be done to clean up the comment section. Getting to a ridiculous point
  [-]
  - theappsecguy 1 hour ago
    It's not about style or formatting, people are tired of reading slop.
    [-]
    - horsawlarway 54 minutes ago
      These comments are worse than the slop.
      [-]
      - bspammer 21 minutes ago
        They aren't actually. They make me feel like I'm not going crazy - that I'm not the only person noticing that the quality of the average article on hacker news has dropped off a cliff in the last 6 months. Links from different people with different cultures, life experiences, and languages have the same tone of voice, the same sentence structure, and the same breathless, boring, staccato yet arrhythmic, emotive yet soulless style.
        I hope everyone keeps pointing it out. Even better, change the site guidelines to make AI generated articles a flaggable offense. It's already been done for comments.
- eatonphil 1 hour ago
  Pangram says the text is entirely AI generated but I don't know how trustworthy Pangram is. (I would love to hear what others think about it.)
  [-]
  - Chu4eeno 49 minutes ago
    It has extremely low false positive rate in various tests (e. g. https://arxiv.org/html/2501.15654v1), I've never seen it mark something I knew was human written as AI.
- __s 45 minutes ago
  You don't need to be charitable, Snowflake laid off technical writers citing AI to replace them: https://snowflake.help/snowflake-layoffs-2026-technical-writ...
- Waterluvian 1 hour ago
  I get it's in vogue to take stabs at whether or not AI was used. But I think the more useful approach is to instead be critical of the end product, if you have criticisms of it.
  [-]
  - bronson 1 hour ago
    This IS a criticism of the end product.
    [-]
    - Waterluvian 1 hour ago
      This actually gave me an amusing idea: a book review club that strictly reviews the cover art, book binding, hand feel, paper weight, font, etc. of the book.
      [-]
      - red_trumpet 1 hour ago
        How weird a book club that actively ignores the style of writing would be!
        [-]
        chuckadams 18 minutes ago
        That's most vinyl collectors I know of these days, they collect for the album art and might never land a needle on the actual record.
      - throwaway613746 1 hour ago
        [dead]
ing33k 1 hour ago
The Authors avatar looked familiar, crunchydata ?
[-]
- eatonphil 1 hour ago
  Snowflake acquired Crunchy Data yes and Craig was at Crunchy Data.
  [-]
  - pbronez 54 minutes ago
    Interesting, I was wondering why Snowflake was investing in PostgreSQL. Looks like Snowflake bought Crunchy Data and Databricks bought NEON… so the two leading DWaaS companies have managed PostgreSQL offerings now.
    [-]
    - __s 47 minutes ago
      & ClickHouse has managed Postgres in open beta (which I work on)
klaussilveira 47 minutes ago
Oh my, finally a reason to upgrade from 16.
[-]
- fourseventy 43 minutes ago
  The Async I/O in PG18 is what got me to jump from 16
felix-the-cat 1 hour ago
This looks amazing, we've been using Postgres in production for the last two years on a fairly high-volume system and it's been fantastic.