pganalyze

Check out our weekly series: "5mins of Postgres" (published every week), where pganalyze founder and CEO Lukas Fittl walks through interesting Postgres content from the previous 7 days. We also publish Postgres presentations, our conference talks, product walk-throughs, and tutorials.

About pganalyze:
pganalyze provides deep, actionable insights into Postgres. Specializing in PostgreSQL database monitoring and optimization, it gives automatic insights into Postgres query plans, helps improve query performance with its Index Advisor, and lets you perform query drill-down analysis, observe per-query statistics and conduct trend analysis. It integrates with both self-managed Postgres servers as well as Database-as-a-service providers like Amazon RDS.

DBAs and developers use pganalyze because it surfaces opportunities to optimize Postgres performance and gives recommendations that help speed up queries. It comes with helpful security features like PII filtering and SSO integration.

Ай бұрын

The surprising logic of the Postgres work_mem setting, and how to tune it

Ай бұрын

Waiting for Postgres 17: Better Query Plans for Materialized CTE Scans

Ай бұрын

Postgres Planner Quirks: JOIN Equivalence Classes and IN/ANY filters

Ай бұрын

What's (not) in Postgres 17 beta1, and how to test it

2 ай бұрын

Postgres Planner Quirks: How to fix bad JSONB selectivity estimates

2 ай бұрын

Postgres CVE-2024-4317 and how to fix the system views

2 ай бұрын

Postgres Planner Quirks: The impact of ORDER BY + LIMIT on index usage

2 ай бұрын

Waiting for Postgres 17: Streaming I/O for sequential scans & ANALYZE

3 ай бұрын

Waiting for Postgres 17: Faster B-Tree Index Scans for IN(...) lists and ANY =

3 ай бұрын

Waiting for Postgres 17: Benchmarking with pg_buffercache_evict

3 ай бұрын

Waiting for Postgres 17: Faster VACUUM with Adaptive Radix Trees

3 ай бұрын

Waiting for Postgres 17: Improved EXPLAIN for SubPlan nodes

4 ай бұрын

Waiting for Postgres 17: The new built-in C.UTF-8 locale

4 ай бұрын

How Figma built DBProxy for sharding Postgres

4 ай бұрын

Tuning random_page_cost and how index correlation affects query plans

4 ай бұрын

Waiting for Postgres 17: Configurable SLRU cache sizes for increased performance

4 ай бұрын

Speeding up partial COUNT(*) in Postgres by using LIMIT in a subquery

5 ай бұрын

Out of range planner statistics and "get_actual_variable_range" in Postgres

5 ай бұрын

Faster query plans with Postgres 16: Incremental Sorts, Anti-JOINs and more

5 ай бұрын

Reducing table size with optimal column ordering and array-based storage

5 ай бұрын

Finding the root cause of locking problems in Postgres

6 ай бұрын

How to partition Postgres tables by timestamp based UUIDs

6 ай бұрын

The different trade-offs of Distributed Postgres architectures

7 ай бұрын

Waiting for Postgres 17: Incremental base backups

7 ай бұрын

Zero downtime Postgres upgrades and how to logically replicate very large tables

7 ай бұрын

The un-fun work of making Postgres FIPS compliant

7 ай бұрын

Avoiding deadlocks in Postgres migrations

7 ай бұрын

How to use CREATE STATISTICS to improve a plan rows estimate, and why it matters

8 ай бұрын

5mins of Postgres E91: GitLab's challenge with Postgres LWLock lock_manager contention

Пікірлер

@JamesBData3 күн бұрын

This is top-notch Postgres content. Thanks for presenting it!

@IIIxwaveIII7 күн бұрын

great info, is still inside the 17 release?

@mirceacadariu34115 күн бұрын

Thanks, great content as always

@GT-zg5qz18 күн бұрын

Hello. I guess 25% was recommended as starting point, because it is better when default value. And, of course here may not be enough memory for another processes. As a result, the 45-50% is a finish.

@ayoubarahmatАй бұрын

thanks for sharing

@pganalyze6516Ай бұрын

You're very welcome! Thanks for your interest!

@nurxgАй бұрын

What is CTE?

@pganalyze6516Ай бұрын

CTE = Common Table Expression, or "WITH [name] AS ( SELECT ...) SELECT" in a query. See www.postgresql.org/docs/current/queries-with.html

@harisaimАй бұрын

Damn, that's a great explanation in 5 mins. You rocked it Lukas

@DavidSchmittАй бұрын

That's a booby trap if I've ever seen one. Luckily the fix seems pretty easy!

@yazdan6504Ай бұрын

What are the demerits of setting random_page_cost as low as 1.1 If there are some queries on my db , that get benefited from lower the random page cost, is there any other queries that would get hampered due to this?

@jirehla-ab1671Ай бұрын

@pganalyze, can the same prepared transaction be utilized across multiple connections in postgres?

@jocketf3083Ай бұрын

Interesting! Thank You!

@jirehla-ab16712 ай бұрын

Would prepared transactions be no benefit in oltp wirkloads where Assuming that each client will only be executing single block of transactions as opposed to executing multiple blocks of transactions?

@szuliq2 ай бұрын

You did great thing there. Thanks for great content, as always!

@awksedgreep2 ай бұрын

Currently spending the next 6 weekends purging data from a large complex 3nf database with UUID FKs between all of the tables. On an extremely large instance(192G Ram, 48 vCPUs) I'm getting <31 deletes/sec with all the ugly UUID FK lookups.

@LukasFittl2 ай бұрын

That sounds like a lot of work! I assume you have an index covering the foreign key lookup? (since you commented on an episode where we talk about indexing UUIDs, I assume you're running into the exact problem with indexes on UUIDs being bloated?)

@awksedgreep2 ай бұрын

@@LukasFittl Correct, and all those complex index lookups for all the referencing tables are slowing down deletes, even though we've handled(rolled off and purged) all the referencing rows. All referencing tables have been fully vacuumed as well after cleanup so they should be fresh. Still super slow.

@LukasFittl2 ай бұрын

@@awksedgreep Makes sense. If you haven't done so already, it might be worth trying a REINDEX CONCURRENTLY on the affected indexes that support the foreign key. Even after a regular vacuum, there would be lots of inefficient index structure left behind that a reindex can fix up.

@filipebraganca25582 ай бұрын

Never heard about that +0 solution. Genious!!!! Thanks for sharing it.

@professortrog77422 ай бұрын

Indeed.

@andatki2 ай бұрын

Woo hoo. Nice job to everyone involved on this as well, including you Lukas for helping raise awareness of this and connect everything together. In Active Record Rails apps, I’ve seen problematic IN clauses with a large amount of values, showing up in PGSS. My feeling is this is going to help a lot when this becomes available.

@LukasFittl2 ай бұрын

Yeah, I'm excited about what this will bring for the typical Rails app - these overly long IN lists are very much a Rails pattern that should see a noticeable performance benefit here with 17.

@jeremyalva2 ай бұрын

very effective 5 minutes -- thanks for the great vid!

@raphaelsantanaguitar3 ай бұрын

I am very glad to have found this channel! Got some performance concerns when working with a correlated subquery and this was very enlightening! Thanks

@LukasFittl3 ай бұрын

Happy to hear you found it useful!

@jocketf30833 ай бұрын

Awesome!

@DavidSchmitt3 ай бұрын

Nice boost. Vacuum can never be fast enough ❤

@MrWulfSolter3 ай бұрын

Thanks for doing these Lukas! I really appreciate your summaries of the features/functionality in commit logs

@LukasFittl3 ай бұрын

Thanks Wulf! Its an excellent excuse to actually test the new functionality :)

@MrMetalWarlock3 ай бұрын

This is a gem!

@AkshatJain113 ай бұрын

Great video, very well explained!

@qiweishen55244 ай бұрын

Very interesting! Curious does the unpartitioned table has an index on the partition by column? If not, is the join by index better than partition-wise join

@byterelay4 ай бұрын

Thanks, Lukas! Very interesting overview, I was not aware of many details. BTW there will be a talk on this topic by Dilip at PGConf this year. Looking forward to it.

@diegonayalazo4 ай бұрын

Thanks Lucas ❤

@andatki4 ай бұрын

Nice examples Lukas showing the SQL to set up ranges using the new functions.

@LukasFittl4 ай бұрын

Thanks Andy!

@andatki5 ай бұрын

Thanks Lukas! For this kind of excellent, but densely filled blog post, on high value planner optimizations with example queries that benefit, I appreciate having this overview of the items to start to digest them. It's easier to slot the quick audio into my day, then I can go back into the nitty gritty details of the post and try the examples out. Thanks to David, Melanie, and all the contributors for these improvements!

@LukasFittl5 ай бұрын

Thanks Andy, happy to hear you found it useful!

@smyrnian_5 ай бұрын

Congratulations on 100th episode Lukas!

@rosendo32195 ай бұрын

Erwin the stackoverflow legend. Almost on every post about relational databases you gonna find his answer.

@spartan_j1175 ай бұрын

Very useful, as usual. Ty!

@davidr10005 ай бұрын

Very good summary. Thank you.

@LukasFittl5 ай бұрын

Thank you for the excellent post and the detailed examples! Definitely had a hard time editing this down (started with almost 20 minutes of walking through your post) :)

@ankit82685 ай бұрын

Loved it, very informative 🎉

@DavidTuron15 ай бұрын

Wow, pg_input_is_valid is very good missing part:) Thanks!

@veteran35th5 ай бұрын

Just to add in complexity, you've also now got Postgres in a container .... on Kubernetes

@ayoubarahmat5 ай бұрын

thanks fotr sharing , keep up the good work 🔥

@professortrog77425 ай бұрын

It often works even better if you group the data on itemid and put a chunk of time (say a day or a week) of that item’s data together. Works especially well for physical measurements, as they tend to change more gradual. This is basically what TimescaleDB does, reaching over 90% compression if configured properly.

@ayoubarahmat5 ай бұрын

very informative, thanks for sharing

@jocketf30836 ай бұрын

Awesome! Thanks for the video!

@venkataramanak82646 ай бұрын

We also have a huge table with 6 billion rows sizing to 450GB. We found that hash partitioning on id column with 128 partitions with ~3.5GB each, immediately shown decreased response times and way faster index creations. Simple select queries are way faster.

@CernyMatej6 ай бұрын

Simply brilliant!

@JohnnyMetz6 ай бұрын

Incredible video. Great explanation and example

@andatki7 ай бұрын

Thanks Lukas!

@scandalinbohemia7 ай бұрын

Brilliant. Thanks Lukas!

@DylanYoung7 ай бұрын

This seems problematic to me. What if you need openssl in fips mode for security purposes, but want md5 for non-security related work? Seems like the FIPS capacity needs to be more tunable.

@LukasFittl7 ай бұрын

From the FIPS-related conversations I've personally been part of (unrelated to Peter's work on this in Postgres), the requirement I've heard is that it actually has to be removed/turned off completely to fulfill the requirements. That's been in the context of operating a managed service that needed to be FIPS-compliant. I'm sure there can be nuances to this - maybe this could be addressed by having an explicit "pg_md5" module that one could load if needed. All that said, in my experience md5 isn't a particularly good hash function for non-security purposes either - I personally wish we had something like xxh3 hash built-in with Postgres.

@szuliq7 ай бұрын

Excellent content as always!

@shamstabrez29868 ай бұрын

uploadd the entire series of postgres ffrom beginners to advanced level

@DylanYoung8 ай бұрын

Neat!

@JamesSjaalman8 ай бұрын

You don't touch existing PK or UNIQUE indexes (which sounds logical) , but how about supporting indexes for FKs ? Seems like the low hanging fruit to me.

@LukasFittl8 ай бұрын

Good question! There is no reason indexes supporting deletes on foreign keys couldn't be considered in the model (you could consider them a scan, just indirectly running), but so far we haven't integrated it yet. Definitely on the roadmap though, and something we'll likely support in the near future.

@JamesSjaalman8 ай бұрын

Detecting them seems relatively obvious, if they are registered in the schema/catalogs. @@LukasFittl

@Golf_Quest8 ай бұрын

Hi Lucas, nice video. I'm the original author of Aurora's QPM, and I'd be happy to tell you more about it. Many people think of QPM as you described it - as a way to force the planner to use a known-good plan or to avoid a known-bad plan, but it's actually a fully cost-based mechanism. Multiple plans may be approved, and the planner will choose the minimum cost plan among them for the current bind variables and literals. If you think about prepared statements with bind variables, it's immediately apparent that a single optimal plan may not be enough, but the same is true with literals. The ability to evolve the set of known-good plans enables plans to just get better and better with time as the planner discovers new and better plans, especially across major version upgrades.

@LukasFittl8 ай бұрын

Thanks for providing the additional context! And yes, makes a lot of sense to allow better costed plans vs just a single fixed plan.

@JamesBData3 күн бұрын
This is top-notch Postgres content. Thanks for presenting it!
@IIIxwaveIII7 күн бұрын
great info, is still inside the 17 release?
@mirceacadariu34115 күн бұрын
Thanks, great content as always
@GT-zg5qz18 күн бұрын
Hello. I guess 25% was recommended as starting point, because it is better when default value. And, of course here may not be enough memory for another processes. As a result, the 45-50% is a finish.
@ayoubarahmatАй бұрын
thanks for sharing
@pganalyze6516Ай бұрын
You're very welcome! Thanks for your interest!
@nurxgАй бұрын
What is CTE?
@pganalyze6516Ай бұрын
CTE = Common Table Expression, or "WITH [name] AS ( SELECT ...) SELECT" in a query. See www.postgresql.org/docs/current/queries-with.html
@harisaimАй бұрын
Damn, that's a great explanation in 5 mins. You rocked it Lukas
@DavidSchmittАй бұрын
That's a booby trap if I've ever seen one. Luckily the fix seems pretty easy!
@yazdan6504Ай бұрын
What are the demerits of setting random_page_cost as low as 1.1 If there are some queries on my db , that get benefited from lower the random page cost, is there any other queries that would get hampered due to this?
@jirehla-ab1671Ай бұрын
@pganalyze, can the same prepared transaction be utilized across multiple connections in postgres?
@jocketf3083Ай бұрын
Interesting! Thank You!
@jirehla-ab16712 ай бұрын
Would prepared transactions be no benefit in oltp wirkloads where Assuming that each client will only be executing single block of transactions as opposed to executing multiple blocks of transactions?
@szuliq2 ай бұрын
You did great thing there. Thanks for great content, as always!
@awksedgreep2 ай бұрын
Currently spending the next 6 weekends purging data from a large complex 3nf database with UUID FKs between all of the tables. On an extremely large instance(192G Ram, 48 vCPUs) I'm getting <31 deletes/sec with all the ugly UUID FK lookups.
@LukasFittl2 ай бұрын
That sounds like a lot of work! I assume you have an index covering the foreign key lookup? (since you commented on an episode where we talk about indexing UUIDs, I assume you're running into the exact problem with indexes on UUIDs being bloated?)
@awksedgreep2 ай бұрын
@@LukasFittl Correct, and all those complex index lookups for all the referencing tables are slowing down deletes, even though we've handled(rolled off and purged) all the referencing rows. All referencing tables have been fully vacuumed as well after cleanup so they should be fresh. Still super slow.
@LukasFittl2 ай бұрын
@@awksedgreep Makes sense. If you haven't done so already, it might be worth trying a REINDEX CONCURRENTLY on the affected indexes that support the foreign key. Even after a regular vacuum, there would be lots of inefficient index structure left behind that a reindex can fix up.
@filipebraganca25582 ай бұрын
Never heard about that +0 solution. Genious!!!! Thanks for sharing it.
@professortrog77422 ай бұрын
Indeed.
@andatki2 ай бұрын
Woo hoo. Nice job to everyone involved on this as well, including you Lukas for helping raise awareness of this and connect everything together. In Active Record Rails apps, I’ve seen problematic IN clauses with a large amount of values, showing up in PGSS. My feeling is this is going to help a lot when this becomes available.
@LukasFittl2 ай бұрын
Yeah, I'm excited about what this will bring for the typical Rails app - these overly long IN lists are very much a Rails pattern that should see a noticeable performance benefit here with 17.
@jeremyalva2 ай бұрын
very effective 5 minutes -- thanks for the great vid!
@raphaelsantanaguitar3 ай бұрын
I am very glad to have found this channel! Got some performance concerns when working with a correlated subquery and this was very enlightening! Thanks
@LukasFittl3 ай бұрын
Happy to hear you found it useful!
@jocketf30833 ай бұрын
Awesome!
@DavidSchmitt3 ай бұрын
Nice boost. Vacuum can never be fast enough ❤
@MrWulfSolter3 ай бұрын
Thanks for doing these Lukas! I really appreciate your summaries of the features/functionality in commit logs
@LukasFittl3 ай бұрын
Thanks Wulf! Its an excellent excuse to actually test the new functionality :)
@MrMetalWarlock3 ай бұрын
This is a gem!
@AkshatJain113 ай бұрын
Great video, very well explained!
@qiweishen55244 ай бұрын
Very interesting! Curious does the unpartitioned table has an index on the partition by column? If not, is the join by index better than partition-wise join
@byterelay4 ай бұрын
Thanks, Lukas! Very interesting overview, I was not aware of many details. BTW there will be a talk on this topic by Dilip at PGConf this year. Looking forward to it.
@diegonayalazo4 ай бұрын
Thanks Lucas ❤
@andatki4 ай бұрын
Nice examples Lukas showing the SQL to set up ranges using the new functions.
@LukasFittl4 ай бұрын
Thanks Andy!
@andatki5 ай бұрын
Thanks Lukas! For this kind of excellent, but densely filled blog post, on high value planner optimizations with example queries that benefit, I appreciate having this overview of the items to start to digest them. It's easier to slot the quick audio into my day, then I can go back into the nitty gritty details of the post and try the examples out. Thanks to David, Melanie, and all the contributors for these improvements!
@LukasFittl5 ай бұрын
Thanks Andy, happy to hear you found it useful!
@smyrnian_5 ай бұрын
Congratulations on 100th episode Lukas!
@rosendo32195 ай бұрын
Erwin the stackoverflow legend. Almost on every post about relational databases you gonna find his answer.
@spartan_j1175 ай бұрын
Very useful, as usual. Ty!
@davidr10005 ай бұрын
Very good summary. Thank you.
@LukasFittl5 ай бұрын
Thank you for the excellent post and the detailed examples! Definitely had a hard time editing this down (started with almost 20 minutes of walking through your post) :)
@ankit82685 ай бұрын
Loved it, very informative 🎉
@DavidTuron15 ай бұрын
Wow, pg_input_is_valid is very good missing part:) Thanks!
@veteran35th5 ай бұрын
Just to add in complexity, you've also now got Postgres in a container .... on Kubernetes
@ayoubarahmat5 ай бұрын
thanks fotr sharing , keep up the good work 🔥
@professortrog77425 ай бұрын
It often works even better if you group the data on itemid and put a chunk of time (say a day or a week) of that item’s data together. Works especially well for physical measurements, as they tend to change more gradual. This is basically what TimescaleDB does, reaching over 90% compression if configured properly.
@ayoubarahmat5 ай бұрын
very informative, thanks for sharing
@jocketf30836 ай бұрын
Awesome! Thanks for the video!
@venkataramanak82646 ай бұрын
We also have a huge table with 6 billion rows sizing to 450GB. We found that hash partitioning on id column with 128 partitions with ~3.5GB each, immediately shown decreased response times and way faster index creations. Simple select queries are way faster.
@CernyMatej6 ай бұрын
Simply brilliant!
@JohnnyMetz6 ай бұрын
Incredible video. Great explanation and example
@andatki7 ай бұрын
Thanks Lukas!
@scandalinbohemia7 ай бұрын
Brilliant. Thanks Lukas!
@DylanYoung7 ай бұрын
This seems problematic to me. What if you need openssl in fips mode for security purposes, but want md5 for non-security related work? Seems like the FIPS capacity needs to be more tunable.
@LukasFittl7 ай бұрын
From the FIPS-related conversations I've personally been part of (unrelated to Peter's work on this in Postgres), the requirement I've heard is that it actually has to be removed/turned off completely to fulfill the requirements. That's been in the context of operating a managed service that needed to be FIPS-compliant. I'm sure there can be nuances to this - maybe this could be addressed by having an explicit "pg_md5" module that one could load if needed. All that said, in my experience md5 isn't a particularly good hash function for non-security purposes either - I personally wish we had something like xxh3 hash built-in with Postgres.
@szuliq7 ай бұрын
Excellent content as always!
@shamstabrez29868 ай бұрын
uploadd the entire series of postgres ffrom beginners to advanced level
@DylanYoung8 ай бұрын
Neat!
@JamesSjaalman8 ай бұрын
You don't touch existing PK or UNIQUE indexes (which sounds logical) , but how about supporting indexes for FKs ? Seems like the low hanging fruit to me.
@LukasFittl8 ай бұрын
Good question! There is no reason indexes supporting deletes on foreign keys couldn't be considered in the model (you could consider them a scan, just indirectly running), but so far we haven't integrated it yet. Definitely on the roadmap though, and something we'll likely support in the near future.
@JamesSjaalman8 ай бұрын
Detecting them seems relatively obvious, if they are registered in the schema/catalogs. @@LukasFittl
@Golf_Quest8 ай бұрын
Hi Lucas, nice video. I'm the original author of Aurora's QPM, and I'd be happy to tell you more about it. Many people think of QPM as you described it - as a way to force the planner to use a known-good plan or to avoid a known-bad plan, but it's actually a fully cost-based mechanism. Multiple plans may be approved, and the planner will choose the minimum cost plan among them for the current bind variables and literals. If you think about prepared statements with bind variables, it's immediately apparent that a single optimal plan may not be enough, but the same is true with literals. The ability to evolve the set of known-good plans enables plans to just get better and better with time as the planner discovers new and better plans, especially across major version upgrades.
@LukasFittl8 ай бұрын
Thanks for providing the additional context! And yes, makes a lot of sense to allow better costed plans vs just a single fixed plan.