PostgreSQL Configurations Every Developer Should Know

Overview
Memory Settings
Connection and Worker Settings
WAL and Checkpoint Settings
Autovacuum Settings
Connection and Session Settings
- tcp_keepalives_idle
- idle_session_timeout
Essential PostgreSQL Extensions
- pg_trgm
- pgvector
- pgcrypto
- citext
- uuid-ossp
- pg_cron
Connection Pooling and Connectivity
References

Overview

It’s common knowledge that PostgreSQL is one of the most widely used relational databases in the world, so I decided to put together a collection of important configurations to help you get more out of it.

My first encounter with many of these settings was during the “Rinha de Backend” challenge by zanfranceschi, where participants competed to build a system capable of handling massive concurrent requests without bottlenecks or failures. The challenge had no restrictions, and many people disabled or tweaked Postgres settings to squeeze out more performance (even if less safe for real production environments).

In practice, everything I cover here comes from the official PostgreSQL documentation (which I read for the first time while writing this post). It’s a large doc, though, and many settings have very specific use cases, so I think it’s worth summarizing just the most important ones.

The goal here isn’t to cover what you’d normally learn about relational databases — indexes, SQL, constraints, procedures — but to talk about what nobody taught me in college: networking, advanced configuration, and extensions.

About Postgres

PostgreSQL — or just Postgres — is a relational database. It’s a mature and very flexible one, capable of storing nested structures or defining complex custom data types. PostgreSQL’s performance depends heavily on proper configuration, since the defaults are conservative and designed for compatibility, not performance.

Settings can be placed in a postgresql.conf file, like in the example below for a 16GB RAM server:

1
# Memory Settings
2
shared_buffers = 4GB                    # 25% of RAM
3
work_mem = 16MB                         # Memory per sort/hash operation
4
maintenance_work_mem = 1GB              # For VACUUM, CREATE INDEX
5
effective_cache_size = 12GB             # 75% of RAM (hint for the query planner)
6

7
# Connection Settings
8
max_connections = 200                   # Use connection pooling for more
9
listen_addresses = '*'                  # Accept remote connections
10

11
# Workers and Parallelism
12
max_worker_processes = 20
13
max_parallel_workers = 12
14
max_parallel_workers_per_gather = 4
15

16
# WAL and Checkpoint
17
wal_level = replica                     # For replication
18
max_wal_size = 4GB
19
checkpoint_timeout = 15min
20
checkpoint_completion_target = 0.9
21

22
# Autovacuum
23
autovacuum = on
24
autovacuum_max_workers = 4
25
autovacuum_naptime = 30s
26

27
# Connection and Session
28
tcp_keepalives_idle = 300               # Detect dead connections
29
idle_session_timeout = 30min            # Close idle sessions
30

31
# Extensions
32
shared_preload_libraries = 'pg_stat_statements, pg_cron'

I’ll walk through some of these settings and what they mean. There are many more out there, but if I listed them all, you’d be better off just reading the docs. The idea is to highlight the ones that matter most for improving your database’s performance.

Memory Settings

shared_buffers

Description: PostgreSQL’s primary cache. All data read from disk is first loaded into shared buffers, and writes are also staged here before being flushed to disk.

Why It Matters: This is your database’s working memory. Every read operation checks this cache first. If the cache is too small, you’ll get high cache miss rates and constant disk I/O to fetch data. If it’s too large, you’ll waste resources unnecessarily.

Sweet Spot: 15–25% of total RAM is the standard recommendation, though it can vary depending on your workload.

1
# For a server with 16GB RAM
2
shared_buffers = 4GB
3

4
# For a server with 64GB RAM
5
shared_buffers = 16GB

Beyond 25% of RAM, you get diminishing returns because PostgreSQL also relies on the OS filesystem cache. The OS is generally better at caching than PostgreSQL’s own buffer pool.

work_mem

Description: Memory allocated for internal operations and processing. If the workload is too heavy and hits this limit, to avoid out-of-memory (OOM) errors the database starts processing data on disk. Slow, but it prevents crashes. This is called a Disk Spill.

If this value is set too high, the query planner can go a little crazy and start making less efficient plans because it thinks it has plenty of memory.

Sweet Spot: 4–32MB (context-dependent)

Why It Matters: work_mem that’s too low causes queries to spill to disk, drastically slowing things down. work_mem that’s too high can cause OOM errors.

1
work_mem = 16MB

A single complex query can spawn multiple sort/hash/join operations. The formula is:

1
Potential Memory = work_mem × max_connections × operations_per_query

With 100 connections, 16MB work_mem, and an average of 3 operations per query, you could consume 4.8GB.

maintenance_work_mem

Description: Memory used for maintenance operations like VACUUM, CREATE INDEX, ALTER TABLE, and FOREIGN KEY constraint checks.

Default: 64MB

Sweet Spot: 256MB – 2GB

Why It Matters: Higher values significantly speed up index creation, vacuuming, and other maintenance tasks. Unlike work_mem, typically only one maintenance operation runs at a time.

1
# For servers with 16GB+ RAM
2
maintenance_work_mem = 1GB
3

4
# For large databases (100GB+)
5
maintenance_work_mem = 2GB

This doesn’t affect normal query performance, but it dramatically improves the speed of bulk operations and routine maintenance.

Connection and Worker Settings

max_connections

Description: The maximum number of simultaneous connections to the database allowed. If an application is hitting bottlenecks between backend and database, the culprit might be a low number of simultaneous connections.

Default: 100

Sweet Spot: 100–400 (highly dependent on workload)

Why It Matters: Each connection consumes memory (several MB) for connection overhead. Too many connections create contention and resource exhaustion.

1
# For web apps with connection pooling
2
max_connections = 200

A very high number of connections can consume too many resources and cause “Too many connections” errors.

Instead of opening lots of simultaneous connections, consider connection pooling (PgBouncer, pgpool-II). This way, instead of constantly creating new connections (which have overhead), you keep connections alive with a max idle time before they’re closed.

max_worker_processes

Description: The maximum number of background worker processes the system can support. This is the total budget for all background workers.

Default: 8

Why It Matters: This is the foundation for parallelism. Parallel queries, autovacuum workers, logical replication workers, and background workers all draw from this pool.

1
max_worker_processes = 16

max_parallel_workers

Description: The maximum number of workers that can be used for parallel query execution across the entire system.

Default: 8

Sweet Spot: 50–75% of CPU cores

Why It Matters: Allows PostgreSQL to parallelize query execution, significantly speeding up large scans and aggregations.

1
# For a 16-core server
2
max_parallel_workers = 12

Constraint: Cannot exceed max_worker_processes.

WAL and Checkpoint Settings

WAL → A technique used by Postgres and other databases to guarantee durability and consistency. It records all changes made to the database in an append-only log before the change is applied. This ensures that every committed transaction has its log entry saved. This is important for replicating changes to read replicas (which need to know what happened on the write replica) and for disaster recovery.

WAL

wal_level

Description: Determines how much information is written to the Write-Ahead Log (WAL).

Default: replica Sweet Spot:

minimal for standalone databases without replication (fastest)
replica for streaming replication
logical for logical replication

Why It Matters: Lower levels mean less WAL overhead but disable replication features.

1
# For production with replication
2
wal_level = replica
3

4
# For bulk loads (temporarily)
5
wal_level = minimal

max_wal_size

Description: Maximum size of WAL files before triggering a checkpoint.

Default: 1GB

Sweet Spot: 2GB – 16GB (depends on write volume)

Why It Matters: Larger values reduce checkpoint frequency, improving write performance but increasing recovery time.

1
# For moderate write loads
2
max_wal_size = 4GB
3

4
# For heavy write loads
5
max_wal_size = 8GB

Larger values mean longer crash recovery time.

checkpoint_timeout

Description: Maximum time between automatic WAL checkpoints.

Default: 5 minutes

Sweet Spot: 10–30 minutes

Why It Matters: Checkpoints cause I/O spikes. Less frequent checkpoints smooth out I/O but increase recovery time.

1
# For heavy write loads
2
checkpoint_timeout = 15min
3

4
# For very heavy write loads
5
checkpoint_timeout = 30min

Autovacuum Settings

VACUUM → If you delete 10,000 rows from your database, the size of your database doesn’t change. What databases like Postgres do is mark those rows as dead. This happens because a fast delete/update operation (physically deleting/modifying can take time) is preferable. This is called MVCC (Multi-Version Concurrency Control), where rows aren’t overwritten but instead a new version is created to avoid concurrency errors. What VACUUM does is physically delete the rows that were marked as dead or modified.

autovacuum

Description: Enables the autovacuum daemon for automatic database maintenance — in other words, it automates VACUUM.

Default: on

Sweet Spot: on (always)

Why It Matters: Autovacuum prevents transaction ID wraparound, removes dead tuples, and updates statistics. Disabling it is almost never recommended.

1
autovacuum = on

Disabling autovacuum can lead to severe performance degradation and database corruption (transaction ID wraparound).

autovacuum_max_workers

Description: Maximum number of autovacuum processes that can run simultaneously.

Default: 3

Sweet Spot: 3–6

Why It Matters: More workers can vacuum multiple tables concurrently, which matters for databases with many active tables.

1
# For databases with many tables
2
autovacuum_max_workers = 6

More workers consume more resources but keep tables cleaner under heavy write loads.

autovacuum_naptime

Description: Minimum delay between autovacuum runs on any given database.

Default: 1 minute

Sweet Spot: 30 seconds – 1 minute

Why It Matters: Controls how often autovacuum checks whether there’s work to do.

1
# For heavy write loads
2
autovacuum_naptime = 30s

Connection and Session Settings

tcp_keepalives_idle

Description: Time before sending a TCP keepalive packet to detect dead connections.

Default: 0 (uses OS default, typically 2 hours)

Sweet Spot: 60–600 seconds

Why It Matters: Detects and closes dead connections faster, preventing connection slot exhaustion.

1
# Detect dead connections within 5 minutes
2
tcp_keepalives_idle = 300

idle_session_timeout

Description: Automatically terminates sessions that have been idle for the specified duration.

Default: 0 (disabled)

Sweet Spot: 10–60 minutes (application-dependent)

Why It Matters: Prevents idle connections from holding connection slots indefinitely.

1
# Close connections idle for 30 minutes
2
idle_session_timeout = 30min

Useful when applications don’t close connections properly.

PostgreSQL Extensions

PostgreSQL’s extension system lets you add functionality without modifying the database core. Extensions are pre-packaged modules that can be installed and enabled per database.

How to Install and Enable Extensions

1
-- Check available extensions
2
SELECT * FROM pg_available_extensions;
3

4
-- Enable an extension
5
CREATE EXTENSION IF NOT EXISTS extension_name;
6

7
-- Check installed extensions
8
\dx

Popular Extension Examples

pg_trgm

Description: Provides text similarity matching based on trigrams and fast full-text search using GIN/GiST indexes.

Use Cases:

Fuzzy text search (finding similar strings)
Autocomplete functionality
Typo-tolerant search
Fast LIKE/ILIKE queries with pattern matching

Why It Matters: Enables high-performance similarity searches that would otherwise require external search engines. Particularly useful for user-facing search features.

1
-- Enable the extension
2
CREATE EXTENSION IF NOT EXISTS pg_trgm;
3

4
-- Create a GIN index for fast similarity searches
5
CREATE INDEX idx_users_name_trgm ON users USING GIN (name gin_trgm_ops);
6

7
-- Find similar names (fuzzy matching)
8
SELECT name, similarity(name, 'John Doe') AS sim
9
FROM users
10
WHERE similarity(name, 'John Doe') > 0.3
11
ORDER BY sim DESC;
12

13
-- Fast pattern matching with index support
14
SELECT * FROM users
15
WHERE name ILIKE '%john%';
16

17
-- Find records with typos
18
SELECT * FROM products
19
WHERE name % 'iPone';  -- Will match 'iPhone'

Key Functions:

similarity(text, text) - Returns similarity score (0-1)
word_similarity(text, text) - Word-based similarity
text % text - Similarity operator (configurable threshold)

pgvector

Description: Adds vector data types and similarity search capabilities for AI/ML applications, enabling efficient storage and querying of embeddings.

Use Cases:

Semantic search
Recommendation systems
Image similarity search
AI-powered applications using OpenAI, Cohere, etc. embeddings

Why It Matters: Essential for modern AI applications. Lets you store embeddings directly in PostgreSQL and perform efficient nearest-neighbor searches without external vector databases.

1
-- Enable the extension
2
CREATE EXTENSION IF NOT EXISTS vector;
3

4
-- Create a table with a vector column
5
CREATE TABLE documents (
6
  id SERIAL PRIMARY KEY,
7
  content TEXT,
8
  embedding vector(1536)  -- OpenAI ada-002 embedding dimension
9
);
10

11
-- Create an index for fast similarity search
12
CREATE INDEX ON documents USING ivfflat (embedding vector_cosine_ops)
13
WITH (lists = 100);
14

15
-- Or use HNSW for better performance (PostgreSQL 16+)
16
CREATE INDEX ON documents USING hnsw (embedding vector_cosine_ops);
17

18
-- Find similar documents (cosine similarity)
19
SELECT id, content,
20
       1 - (embedding <=> '[0.1, 0.2, ...]'::vector) AS similarity
21
FROM documents
22
ORDER BY embedding <=> '[0.1, 0.2, ...]'::vector
23
LIMIT 10;
24

25
-- Available distance operators:
26
-- <-> (L2 distance)
27
-- <=> (cosine distance)
28
-- <#> (inner product)

pgcrypto

Description: Provides cryptographic functions for encryption, hashing, and random data generation.

Use Cases:

Password hashing
Data encryption at rest
Secure token generation
PGP encryption/decryption

Why It Matters: Enables secure data storage without application-level encryption. Built-in cryptographic functions ensure consistent security practices.

1
-- Enable the extension
2
CREATE EXTENSION IF NOT EXISTS pgcrypto;
3

4
-- Hash passwords (bcrypt)
5
INSERT INTO users (email, password_hash)
6
VALUES ('user@example.com', crypt('user_password', gen_salt('bf')));
7

8
-- Verify password
9
SELECT * FROM users
10
WHERE email = 'user@example.com'
11
  AND password_hash = crypt('user_password', password_hash);
12

13
-- Generate random UUIDs
14
SELECT gen_random_uuid();
15

16
-- Encrypt/decrypt data
17
-- Symmetric encryption
18
SELECT pgp_sym_encrypt('sensitive data', 'encryption_key');
19
SELECT pgp_sym_decrypt(encrypted_column, 'encryption_key') FROM table_name;
20

21
-- Generate secure random bytes
22
SELECT gen_random_bytes(32);
23

24
-- Hash functions
25
SELECT digest('data', 'sha256');
26
SELECT encode(digest('data', 'sha512'), 'hex');

citext

Description: A case-insensitive text type that behaves like regular text but with case-insensitive comparisons and indexing.

Use Cases:

Email addresses
Usernames
Case-insensitive unique constraints
Search without ILIKE performance penalties

Why It Matters: Simplifies case-insensitive operations without needing LOWER() functions everywhere. Preserves the original case while comparing case-insensitively.

1
-- Enable the extension
2
CREATE EXTENSION IF NOT EXISTS citext;
3

4
-- Create a table with case-insensitive columns
5
CREATE TABLE users (
6
  id SERIAL PRIMARY KEY,
7
  email citext UNIQUE,  -- Case-insensitive uniqueness
8
  username citext NOT NULL
9
);
10

11
-- These will be treated as duplicates
12
INSERT INTO users (email, username)
13
VALUES ('User@Example.com', 'JohnDoe');
14

15
-- This will fail (duplicate email)
16
INSERT INTO users (email, username)
17
VALUES ('user@example.com', 'JaneDoe');
18

19
-- Case-insensitive comparisons (no ILIKE needed)
20
SELECT * FROM users WHERE email = 'USER@EXAMPLE.COM';
21

22
-- Index works with case-insensitive searches
23
CREATE INDEX idx_users_email ON users(email);

Key Benefits:

Preserves the original casing in storage
Automatic case-insensitive comparisons
Works with B-tree indexes
No need for functional indexes on LOWER()

uuid-ossp

Description: Generates universally unique identifiers (UUIDs) using various algorithms.

Use Cases:

Primary keys for distributed systems
Non-sequential identifiers
Public-facing IDs (URLs, APIs)
Safe identifiers for cross-database merges

Why It Matters: UUIDs prevent ID collisions in distributed systems and hide sequential patterns. Essential for microservices and multi-region deployments.

1
-- Enable the extension
2
CREATE EXTENSION IF NOT EXISTS "uuid-ossp";
3

4
-- Create a table with a UUID primary key
5
CREATE TABLE orders (
6
  id UUID PRIMARY KEY DEFAULT uuid_generate_v4(),
7
  customer_id UUID NOT NULL,
8
  total DECIMAL(10,2)
9
);
10

11
-- Available UUID generation functions:
12
SELECT uuid_generate_v1();    -- Time-based
13
SELECT uuid_generate_v4();    -- Random (most common)
14

15
-- Insert with auto-generated UUID
16
INSERT INTO orders (customer_id, total)
17
VALUES ('a0eebc99-9c0b-4ef8-bb6d-6bb9bd380a11', 99.99);

UUID Versions:

v1 - Time-based (includes MAC address, potential privacy concern)
v4 - Random (recommended for most use cases)

Trade-offs:

Pros: Globally unique, non-sequential, safe for merging
Cons: 16 bytes vs 4-8 bytes for integers, slightly slower indexes

pg_cron

Description: A simple cron-based task scheduler that runs inside PostgreSQL.

Use Cases:

Periodic data cleanup
Scheduled aggregations
Automated maintenance tasks
Recurring report generation

Why It Matters: Eliminates the need for external schedulers for database-centric tasks. Jobs run with database-level guarantees and can use SQL directly.

1
-- Enable the extension (requires superuser)
2
CREATE EXTENSION IF NOT EXISTS pg_cron;
3

4
-- Schedule a job (daily cleanup at 3 AM)
5
SELECT cron.schedule(
6
  'cleanup-old-logs',
7
  '0 3 * * *',
8
  'DELETE FROM logs WHERE created_at < NOW() - INTERVAL ''90 days'''
9
);
10

11
-- Schedule aggregation every 15 minutes
12
SELECT cron.schedule(
13
  'update-stats',
14
  '*/15 * * * *',
15
  'REFRESH MATERIALIZED VIEW CONCURRENTLY user_stats'
16
);
17

18
-- View scheduled jobs
19
SELECT * FROM cron.job;
20

21
-- View job execution history
22
SELECT * FROM cron.job_run_details
23
ORDER BY start_time DESC
24
LIMIT 10;
25

26
-- Unschedule a job
27
SELECT cron.unschedule('cleanup-old-logs');

Configuration: Add to postgresql.conf:

1
shared_preload_libraries = 'pg_cron'
2
cron.database_name = 'your_database'

Cron Syntax:

1
┌───────────── minute (0 - 59)
2
│ ┌───────────── hour (0 - 23)
3
│ │ ┌───────────── day of month (1 - 31)
4
│ │ │ ┌───────────── month (1 - 12)
5
│ │ │ │ ┌───────────── day of week (0 - 6) (Sunday to Saturday)
6
│ │ │ │ │
7
│ │ │ │ │
8
* * * * *

Connection Pooling and Connectivity

Why Connection Pooling Matters

PostgreSQL creates a new server process for each connection, which consumes significant memory and resources. Without connection pooling, applications can quickly exhaust available connections, leading to “too many connections” errors and performance degradation.

The Problem:

1
Example: Each connection = ~10MB RAM + CPU overhead
2
100 connections = ~1GB RAM minimum
3
1000 connections = System degradation

The Solution: Connection pooling maintains a smaller pool of database connections that are shared among many application connections, without needing to create a new connection for every transaction.

PgBouncer

Description: A lightweight connection pool for PostgreSQL. The most popular and widely used solution.

Why It Matters: PgBouncer can handle thousands of client connections while maintaining a small pool of actual database connections, reducing database overhead.

Pooling Modes:

Session Pooling: Connection assigned to the client for the entire session
Transaction Pooling (Recommended): Connection returned to the pool after each transaction

Configuration example (/etc/pgbouncer/pgbouncer.ini):

1
[databases]
2
mydb = host=localhost port=5432 dbname=mydb
3

4
[pgbouncer]
5
listen_addr = 127.0.0.1
6
listen_port = 6432
7
auth_type = md5
8
auth_file = /etc/pgbouncer/userlist.txt
9

10
# Connection pool settings
11
pool_mode = transaction
12
max_client_conn = 1000
13
default_pool_size = 25
14
reserve_pool_size = 5
15
reserve_pool_timeout = 3
16

17
# Performance tuning
18
server_idle_timeout = 600
19
query_timeout = 60

Application-Level Pooling

Many frameworks and database access libraries offer built-in pooling. This approach is simpler to set up but less flexible than dedicated solutions like PgBouncer.

Node.js (pg-pool):

1
const { Pool } = require('pg');
2

3
const pool = new Pool({
4
  host: 'localhost',
5
  database: 'mydb',
6
  user: 'username',
7
  password: 'password',
8
  max: 20,                    // Maximum connections in the pool
9
  idleTimeoutMillis: 30000,   // Time before closing an idle connection
10
  connectionTimeoutMillis: 2000,
11
});
12

13
// Usage
14
const result = await pool.query('SELECT * FROM users WHERE id = $1', [userId]);

Java (HikariCP):

1
HikariConfig config = new HikariConfig();
2
config.setJdbcUrl("jdbc:postgresql://localhost:5432/mydb");
3
config.setUsername("username");
4
config.setPassword("password");
5
config.setMaximumPoolSize(20);
6
config.setMinimumIdle(5);
7
config.setIdleTimeout(300000);
8
config.setConnectionTimeout(20000);
9

10
HikariDataSource dataSource = new HikariDataSource(config);

Python (SQLAlchemy):

1
from sqlalchemy import create_engine
2

3
engine = create_engine(
4
    "postgresql://username:password@localhost/mydb",
5
    pool_size=10,
6
    max_overflow=20,
7
    pool_timeout=30,
8
    pool_recycle=1800,
9
)

When to use application pooling vs PgBouncer:

Application pooling: Simple, monolithic applications, less operational overhead
PgBouncer: Multiple applications, microservices, need for centralized control

Sizing Your Connection Pool

Signs of a poorly sized pool:

Pool too small: Connection timeouts, high latency, wait queues
Pool too large: High memory usage, CPU contention, too many idle connections

Table of Contents

Overview

About Postgres

Memory Settings

shared_buffers

work_mem

maintenance_work_mem

Connection and Worker Settings

max_connections

max_worker_processes

max_parallel_workers

WAL and Checkpoint Settings

wal_level

max_wal_size

checkpoint_timeout

Autovacuum Settings

autovacuum

autovacuum_max_workers

autovacuum_naptime

Connection and Session Settings

tcp_keepalives_idle

idle_session_timeout

PostgreSQL Extensions

How to Install and Enable Extensions

Popular Extension Examples

pg_trgm

pgvector

pgcrypto

citext

uuid-ossp

pg_cron

Connection Pooling and Connectivity

Why Connection Pooling Matters

PgBouncer

Application-Level Pooling

Sizing Your Connection Pool

References