This module focuses on understanding PostgreSQL's transaction management, isolation levels, and common concurrency challenges like deadlocks. For definitions of terms used in this module, refer to our Glossary.
Before starting this module, ensure you understand:
erDiagram
accounts {
bigint id PK
decimal balance
string owner_name
timestamp created_at
timestamp updated_at
}
In this module, you'll explore:
- How PostgreSQL manages transactions
- Transaction isolation levels
- Deadlock detection and prevention
- Best practices for transaction management
Every PostgreSQL transaction follows this lifecycle:
graph TD
A[BEGIN] --> B[Execute Statements]
B --> C{Success?}
C -->|Yes| D[COMMIT]
C -->|No| E[ROLLBACK]
D --> F[End]
E --> F
PostgreSQL supports four isolation levels, each with different guarantees:
graph TB
subgraph "Isolation Levels"
direction TB
RL[Read Uncommitted*] --> RC[Read Committed]
RC --> RR[Repeatable Read]
RR --> S[Serializable]
style RL fill:#fdd,stroke:#333
style RC fill:#dfd,stroke:#333
style RR fill:#ddf,stroke:#333
style S fill:#fdf,stroke:#333
end
note[" Behaves as Read Committed in PostgreSQL"]
note --> RL
For more details on isolation levels, see Isolation Level.
Level | Dirty Read | Non-Repeatable Read | Phantom Read | Serialization Anomaly |
---|---|---|---|---|
Read Uncommitted [1] | Prevented | Possible | Possible | Possible |
Read Committed | Prevented | Possible | Possible | Possible |
Repeatable Read | Prevented | Prevented | Prevented [2] | Possible |
Serializable | Prevented | Prevented | Prevented | Prevented |
- PostgreSQL treats Read Uncommitted as Read Committed
- PostgreSQL's implementation prevents phantom reads
Run ruby transaction_lab.rb
to see the output.
Starting concurrent transfers (this should cause a deadlock)...
👱♀️ Alice initiating transfer...
👨 Bob initiating transfer...
Transferring 100 from Bob to Alice
Deadlock detected (attempt 1/3): PG::TRDeadlockDetected
DETAIL: Process 6338 waits for ShareLock on transaction 48693606; blocked by process 6339.
Process 6339 waits for ShareLock on transaction 48693605; blocked by process 6338.
# Read Committed (default)
Account.transaction do
# Default isolation level
account.update!(balance: new_balance)
end
# Repeatable Read
Account.transaction(isolation: :repeatable_read) do
# Prevents non-repeatable reads
balance = account.balance
sleep(1) # Simulate work
# Will see same balance even if another transaction changed it
account.update!(balance: balance + 100)
end
# Serializable
Account.transaction(isolation: :serializable) do
# Strongest isolation - prevents all concurrency anomalies
# May throw SerializationFailure - application must retry
balance = account.balance
account.update!(balance: balance + 100)
end
-
Dirty Read (prevented in PostgreSQL)
- Transaction reads data written by concurrent uncommitted transaction
-
Non-Repeatable Read (possible in Read Committed)
# Transaction 1 # Transaction 2 read(A) # value = 100 update(A) to 200 commit read(A) # value = 200 # Different result!
-
Phantom Read (possible in Read Committed)
# Transaction 1 # Transaction 2 count(*) # returns 10 insert new row commit count(*) # returns 11 # Different result!
-
Serialization Anomaly (possible in all except Serializable)
# Transaction 1 # Transaction 2 sum = total_balance() sum = total_balance() insert(-100) insert(-100) commit commit # Both transactions saw the same sum but both subtracted, # potentially leading to negative balance
Our transaction_lab.rb
demonstrates a classic deadlock:
sequenceDiagram
participant T1 as Transaction 1
participant A1 as Account 1
participant A2 as Account 2
participant T2 as Transaction 2
T1->>A1: Lock Account 1
T2->>A2: Lock Account 2
T1--xA2: Try to Lock Account 2 (Blocked)
T2--xA1: Try to Lock Account 1 (Blocked)
Note over T1,T2: Deadlock Detected!
T2-->>T2: Transaction Aborted
T1->>A2: Lock Acquired
T1->>T1: Transaction Completes
For more information about deadlocks, see Deadlock.
See ACID Properties in the glossary.
PostgreSQL uses various lock types. For details, see Lock Types.
PostgreSQL automatically detects deadlocks:
- Monitors wait-for graph
- Identifies cycles (deadlocks)
- Chooses a victim transaction
- Rolls back the victim
- Each thread needs its own database connection in multi-threaded scenarios
- Use
ActiveRecord::Base.connection_pool.with_connection
to ensure proper connection handling - Connections are automatically returned to the pool after the block completes
- Deadlocks cause automatic transaction rollback
- Both transactions involved in a deadlock may be affected
- Need to explicitly reload records after deadlock to get current state
- Always verify final state after transaction completion
- Inconsistent lock ordering is a primary cause of deadlocks
- To prevent deadlocks, always acquire locks in a consistent order
- Example: Always lock records in ascending ID order
- Use
Account.lock.where(id: [id1, id2]).order(:id)
pattern
- Reset initial state before running concurrent transactions
- Use explicit transaction blocks
- Handle both specific (Deadlocked) and generic exceptions
- Verify final state through explicit reload
-
Basic Transaction Handling
Account.transaction do # Operations here are atomic account.update!(balance: account.balance + 100) end
-
Deadlock Simulation
# See transaction_lab.rb for implementation TransactionLab.simulate_deadlock
-
Isolation Level Testing
Account.transaction(isolation: :serializable) do # Operations here run in serializable isolation end
=== Basic Transaction Example ===
Initial balance: 0.0
Balance after update: 100.0
Transaction failed: Simulated error
Balance after rollback: 0.0
Key Learnings:
- Proper initialization is crucial - always ensure accounts start with a valid balance
- Transactions properly rollback on error, maintaining data consistency
- Using
reload
after operations ensures we see the actual database state - The
update_column
bypass can be useful for initial setup, but should be used sparingly
Deadlock detected (attempt 1/3): PG::TRDeadlockDetected
DETAIL: Process 10899 waits for ShareLock on transaction 48746007; blocked by process 10898.
Process 10898 waits for ShareLock on transaction 48746008; blocked by process 10899.
Key Learnings:
- Deadlocks are automatically detected by PostgreSQL
- Retry mechanism with exponential backoff helps resolve deadlocks
- Final account balances remain consistent despite deadlocks
- Detailed error messages help diagnose lock conflicts
Initial balance read: 1000.0
Concurrent transaction modified balance (+50)
Balance after concurrent modification: 1050.0
Final balance after our update (+100): 1150.0
- Sees and incorporates concurrent changes
- No serialization failures
- Most permissive but least consistent
Initial balance read: 1000.0
Concurrent transaction modified balance (+50)
Balance after concurrent modification: 1000.0
Serialization failure occurred...
Key Learnings:
- Higher isolation levels prevent reading intermediate states
- Serialization failures require retry logic
- Maximum retry limits prevent infinite retry loops
- Each retry sees the latest committed state
-
Balance Management:
- Always initialize accounts with valid balances
- Use
reload
to ensure fresh data - Handle nil values explicitly
-
Concurrency Handling:
- Implement retry mechanisms with maximum attempts
- Use exponential backoff between retries
- Reset test data between isolation level tests
-
Error Handling:
- Catch specific exceptions (
PG::TRDeadlockDetected
,ActiveRecord::SerializationFailure
) - Provide clear error messages
- Verify final state after errors
- Catch specific exceptions (
-
Testing Strategy:
- Reset state before each test
- Test each isolation level independently
- Verify both success and failure scenarios
These practical exercises demonstrate the importance of proper transaction management and the trade-offs between different isolation levels in a real-world scenario.
After completing this module, you should understand:
- Basic transaction management in PostgreSQL
- Different isolation levels and their implications
- How deadlocks occur and how to prevent them
- Best practices for handling concurrent transactions
- Lock types and their use cases
transaction_lab.rb
: Hands-on exercises with transaction conceptsREADME.md
: This documentation file
- PostgreSQL Documentation: Transaction Management
- PostgreSQL Documentation: Concurrency Control
- PostgreSQL Documentation: Lock Management
- PostgreSQL's MVCC (Multi-Version Concurrency Control) allows for high concurrency
- Deadlocks are automatically detected and resolved
- Proper transaction isolation level selection is crucial for application correctness
- Lock management requires careful consideration to avoid performance issues
- Transaction scope should be as narrow as possible while maintaining consistency
After completing this module, proceed to:
- Query Optimization to learn how transactions affect query performance
- TimescaleDB Extension to understand time-series data management