Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL #99

moshegood · 2024-02-13T16:44:49Z

Description of changes:
When a lock is not acquired because the ShouldSkipBlockingWait has been set, we cache the data pulled from DynamoDB. In future calls, if the recordVersion matches the cached version, we check is the cached version to see if it is expired rather than the freshly pulled copy, as the freshly pulled copy will never be expired.

lcabancla · 2024-02-13T18:59:58Z

src/main/java/com/amazonaws/services/dynamodbv2/AmazonDynamoDBLockClient.java

+                        if (notMyLocks.containsKey(id) &&
+                              notMyLocks.get(id).getRecordVersionNumber()
+                              .equals(existingLock.get().getRecordVersionNumber())) {
+                          itReallyIsExpired = oldCopy.isExpired();


I think this approach works, with the following downsides:

The very first acquire call on an existing lock will throw, always. There should be a separate check for notMyLocks.containsKey(id).

Ownership changes for expired locks will take twice as long since oldCopy.isExpired() will return true only after lease_duration secs. Then the rest of the acquire will take another lease_duration seconds.

For item 2, maybe try calling upsertAndMonitorExpiredLock() immediately when it expired?

Item 2 fixed in latest push.

Item 1: the first call on an existing lock

there is no way to know how long it has been acquired

the only possible options is to return that the lock is held

There is another option which is to block on the first call. If it returned false, then the following calls should fail fast.

If the user wants to block to wait for a lease, they should not set: ShouldSkipBlockingWait

I am fine with that approach as long as it is made clear (in the README?) that a return value of false does not only mean the lease is held. It may also mean that the client is, with the information that it has, not able to determine whether the lease is owned or not.

Probably outside the scope of this PR, but instead of using the skipBlockingWait property there can be two acquire methods - one for blocking and another for non-blocking.

See #102 for updates to the README.

The message here needs to be updated as well since all acquires during the first LEASE_DURATION seconds will fail for an existing, expired lock.

src/main/java/com/amazonaws/services/dynamodbv2/AmazonDynamoDBLockClient.java

ThePumpingLemma · 2024-02-13T19:30:25Z

src/main/java/com/amazonaws/services/dynamodbv2/AmazonDynamoDBLockClient.java

@@ -233,6 +233,7 @@ public class AmazonDynamoDBLockClient implements Runnable, Closeable {
    private final boolean holdLockOnServiceUnavailable;
    private final String ownerName;
    private final ConcurrentHashMap<String, LockItem> locks;
+    private final ConcurrentHashMap<String, LockItem> notMyLocks;


Do we need to purge entries from the map eventually?

If an app takes out leases with constantly changing names, then this can grow unboundedly.
If the lease names that it attempts to acquire are limited, then not.

…longer than TTL

moshegood changed the title ~~ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL~~ Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL Feb 13, 2024

lcabancla reviewed Feb 13, 2024

View reviewed changes

src/main/java/com/amazonaws/services/dynamodbv2/AmazonDynamoDBLockClient.java Show resolved Hide resolved

ThePumpingLemma reviewed Feb 13, 2024

View reviewed changes

ShouldSkipBlockingWait should still acquire a dead lock if tried for …

4d31892

…longer than TTL

moshegood force-pushed the moshe/ShouldSkipBlockingWait/eventually.should.work branch from 949cb9b to d592483 Compare February 15, 2024 18:18

Do not wait once we know a lease is expired

dbc76cc

moshegood force-pushed the moshe/ShouldSkipBlockingWait/eventually.should.work branch from d592483 to dbc76cc Compare February 15, 2024 18:19

Add tests for eventually getting the lock when SkipBlockingWait is set

f620e0a

lcabancla mentioned this pull request Feb 16, 2024

Add a withClockSkewUpperBound option when acquiring a lock #88

Open

moshegood force-pushed the moshe/ShouldSkipBlockingWait/eventually.should.work branch from 5fde650 to f620e0a Compare February 20, 2024 14:15

shetsa-amzn self-requested a review October 9, 2024 16:55

shetsa-amzn approved these changes Oct 9, 2024

View reviewed changes

shetsa-amzn merged commit 03b4305 into awslabs:master Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL #99

Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL #99

moshegood commented Feb 13, 2024 •

edited

Loading

lcabancla Feb 13, 2024 •

edited

Loading

lcabancla Feb 13, 2024

moshegood Feb 15, 2024

lcabancla Feb 16, 2024

moshegood Feb 16, 2024

lcabancla Feb 16, 2024

lcabancla Feb 16, 2024 •

edited

Loading

moshegood Feb 19, 2024

lcabancla Feb 20, 2024

ThePumpingLemma Feb 13, 2024

moshegood Feb 15, 2024

Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL #99

Fix: ShouldSkipBlockingWait should still acquire a dead lock if tried for longer than TTL #99

Conversation

moshegood commented Feb 13, 2024 • edited Loading

lcabancla Feb 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lcabancla Feb 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

moshegood commented Feb 13, 2024 •

edited

Loading

lcabancla Feb 13, 2024 •

edited

Loading

lcabancla Feb 16, 2024 •

edited

Loading