gh-89083: add support for UUID version 7 (RFC 9562) #121119

picnixz · 2024-06-28T09:46:15Z

Based on the discussion in #89083 and https://discuss.python.org/t/rfc-4122-9562-uuid-version-7-and-8-implementation/56725/2, this is the implementation that I suggest for the standard library.

The documentation is still missing because I don't have a good formulation for now.

In this PR, I did not include the following:

mutex guards
timestamp offsets

The reason is that I want to keep the first implementation simple for the sake of review. In addition, we did not give the add mutex for UUIDv1 so I don't want to do it only for v7.

@sergeyprokhorenko I don't know if you have the answer, but is there any safe guards if the timestamp overflows actually? or do we just don't care at all for now? (like, leave the problem for the future generations?)

Issue: Support UUIDv6, UUIDv7, and UUIDv8 from RFC 9562 #89083

📚 Documentation preview 📚: https://cpython-previews--121119.org.readthedocs.build/

sergeyprokhorenko · 2024-06-28T11:12:53Z

@sergeyprokhorenko I don't know if you have the answer, but is there any safe guards if the timestamp overflows actually? or do we just don't care at all for now? (like, leave the problem for the future generations?)

You already have three counter overflow protections:

Very long counter (42 bits)
Counter segment (MSB) initialized to 0
Incremented timestamp on overflow

The timestamp will not be full for about 6900 years. If the system clock stops and the timestamp is used as a counter, it will also last a long time.

You can be absolutely calm

picnixz · 2024-06-28T11:30:48Z

Yes, but I wanted to know whether the RFC actually considered the case when you use your own offset. Let's say we want to generate a future UUID for some obscure reason, I was wondering "is there anything on that topics?" But I think I'll just leave it to future generations.

What I meant is "what do you do if the operation of incrementing the timestamp itself overflows"?

sergeyprokhorenko · 2024-06-28T11:35:21Z

Yes, but I wanted to know whether the RFC actually considered the case when you use your own offset. Let's say we want to generate a future UUID for some obscure reason, I was wondering "is there anything on that topics?" But I think I'll just leave it to future generations.

What I meant is "what do you do if the operation of incrementing the timestamp itself overflows"?

Don't set offsets to 6900 years or minus 2k years, and everything will be OK. Foolproofing is an implementation detail.

sergeyprokhorenko · 2024-06-28T13:09:52Z

When the timestamp goes beyond the upper or lower limit of the acceptable range, a zero offset can be applied. This is how I would do it. The RFC does not cover this issue.

I think timestamp offset could be a competitive advantage of this implementation without significant cost.

UUIDv1 can be forgotten and no longer upgraded. This is an outdated version

Lib/uuid.py

pretoriusdre · 2024-07-22T12:57:23Z

Great job on this PR. One thing...

In the get_counter_and_tail method:
rand = int.from_bytes(os.urandom(10))

Might I suggest to explicitly specify the required byteorder using the byteorder argument?

Running this code in an older python env gives an error:
TypeError: from_bytes() missing required argument 'byteorder' (pos 2)

It seems like some default is now provided, but in my opinion, this could lead to some ambiguity. See below:
https://discuss.python.org/t/what-should-be-the-default-value-for-int-to-bytes-byteorder/10616

There is another usage of int.from_bytes in the same uuid module, perhaps if the above is being addressed, this could be put within same scope.

picnixz · 2024-07-22T13:28:39Z

Running this code in an older python env gives an error:

This feature would only be put in 3.14 or later, so we can ignore this.

but in my opinion, this could lead to some ambiguity

It doesn't matter whether it's little or big endian here since we are only interested in randomness and not actual data. In addition, not specifying it might be a bit faster since the C implementation currently does:

    if (byteorder == NULL)
        little_endian = 0;
    else if (_PyUnicode_Equal(byteorder, &_Py_ID(little)))
        little_endian = 1;
    else if (_PyUnicode_Equal(byteorder, &_Py_ID(big)))
        little_endian = 0;

So, not specifying the byteorder, is equivalent to have byteorder being NULL out there, which saves a string comparison.

jnoring-pw

Just a few minor comments as I looked through this code (I'm interested in uuid7 suport in our project). Thanks for this! It's looking good.

Lib/uuid.py

Davidamgad2 · 2025-01-13T19:33:33Z

Hello,
Thank you so much for your efforts in advance!
May I ask is v7 will be merged soon and I will be able to use it?in django app
Thanks again!

picnixz · 2025-01-13T19:43:42Z

I would be happy to but I would need more core devs to support this. We have roughly until May 2025 to include it for 3.14. Hopefully we'll manage to!

picnixz · 2025-01-13T19:51:01Z

I'll have another look at Rust and other languages' UUIDv7 implementations. Then, I'll ask other core devs to review the PR. And hopefully, we'll have it before the beta release.

Davidamgad2 · 2025-01-13T20:01:15Z

I'm not sure if this allowed to but I found this repo for uuidv7 if this would help, for now

https://github.com/aminalaee/uuid-utils

Thanks again and keep it up! 🔥

picnixz · 2025-01-13T20:22:25Z

I'm not sure if this allowed to but I found this repo for uuidv7 if this would help, for now

I actually found other implementations (see the issue thread) but I want to be as compliant as I can to the RFC and closer to what other standard libraries use. Currently, the implementation follows the implementation of Rust as it was last summer but I'll check if this has changed since then.

picnixz · 2025-02-17T10:25:08Z

Following https://discuss.python.org/t/rfc-4122-9562-uuid-version-7-and-8-implementation/56725/, and considering that many core developers suggested to keep it aligned with the Rust implementation as a first iteration and not PostgreSQL, and that even the author of the UUIDv7 for PostgreSQL recommended that Python aligns itself with Rust to maintain portability on platforms lacking microsecond resolution system clocks, I decided to keep the current implementation.

hugovk · 2025-02-20T09:56:17Z

Lib/uuid.py

+_last_timestamp_v7 = None
+_last_counter_v7 = 0  # 42-bit counter
+
+def uuid7():


Slip in some PEP 8 whitespace:

Suggested change

_last_timestamp_v7 = None

_last_counter_v7 = 0 # 42-bit counter

def uuid7():

_last_timestamp_v7 = None

_last_counter_v7 = 0 # 42-bit counter

def uuid7():

The file is not really PEP-8 friendly because some functions are separated by two empty lines, others aren't. I would prefer either having all uuid functions without PEP-8 or all of them with (it feels a bit weird that uuid1 to uuid4 won't have the double spacing but UUID 6, 7, and 8 would).

Then add the rest too :) PEP 8: "although this is also an opportunity to clean up someone else’s mess (in true XP style)."

I guess I can do it. The module is very very slow for updates so I don't think we'll have issues with backports :)

hugovk · 2025-02-20T09:56:55Z

Lib/uuid.py

+    # by construction, the variant and version bits are already cleared
+    int_uuid_7 |= _RFC_4122_VERSION_7_FLAGS
+    return UUID._from_int(int_uuid_7)
+
 def uuid8(a=None, b=None, c=None):


Suggested change

def uuid8(a=None, b=None, c=None):

def uuid8(a=None, b=None, c=None):

Lib/uuid.py

picnixz added 6 commits June 28, 2024 11:40

add UUIDv7 implementation

42d55b4

add tests

6826fa1

blurb

edc2cab

update CHANGELOG

c6d26b6

update RFC number

2ddb4b8

add TODO in the docs

bcd1417

bedevere-app bot mentioned this pull request Jun 28, 2024

Support UUIDv6, UUIDv7, and UUIDv8 from RFC 9562 #89083

Open

bedevere-app bot added the awaiting review label Jun 28, 2024

This was referenced Jun 28, 2024

gh-89083: support UUID version 7 (monotonous version) (RFC 9562) [abandoned proposal] #120830

Closed

gh-89083: add support for UUID version 6 (RFC 9562) #120650

Draft

picnixz changed the title ~~gh-89083: add ref. impl. for UUID version 7 (RFC 9562)~~ gh-89083: add support for UUID version 7 (RFC 9562) Jun 28, 2024

picnixz mentioned this pull request Jun 30, 2024

[RFE] fields and time_* properties must not be used on UUIDs that are time-agnostic. #120878

Open

mastizada reviewed Jul 7, 2024

View reviewed changes

Lib/uuid.py Outdated Show resolved Hide resolved

sixcare mentioned this pull request Jul 8, 2024

Switch out UUIDv4 with UUIDv7 Turplanlegger/turplanlegger-fastapi#89

Open

Merge branch 'main' into uuid-v7-method-1

4630c8f

jnoring-pw reviewed Aug 20, 2024

View reviewed changes

Lib/uuid.py Outdated Show resolved Hide resolved

Lib/uuid.py Show resolved Hide resolved

picnixz added 6 commits August 21, 2024 13:32

Merge branch 'main' into uuid-v7-89083

cd80afb

add UUIDv8 implementation

c3d4745

add tests

392d289

blurb

26889ea

add What's New entry

44b66e6

add docs

7be6dc4

picnixz changed the title ~~gh-89083: add support for UUID version 7 (RFC 9562)~~ gh-89083: add support for UUID version 7 (RFC 9562) Aug 22, 2024

Improve hexadecimal masks reading

8ba3d8b

picnixz added 8 commits November 14, 2024 10:44

improve test comments

c18d0c4

Merge remote-tracking branch 'upstream/main'

2df6f41

fix lint

6fcb6a1

Merge branch 'main' into uuid-v7-89083

f6048c9

post-merge

be3f024

Merge branch 'main' into uuid-v7-89083

99c6761

use versionchanged instead of versionadded

06befca

Merge branch 'main' into uuid-v7-method-1

2aacadf

edgarrmondragon mentioned this pull request Nov 21, 2024

feature: consider using a sequential UID for a job's run ID meltano/meltano#8919

Open

picnixz added 5 commits December 5, 2024 20:59

Merge branch 'main' into uuid-v7-method-1

f7f536e

improve UUIDv7 tests readability

aee2898

improve UUIDv7 uniqueness tests

1a5ac19

Merge branch 'main' into uuid-v7-method-1

8764b28

Merge branch 'main' into uuid-v7-method-1

af0baef

picnixz added the type-feature A feature request or enhancement label Jan 13, 2025

picnixz added 3 commits January 20, 2025 12:58

Merge branch 'main' into feat/uuid/v7-89083

939b5a8

use UUID._from_int for UUIDv7 and remove divmod usage

ef85b20

Merge branch 'main' into uuid-v7-method-1

2d08821

Mardoxx mentioned this pull request Jan 26, 2025

Update to RFC 9562 YoussefEgla/uuid-v7#14

Open

Merge branch 'main' into uuid-v7-method-1

eaa9ad4

picnixz requested review from vstinner and hugovk February 17, 2025 10:22

hugovk reviewed Feb 20, 2025

View reviewed changes

backport Victor's review on UUIDv6

571d2fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-89083: add support for UUID version 7 (RFC 9562) #121119

gh-89083: add support for UUID version 7 (RFC 9562) #121119

picnixz commented Jun 28, 2024 •

edited by github-actions bot

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading

picnixz commented Jun 28, 2024 •

edited

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading

pretoriusdre commented Jul 22, 2024

picnixz commented Jul 22, 2024

jnoring-pw left a comment

Davidamgad2 commented Jan 13, 2025

picnixz commented Jan 13, 2025

picnixz commented Jan 13, 2025

Davidamgad2 commented Jan 13, 2025

picnixz commented Jan 13, 2025

picnixz commented Feb 17, 2025

hugovk Feb 20, 2025

picnixz Feb 22, 2025

hugovk Feb 22, 2025

picnixz Feb 22, 2025

hugovk Feb 20, 2025

	def uuid8(a=None, b=None, c=None):

	def uuid8(a=None, b=None, c=None):

gh-89083: add support for UUID version 7 (RFC 9562) #121119

Are you sure you want to change the base?

gh-89083: add support for UUID version 7 (RFC 9562) #121119

Conversation

picnixz commented Jun 28, 2024 • edited by github-actions bot Loading

sergeyprokhorenko commented Jun 28, 2024 • edited Loading

picnixz commented Jun 28, 2024 • edited Loading

sergeyprokhorenko commented Jun 28, 2024 • edited Loading

sergeyprokhorenko commented Jun 28, 2024 • edited Loading

pretoriusdre commented Jul 22, 2024

picnixz commented Jul 22, 2024

jnoring-pw left a comment

Choose a reason for hiding this comment

Davidamgad2 commented Jan 13, 2025

picnixz commented Jan 13, 2025

picnixz commented Jan 13, 2025

Davidamgad2 commented Jan 13, 2025

picnixz commented Jan 13, 2025

picnixz commented Feb 17, 2025

hugovk Feb 20, 2025

Choose a reason for hiding this comment

picnixz Feb 22, 2025

Choose a reason for hiding this comment

hugovk Feb 22, 2025

Choose a reason for hiding this comment

picnixz Feb 22, 2025

Choose a reason for hiding this comment

hugovk Feb 20, 2025

Choose a reason for hiding this comment

picnixz commented Jun 28, 2024 •

edited by github-actions bot

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading

picnixz commented Jun 28, 2024 •

edited

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading

sergeyprokhorenko commented Jun 28, 2024 •

edited

Loading