Timestamps and ACL caching #37

kjetilk · 2019-02-18T01:20:14Z

I started thinking about the design of an ACL cache, and figured some metadata set on the authorizations, using DC properties could help with that.

This proposal helps both with caching individual authorizations, like a specialized reverse proxy or a Solid app could, as well as recommendations for Solid servers to implement so that legacy HTTP caches may cache ACL resources.

I think this should be in the WAC spec, so I submit it as a proposal for future consideration.

gobengo · 2019-03-28T11:59:19Z

wrt adding timestamp rdf properties, using them for HTTP cache headers: I agree, and is this unique to Authorizations? To me it seems like this advice is good for any type of thing coming out of a solid server. Curious if others agree.

Without blocking this particular PR, should this advice be 'lifted' into a more general part of the solid specs?

kjetilk · 2019-03-28T22:05:10Z

wrt adding timestamp rdf properties, using them for HTTP cache headers: I agree, and is this unique to Authorizations? To me it seems like this advice is good for any type of thing coming out of a solid server. Curious if others agree.

Yeah, actually, I agree. :-) But, I note that for the WAC spec, we can specify how the RDF graph that specifies the authorization looks, but for other data, that is not so much the case...

Without blocking this particular PR, should this advice be 'lifted' into a more general part of the solid specs?

...so, in the interest of orthogonal specifications, I think this should be considered independently from other specs.

michielbdejong · 2019-04-25T06:58:15Z

I agree that any RDF source should produce an ETag header, so that clients can request it with an If-None-Match header, or use the similar less granular mechanism based on Last-Modified and if-Modified-Since.

I don't see why you would put those timestamps inside the data, though?

TallTed · 2019-04-25T13:37:02Z

@michielbdejong - It's worth noting that many systems do not properly track "Modified" dates for files, blurring lines with "Touched" and "Opened" (among other actions). Tracking modification datetime info explicitly within the data thus can have value.

That said, having worked with multiple systems that use such internal tracking, I can also say that relying on humans to (remember to) (accurately) do the work of changing those dates is similarly fraught with peril, so it would be good if increasingly intelligent technology could be brought to bear on it.

dmitrizagidulin · 2019-04-25T21:30:59Z

README.md

+and usability. Since servers must always use the most recent
+authorizations for operations, discrepancies between a client/proxy
+cache and what the server uses may arise if an application uses a
+stale authorization. That will not be security critical (since the the


Tiny typo - 'the' twice

dmitrizagidulin · 2019-04-25T21:35:06Z

@kjetilk I like this proposal. Especially the 'issued' and 'modified' attributes.

I'm a bit unsure about the 'valid' predicate, though. Is the intention to just use it for cache control? In which case, maybe an explicit cache control header from an http-headers ontology would be clearer?

If the intention is broader, I suspect this might be a bit confusing in terms of user interface and usability. What's the pain point that the 'valid' term is solving?

michielbdejong · 2019-04-26T10:21:29Z

many systems do not properly track "Modified" dates for files

That's irrelevant for the Solid spec, right? We can just warn against that in the spec, saying, beware if you implement your storage layer directly on a file system, looking at the mtime might not be good enough to implement proper ETags. The server should probably generate the ETag in code, and store it explicitly alongside the data?

I'm not (yet) convinced that the possible advantages of the changes suggested in this PR merit their cost.

In any case, and in a separate note, I think we should use a versioned spec, not a living document, so the 0.7 spec as it stands now will forever keep pointing at its snapshot versions of the various sub-specs, and at some point we need to do a triage round to establish which proposals would be eligible for going into the 0.8 spec (and I'm hoping we can postpone that until at least the end of 2018).

kjetilk · 2019-04-26T19:18:09Z

I don't see why you would put those timestamps inside the data, though?

I tried to explain that in the spec itself, but I'll be happy to further clarify. There are a few reasons for this: It gives increased granularity, as you can cache individual authorizations, not just at a "ACL file level". It also provides orthogonality to the HTTP protocol, you don't need to rely on things being served over HTTP to use caching. I think this will be very important soon in an IoT world. Another aspect of this is that you don't need a separate layer of storage to manage these times, they are right there in the authorizations.

kjetilk · 2019-04-26T19:31:02Z

many systems do not properly track "Modified" dates for files

That's irrelevant for the Solid spec, right? We can just warn against that in the spec, saying, beware if you implement your storage layer directly on a file system, looking at the mtime might not be good enough to implement proper ETags. The server should probably generate the ETag in code, and store it explicitly alongside the data?

I think we should keep mtime and ETag completely separate. They are two different things. Etags can be computed in many ways, and it is very important to get them right, but it is orthogonal to mtime.

So, there are two topics of importance here, one is caching generally, and one is an implementation detail of an ACL cache.

As I said above, caching generally should not have to rely on the "ACL file" as the smallest unit of caching, it should be possible for a specialized cache to consider each individual authorization as the smallest unit.

For the ACL cache that needs to be present for performance reasons in the actual authorization process, it is also as a matter of practicality, you don't want to look up the mtime on the backend if you can rely on that they are correct in your ACL cache, but you ACL cache should not consider your backend. The ACL cache should basically be a memory quad store that can be queried for authorizations really fast, and getting the mtimes from the ACL cache should also be a really fast operation.

So, either you store it with the authorization itself, or you store it in a separate resource, but I think that would be a bad design. To say that an authorization itself has been modified at a certain time is exactly what you should say, and this is saying it.

kjetilk · 2019-04-26T19:38:38Z

@kjetilk I like this proposal. Especially the 'issued' and 'modified' attributes.

I'm a bit unsure about the 'valid' predicate, though. Is the intention to just use it for cache control? In which case, maybe an explicit cache control header from an http-headers ontology would be clearer?

If the intention is broader, I suspect this might be a bit confusing in terms of user interface and usability. What's the pain point that the 'valid' term is solving?

Yeah, well, I'm not sure the max-age has a good place in the HTTP headers ontology, but if it did, it would indeed be more precise.

So, I mostly chose the dct:valid predicate for consistency with the others, and possibly some unexpected reuse. You never know, what people might use it for if an authorization is said to be valid up to a certain time. :-)

Now, it was motivated from the observation I had that parsing a simple ACL file came at about 200 ms cost. That's quite a lot. We will be looking up ACL files for pretty much everything, it will be a situation where every millisecond counts when we get to the point that UX is based on the integration of a lot of resources. To not have to look up an ACL at all, but to use it without further ado from an ACL cache could mean a lot of milliseconds. :-)

It is certainly possible to use a different predicate for it, but I think it is a good fit myself.

TallTed · 2019-07-31T21:48:24Z

Needs a round of conflict resolution, @kjetilk

kjetilk added 14 commits February 12, 2019 15:01

Add namespace prefix for dc

c2a4030

Describe the use of timestamps in relation to authorizations

3843ee2

forgot syntax quotes

47e6c23

On using the timestamp metadata for caching

ef0b35a

On setting dc:valid in clients

9abb4ad

Servers must use current authorization

7d6770b

Comment on heuristic freshness (which could cause inconsistencies)

d3a24a8

minor change

0b7c77f

Note on group listings

1774c82

Update ToC

afbcb98

Improve language

6563a87

Elaborate on stale caches

3b64400

skip namespace for clarity and brevity

d05be54

Minor improvements

50546b6

kjetilk requested review from timbl, dmitrizagidulin, csarven and RubenVerborgh February 18, 2019 01:20

kjetilk added this to the Spec Pull Requests milestone Apr 15, 2019

dmitrizagidulin reviewed Apr 25, 2019

View reviewed changes

kjetilk mentioned this pull request Sep 25, 2019

Metadata to help cache individual authorizations solid/authorization-panel#42

Open

csarven self-assigned this May 5, 2021

csarven mentioned this pull request Jun 17, 2021

Initial Editor’s Draft of the WAC specification #83

Merged

This was referenced Jul 1, 2021

Clarify what happens with semantically invalid WebAC resources solid/specification#56

Closed

Add time constraints to WAC rules #87

Open

Babydolljenny1985 approved these changes Apr 2, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timestamps and ACL caching #37

Timestamps and ACL caching #37

kjetilk commented Feb 18, 2019 •

edited

Loading

gobengo commented Mar 28, 2019 •

edited

Loading

kjetilk commented Mar 28, 2019

michielbdejong commented Apr 25, 2019

TallTed commented Apr 25, 2019

dmitrizagidulin Apr 25, 2019

dmitrizagidulin commented Apr 25, 2019

michielbdejong commented Apr 26, 2019

kjetilk commented Apr 26, 2019

kjetilk commented Apr 26, 2019

kjetilk commented Apr 26, 2019

TallTed commented Jul 31, 2019

Timestamps and ACL caching #37

Are you sure you want to change the base?

Timestamps and ACL caching #37

Conversation

kjetilk commented Feb 18, 2019 • edited Loading

gobengo commented Mar 28, 2019 • edited Loading

kjetilk commented Mar 28, 2019

michielbdejong commented Apr 25, 2019

TallTed commented Apr 25, 2019

dmitrizagidulin Apr 25, 2019

Choose a reason for hiding this comment

dmitrizagidulin commented Apr 25, 2019

michielbdejong commented Apr 26, 2019

kjetilk commented Apr 26, 2019

kjetilk commented Apr 26, 2019

kjetilk commented Apr 26, 2019

TallTed commented Jul 31, 2019

kjetilk commented Feb 18, 2019 •

edited

Loading

gobengo commented Mar 28, 2019 •

edited

Loading