Discover and use the cache authorization token #24

bbockelm · 2025-01-11T17:26:49Z

If a cache authorization token file is provided in the plugin's configuration, then periodically read it out and use it in the generated curl requests.

If the plugin is configured to use the cache authorization token, then read it periodically from the file and add it to the HTTP request via the query parameters. Includes relevant unit and integration tests.

jhiemstrawisc

I only had a few light questions/comments, so I'll approve ahead of time.

jhiemstrawisc · 2025-01-20T18:30:21Z

src/CurlWorker.hh

+    // Configure a curl handle to use the current cache token
+    //
+    // Adds the token to the curl handle URL's query string
+    // parameter `access_token`, as specified in RFC 6750, Sec 2.3.


Two sets of questions here:

Section 2.3 of the RFC you link to warns:

Because of the security weaknesses associated with the URI method, including the high likelihood that the URL containing the access token will be logged, it SHOULD NOT be used unless it is impossible to transport the access token in the "Authorization" request header field or the HTTP request entity-body.

My assumption is that you're using the query param instead of the Authz header because the Authz header likely contains a JWT for the origin. Can you confirm or deny that with a brief comment? If I'm not correct, what's the logic for doing something the RFC explicitly warns against?

Does this imply the origin receives the two tokens, one client token potentially in the Authz header and one cache token as a URL query param under access_token? Why not use a simple JWT like we use everywhere else in Pelican Authz?

We need to get two tokens to the origin -- one identifying the cache, one showing that there was at least one valid user request to the cache.

I started with the idea of putting both in the Authorization header (as suggested in the RFC text you quote). However, that ran aground in upstream XRootD as sending two Authorization headers is not kosher: you're only supposed to send two headers in very limited cases (where the header is explicitly allowed to repeat or where the value is comma-separated).

So, eliminating that, this was the remaining option. The "good news" here is that we already have log scrubbing for tokens, meaning there's mitigation for the explicit issue called out. This URL is also not exposed to the browser, meaning there's no likelihood of a user copy/pasting it.

src/CurlUtil.cc

jhiemstrawisc · 2025-01-20T18:42:13Z

src/CurlUtil.cc

+    std::string_view url{url_char};
+    auto has_query_string = url.find('?') != std::string::npos;
+    std::string final_url{url};
+    final_url += has_query_string ? "&" : "?";
+    final_url += "access_token=";
+    final_url += token;


This type of URL parsing feels like it should be common enough (if not already, then eventually) to make a utility function.

Agreed!

This has a lot of the characteristics of a generic URL parser -- but only targets extracting a a single parameter. I think I'm going to keep it as-is for this PR but tackle things next time we need a "one-off".

bbockelm · 2025-01-31T00:52:04Z

Looks like GitHub was having A Morning today and managed to duplicate my comments a few times! Let's see what happens when I merge... 😆

bbockelm added 2 commits January 11, 2025 11:22

Discover and use the cache authorization token

a448f83

If the plugin is configured to use the cache authorization token, then read it periodically from the file and add it to the HTTP request via the query parameters. Includes relevant unit and integration tests.

Early exit of setup/teardown if the pelican process is daed

f7334f2

bbockelm requested a review from jhiemstrawisc January 11, 2025 17:31

bbockelm mentioned this pull request Jan 18, 2025

Enable cache token request code PelicanPlatform/pelican#1920

Open

jhiemstrawisc approved these changes Jan 20, 2025

View reviewed changes

bbockelm merged commit 7bb9680 into PelicanPlatform:main Jan 31, 2025
1 check passed

jhiemstrawisc mentioned this pull request Feb 21, 2025

Add federation token endpoint in Director, and implement routine for caches to fetch one PelicanPlatform/pelican#1985

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discover and use the cache authorization token #24

Discover and use the cache authorization token #24

bbockelm commented Jan 11, 2025

jhiemstrawisc left a comment

jhiemstrawisc Jan 20, 2025

bbockelm Jan 30, 2025

jhiemstrawisc Jan 20, 2025

This comment was marked as duplicate.

This comment was marked as duplicate.

bbockelm Jan 31, 2025

bbockelm commented Jan 31, 2025

Discover and use the cache authorization token #24

Discover and use the cache authorization token #24

Conversation

bbockelm commented Jan 11, 2025

jhiemstrawisc left a comment

Choose a reason for hiding this comment

jhiemstrawisc Jan 20, 2025

Choose a reason for hiding this comment

bbockelm Jan 30, 2025

Choose a reason for hiding this comment

jhiemstrawisc Jan 20, 2025

Choose a reason for hiding this comment

This comment was marked as duplicate.

This comment was marked as duplicate.

bbockelm Jan 31, 2025

Choose a reason for hiding this comment

bbockelm commented Jan 31, 2025