Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardize POST canonicalization query strings with pywb #68

Merged
merged 2 commits into from
Aug 18, 2024

Conversation

tw4l
Copy link
Member

@tw4l tw4l commented Apr 3, 2024

Fixes #58

This PR modifies canonicalization of non-GET request bodies into query strings to be consistent across Webrecorder projects. It introduces a new test case that is also in pywb to help ensure parity.

This is part of a cross-repo effort to standardize how POST canonicalization works in Webrecorder tools, and document this in a Webrecorder specfiication.

Todo

Still needs testing against fuzzy matching to ensure that there aren't unintended side effects to the change.

tw4l added 2 commits April 2, 2024 13:02
This is part of bringing consistency between pywb and warcio.js
with regard to how query parameters are constructed from request
bodies for non-GET requests.

The more complicated test is duplicated in pywb to help ensure
that the results are consistent across our toolsets/langauges.
@tw4l tw4l changed the title Issue 58 request body standardization Standardize POST canonicalization query strings with pywb Apr 3, 2024
@tw4l tw4l requested a review from ikreymer April 8, 2024 15:32
@tw4l tw4l marked this pull request as ready for review April 8, 2024 15:32
@ikreymer
Copy link
Member

Tested with fuzzy matching, the keys are more accurate now, so should only help with fuzzy matching in wabac.js!

@ikreymer ikreymer merged commit f151081 into main Aug 18, 2024
6 checks passed
@ikreymer ikreymer deleted the issue-58-request-body-standardization branch August 24, 2024 04:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Make POST/non-GET URL canonicalization consistent with pywb
2 participants