Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix type hints #132

Merged
merged 4 commits into from
Feb 13, 2024
Merged

Fix type hints #132

merged 4 commits into from
Feb 13, 2024

Conversation

benoit74
Copy link
Collaborator

@benoit74 benoit74 commented Feb 12, 2024

Fix #109
Fix #112

Rationale

Type hints were too restrictive / incorrect

Changes

  • Item
    • for **kwargs, the type specified must be the one of individual param values, not the whole dictionary
    • the Union[str, bool, bytes] type hint was replaced by Any, because the goal of this param is precisely to pass any type which could be further used
    • FileItem and URLItem were modified as well because they were very close to StaticItem
    • no change was necessary in StaticItem in fact
    • references to callback in docstring has been removed because it has been moved to the add_item method quite few years ago
  • convert_image
    • src and dst can be either a Pathlib.Path or an io.BytesIO (maybe even typing.BinaryIO or typing.IO[bytes], but I did not find a convenient way to test this to confirm that adherence to typing.IO[bytes] contract is sufficient or if we need something additional from io.BytesIO)
    • for **params, the type specified must be the one of individual param values, not the whole dictionary ; based on existing codebase across all scrapers I assumed that str is sufficient.
    • save_image had to be changed as well to reflect changes made in convert_image

@benoit74 benoit74 self-assigned this Feb 12, 2024
Copy link

codecov bot commented Feb 12, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (5d95d59) 100.00% compared to head (cd25285) 100.00%.

❗ Current head cd25285 differs from pull request most recent head 824504f. Consider uploading reports for the commit 824504f to get more accurate results

Additional details and impacted files
@@            Coverage Diff            @@
##              main      #132   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           32        32           
  Lines         1345      1370   +25     
  Branches       229       237    +8     
=========================================
+ Hits          1345      1370   +25     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@benoit74 benoit74 marked this pull request as ready for review February 13, 2024 07:42
@benoit74 benoit74 requested a review from rgaudin February 13, 2024 07:42
Copy link
Member

@rgaudin rgaudin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good ; please see my comments regarding kwargs

@@ -106,7 +104,7 @@ def download_for_size(url, on_disk, tmp_dir=None):
size, _ = stream_file(url.geturl(), fpath=fpath, byte_stream=stream)
return fpath or stream, size

def __init__(self, url: str, **kwargs):
def __init__(self, url: str, **kwargs: Any):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that all expected (for which we have processing in the class) values should be individual optionnal keyword arguments with they expected type. We can leave Any for the rest

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same logic would apply to classes inheriting Item

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can only agree :)

@benoit74
Copy link
Collaborator Author

Changes done

Nota: please do not merge, I will rebase/push-force before merge first since another PR has been merged in the mean time.

@benoit74 benoit74 requested a review from rgaudin February 13, 2024 14:41
Copy link
Member

@rgaudin rgaudin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me but we're now storing several variables per object for stuff that might never be set.
Given those Item are created in very huge number and only released at an unknown/uncontrolled point in the future (when libzim is done copying to disk) I wonder if it's a good idea.

@benoit74
Copy link
Collaborator Author

I don't know ... you are right, but somehow this is the price to pay to have proper typing in the whole class, otherwise we will still rely on get_attr and mostly untyped values.

Should we give it a try or rollback the change?

@benoit74
Copy link
Collaborator Author

By rollbacking the change I mean changing only the method signatures, it is already a step forward.

@rgaudin
Copy link
Member

rgaudin commented Feb 13, 2024

Yes the method signature is what's most important because that's the public API.

Could you use the getattr way for the optional stuff in an independent commit so we have a better view at both options?

@benoit74
Copy link
Collaborator Author

I pushed one more commit to "rollback" what is necessary to keep the get_attr approach while still enhancing the public API.

I tend to admit I prefer this "middle-ground" approach.

@benoit74 benoit74 requested a review from rgaudin February 13, 2024 16:02
Copy link
Member

@rgaudin rgaudin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep, I also think I prefer this one. Good to me

We might have many Items, some of them long-lived, so it is maybe better
to not store many properties with a None value because they are unused
and hence keep a lean memory footprint
@benoit74
Copy link
Collaborator Author

Nota: last push is just a rebase on main last commit.

@benoit74 benoit74 merged commit 5558962 into main Feb 13, 2024
@benoit74 benoit74 deleted the fix_type_hints branch February 13, 2024 16:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Type hints of convert_image is not correct. Type hints of __init__ in StaticItem are not correct
2 participants