Skip to content
  • Sponsor pandas-dev/pandas-stubs

  • Notifications You must be signed in to change notification settings
  • Fork 140
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use UnknownSeries in series.pyi #1177

Merged
merged 2 commits into from
Mar 18, 2025

Conversation

MarcoGorelli
Copy link
Member

@MarcoGorelli MarcoGorelli commented Mar 18, 2025

ref #1171

  • Closes #xxxx (Replace xxxx with the Github issue number)
  • Tests added: Please use assert_type() to assert the type of any return value

These all look like cases where:

  • value-dependent behaviour makes narrowing the inner type untractable (e.g. in align, 'int64' might get upcast to 'float64' depending on the values, which aren't known statically)
  • another generic is instantiated, like in Rolling. .rolling_sum. Currently these all return 'float64', but it's not clear to me that they should? For nullable types, for example, 'int' could be preserved? Given that this might be considered a bug in pandas, I think just keeping UnknownSeries is safest for now? pandas issue for reference: BUG(?): rolling sum with pyarrow types results in float64 instead of preserving integer type pandas#61144

@MarcoGorelli MarcoGorelli marked this pull request as ready for review March 18, 2025 15:02

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.
@MarcoGorelli MarcoGorelli force-pushed the unknown-series-in-series branch from 2ecc29a to b3b608e Compare March 18, 2025 15:06
Copy link
Collaborator

@Dr-Irv Dr-Irv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we may want to add a test like this (maybe more tests - not sure):

def test_unknown() -> None:
    s = pd.Series(pd.NA)

    def foo(sf: pd.Series):
        pass

    foo(s)

    check(assert_type(s + pd.Series([1]), pd.Series), pd.Series)

Idea is to make sure that an UnknownSeries can be passed and that arithmetic works. I'm fine for now if you just add the above.

Eventually, we need to deal with this comment in a PR: #1093 (comment) (and other comments in that PR).

But the incremental approach is fine. Would be interesting to convert all references in the stubs from plain Series to UnknownSeries.

Copy link
Collaborator

@Dr-Irv Dr-Irv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @MarcoGorelli

@Dr-Irv Dr-Irv merged commit 7531ef8 into pandas-dev:main Mar 18, 2025
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants