From 531561709e551cca6d4bc8e8ce03980ba438470b Mon Sep 17 00:00:00 2001
From:  <cparmet@gmail.com>
Date: Thu, 27 Jun 2024 12:39:14 -0400
Subject: [PATCH] Deployed 221c5eb with MkDocs version: 1.6.0

---
 API reference/DataFrameChecks/index.html | 3 ++-
 API reference/SeriesChecks/index.html    | 3 ++-
 search/search_index.json                 | 2 +-
 3 files changed, 5 insertions(+), 3 deletions(-)
diff --git a/API reference/DataFrameChecks/index.html b/API reference/DataFrameChecks/index.html
index c1681a8..bc3f81c 100644
--- a/API reference/DataFrameChecks/index.html	
+++ b/API reference/DataFrameChecks/index.html	
@@ -3181,8 +3181,9 @@ <h2 id="pandas_checks.DataFrameChecks.DataFrameChecks.assert_type" class="doc do
       </thead>
       <tbody>
           <tr class="doc-section-item">
-            <td><code>type</code></td>
+            <td><code>dtype</code></td>
             <td>
+                  <code><span title="typing.Type">Type</span>[<span title="typing.Any">Any</span>]</code>
             </td>
             <td>
               <div class="doc-md-description">
diff --git a/API reference/SeriesChecks/index.html b/API reference/SeriesChecks/index.html
index c73068a..4b57961 100644
--- a/API reference/SeriesChecks/index.html	
+++ b/API reference/SeriesChecks/index.html	
@@ -2496,8 +2496,9 @@ <h2 id="pandas_checks.SeriesChecks.SeriesChecks.assert_type" class="doc doc-head
       </thead>
       <tbody>
           <tr class="doc-section-item">
-            <td><code>type</code></td>
+            <td><code>dtype</code></td>
             <td>
+                  <code><span title="typing.Type">Type</span>[<span title="typing.Any">Any</span>]</code>
             </td>
             <td>
               <div class="doc-md-description">
diff --git a/search/search_index.json b/search/search_index.json
index 841b445..a21104d 100644
--- a/search/search_index.json
+++ b/search/search_index.json
@@ -1 +1 @@
-{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"About","text":""},{"location":"#introduction","title":"Introduction","text":"<p>Pandas Checks is a Python library for data science and data engineering. It adds non-invasive health checks for Pandas method chains.</p>"},{"location":"#what-are-method-chains","title":"What are method chains?","text":"<p>Method chains are one of the coolest features of the Pandas library! They allow you to write more functional code with fewer intermediate variables and fewer side effects. If you're familiar with R, method chains are Python's version of dplyr pipes.</p>"},{"location":"#why-use-pandas-checks","title":"Why use Pandas Checks?","text":"<p>Pandas Checks adds the ability to inspect and validate your Pandas data at any point in the method chain, without modifying the underlying data. Think of Pandas Checks as a drone you can send up to check on your pipeline, whether it's in exploratory data analysis, prototyping, or production.</p> <p>That way you don't need to chop up a method chain, or create intermediate variables, every time you need to diagnose, treat, or prevent problems with your data processing pipeline.</p> <p>As Fleetwood Mac says, you would never break the chain.</p> <p></p>"},{"location":"#giving-feedback-and-contributing","title":"Giving feedback and contributing","text":"<p>If you run into trouble or have questions, I'd love to know. Please open an issue.</p> <p>Contributions are appreciated! Please open an issue or submit a pull request. Pandas Checks uses the wonderful libraries poetry for package and dependency management, nox for test automation, and mkdocs for docs.</p>"},{"location":"#license","title":"License","text":"<p>Pandas Checks is licensed under the BSD-3 License.</p> <p>\ud83d\udc3c\ud83e\ude7a</p>"},{"location":"usage/","title":"Usage","text":""},{"location":"usage/#installation","title":"Installation","text":"<p>First make Pandas Check available in your environment.</p> <pre><code>pip install pandas-checks\n</code></pre> <p>Then import it in your code. It works in Jupyter, IPython, and Python scripts run from the command line.</p> <pre><code>import pandas_checks\n</code></pre> <p>After importing, you don't need to access the <code>pandas_checks</code> module directly.</p> <p>\ud83d\udca1 Tip: You can import Pandas Checks either before or after your code imports Pandas. Just somewhere. \ud83d\ude01</p>"},{"location":"usage/#basic-usage","title":"Basic usage","text":"<p>Pandas Checks adds <code>.check</code> methods to Pandas DataFrames and Series. </p> <p>Say you have a nice function.</p> <pre><code>\ndef clean_iris_data(iris: pd.DataFrame) -&gt; pd.DataFrame:\n    \"\"\"Preprocess data about pretty flowers.\n\n    Args:\n        iris: The raw iris dataset.\n\n    Returns:\n        The cleaned iris dataset.\n    \"\"\"\n\n    return (\n        iris\n        .dropna() # Drop rows with any null values\n        .rename(columns={\"FLOWER_SPECIES\": \"species\"}) # Rename a column\n        .query(\"species=='setosa'\") # Filter to rows with a certain value\n    )\n</code></pre> <p>But what if you want to make the chain more robust? Or see what's happening to the data as it flows down the pipeline? Or understand why your new <code>iris</code> CSV suddenly makes the cleaned data look weird? </p> <p>You can add some <code>.check</code> steps.</p> <pre><code>\n(\n    iris\n    .dropna()\n    .rename(columns={\"FLOWER_SPECIES\": \"species\"})\n\n    # Validate assumptions\n    .check.assert_positive(subset=[\"petal_length\", \"sepal_length\"])\n\n    # Plot the distribution of a column after cleaning\n    .check.hist(column='petal_length') \n\n    .query(\"species=='setosa'\")\n\n    # Display the first few rows after cleaning\n    .check.head(3)  \n)\n</code></pre> <p>The <code>.check</code> methods will display the following results:</p> <p></p> <p>The <code>.check</code> methods didn't modify how the <code>iris</code> data is processed by your code. They just let you check the data as it flows down the pipeline. That's the difference between Pandas <code>.head()</code> and Pandas Checks <code>.check.head()</code>.</p>"},{"location":"usage/#features","title":"Features","text":""},{"location":"usage/#check-methods","title":"Check methods","text":"<p>Here's what's in the doctor's bag.</p> <p>Describe     - Standard Pandas methods:         - <code>.check.columns()</code> - DataFrame | Series         - <code>.check.dtypes()</code> for DataFrame | <code>.check.dtype()</code> for Series         - <code>.check.describe()</code> - DataFrame | Series         - <code>.check.head()</code> - DataFrame | Series         - <code>.check.info()</code> - DataFrame | Series         - <code>.check.memory_usage()</code> - DataFrame | Series         - <code>.check.nunique()</code> - DataFrame | Series         - <code>.check.shape()</code> - DataFrame | Series         - <code>.check.tail()</code> - DataFrame | Series         - <code>.check.unique()</code> - DataFrame | Series         - <code>.check.value_counts()</code> - DataFrame | Series     - New functions in Pandas Checks:         - <code>.check.function()</code>: Apply an arbitrary lambda function to your data and see the result - DataFrame | Series         - <code>.check.ncols()</code>: Count columns - DataFrame | Series         - <code>.check.ndups()</code>: Count rows with duplicate values - DataFrame | Series         - <code>.check.nnulls()</code>: Count rows with null values - DataFrame | Series         - <code>.check.print()</code>: Print a string, a variable, or the current dataframe - DataFrame | Series</p> <ul> <li> <p>Export interim files</p> <ul> <li><code>.check.write()</code>: Export the current data, inferring file format from the name - DataFrame | Series</li> </ul> </li> <li> <p>Time your code</p> <ul> <li><code>.check.print_time_elapsed(start_time)</code>: Print the execution time since you called <code>start_time = pdc.start_timer()</code> - DataFrame | Series</li> <li> <p>\ud83d\udca1 Tip:  You can also use this stopwatch outside a method chain, anywhere in your Python code:  </p> <p>```python from pandas_checks import print_elapsed_time, start_timer</p> <p>start_time = start_timer() ... print_elapsed_time(start_time) ```</p> </li> </ul> </li> <li> <p>Turn off Pandas Checks</p> <ul> <li><code>.check.disable_checks()</code>: Don't run checks, for production mode etc. By default, still runs assertions. - DataFrame | Series</li> <li><code>.check.enable_checks()</code>: Run checks - DataFrame | Series</li> </ul> </li> <li> <p>Validate </p> <ul> <li>General<ul> <li><code>.check.assert_data()</code>: Check that data passes an arbitrary condition - DataFrame | Series</li> </ul> </li> <li>Types<ul> <li><code>.check.assert_datetime()</code> - DataFrame | Series</li> <li><code>.check.assert_float()</code> - DataFrame | Series</li> <li><code>.check.assert_int()</code> - DataFrame | Series</li> <li><code>.check.assert_str()</code> - DataFrame | Series</li> <li><code>.check.assert_timedelta()</code> - DataFrame | Series</li> <li><code>.check.assert_type()</code> - DataFrame | Series</li> </ul> </li> <li>Values<ul> <li><code>.check.assert_less_than()</code> - DataFrame | Series</li> <li><code>.check.assert_greater_than()</code> - DataFrame | Series</li> <li><code>.check.assert_negative()</code> - DataFrame | Series</li> <li><code>.check.assert_not_null()</code> - DataFrame | Series</li> <li><code>.check.assert_null()</code> - DataFrame | Series</li> <li><code>.check.assert_positive()</code> - DataFrame | Series</li> <li><code>.check.assert_unique()</code> - DataFrame | Series</li> </ul> </li> </ul> </li> <li> <p>Visualize</p> <ul> <li><code>.check.hist()</code>: A histogram - DataFrame | Series</li> <li><code>.check.plot()</code>: An arbitrary plot you can customize - DataFrame | Series</li> </ul> </li> </ul>"},{"location":"usage/#customizing-a-check","title":"Customizing a check","text":"<p>You can use Pandas Checks methods like the regular Pandas methods. They accept the same arguments. For example, you can pass: * <code>.check.head(7)</code> * <code>.check.value_counts(column=\"species\", dropna=False, normalize=True)</code> * <code>.check.plot(kind=\"scatter\", x=\"sepal_width\", y=\"sepal_length\")</code></p> <p>Also, most Pandas Checks methods accept 3 additional arguments: 1. <code>check_name</code>: text to display before the result of the check 2. <code>fn</code>: a lambda function that modifies the data displayed by the check 3. <code>subset</code>: limit a check to certain columns</p> <pre><code>(\n    iris\n    .check.value_counts(column='species', check_name=\"Varieties after data cleaning\")\n    .assign(species=lambda df: df[\"species\"].str.upper()) # Do your regular Pandas data processing, like upper-casing the values in one column\n    .check.head(n=2, fn=lambda df: df[\"petal_width\"]*2) # Modify the data that gets displayed in the check only\n    .check.describe(subset=['sepal_width', 'sepal_length'])  # Only apply the check to certain columns\n)\n</code></pre> <p></p>"},{"location":"usage/#configuring-pandas-check","title":"Configuring Pandas Check","text":""},{"location":"usage/#global-configuration","title":"Global configuration","text":"<p>You can change how Pandas Checks works everywhere. For example:</p> <pre><code>import pandas_checks as pdc\n\n# Set output precision and turn off the cute emojis\npdc.set_format(precision=3, use_emojis=False)\n\n# Don't run any of the calls to Pandas Checks, globally. \npdc.disable_checks()\n</code></pre> <p>Run <code>pdc.describe_options()</code> to see the arguments you can pass to <code>.set_format()</code>.</p> <p>\ud83d\udca1 Tip: By default, <code>disable_checks()</code> and <code>enable_checks()</code> do not change whether Pandas Checks will run assertion methods (<code>.check.assert_*</code>). </p> <p>To turn off assertions too, add the argument <code>enable_asserts=False</code>, such as: <code>disable_checks(enable_asserts=False)</code>.</p>"},{"location":"usage/#local-configuration","title":"Local configuration","text":"<p>You can also adjust settings within a method chain by bookending the chain, like this:</p> <pre><code># Customize format during one method chain\n(\n    iris\n    .check.set_format(precision=7, use_emojis=False)\n    ... # Any .check methods in here will use the new format\n    .check.reset_format() # Restore default format\n)\n\n# Turn off Pandas Checks during one method chain\n(\n    iris\n    .check.disable_checks()\n    ... # Any .check methods in here will not be run\n    .check.enable_checks() # Turn it back on for the next code\n)\n</code></pre>"},{"location":"usage/#hybrid-eda-production-data-processing","title":"Hybrid EDA-Production data processing","text":"<p>Exploratory Data Analysis is often taught as a one-time step we do to plan our production data processing. But sometimes EDA is a cyclical process we go back to for deeper inspection during debugging, code edits, or changes in the input data. If explorations were useful in EDA, they may be useful again.</p> <p>Unfortunately, it's hard to go back to EDA. It's too out of sync. The prod data processing pipeline has usually evolved too much, making the EDA code a historical artifact full of cobwebs that we can't easily fire up again. </p> <p>But if you use Pandas Checks during EDA, you could roll your <code>.check</code> methods into your first production code. Then in prod mode, disable Pandas Checks when you don't need it, to save compute and streamline output. When you ever need to pull out those EDA tools, enable Pandas Checks globally or locally.  </p> <p>This can make your prod pipline more transparent and easier to inspect.  </p>"},{"location":"API%20reference/DataFrameChecks/","title":"DataFrame methods","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks._obj","title":"<code>_obj = pandas_obj</code>  <code>instance-attribute</code>","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.__init__","title":"<code>__init__(pandas_obj)</code>","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_data","title":"<code>assert_data(condition, subset=None, pass_message=' \u2714\ufe0f Assertion passed ', fail_message=' \u3128 Assertion failed ', raise_exception=True, exception_to_raise=DataError, message_shows_condition=True, verbose=False)</code>","text":"<p>Tests whether Dataframe meets condition. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>condition</code> <code>Callable</code> <p>Assertion criteria in the form of a lambda function, such as <code>lambda df: df.shape[0]&gt;10</code>.</p> required <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. Applied after fn. Subsetting can also be done within the <code>condition</code>, such as <code>lambda df: df['column_name'].sum()&gt;10</code></p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assertion passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assertion failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>message_shows_condition</code> <code>bool</code> <p>Whether the fail/pass message should also print the assertion criteria</p> <code>True</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_datetime","title":"<code>assert_datetime(subset=None, pass_message=' \u2714\ufe0f Assert datetime passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is datetime or timestamp. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert datetime passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_float","title":"<code>assert_float(subset=None, pass_message=' \u2714\ufe0f Assert float passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is floats. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert float passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_greater_than","title":"<code>assert_greater_than(min, or_equal_to=True, subset=None, pass_message=' \u2714\ufe0f Assert minimum passed ', fail_message=' \u3128 Assert minimum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is &gt; or &gt;= a value. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>min</code> <code>Any</code> <p>the minimum value to compare DataFrame to. Accepts any type that can be used in &gt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &gt;= min (True) or &gt; min (False)</p> <code>True</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert minimum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert minimum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_int","title":"<code>assert_int(subset=None, pass_message=' \u2714\ufe0f Assert integeer passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is integers. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert integeer passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_less_than","title":"<code>assert_less_than(max, or_equal_to=True, subset=None, pass_message=' \u2714\ufe0f Assert maximum passed ', fail_message=' \u3128 Assert maximum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is &lt; or &lt;= a value. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>max</code> <code>Any</code> <p>the max value to compare DataFrame to. Accepts any type that can be used in &lt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &lt;= min (True) or &lt; max (False)</p> <code>True</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert maximum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert maximum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_negative","title":"<code>assert_negative(subset=None, assert_not_null=True, pass_message=' \u2714\ufe0f Assert negative passed ', fail_message=' \u3128 Assert negative failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all negative values. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against.`</p> <code>None</code> <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert negative passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert negative failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_not_null","title":"<code>assert_not_null(subset=None, pass_message=' \u2714\ufe0f Assert no nulls passed ', fail_message=' \u3128 Assert no nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has no nulls. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert no nulls passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert no nulls failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_null","title":"<code>assert_null(subset=None, pass_message=' \u2714\ufe0f Assert all nulls passed ', fail_message=' \u3128 Assert all nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all nulls. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert all nulls passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert all nulls failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_positive","title":"<code>assert_positive(subset=None, assert_not_null=True, pass_message=' \u2714\ufe0f Assert positive passed ', fail_message=' \u3128 Assert positive failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all positive values. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert positive passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert positive failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_str","title":"<code>assert_str(subset=None, pass_message=' \u2714\ufe0f Assert string passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is strings. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert string passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_timedelta","title":"<code>assert_timedelta(subset=None, pass_message=' \u2714\ufe0f Assert timedelta passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is of type timedelta. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert timedelta passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_type","title":"<code>assert_type(dtype, subset=None, pass_message=' \u2714\ufe0f Assert type passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns meets type assumption. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>type</code> <p>The required variable type</p> required <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert type passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_unique","title":"<code>assert_unique(subset=None, pass_message=' \u2714\ufe0f Assert unique passed ', fail_message=' \u3128 Assert unique failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has no duplicate rows. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert unique passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert unique failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.columns","title":"<code>columns(fn=lambda df: df, subset=None, check_name='\ud83c\udfdb\ufe0f Columns')</code>","text":"<p>Prints the column names of a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before printing columns. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before printing their names. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83c\udfdb\ufe0f Columns'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.describe","title":"<code>describe(fn=lambda df: df, subset=None, check_name='\ud83d\udccf Distributions', **kwargs)</code>","text":"<p>Displays descriptive statistics about a DataFrame without modifying the DataFrame itself.</p> <p>See Pandas docs for describe() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas describe(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas describe(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\udccf Distributions'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas describe() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off Pandas Checks globally, such as in production mode. Calls to .check functions will not be run. Does not modify the DataFrame itself.</p> <p>Args     enable_assert: Optionally, whether to also enable or disable assert statements</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.dtypes","title":"<code>dtypes(fn=lambda df: df, subset=None, check_name='\ud83d\uddc2\ufe0f Data types')</code>","text":"<p>Displays the data types of a DataFrame's columns without modifying the DataFrame itself.</p> <p>See Pandas docs for dtypes for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas dtypes. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas .dtypes. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\uddc2\ufe0f Data types'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Globally enables Pandas Checks. Subequent calls to .check methods will be run. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Optionally, whether to globally enable or disable calls to .check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.function","title":"<code>function(fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Applies an arbitrary function on a DataFrame and shows the result, without modifying the DataFrame itself.</p> Example <p>.check.function(fn=lambda df: df.shape[0]&gt;10, check_name='Has at least 10 rows?') which will result in 'True' or 'False'</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>A lambda function to apply to the DataFrame. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas describe(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.get_mode","title":"<code>get_mode(check_name='\ud83d\udc3c\ud83e\ude7a Pandas Checks mode')</code>","text":"<p>Displays the current values of Pandas Checks global options enable_checks and enable_asserts. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check. Will be used as a preface the printed result.</p> <code>'\ud83d\udc3c\ud83e\ude7a Pandas Checks mode'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.head","title":"<code>head(n=5, fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Displays the first n rows of a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for head() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>The number of rows to display.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas head(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas head(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.hist","title":"<code>hist(fn=lambda df: df, subset=[], check_name=None, **kwargs)</code>","text":"<p>Displays a histogram for the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for hist() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas hist(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas hist(). Applied after fn.</p> <code>[]</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas hist() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>If more than one column is passed, displays a grid of histograms</p> <p>Only renders in interactive mode (IPython/Jupyter), not in terminal</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.info","title":"<code>info(fn=lambda df: df, subset=None, check_name='\u2139\ufe0f Info', **kwargs)</code>","text":"<p>Displays summary information about a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for info() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas info(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas info(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2139\ufe0f Info'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.memory_usage","title":"<code>memory_usage(fn=lambda df: df, subset=None, check_name='\ud83d\udcbe Memory usage', **kwargs)</code>","text":"<p>Displays the memory footprint of a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for memory_usage() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas memory_usage(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas memory_usage(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcbe Memory usage'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Include argument <code>deep=True</code> to get further memory usage of object dtypes in the DataFrame. See Pandas docs for memory_usage() for more info.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.ncols","title":"<code>ncols(fn=lambda df: df, subset=None, check_name='\ud83c\udfdb\ufe0f Columns')</code>","text":"<p>Displays the number of columns in a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of columns. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting the number of columns. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83c\udfdb\ufe0f Columns'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.ndups","title":"<code>ndups(fn=lambda df: df, subset=None, check_name=None, **kwargs)</code>","text":"<p>Displays the number of duplicated rows in a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for duplicated() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of duplicates. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting duplicate rows. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas duplicated() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nnulls","title":"<code>nnulls(fn=lambda df: df, subset=None, by_column=True, check_name='\ud83d\udc7b Rows with NaNs')</code>","text":"<p>Displays the number of rows with null values in a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for isna() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of rows with a null. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting nulls.</p> <code>None</code> <code>by_column</code> <code>bool</code> <p>If True, count null values with each column separately. If False, count rows with a null value in any column. Applied after fn.</p> <code>True</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udc7b Rows with NaNs'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nrows","title":"<code>nrows(fn=lambda df: df, subset=None, check_name='\u2630 Rows')</code>","text":"<p>Displays the number of rows in a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of rows. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are considered when counting rows. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2630 Rows'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nunique","title":"<code>nunique(column, fn=lambda df: df, check_name=None, **kwargs)</code>","text":"<p>Displays the number of unique rows in a single column, without modifying the DataFrame itself.</p> <p>See Pandas docs for nunique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>The name of a column to count uniques in. Applied after fn.</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas nunique(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas nunique() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.plot","title":"<code>plot(fn=lambda df: df, subset=None, check_name='', **kwargs)</code>","text":"<p>Displays a plot of the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for plot() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas plot(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are plotted. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional title for the plot.</p> <code>''</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas plot() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p> <p>If you pass a 'title' kwarg, it becomes the plot title, overriding check_name</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.print","title":"<code>print(object=None, fn=lambda df: df, subset=None, check_name=None, max_rows=10)</code>","text":"<p>Displays text, another object, or (by default) the current DataFrame's head. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>object</code> <code>Any</code> <p>Object to print. Can be anything printable: str, int, list, another DataFrame, etc. If None, print the DataFrame's head (with <code>max_rows</code> rows).</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before printing <code>object</code>. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are printed. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>max_rows</code> <code>int</code> <p>Maximum number of rows to print if object=None.</p> <code>10</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Parameters:</p> Name Type Description Default <code>start_time</code> <code>float</code> <p>The index time when the stopwatch started, which comes from the Pandas Checks start_timer()</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to print before the elapsed time.</p> <code>'Time elapsed'</code> <code>units</code> <code>str</code> <p>The units in which to display the elapsed time. Can be \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <code>'auto'</code> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings. Does not modify the DataFrame itself.</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Does not modify the DataFrame itself.</p> <p>Run pandas_checks.describe_options() to see a list of available options.</p> <p>For example, .check.set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to Pandas Checks .check.assert_data() statements globally.</p> required <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.shape","title":"<code>shape(fn=lambda df: df, subset=None, check_name='\ud83d\udcd0 Shape')</code>","text":"<p>Displays the Dataframe's dimensions, without modifying the DataFrame itself.</p> <p>See Pandas docs for shape for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas <code>shape</code>. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are considered when printing the shape. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcd0 Shape'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>See also .check.nrows() and .check.ncols()</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.tail","title":"<code>tail(n=5, fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Displays the last n rows of the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for tail() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>Number of rows to show.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas tail(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are displayed. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.unique","title":"<code>unique(column, fn=lambda df: df, check_name=None)</code>","text":"<p>Displays the unique values in a column, without modifying the DataFrame itself.</p> <p>See Pandas docs for unique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>Column to check for unique values.</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before calling Pandas unique(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p><code>fn</code> is applied to the dataframe before selecting <code>column</code>. If you want to select the column before modifying it, set <code>column=None</code> and start <code>fn</code> with a column selection, i.e. <code>fn=lambda df: df[\"my_column\"].stuff()</code></p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.value_counts","title":"<code>value_counts(column, fn=lambda df: df, max_rows=10, check_name=None, **kwargs)</code>","text":"<p>Displays the value counts for a column, without modifying the DataFrame itself.</p> <p>See Pandas docs for value_counts() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>Column to check for value counts.</p> required <code>max_rows</code> <code>int</code> <p>Maximum number of rows to show in the value counts.</p> <code>10</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas value_counts(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas value_counts() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p><code>fn</code> is applied to the dataframe before selecting <code>column</code>. If you want to select the column before modifying it, set <code>column=None</code> and start <code>fn</code> with a column selection, i.e. <code>fn=lambda df: df[\"my_column\"].stuff()</code></p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.write","title":"<code>write(path, format=None, fn=lambda df: df, subset=None, verbose=False, **kwargs)</code>","text":"<p>Exports DataFrame to file, without modifying the DataFrame itself.</p> <p>Format is inferred from path extension like .csv.</p> <p>This functions uses the corresponding Pandas export function such as to_csv(). See Pandas docs for those functions for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>path</code> <code>str</code> <p>Path to write the file to.</p> required <code>format</code> <code>Union[str, None]</code> <p>Optional file format to force for the export. If None, format is inferred from the file's extension in <code>path</code>.</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before exporting. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are exported. Applied after fn.</p> <code>None</code> <code>verbose</code> <code>bool</code> <p>Whether to print a message when the file is written.</p> <code>False</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional keyword arguments to pass to the Pandas export function (.to_csv).</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Exporting to some formats such as Excel, Feather, and Parquet may require you to install additional packages.</p>"},{"location":"API%20reference/SeriesChecks/","title":"Series methods","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks._obj","title":"<code>_obj = pandas_obj</code>  <code>instance-attribute</code>","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.__init__","title":"<code>__init__(pandas_obj)</code>","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_data","title":"<code>assert_data(condition, pass_message=' \u2714\ufe0f Assertion passed ', fail_message=' \u3128 Assertion failed ', raise_exception=True, exception_to_raise=DataError, message_shows_condition=True, verbose=False)</code>","text":"<p>Tests whether Series meets condition. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>condition</code> <code>Callable</code> <p>Assertion criteria in the form of a lambda function, such as <code>lambda s: s.shape[0]&gt;10</code>.</p> required <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assertion passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assertion failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>message_shows_condition</code> <code>bool</code> <p>Whether the fail/pass message should also print the assertion criteria</p> <code>True</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_datetime","title":"<code>assert_datetime(pass_message=' \u2714\ufe0f Assert datetime passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is datetime or timestamp. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert datetime passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_float","title":"<code>assert_float(pass_message=' \u2714\ufe0f Assert float passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is floats. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert float passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_greater_than","title":"<code>assert_greater_than(min, or_equal_to=True, pass_message=' \u2714\ufe0f Assert minimum passed ', fail_message=' \u3128 Assert minimum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series is &gt; or &gt;= a value. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>min</code> <code>Any</code> <p>the minimum value to compare Series to. Accepts any type that can be used in &gt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &gt;= min (True) or &gt; min (False)</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert minimum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert minimum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_int","title":"<code>assert_int(pass_message=' \u2714\ufe0f Assert integeer passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is integers. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_less_than","title":"<code>assert_less_than(max, or_equal_to=True, pass_message=' \u2714\ufe0f Assert maximum passed ', fail_message=' \u3128 Assert maximum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series is &lt; or &lt;= a value. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>max</code> <code>Any</code> <p>the max value to compare Series to. Accepts any type that can be used in &lt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &lt;= min (True) or &lt; max (False)</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert maximum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert maximum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_negative","title":"<code>assert_negative(assert_not_null=True, pass_message=' \u2714\ufe0f Assert negative passed ', fail_message=' \u3128 Assert negative failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all negative values. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert negative passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert negative failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_not_null","title":"<code>assert_not_null(pass_message=' \u2714\ufe0f Assert no nulls passed ', fail_message=' \u3128 Assert no nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has no nulls. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_null","title":"<code>assert_null(pass_message=' \u2714\ufe0f Assert all nulls passed ', fail_message=' \u3128 Assert all nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all nulls. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_positive","title":"<code>assert_positive(assert_not_null=True, pass_message=' \u2714\ufe0f Assert positive passed ', fail_message=' \u3128 Assert positive failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all positive values. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>assert_not_null: Whether to also enforce that data has no nulls.\npass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_str","title":"<code>assert_str(pass_message=' \u2714\ufe0f Assert string passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is strings. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_timedelta","title":"<code>assert_timedelta(pass_message=' \u2714\ufe0f Assert timedelta passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is of type timedelta. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_type","title":"<code>assert_type(dtype, pass_message=' \u2714\ufe0f Assert type passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series meets type assumption. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>type</code> <p>The required variable type</p> required <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert type passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_unique","title":"<code>assert_unique(pass_message=' \u2714\ufe0f Assert unique passed ', fail_message=' \u3128 Assert unique failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has no duplicate rows. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.describe","title":"<code>describe(fn=lambda s: s, check_name='\ud83d\udccf Distribution', **kwargs)</code>","text":"<p>Displays descriptive statistics about a Series, without modifying the Series itself.</p> <p>See Pandas docs for describe() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas describe(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\udccf Distribution'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas describe() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off Pandas Checks globally, such as in production mode. Calls to .check functions will not be run. Does not modify the Series itself.</p> <p>Args     enable_assert: Optionally, whether to also enable or disable assert statements</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.dtype","title":"<code>dtype(fn=lambda s: s, check_name='\ud83d\uddc2\ufe0f Data type')</code>","text":"<p>Displays the data type of a Series, without modifying the Series itself.</p> <p>See Pandas docs for .dtype for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas dtype. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\uddc2\ufe0f Data type'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Globally enables Pandas Checks. Subequent calls to .check methods will be run. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Optionally, whether to globally enable or disable calls to .check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.function","title":"<code>function(fn=lambda s: s, check_name=None)</code>","text":"<p>Applies an arbitrary function on a Series and shows the result, without modifying the Series itself.</p> Example <p>.check.function(fn=lambda s: s.shape[0]&gt;10, check_name='Has at least 10 rows?') which will result in 'True' or 'False'</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>The lambda function to apply to the Series. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.get_mode","title":"<code>get_mode(check_name='\u2699\ufe0f Pandas Checks mode')</code>","text":"<p>Displays the current values of Pandas Checks global options enable_checks and enable_asserts. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check. Will be used as a preface the printed result.</p> <code>'\u2699\ufe0f Pandas Checks mode'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.head","title":"<code>head(n=5, fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the first n rows of a Series, without modifying the Series itself.</p> <p>See Pandas docs for head() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>The number of rows to display.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas head(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.hist","title":"<code>hist(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays a histogram for the Series's distribution, without modifying the Series itself.</p> <p>See Pandas docs for hist() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas head(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas hist() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.info","title":"<code>info(fn=lambda s: s, check_name='\u2139\ufe0f Series info', **kwargs)</code>","text":"<p>Displays summary information about a Series, without modifying the Series itself.</p> <p>See Pandas docs for info() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas info(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2139\ufe0f Series info'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.memory_usage","title":"<code>memory_usage(fn=lambda s: s, check_name='\ud83d\udcbe Memory usage', **kwargs)</code>","text":"<p>Displays the memory footprint of a Series, without modifying the Series itself.</p> <p>See Pandas docs for memory_usage() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas memory_usage(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcbe Memory usage'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas memory_usage() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Include argument <code>deep=True</code> to get further memory usage of object dtypes. See Pandas docs for memory_usage() for more info.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.ndups","title":"<code>ndups(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays the number of duplicated rows in the Series, without modifying the Series itself.</p> <p>See Pandas docs for duplicated() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting the number of duplicates. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas duplicated() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nnulls","title":"<code>nnulls(fn=lambda s: s, check_name='\ud83d\udc7b Rows with NaNs')</code>","text":"<p>Displays the number of rows with null values in the Series, without modifying the Series itself.</p> <p>See Pandas docs for isna() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting rows with nulls. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udc7b Rows with NaNs'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nrows","title":"<code>nrows(fn=lambda s: s, check_name='\u2630 Rows')</code>","text":"<p>Displays the number of rows in a Series, without modifying the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting the number of rows. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2630 Rows'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nunique","title":"<code>nunique(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays the number of unique rows in a Series, without modifying the Series itself.</p> <p>See Pandas docs for nunique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas nunique(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas nunique() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.plot","title":"<code>plot(fn=lambda s: s, check_name='', **kwargs)</code>","text":"<p>Displays a plot of the Series, without modifying the Series itself.</p> <p>See Pandas docs for plot() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas plot(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional title for the plot.</p> <code>''</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas plot() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p> <p>If you pass a 'title' kwarg, it becomes the plot title, overriding check_name</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.print","title":"<code>print(object=None, fn=lambda s: s, check_name=None, max_rows=10)</code>","text":"<p>Displays text, another object, or (by default) the current DataFrame's head. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>object</code> <code>Any</code> <p>Object to print. Can be anything printable: str, int, list, another DataFrame, etc. If None, print the Series's head (with <code>max_rows</code> rows).</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before printing <code>object</code>. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>max_rows</code> <code>int</code> <p>Maximum number of rows to print if object=None.</p> <code>10</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Args: start_time: The index time when the stopwatch started, which comes from the Pandas Checks start_timer() lead_in: Optional text to print before the elapsed time. units: The units in which to display the elapsed time. Can be \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings. Does not modify the Series itself.</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Run pandas_checks.describe_options() to see a list of available options. Does not modify the Series itself</p> <p>For example, .check.set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to Pandas Checks .check.assert_data() globally.</p> required <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.shape","title":"<code>shape(fn=lambda s: s, check_name='\ud83d\udcd0 Shape')</code>","text":"<p>Displays the Series's dimensions, without modifying the Series itself.</p> <p>See Pandas docs for <code>shape</code> for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas <code>shape</code>. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcd0 Shape'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>See also .check.nrows()</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.tail","title":"<code>tail(n=5, fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the last n rows of the Series, without modifying the Series itself.</p> <p>See Pandas docs for tail() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>Number of rows to show.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas tail(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.unique","title":"<code>unique(fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the unique values in a Series, without modifying the Series itself.</p> <p>See Pandas docs for unique() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas unique(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.value_counts","title":"<code>value_counts(fn=lambda s: s, max_rows=10, check_name=None, **kwargs)</code>","text":"<p>Displays the value counts for a Series, without modifying the Series itself.</p> <p>See Pandas docs for value_counts() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>max_rows</code> <code>int</code> <p>Maximum number of rows to show in the value counts.</p> <code>10</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas value_counts(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas value_counts() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.write","title":"<code>write(path, format=None, fn=lambda s: s, verbose=False, **kwargs)</code>","text":"<p>Exports Series to file, without modifying the Series itself.</p> <p>Format is inferred from path extension like .csv.</p> <p>This functions uses the corresponding Pandas export function such as to_csv(). See Pandas docs for those functions for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>path</code> <code>str</code> <p>Path to write the file to.</p> required <code>format</code> <code>Union[str, None]</code> <p>Optional file format to force for the export. If None, format is inferred from the file's extension in <code>path</code>.</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before exporting. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>verbose</code> <code>bool</code> <p>Whether to print a message when the file is written.</p> <code>False</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional keyword arguments to pass to the Pandas export function (.to_csv).</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Exporting to some formats such as Excel, Feather, and Parquet may require you to install additional packages.</p>"},{"location":"API%20reference/display/","title":"Display","text":"<p>Utilities for displaying text, tables, and plots in Pandas Checks in both terminal and IPython/Jupyter environments.</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_check","title":"<code>_display_check(data, name=None)</code>","text":"<p>Renders the result of a Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>The data to display.</p> required <code>name</code> <code>Union[str, None]</code> <p>The optional name of the check.</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_plot","title":"<code>_display_plot()</code>","text":"<p>Renders the active Pandas Checks matplotlib plot object in an IPython/Jupyter environment with an optional indent.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>It assumes the plot has already been drawn by another function, such as with .plot() or .hist().</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_plot_title","title":"<code>_display_plot_title(line, lead_in=None, colors={})</code>","text":"<p>Displays a plot title with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The title text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the title.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color settings for the text and lead-in text. See details in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_table","title":"<code>_display_table(table)</code>","text":"<p>Renders a Pandas DataFrame or Series in an IPython/Jupyter environment with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>table</code> <code>Union[DataFrame, Series]</code> <p>The DataFrame or Series to display.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_table_title","title":"<code>_display_table_title(line, lead_in=None, colors={})</code>","text":"<p>Displays a table title with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The title text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the title.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optiona dictionary containing color options for the text and lead-in text. See details in docstring for _render_text()</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._filter_emojis","title":"<code>_filter_emojis(text)</code>","text":"<p>Removes emojis from text if user has globally forbidden them.</p> <p>Parameters:</p> Name Type Description Default <code>text</code> <code>str</code> <p>The text to filter emojis from.</p> required <p>Returns:</p> Type Description <code>str</code> <p>The text with emojis removed if the user's global settings do not allow emojis. Else, the original text.</p>"},{"location":"API%20reference/display/#pandas_checks.display._format_background_color","title":"<code>_format_background_color(color)</code>","text":"<p>Applies a background color to text used being displayed in the terminal.</p> <p>Parameters:</p> Name Type Description Default <code>color</code> <code>str</code> <p>The background color to format. See syntax in docstring for _render_text().</p> required <p>Returns:</p> Type Description <code>str</code> <p>The formatted background color.</p>"},{"location":"API%20reference/display/#pandas_checks.display._lead_in","title":"<code>_lead_in(lead_in, foreground, background)</code>","text":"<p>Formats a lead-in text with colors.</p> <p>Parameters:</p> Name Type Description Default <code>lead_in</code> <code>Union[str, None]</code> <p>The lead-in text to format.</p> required <code>foreground</code> <code>str</code> <p>The foreground color for the lead-in text. See syntax in docstring for _render_text().</p> required <code>background</code> <code>str</code> <p>The background color for the lead-in text. See syntax in docstring for _render_text().</p> required <p>Returns:</p> Type Description <code>str</code> <p>The formatted lead-in text.</p>"},{"location":"API%20reference/display/#pandas_checks.display._print_table_terminal","title":"<code>_print_table_terminal(table)</code>","text":"<p>Prints a Pandas table in a terminal with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>table</code> <code>Union[DataFrame, Series]</code> <p>A DataFrame or Series.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._render_html_with_indent","title":"<code>_render_html_with_indent(object_as_html)</code>","text":"<p>Renders HTML with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>object_as_html</code> <code>str</code> <p>The HTML to render.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._render_text","title":"<code>_render_text(text, tag, lead_in=None, colors={})</code>","text":"<p>Renders text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>text</code> <code>str</code> <p>The text to render.</p> required <code>tag</code> <code>str</code> <p>The HTML tag to use for rendering.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>Optional colors for the text and lead-in text. Keys include:     - text_color: The foreground color of the main text.     - text_background_color: The background or highlight color of the main text.     - lead_in_text_color: The foreground color of lead-in text.     - lead_in_background_color: The background color of lead-in text. Color values are phrased such as \"blue\" or \"white\". They are passed to either HTML     for Jupyter/IPython outputs and to <code>termcolor</code> when code is run in terminal.     For color options when code is run in terminal, see         https://github.com/termcolor/termcolor.</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._warning","title":"<code>_warning(message, lead_in='\ud83d\udc3c\ud83e\ude7a Pandas Checks warning', clean_type=False)</code>","text":"<p>Displays a warning message.</p> <p>Parameters:</p> Name Type Description Default <code>message</code> <code>str</code> <p>The warning message to display.</p> required <code>lead_in</code> <code>str</code> <p>Optional lead-in text to display before the warning message.</p> <code>'\ud83d\udc3c\ud83e\ude7a Pandas Checks warning'</code> <code>clean_type</code> <code>bool</code> <p>Optional flag to remove the class type from the message, when running .check.dtype().</p> <code>False</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/","title":"Options","text":"<p>Utilities for configuring Pandas Checks options.</p> <p>This module provides functions for setting and managing global options for Pandas Checks, including formatting and disabling checks and assertions.</p>"},{"location":"API%20reference/options/#pandas_checks.options._initialize_format_options","title":"<code>_initialize_format_options(options=None)</code>","text":"<p>Initializes or resets Pandas Checks formatting options.</p> <p>Parameters:</p> Name Type Description Default <code>options</code> <code>Union[List[str], None]</code> <p>A list of option names to initialize or reset. If None, all formatting options will be initialized or reset.</p> <code>None</code> <p>Returns:     None</p> Note <p>We separate this function from _initialize_options() so user can reset just formatting without changing mode</p>"},{"location":"API%20reference/options/#pandas_checks.options._initialize_options","title":"<code>_initialize_options()</code>","text":"<p>Initializes (or resets) all Pandas Checks options to their default values.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>We separate this function from _initialize_format_options() so user can reset just formatting if desired without changing mode</p>"},{"location":"API%20reference/options/#pandas_checks.options._register_option","title":"<code>_register_option(name, default_value, description, validator)</code>","text":"<p>Registers a Pandas Checks option in the global Pandas context manager.</p> <p>If the option has already been registered, reset its value.</p> <p>This method enables setting global formatting for Pandas Checks results and storing variables that will persist across Pandas method chains, which return newly initialized DataFrames at each method (and so reset the DataFrame's attributes).</p> <p>Parameters:</p> Name Type Description Default <code>name</code> <code>str</code> <p>The name of the option to register.</p> required <code>default_value</code> <code>Any</code> <p>The default value for the option.</p> required <code>description</code> <code>str</code> <p>A description of the option.</p> required <code>validator</code> <code>Callable</code> <p>A function to validate the option value.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>For more details on the arguments, see the documentation for pandas._config.config.register_option()</p>"},{"location":"API%20reference/options/#pandas_checks.options._set_option","title":"<code>_set_option(option, value)</code>","text":"<p>Updates the value of a Pandas Checks option in the global Pandas context manager.</p> <p>Parameters:</p> Name Type Description Default <code>option</code> <code>str</code> <p>The name of the option to set.</p> required <code>value</code> <code>Any</code> <p>The value to set for the option.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Raises:</p> Type Description <code>AttributeError</code> <p>If the <code>option</code> is not a valid Pandas Checks option.</p>"},{"location":"API%20reference/options/#pandas_checks.options.describe_options","title":"<code>describe_options()</code>","text":"<p>Prints all global options for Pandas Checks, their default values, and current values.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off all calls to Pandas Checks methods and optionally enables or disables check.assert_data(). Does not modify the DataFrame itself.</p> <p>If this function is called, subequent calls to .check functions will not be run.</p> <p>Typically used to     1) Globally switch off Pandas Checks, such as during production. or     2) Temporarily switch off Pandas Checks, such as for a stable part of a notebook.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Whether to also run calls to Pandas Checks .check.assert_data()</p> <code>True</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Turns on Pandas Checks globally. Subsequent calls to .check methods will be run.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Whether to also enable or disable check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/options/#pandas_checks.options.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Run pandas_checks.describe_options() to see a list of available options.</p> <p>For example, set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code>"},{"location":"API%20reference/options/#pandas_checks.options.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to .check.assert_data() globally.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/","title":"Run checks","text":"<p>Utilities for running Pandas Checks data checks.</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._apply_modifications","title":"<code>_apply_modifications(data, fn=lambda df: df, subset=None)</code>","text":"<p>Applies user's modifications to a data object.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>May be any Pandas DataFrame, Series, string, or other variable</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to modify <code>data</code></p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Columns to subset after applying modifications</p> <code>None</code> <p>Returns:</p> Type Description <code>Any</code> <p>Modified and optionally subsetted data object.  If all arguments are defaults, data is returned unchanged.</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._check_data","title":"<code>_check_data(data, check_fn=lambda df: df, modify_fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Runs a selected check on a data object</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>A Pandas DataFrame, Series, string, or other variable</p> required <code>check_fn</code> <code>Callable</code> <p>Function to apply to data for checking. For example if we're running .check.value_counts(), this function would appply the Pandas value_counts() method</p> <code>lambda df: df</code> <code>modify_fn</code> <code>Callable</code> <p>Optional function to modify data before checking</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional list of columns or name of column to subset data before running check_fn</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>Name to use when displaying check result</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._display_check","title":"<code>_display_check(data, name=None)</code>","text":"<p>Renders the result of a Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>The data to display.</p> required <code>name</code> <code>Union[str, None]</code> <p>The optional name of the check.</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/timer/","title":"Timer","text":"<p>Provides a timer utility for tracking the elapsed time of steps within a Pandas method chain.</p> <p>Note that these functions rely on the <code>pdchecks.enable_checks</code> option being enabled in the Pandas configuration, as it is by default.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='\u23f1\ufe0f Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Parameters:</p> Name Type Description Default <code>start_time</code> <code>float</code> <p>The index time when the stopwatch started, which comes from the Pandas Checks start_timer()</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to print before the elapsed time.</p> <code>'\u23f1\ufe0f Time elapsed'</code> <code>units</code> <code>str</code> <p>The units in which to display the elapsed time. Accepted values: - \"auto\" - \"milliseconds\", \"seconds\", \"minutes\", \"hours\" - \"ms\", \"s\", \"m\", \"h\"</p> <code>'auto'</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of expected time units</p> Note <p>If you change the default values for this function's argument, change them in <code>.check.print_time_elapsed</code> too in DataFrameChecks and SeriesChecks so they're exposed to the user.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.start_timer","title":"<code>start_timer(verbose=False)</code>","text":"<p>Starts a Pandas Checks stopwatch to measure run time between operations, such as steps in a Pandas method chain. Use print_elapsed_time() to get timings.</p> <p>Parameters:</p> Name Type Description Default <code>verbose</code> <code>bool</code> <p>Whether to print a message that the timer has started.</p> <code>False</code> <p>Returns:</p> Type Description <code>float</code> <p>Timestamp as a float</p>"},{"location":"API%20reference/utils/","title":"Utils","text":"<p>Utility functions for the pandas_checks package.</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._has_nulls","title":"<code>_has_nulls(data, fail_message, raise_exception=True, exception_to_raise=DataError)</code>","text":"<p>Utility function to check for nulls as part of a larger check</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._is_type","title":"<code>_is_type(data, dtype)</code>","text":"<p>Utility function to check if a dataframe's columns or one series has an expected type. Includes special handling for strings, since 'object' type in Pandas may not mean a string</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._lambda_to_string","title":"<code>_lambda_to_string(lambda_func)</code>","text":"<p>Create a string representation of a lambda function.</p> <p>Parameters:</p> Name Type Description Default <code>lambda_func</code> <code>Callable</code> <p>An arbitrary function in lambda form</p> required <p>Returns:</p> Type Description <code>str</code> <p>A string version of lambda_func</p> Todo <p>This still returns all arguments to the calling function.     They get entangled with the argument when it's a lambda function.     Try other ways to get just the argument we want.</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._series_is_type","title":"<code>_series_is_type(s, dtype)</code>","text":"<p>Utility function to check if a series has an expected type. Includes special handling for strings, since 'object' type in Pandas may not mean a string</p>"}]}
\ No newline at end of file
+{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"About","text":""},{"location":"#introduction","title":"Introduction","text":"<p>Pandas Checks is a Python library for data science and data engineering. It adds non-invasive health checks for Pandas method chains.</p>"},{"location":"#what-are-method-chains","title":"What are method chains?","text":"<p>Method chains are one of the coolest features of the Pandas library! They allow you to write more functional code with fewer intermediate variables and fewer side effects. If you're familiar with R, method chains are Python's version of dplyr pipes.</p>"},{"location":"#why-use-pandas-checks","title":"Why use Pandas Checks?","text":"<p>Pandas Checks adds the ability to inspect and validate your Pandas data at any point in the method chain, without modifying the underlying data. Think of Pandas Checks as a drone you can send up to check on your pipeline, whether it's in exploratory data analysis, prototyping, or production.</p> <p>That way you don't need to chop up a method chain, or create intermediate variables, every time you need to diagnose, treat, or prevent problems with your data processing pipeline.</p> <p>As Fleetwood Mac says, you would never break the chain.</p> <p></p>"},{"location":"#giving-feedback-and-contributing","title":"Giving feedback and contributing","text":"<p>If you run into trouble or have questions, I'd love to know. Please open an issue.</p> <p>Contributions are appreciated! Please open an issue or submit a pull request. Pandas Checks uses the wonderful libraries poetry for package and dependency management, nox for test automation, and mkdocs for docs.</p>"},{"location":"#license","title":"License","text":"<p>Pandas Checks is licensed under the BSD-3 License.</p> <p>\ud83d\udc3c\ud83e\ude7a</p>"},{"location":"usage/","title":"Usage","text":""},{"location":"usage/#installation","title":"Installation","text":"<p>First make Pandas Check available in your environment.</p> <pre><code>pip install pandas-checks\n</code></pre> <p>Then import it in your code. It works in Jupyter, IPython, and Python scripts run from the command line.</p> <pre><code>import pandas_checks\n</code></pre> <p>After importing, you don't need to access the <code>pandas_checks</code> module directly.</p> <p>\ud83d\udca1 Tip: You can import Pandas Checks either before or after your code imports Pandas. Just somewhere. \ud83d\ude01</p>"},{"location":"usage/#basic-usage","title":"Basic usage","text":"<p>Pandas Checks adds <code>.check</code> methods to Pandas DataFrames and Series. </p> <p>Say you have a nice function.</p> <pre><code>\ndef clean_iris_data(iris: pd.DataFrame) -&gt; pd.DataFrame:\n    \"\"\"Preprocess data about pretty flowers.\n\n    Args:\n        iris: The raw iris dataset.\n\n    Returns:\n        The cleaned iris dataset.\n    \"\"\"\n\n    return (\n        iris\n        .dropna() # Drop rows with any null values\n        .rename(columns={\"FLOWER_SPECIES\": \"species\"}) # Rename a column\n        .query(\"species=='setosa'\") # Filter to rows with a certain value\n    )\n</code></pre> <p>But what if you want to make the chain more robust? Or see what's happening to the data as it flows down the pipeline? Or understand why your new <code>iris</code> CSV suddenly makes the cleaned data look weird? </p> <p>You can add some <code>.check</code> steps.</p> <pre><code>\n(\n    iris\n    .dropna()\n    .rename(columns={\"FLOWER_SPECIES\": \"species\"})\n\n    # Validate assumptions\n    .check.assert_positive(subset=[\"petal_length\", \"sepal_length\"])\n\n    # Plot the distribution of a column after cleaning\n    .check.hist(column='petal_length') \n\n    .query(\"species=='setosa'\")\n\n    # Display the first few rows after cleaning\n    .check.head(3)  \n)\n</code></pre> <p>The <code>.check</code> methods will display the following results:</p> <p></p> <p>The <code>.check</code> methods didn't modify how the <code>iris</code> data is processed by your code. They just let you check the data as it flows down the pipeline. That's the difference between Pandas <code>.head()</code> and Pandas Checks <code>.check.head()</code>.</p>"},{"location":"usage/#features","title":"Features","text":""},{"location":"usage/#check-methods","title":"Check methods","text":"<p>Here's what's in the doctor's bag.</p> <p>Describe     - Standard Pandas methods:         - <code>.check.columns()</code> - DataFrame | Series         - <code>.check.dtypes()</code> for DataFrame | <code>.check.dtype()</code> for Series         - <code>.check.describe()</code> - DataFrame | Series         - <code>.check.head()</code> - DataFrame | Series         - <code>.check.info()</code> - DataFrame | Series         - <code>.check.memory_usage()</code> - DataFrame | Series         - <code>.check.nunique()</code> - DataFrame | Series         - <code>.check.shape()</code> - DataFrame | Series         - <code>.check.tail()</code> - DataFrame | Series         - <code>.check.unique()</code> - DataFrame | Series         - <code>.check.value_counts()</code> - DataFrame | Series     - New functions in Pandas Checks:         - <code>.check.function()</code>: Apply an arbitrary lambda function to your data and see the result - DataFrame | Series         - <code>.check.ncols()</code>: Count columns - DataFrame | Series         - <code>.check.ndups()</code>: Count rows with duplicate values - DataFrame | Series         - <code>.check.nnulls()</code>: Count rows with null values - DataFrame | Series         - <code>.check.print()</code>: Print a string, a variable, or the current dataframe - DataFrame | Series</p> <ul> <li> <p>Export interim files</p> <ul> <li><code>.check.write()</code>: Export the current data, inferring file format from the name - DataFrame | Series</li> </ul> </li> <li> <p>Time your code</p> <ul> <li><code>.check.print_time_elapsed(start_time)</code>: Print the execution time since you called <code>start_time = pdc.start_timer()</code> - DataFrame | Series</li> <li> <p>\ud83d\udca1 Tip:  You can also use this stopwatch outside a method chain, anywhere in your Python code:  </p> <p>```python from pandas_checks import print_elapsed_time, start_timer</p> <p>start_time = start_timer() ... print_elapsed_time(start_time) ```</p> </li> </ul> </li> <li> <p>Turn off Pandas Checks</p> <ul> <li><code>.check.disable_checks()</code>: Don't run checks, for production mode etc. By default, still runs assertions. - DataFrame | Series</li> <li><code>.check.enable_checks()</code>: Run checks - DataFrame | Series</li> </ul> </li> <li> <p>Validate </p> <ul> <li>General<ul> <li><code>.check.assert_data()</code>: Check that data passes an arbitrary condition - DataFrame | Series</li> </ul> </li> <li>Types<ul> <li><code>.check.assert_datetime()</code> - DataFrame | Series</li> <li><code>.check.assert_float()</code> - DataFrame | Series</li> <li><code>.check.assert_int()</code> - DataFrame | Series</li> <li><code>.check.assert_str()</code> - DataFrame | Series</li> <li><code>.check.assert_timedelta()</code> - DataFrame | Series</li> <li><code>.check.assert_type()</code> - DataFrame | Series</li> </ul> </li> <li>Values<ul> <li><code>.check.assert_less_than()</code> - DataFrame | Series</li> <li><code>.check.assert_greater_than()</code> - DataFrame | Series</li> <li><code>.check.assert_negative()</code> - DataFrame | Series</li> <li><code>.check.assert_not_null()</code> - DataFrame | Series</li> <li><code>.check.assert_null()</code> - DataFrame | Series</li> <li><code>.check.assert_positive()</code> - DataFrame | Series</li> <li><code>.check.assert_unique()</code> - DataFrame | Series</li> </ul> </li> </ul> </li> <li> <p>Visualize</p> <ul> <li><code>.check.hist()</code>: A histogram - DataFrame | Series</li> <li><code>.check.plot()</code>: An arbitrary plot you can customize - DataFrame | Series</li> </ul> </li> </ul>"},{"location":"usage/#customizing-a-check","title":"Customizing a check","text":"<p>You can use Pandas Checks methods like the regular Pandas methods. They accept the same arguments. For example, you can pass: * <code>.check.head(7)</code> * <code>.check.value_counts(column=\"species\", dropna=False, normalize=True)</code> * <code>.check.plot(kind=\"scatter\", x=\"sepal_width\", y=\"sepal_length\")</code></p> <p>Also, most Pandas Checks methods accept 3 additional arguments: 1. <code>check_name</code>: text to display before the result of the check 2. <code>fn</code>: a lambda function that modifies the data displayed by the check 3. <code>subset</code>: limit a check to certain columns</p> <pre><code>(\n    iris\n    .check.value_counts(column='species', check_name=\"Varieties after data cleaning\")\n    .assign(species=lambda df: df[\"species\"].str.upper()) # Do your regular Pandas data processing, like upper-casing the values in one column\n    .check.head(n=2, fn=lambda df: df[\"petal_width\"]*2) # Modify the data that gets displayed in the check only\n    .check.describe(subset=['sepal_width', 'sepal_length'])  # Only apply the check to certain columns\n)\n</code></pre> <p></p>"},{"location":"usage/#configuring-pandas-check","title":"Configuring Pandas Check","text":""},{"location":"usage/#global-configuration","title":"Global configuration","text":"<p>You can change how Pandas Checks works everywhere. For example:</p> <pre><code>import pandas_checks as pdc\n\n# Set output precision and turn off the cute emojis\npdc.set_format(precision=3, use_emojis=False)\n\n# Don't run any of the calls to Pandas Checks, globally. \npdc.disable_checks()\n</code></pre> <p>Run <code>pdc.describe_options()</code> to see the arguments you can pass to <code>.set_format()</code>.</p> <p>\ud83d\udca1 Tip: By default, <code>disable_checks()</code> and <code>enable_checks()</code> do not change whether Pandas Checks will run assertion methods (<code>.check.assert_*</code>). </p> <p>To turn off assertions too, add the argument <code>enable_asserts=False</code>, such as: <code>disable_checks(enable_asserts=False)</code>.</p>"},{"location":"usage/#local-configuration","title":"Local configuration","text":"<p>You can also adjust settings within a method chain by bookending the chain, like this:</p> <pre><code># Customize format during one method chain\n(\n    iris\n    .check.set_format(precision=7, use_emojis=False)\n    ... # Any .check methods in here will use the new format\n    .check.reset_format() # Restore default format\n)\n\n# Turn off Pandas Checks during one method chain\n(\n    iris\n    .check.disable_checks()\n    ... # Any .check methods in here will not be run\n    .check.enable_checks() # Turn it back on for the next code\n)\n</code></pre>"},{"location":"usage/#hybrid-eda-production-data-processing","title":"Hybrid EDA-Production data processing","text":"<p>Exploratory Data Analysis is often taught as a one-time step we do to plan our production data processing. But sometimes EDA is a cyclical process we go back to for deeper inspection during debugging, code edits, or changes in the input data. If explorations were useful in EDA, they may be useful again.</p> <p>Unfortunately, it's hard to go back to EDA. It's too out of sync. The prod data processing pipeline has usually evolved too much, making the EDA code a historical artifact full of cobwebs that we can't easily fire up again. </p> <p>But if you use Pandas Checks during EDA, you could roll your <code>.check</code> methods into your first production code. Then in prod mode, disable Pandas Checks when you don't need it, to save compute and streamline output. When you ever need to pull out those EDA tools, enable Pandas Checks globally or locally.  </p> <p>This can make your prod pipline more transparent and easier to inspect.  </p>"},{"location":"API%20reference/DataFrameChecks/","title":"DataFrame methods","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks._obj","title":"<code>_obj = pandas_obj</code>  <code>instance-attribute</code>","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.__init__","title":"<code>__init__(pandas_obj)</code>","text":""},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_data","title":"<code>assert_data(condition, subset=None, pass_message=' \u2714\ufe0f Assertion passed ', fail_message=' \u3128 Assertion failed ', raise_exception=True, exception_to_raise=DataError, message_shows_condition=True, verbose=False)</code>","text":"<p>Tests whether Dataframe meets condition. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>condition</code> <code>Callable</code> <p>Assertion criteria in the form of a lambda function, such as <code>lambda df: df.shape[0]&gt;10</code>.</p> required <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. Applied after fn. Subsetting can also be done within the <code>condition</code>, such as <code>lambda df: df['column_name'].sum()&gt;10</code></p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assertion passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assertion failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>message_shows_condition</code> <code>bool</code> <p>Whether the fail/pass message should also print the assertion criteria</p> <code>True</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_datetime","title":"<code>assert_datetime(subset=None, pass_message=' \u2714\ufe0f Assert datetime passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is datetime or timestamp. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert datetime passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_float","title":"<code>assert_float(subset=None, pass_message=' \u2714\ufe0f Assert float passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is floats. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert float passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_greater_than","title":"<code>assert_greater_than(min, or_equal_to=True, subset=None, pass_message=' \u2714\ufe0f Assert minimum passed ', fail_message=' \u3128 Assert minimum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is &gt; or &gt;= a value. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>min</code> <code>Any</code> <p>the minimum value to compare DataFrame to. Accepts any type that can be used in &gt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &gt;= min (True) or &gt; min (False)</p> <code>True</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert minimum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert minimum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_int","title":"<code>assert_int(subset=None, pass_message=' \u2714\ufe0f Assert integeer passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is integers. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert integeer passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_less_than","title":"<code>assert_less_than(max, or_equal_to=True, subset=None, pass_message=' \u2714\ufe0f Assert maximum passed ', fail_message=' \u3128 Assert maximum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is &lt; or &lt;= a value. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>max</code> <code>Any</code> <p>the max value to compare DataFrame to. Accepts any type that can be used in &lt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &lt;= min (True) or &lt; max (False)</p> <code>True</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert maximum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert maximum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_negative","title":"<code>assert_negative(subset=None, assert_not_null=True, pass_message=' \u2714\ufe0f Assert negative passed ', fail_message=' \u3128 Assert negative failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all negative values. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against.`</p> <code>None</code> <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert negative passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert negative failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_not_null","title":"<code>assert_not_null(subset=None, pass_message=' \u2714\ufe0f Assert no nulls passed ', fail_message=' \u3128 Assert no nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has no nulls. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert no nulls passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert no nulls failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_null","title":"<code>assert_null(subset=None, pass_message=' \u2714\ufe0f Assert all nulls passed ', fail_message=' \u3128 Assert all nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all nulls. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert all nulls passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert all nulls failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_positive","title":"<code>assert_positive(subset=None, assert_not_null=True, pass_message=' \u2714\ufe0f Assert positive passed ', fail_message=' \u3128 Assert positive failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has all positive values. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert positive passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert positive failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_str","title":"<code>assert_str(subset=None, pass_message=' \u2714\ufe0f Assert string passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is strings. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert string passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_timedelta","title":"<code>assert_timedelta(subset=None, pass_message=' \u2714\ufe0f Assert timedelta passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns is of type timedelta. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert timedelta passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_type","title":"<code>assert_type(dtype, subset=None, pass_message=' \u2714\ufe0f Assert type passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns meets type assumption. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>dtype</code> <code>Type[Any]</code> <p>The required variable type</p> required <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert type passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.assert_unique","title":"<code>assert_unique(subset=None, pass_message=' \u2714\ufe0f Assert unique passed ', fail_message=' \u3128 Assert unique failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Dataframe or subset of columns has no duplicate rows. Optionally raises an exception. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>subset</code> <code>Union[str, List, None]</code> <p>Optional, which column or columns to check the condition against. `</p> <code>None</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert unique passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert unique failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.columns","title":"<code>columns(fn=lambda df: df, subset=None, check_name='\ud83c\udfdb\ufe0f Columns')</code>","text":"<p>Prints the column names of a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before printing columns. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before printing their names. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83c\udfdb\ufe0f Columns'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.describe","title":"<code>describe(fn=lambda df: df, subset=None, check_name='\ud83d\udccf Distributions', **kwargs)</code>","text":"<p>Displays descriptive statistics about a DataFrame without modifying the DataFrame itself.</p> <p>See Pandas docs for describe() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas describe(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas describe(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\udccf Distributions'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas describe() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off Pandas Checks globally, such as in production mode. Calls to .check functions will not be run. Does not modify the DataFrame itself.</p> <p>Args     enable_assert: Optionally, whether to also enable or disable assert statements</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.dtypes","title":"<code>dtypes(fn=lambda df: df, subset=None, check_name='\ud83d\uddc2\ufe0f Data types')</code>","text":"<p>Displays the data types of a DataFrame's columns without modifying the DataFrame itself.</p> <p>See Pandas docs for dtypes for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas dtypes. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas .dtypes. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\uddc2\ufe0f Data types'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Globally enables Pandas Checks. Subequent calls to .check methods will be run. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Optionally, whether to globally enable or disable calls to .check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.function","title":"<code>function(fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Applies an arbitrary function on a DataFrame and shows the result, without modifying the DataFrame itself.</p> Example <p>.check.function(fn=lambda df: df.shape[0]&gt;10, check_name='Has at least 10 rows?') which will result in 'True' or 'False'</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>A lambda function to apply to the DataFrame. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas describe(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.get_mode","title":"<code>get_mode(check_name='\ud83d\udc3c\ud83e\ude7a Pandas Checks mode')</code>","text":"<p>Displays the current values of Pandas Checks global options enable_checks and enable_asserts. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check. Will be used as a preface the printed result.</p> <code>'\ud83d\udc3c\ud83e\ude7a Pandas Checks mode'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.head","title":"<code>head(n=5, fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Displays the first n rows of a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for head() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>The number of rows to display.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas head(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas head(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.hist","title":"<code>hist(fn=lambda df: df, subset=[], check_name=None, **kwargs)</code>","text":"<p>Displays a histogram for the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for hist() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas hist(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas hist(). Applied after fn.</p> <code>[]</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas hist() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>If more than one column is passed, displays a grid of histograms</p> <p>Only renders in interactive mode (IPython/Jupyter), not in terminal</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.info","title":"<code>info(fn=lambda df: df, subset=None, check_name='\u2139\ufe0f Info', **kwargs)</code>","text":"<p>Displays summary information about a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for info() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas info(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas info(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2139\ufe0f Info'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.memory_usage","title":"<code>memory_usage(fn=lambda df: df, subset=None, check_name='\ud83d\udcbe Memory usage', **kwargs)</code>","text":"<p>Displays the memory footprint of a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for memory_usage() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas memory_usage(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before running Pandas memory_usage(). Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcbe Memory usage'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Include argument <code>deep=True</code> to get further memory usage of object dtypes in the DataFrame. See Pandas docs for memory_usage() for more info.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.ncols","title":"<code>ncols(fn=lambda df: df, subset=None, check_name='\ud83c\udfdb\ufe0f Columns')</code>","text":"<p>Displays the number of columns in a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of columns. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting the number of columns. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83c\udfdb\ufe0f Columns'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.ndups","title":"<code>ndups(fn=lambda df: df, subset=None, check_name=None, **kwargs)</code>","text":"<p>Displays the number of duplicated rows in a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for duplicated() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of duplicates. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting duplicate rows. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas duplicated() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nnulls","title":"<code>nnulls(fn=lambda df: df, subset=None, by_column=True, check_name='\ud83d\udc7b Rows with NaNs')</code>","text":"<p>Displays the number of rows with null values in a DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for isna() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of rows with a null. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string to select a subset of columns before counting nulls.</p> <code>None</code> <code>by_column</code> <code>bool</code> <p>If True, count null values with each column separately. If False, count rows with a null value in any column. Applied after fn.</p> <code>True</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udc7b Rows with NaNs'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nrows","title":"<code>nrows(fn=lambda df: df, subset=None, check_name='\u2630 Rows')</code>","text":"<p>Displays the number of rows in a DataFrame, without modifying the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before counting the number of rows. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are considered when counting rows. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2630 Rows'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.nunique","title":"<code>nunique(column, fn=lambda df: df, check_name=None, **kwargs)</code>","text":"<p>Displays the number of unique rows in a single column, without modifying the DataFrame itself.</p> <p>See Pandas docs for nunique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>The name of a column to count uniques in. Applied after fn.</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas nunique(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas nunique() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.plot","title":"<code>plot(fn=lambda df: df, subset=None, check_name='', **kwargs)</code>","text":"<p>Displays a plot of the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for plot() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas plot(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are plotted. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional title for the plot.</p> <code>''</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas plot() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p> <p>If you pass a 'title' kwarg, it becomes the plot title, overriding check_name</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.print","title":"<code>print(object=None, fn=lambda df: df, subset=None, check_name=None, max_rows=10)</code>","text":"<p>Displays text, another object, or (by default) the current DataFrame's head. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>object</code> <code>Any</code> <p>Object to print. Can be anything printable: str, int, list, another DataFrame, etc. If None, print the DataFrame's head (with <code>max_rows</code> rows).</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before printing <code>object</code>. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are printed. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>max_rows</code> <code>int</code> <p>Maximum number of rows to print if object=None.</p> <code>10</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Parameters:</p> Name Type Description Default <code>start_time</code> <code>float</code> <p>The index time when the stopwatch started, which comes from the Pandas Checks start_timer()</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to print before the elapsed time.</p> <code>'Time elapsed'</code> <code>units</code> <code>str</code> <p>The units in which to display the elapsed time. Can be \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <code>'auto'</code> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings. Does not modify the DataFrame itself.</p> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Does not modify the DataFrame itself.</p> <p>Run pandas_checks.describe_options() to see a list of available options.</p> <p>For example, .check.set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally. Does not modify the DataFrame itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to Pandas Checks .check.assert_data() statements globally.</p> required <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.shape","title":"<code>shape(fn=lambda df: df, subset=None, check_name='\ud83d\udcd0 Shape')</code>","text":"<p>Displays the Dataframe's dimensions, without modifying the DataFrame itself.</p> <p>See Pandas docs for shape for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas <code>shape</code>. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are considered when printing the shape. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcd0 Shape'</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>See also .check.nrows() and .check.ncols()</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.tail","title":"<code>tail(n=5, fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Displays the last n rows of the DataFrame, without modifying the DataFrame itself.</p> <p>See Pandas docs for tail() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>Number of rows to show.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas tail(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are displayed. Applied after fn.</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.unique","title":"<code>unique(column, fn=lambda df: df, check_name=None)</code>","text":"<p>Displays the unique values in a column, without modifying the DataFrame itself.</p> <p>See Pandas docs for unique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>Column to check for unique values.</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before calling Pandas unique(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p><code>fn</code> is applied to the dataframe before selecting <code>column</code>. If you want to select the column before modifying it, set <code>column=None</code> and start <code>fn</code> with a column selection, i.e. <code>fn=lambda df: df[\"my_column\"].stuff()</code></p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.value_counts","title":"<code>value_counts(column, fn=lambda df: df, max_rows=10, check_name=None, **kwargs)</code>","text":"<p>Displays the value counts for a column, without modifying the DataFrame itself.</p> <p>See Pandas docs for value_counts() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>column</code> <code>str</code> <p>Column to check for value counts.</p> required <code>max_rows</code> <code>int</code> <p>Maximum number of rows to show in the value counts.</p> <code>10</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before running Pandas value_counts(). Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas value_counts() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p><code>fn</code> is applied to the dataframe before selecting <code>column</code>. If you want to select the column before modifying it, set <code>column=None</code> and start <code>fn</code> with a column selection, i.e. <code>fn=lambda df: df[\"my_column\"].stuff()</code></p>"},{"location":"API%20reference/DataFrameChecks/#pandas_checks.DataFrameChecks.DataFrameChecks.write","title":"<code>write(path, format=None, fn=lambda df: df, subset=None, verbose=False, **kwargs)</code>","text":"<p>Exports DataFrame to file, without modifying the DataFrame itself.</p> <p>Format is inferred from path extension like .csv.</p> <p>This functions uses the corresponding Pandas export function such as to_csv(). See Pandas docs for those functions for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>path</code> <code>str</code> <p>Path to write the file to.</p> required <code>format</code> <code>Union[str, None]</code> <p>Optional file format to force for the export. If None, format is inferred from the file's extension in <code>path</code>.</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the DataFrame before exporting. Example: <code>lambda df: df.shape[0]&gt;10</code>. Applied before subset.</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>An optional list of column names or a string name of one column to limit which columns are exported. Applied after fn.</p> <code>None</code> <code>verbose</code> <code>bool</code> <p>Whether to print a message when the file is written.</p> <code>False</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional keyword arguments to pass to the Pandas export function (.to_csv).</p> <code>{}</code> <p>Returns:</p> Type Description <code>DataFrame</code> <p>The original DataFrame, unchanged.</p> Note <p>Exporting to some formats such as Excel, Feather, and Parquet may require you to install additional packages.</p>"},{"location":"API%20reference/SeriesChecks/","title":"Series methods","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks._obj","title":"<code>_obj = pandas_obj</code>  <code>instance-attribute</code>","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.__init__","title":"<code>__init__(pandas_obj)</code>","text":""},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_data","title":"<code>assert_data(condition, pass_message=' \u2714\ufe0f Assertion passed ', fail_message=' \u3128 Assertion failed ', raise_exception=True, exception_to_raise=DataError, message_shows_condition=True, verbose=False)</code>","text":"<p>Tests whether Series meets condition. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>condition</code> <code>Callable</code> <p>Assertion criteria in the form of a lambda function, such as <code>lambda s: s.shape[0]&gt;10</code>.</p> required <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assertion passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assertion failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>message_shows_condition</code> <code>bool</code> <p>Whether the fail/pass message should also print the assertion criteria</p> <code>True</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_datetime","title":"<code>assert_datetime(pass_message=' \u2714\ufe0f Assert datetime passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is datetime or timestamp. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert datetime passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_float","title":"<code>assert_float(pass_message=' \u2714\ufe0f Assert float passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is floats. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert float passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_greater_than","title":"<code>assert_greater_than(min, or_equal_to=True, pass_message=' \u2714\ufe0f Assert minimum passed ', fail_message=' \u3128 Assert minimum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series is &gt; or &gt;= a value. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>min</code> <code>Any</code> <p>the minimum value to compare Series to. Accepts any type that can be used in &gt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &gt;= min (True) or &gt; min (False)</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert minimum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert minimum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_int","title":"<code>assert_int(pass_message=' \u2714\ufe0f Assert integeer passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is integers. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_less_than","title":"<code>assert_less_than(max, or_equal_to=True, pass_message=' \u2714\ufe0f Assert maximum passed ', fail_message=' \u3128 Assert maximum failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series is &lt; or &lt;= a value. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>max</code> <code>Any</code> <p>the max value to compare Series to. Accepts any type that can be used in &lt;, such as int, float, str, datetime</p> required <code>or_equal_to</code> <code>bool</code> <p>whether to test for &lt;= min (True) or &lt; max (False)</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert maximum passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert maximum failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_negative","title":"<code>assert_negative(assert_not_null=True, pass_message=' \u2714\ufe0f Assert negative passed ', fail_message=' \u3128 Assert negative failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all negative values. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>assert_not_null</code> <code>bool</code> <p>Whether to also enforce that data has no nulls.</p> <code>True</code> <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert negative passed '</code> <code>fail_message</code> <code>str</code> <p>Message to display if the condition fails.</p> <code>' \u3128 Assert negative failed '</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>DataError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_not_null","title":"<code>assert_not_null(pass_message=' \u2714\ufe0f Assert no nulls passed ', fail_message=' \u3128 Assert no nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has no nulls. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_null","title":"<code>assert_null(pass_message=' \u2714\ufe0f Assert all nulls passed ', fail_message=' \u3128 Assert all nulls failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all nulls. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_positive","title":"<code>assert_positive(assert_not_null=True, pass_message=' \u2714\ufe0f Assert positive passed ', fail_message=' \u3128 Assert positive failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has all positive values. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>assert_not_null: Whether to also enforce that data has no nulls.\npass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_str","title":"<code>assert_str(pass_message=' \u2714\ufe0f Assert string passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is strings. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_timedelta","title":"<code>assert_timedelta(pass_message=' \u2714\ufe0f Assert timedelta passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series is of type timedelta. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_type","title":"<code>assert_type(dtype, pass_message=' \u2714\ufe0f Assert type passed ', fail_message=None, raise_exception=True, exception_to_raise=TypeError, verbose=False)</code>","text":"<p>Tests whether Series meets type assumption. Optionally raises an exception. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>dtype</code> <code>Type[Any]</code> <p>The required variable type</p> required <code>pass_message</code> <code>str</code> <p>Message to display if the condition passes.</p> <code>' \u2714\ufe0f Assert type passed '</code> <code>fail_message</code> <code>Union[str, None]</code> <p>Message to display if the condition fails.</p> <code>None</code> <code>raise_exception</code> <code>bool</code> <p>Whether to raise an exception if the condition fails.</p> <code>True</code> <code>exception_to_raise</code> <code>Type[BaseException]</code> <p>The exception to raise if the condition fails and raise_exception is True.</p> <code>TypeError</code> <code>verbose</code> <code>bool</code> <p>Whether to display the pass message if the condition passes.</p> <code>False</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.assert_unique","title":"<code>assert_unique(pass_message=' \u2714\ufe0f Assert unique passed ', fail_message=' \u3128 Assert unique failed ', raise_exception=True, exception_to_raise=DataError, verbose=False)</code>","text":"<p>Tests whether Series has no duplicate rows. Optionally raises an exception. Does not modify the Series itself.</p> <p>Args:</p> <pre><code>pass_message: Message to display if the condition passes.\nfail_message: Message to display if the condition fails.\nraise_exception: Whether to raise an exception if the condition fails.\nexception_to_raise: The exception to raise if the condition fails and raise_exception is True.\nverbose: Whether to display the pass message if the condition passes.\n</code></pre> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.describe","title":"<code>describe(fn=lambda s: s, check_name='\ud83d\udccf Distribution', **kwargs)</code>","text":"<p>Displays descriptive statistics about a Series, without modifying the Series itself.</p> <p>See Pandas docs for describe() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas describe(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\udccf Distribution'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas describe() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off Pandas Checks globally, such as in production mode. Calls to .check functions will not be run. Does not modify the Series itself.</p> <p>Args     enable_assert: Optionally, whether to also enable or disable assert statements</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.dtype","title":"<code>dtype(fn=lambda s: s, check_name='\ud83d\uddc2\ufe0f Data type')</code>","text":"<p>Displays the data type of a Series, without modifying the Series itself.</p> <p>See Pandas docs for .dtype for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas dtype. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>'\ud83d\uddc2\ufe0f Data type'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Globally enables Pandas Checks. Subequent calls to .check methods will be run. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Optionally, whether to globally enable or disable calls to .check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.function","title":"<code>function(fn=lambda s: s, check_name=None)</code>","text":"<p>Applies an arbitrary function on a Series and shows the result, without modifying the Series itself.</p> Example <p>.check.function(fn=lambda s: s.shape[0]&gt;10, check_name='Has at least 10 rows?') which will result in 'True' or 'False'</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>The lambda function to apply to the Series. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check to preface the result with.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.get_mode","title":"<code>get_mode(check_name='\u2699\ufe0f Pandas Checks mode')</code>","text":"<p>Displays the current values of Pandas Checks global options enable_checks and enable_asserts. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check. Will be used as a preface the printed result.</p> <code>'\u2699\ufe0f Pandas Checks mode'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.head","title":"<code>head(n=5, fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the first n rows of a Series, without modifying the Series itself.</p> <p>See Pandas docs for head() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>The number of rows to display.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas head(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.hist","title":"<code>hist(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays a histogram for the Series's distribution, without modifying the Series itself.</p> <p>See Pandas docs for hist() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas head(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas hist() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.info","title":"<code>info(fn=lambda s: s, check_name='\u2139\ufe0f Series info', **kwargs)</code>","text":"<p>Displays summary information about a Series, without modifying the Series itself.</p> <p>See Pandas docs for info() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas info(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2139\ufe0f Series info'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas info() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.memory_usage","title":"<code>memory_usage(fn=lambda s: s, check_name='\ud83d\udcbe Memory usage', **kwargs)</code>","text":"<p>Displays the memory footprint of a Series, without modifying the Series itself.</p> <p>See Pandas docs for memory_usage() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas memory_usage(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcbe Memory usage'</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas memory_usage() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Include argument <code>deep=True</code> to get further memory usage of object dtypes. See Pandas docs for memory_usage() for more info.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.ndups","title":"<code>ndups(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays the number of duplicated rows in the Series, without modifying the Series itself.</p> <p>See Pandas docs for duplicated() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting the number of duplicates. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas duplicated() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nnulls","title":"<code>nnulls(fn=lambda s: s, check_name='\ud83d\udc7b Rows with NaNs')</code>","text":"<p>Displays the number of rows with null values in the Series, without modifying the Series itself.</p> <p>See Pandas docs for isna() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting rows with nulls. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udc7b Rows with NaNs'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nrows","title":"<code>nrows(fn=lambda s: s, check_name='\u2630 Rows')</code>","text":"<p>Displays the number of rows in a Series, without modifying the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before counting the number of rows. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\u2630 Rows'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.nunique","title":"<code>nunique(fn=lambda s: s, check_name=None, **kwargs)</code>","text":"<p>Displays the number of unique rows in a Series, without modifying the Series itself.</p> <p>See Pandas docs for nunique() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas nunique(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas nunique() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.plot","title":"<code>plot(fn=lambda s: s, check_name='', **kwargs)</code>","text":"<p>Displays a plot of the Series, without modifying the Series itself.</p> <p>See Pandas docs for plot() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas plot(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional title for the plot.</p> <code>''</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas plot() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Plots are only displayed when code is run in IPython/Jupyter, not in terminal.</p> <p>If you pass a 'title' kwarg, it becomes the plot title, overriding check_name</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.print","title":"<code>print(object=None, fn=lambda s: s, check_name=None, max_rows=10)</code>","text":"<p>Displays text, another object, or (by default) the current DataFrame's head. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>object</code> <code>Any</code> <p>Object to print. Can be anything printable: str, int, list, another DataFrame, etc. If None, print the Series's head (with <code>max_rows</code> rows).</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before printing <code>object</code>. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>max_rows</code> <code>int</code> <p>Maximum number of rows to print if object=None.</p> <code>10</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Args: start_time: The index time when the stopwatch started, which comes from the Pandas Checks start_timer() lead_in: Optional text to print before the elapsed time. units: The units in which to display the elapsed time. Can be \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of \"auto\", \"seconds\", \"minutes\", or \"hours\".</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings. Does not modify the Series itself.</p> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Run pandas_checks.describe_options() to see a list of available options. Does not modify the Series itself</p> <p>For example, .check.set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally. Does not modify the Series itself.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to Pandas Checks .check.assert_data() globally.</p> required <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.shape","title":"<code>shape(fn=lambda s: s, check_name='\ud83d\udcd0 Shape')</code>","text":"<p>Displays the Series's dimensions, without modifying the Series itself.</p> <p>See Pandas docs for <code>shape</code> for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas <code>shape</code>. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>'\ud83d\udcd0 Shape'</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>See also .check.nrows()</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.tail","title":"<code>tail(n=5, fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the last n rows of the Series, without modifying the Series itself.</p> <p>See Pandas docs for tail() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>n</code> <code>int</code> <p>Number of rows to show.</p> <code>5</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas tail(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.unique","title":"<code>unique(fn=lambda s: s, check_name=None)</code>","text":"<p>Displays the unique values in a Series, without modifying the Series itself.</p> <p>See Pandas docs for unique() for additional usage information.</p> <p>Parameters:</p> Name Type Description Default <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas unique(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.value_counts","title":"<code>value_counts(fn=lambda s: s, max_rows=10, check_name=None, **kwargs)</code>","text":"<p>Displays the value counts for a Series, without modifying the Series itself.</p> <p>See Pandas docs for value_counts() for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>max_rows</code> <code>int</code> <p>Maximum number of rows to show in the value counts.</p> <code>10</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before running Pandas value_counts(). Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>check_name</code> <code>Union[str, None]</code> <p>An optional name for the check, to be printed as preface to the result.</p> <code>None</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional arguments that are accepted by Pandas value_counts() method.</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p>"},{"location":"API%20reference/SeriesChecks/#pandas_checks.SeriesChecks.SeriesChecks.write","title":"<code>write(path, format=None, fn=lambda s: s, verbose=False, **kwargs)</code>","text":"<p>Exports Series to file, without modifying the Series itself.</p> <p>Format is inferred from path extension like .csv.</p> <p>This functions uses the corresponding Pandas export function such as to_csv(). See Pandas docs for those functions for additional usage information, including more configuration options you can pass to this Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>path</code> <code>str</code> <p>Path to write the file to.</p> required <code>format</code> <code>Union[str, None]</code> <p>Optional file format to force for the export. If None, format is inferred from the file's extension in <code>path</code>.</p> <code>None</code> <code>fn</code> <code>Callable</code> <p>An optional lambda function to apply to the Series before exporting. Example: <code>lambda s: s.dropna()</code>.</p> <code>lambda s: s</code> <code>verbose</code> <code>bool</code> <p>Whether to print a message when the file is written.</p> <code>False</code> <code>**kwargs</code> <code>Any</code> <p>Optional, additional keyword arguments to pass to the Pandas export function (.to_csv).</p> <code>{}</code> <p>Returns:</p> Type Description <code>Series</code> <p>The original Series, unchanged.</p> Note <p>Exporting to some formats such as Excel, Feather, and Parquet may require you to install additional packages.</p>"},{"location":"API%20reference/display/","title":"Display","text":"<p>Utilities for displaying text, tables, and plots in Pandas Checks in both terminal and IPython/Jupyter environments.</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_check","title":"<code>_display_check(data, name=None)</code>","text":"<p>Renders the result of a Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>The data to display.</p> required <code>name</code> <code>Union[str, None]</code> <p>The optional name of the check.</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_plot","title":"<code>_display_plot()</code>","text":"<p>Renders the active Pandas Checks matplotlib plot object in an IPython/Jupyter environment with an optional indent.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>It assumes the plot has already been drawn by another function, such as with .plot() or .hist().</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_plot_title","title":"<code>_display_plot_title(line, lead_in=None, colors={})</code>","text":"<p>Displays a plot title with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The title text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the title.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color settings for the text and lead-in text. See details in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_table","title":"<code>_display_table(table)</code>","text":"<p>Renders a Pandas DataFrame or Series in an IPython/Jupyter environment with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>table</code> <code>Union[DataFrame, Series]</code> <p>The DataFrame or Series to display.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._display_table_title","title":"<code>_display_table_title(line, lead_in=None, colors={})</code>","text":"<p>Displays a table title with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The title text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the title.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optiona dictionary containing color options for the text and lead-in text. See details in docstring for _render_text()</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._filter_emojis","title":"<code>_filter_emojis(text)</code>","text":"<p>Removes emojis from text if user has globally forbidden them.</p> <p>Parameters:</p> Name Type Description Default <code>text</code> <code>str</code> <p>The text to filter emojis from.</p> required <p>Returns:</p> Type Description <code>str</code> <p>The text with emojis removed if the user's global settings do not allow emojis. Else, the original text.</p>"},{"location":"API%20reference/display/#pandas_checks.display._format_background_color","title":"<code>_format_background_color(color)</code>","text":"<p>Applies a background color to text used being displayed in the terminal.</p> <p>Parameters:</p> Name Type Description Default <code>color</code> <code>str</code> <p>The background color to format. See syntax in docstring for _render_text().</p> required <p>Returns:</p> Type Description <code>str</code> <p>The formatted background color.</p>"},{"location":"API%20reference/display/#pandas_checks.display._lead_in","title":"<code>_lead_in(lead_in, foreground, background)</code>","text":"<p>Formats a lead-in text with colors.</p> <p>Parameters:</p> Name Type Description Default <code>lead_in</code> <code>Union[str, None]</code> <p>The lead-in text to format.</p> required <code>foreground</code> <code>str</code> <p>The foreground color for the lead-in text. See syntax in docstring for _render_text().</p> required <code>background</code> <code>str</code> <p>The background color for the lead-in text. See syntax in docstring for _render_text().</p> required <p>Returns:</p> Type Description <code>str</code> <p>The formatted lead-in text.</p>"},{"location":"API%20reference/display/#pandas_checks.display._print_table_terminal","title":"<code>_print_table_terminal(table)</code>","text":"<p>Prints a Pandas table in a terminal with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>table</code> <code>Union[DataFrame, Series]</code> <p>A DataFrame or Series.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._render_html_with_indent","title":"<code>_render_html_with_indent(object_as_html)</code>","text":"<p>Renders HTML with an optional indent.</p> <p>Parameters:</p> Name Type Description Default <code>object_as_html</code> <code>str</code> <p>The HTML to render.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._render_text","title":"<code>_render_text(text, tag, lead_in=None, colors={})</code>","text":"<p>Renders text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>text</code> <code>str</code> <p>The text to render.</p> required <code>tag</code> <code>str</code> <p>The HTML tag to use for rendering.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>Optional colors for the text and lead-in text. Keys include:     - text_color: The foreground color of the main text.     - text_background_color: The background or highlight color of the main text.     - lead_in_text_color: The foreground color of lead-in text.     - lead_in_background_color: The background color of lead-in text. Color values are phrased such as \"blue\" or \"white\". They are passed to either HTML     for Jupyter/IPython outputs and to <code>termcolor</code> when code is run in terminal.     For color options when code is run in terminal, see         https://github.com/termcolor/termcolor.</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/display/#pandas_checks.display._warning","title":"<code>_warning(message, lead_in='\ud83d\udc3c\ud83e\ude7a Pandas Checks warning', clean_type=False)</code>","text":"<p>Displays a warning message.</p> <p>Parameters:</p> Name Type Description Default <code>message</code> <code>str</code> <p>The warning message to display.</p> required <code>lead_in</code> <code>str</code> <p>Optional lead-in text to display before the warning message.</p> <code>'\ud83d\udc3c\ud83e\ude7a Pandas Checks warning'</code> <code>clean_type</code> <code>bool</code> <p>Optional flag to remove the class type from the message, when running .check.dtype().</p> <code>False</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/","title":"Options","text":"<p>Utilities for configuring Pandas Checks options.</p> <p>This module provides functions for setting and managing global options for Pandas Checks, including formatting and disabling checks and assertions.</p>"},{"location":"API%20reference/options/#pandas_checks.options._initialize_format_options","title":"<code>_initialize_format_options(options=None)</code>","text":"<p>Initializes or resets Pandas Checks formatting options.</p> <p>Parameters:</p> Name Type Description Default <code>options</code> <code>Union[List[str], None]</code> <p>A list of option names to initialize or reset. If None, all formatting options will be initialized or reset.</p> <code>None</code> <p>Returns:     None</p> Note <p>We separate this function from _initialize_options() so user can reset just formatting without changing mode</p>"},{"location":"API%20reference/options/#pandas_checks.options._initialize_options","title":"<code>_initialize_options()</code>","text":"<p>Initializes (or resets) all Pandas Checks options to their default values.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>We separate this function from _initialize_format_options() so user can reset just formatting if desired without changing mode</p>"},{"location":"API%20reference/options/#pandas_checks.options._register_option","title":"<code>_register_option(name, default_value, description, validator)</code>","text":"<p>Registers a Pandas Checks option in the global Pandas context manager.</p> <p>If the option has already been registered, reset its value.</p> <p>This method enables setting global formatting for Pandas Checks results and storing variables that will persist across Pandas method chains, which return newly initialized DataFrames at each method (and so reset the DataFrame's attributes).</p> <p>Parameters:</p> Name Type Description Default <code>name</code> <code>str</code> <p>The name of the option to register.</p> required <code>default_value</code> <code>Any</code> <p>The default value for the option.</p> required <code>description</code> <code>str</code> <p>A description of the option.</p> required <code>validator</code> <code>Callable</code> <p>A function to validate the option value.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p> Note <p>For more details on the arguments, see the documentation for pandas._config.config.register_option()</p>"},{"location":"API%20reference/options/#pandas_checks.options._set_option","title":"<code>_set_option(option, value)</code>","text":"<p>Updates the value of a Pandas Checks option in the global Pandas context manager.</p> <p>Parameters:</p> Name Type Description Default <code>option</code> <code>str</code> <p>The name of the option to set.</p> required <code>value</code> <code>Any</code> <p>The value to set for the option.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Raises:</p> Type Description <code>AttributeError</code> <p>If the <code>option</code> is not a valid Pandas Checks option.</p>"},{"location":"API%20reference/options/#pandas_checks.options.describe_options","title":"<code>describe_options()</code>","text":"<p>Prints all global options for Pandas Checks, their default values, and current values.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.disable_checks","title":"<code>disable_checks(enable_asserts=True)</code>","text":"<p>Turns off all calls to Pandas Checks methods and optionally enables or disables check.assert_data(). Does not modify the DataFrame itself.</p> <p>If this function is called, subequent calls to .check functions will not be run.</p> <p>Typically used to     1) Globally switch off Pandas Checks, such as during production. or     2) Temporarily switch off Pandas Checks, such as for a stable part of a notebook.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Whether to also run calls to Pandas Checks .check.assert_data()</p> <code>True</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.enable_checks","title":"<code>enable_checks(enable_asserts=True)</code>","text":"<p>Turns on Pandas Checks globally. Subsequent calls to .check methods will be run.</p> <p>Parameters:</p> Name Type Description Default <code>enable_asserts</code> <code>bool</code> <p>Whether to also enable or disable check.assert_data().</p> <code>True</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/options/#pandas_checks.options.reset_format","title":"<code>reset_format()</code>","text":"<p>Globally restores all Pandas Checks formatting options to their default \"factory\" settings.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/options/#pandas_checks.options.set_format","title":"<code>set_format(**kwargs)</code>","text":"<p>Configures selected formatting options for Pandas Checks. Run pandas_checks.describe_options() to see a list of available options.</p> <p>For example, set_format(check_text_tag= \"h1\", use_emojis=False`) will globally change Pandas Checks to display text results as H1 headings and remove all emojis.</p> <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Parameters:</p> Name Type Description Default <code>**kwargs</code> <code>Any</code> <p>Pairs of setting name and its new value.</p> <code>{}</code>"},{"location":"API%20reference/options/#pandas_checks.options.set_mode","title":"<code>set_mode(enable_checks, enable_asserts)</code>","text":"<p>Configures the operation mode for Pandas Checks globally.</p> <p>Parameters:</p> Name Type Description Default <code>enable_checks</code> <code>bool</code> <p>Whether to run any Pandas Checks methods globally. Does not affect .check.assert_data().</p> required <code>enable_asserts</code> <code>bool</code> <p>Whether to run calls to .check.assert_data() globally.</p> required <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/","title":"Run checks","text":"<p>Utilities for running Pandas Checks data checks.</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._apply_modifications","title":"<code>_apply_modifications(data, fn=lambda df: df, subset=None)</code>","text":"<p>Applies user's modifications to a data object.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>May be any Pandas DataFrame, Series, string, or other variable</p> required <code>fn</code> <code>Callable</code> <p>An optional lambda function to modify <code>data</code></p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Columns to subset after applying modifications</p> <code>None</code> <p>Returns:</p> Type Description <code>Any</code> <p>Modified and optionally subsetted data object.  If all arguments are defaults, data is returned unchanged.</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._check_data","title":"<code>_check_data(data, check_fn=lambda df: df, modify_fn=lambda df: df, subset=None, check_name=None)</code>","text":"<p>Runs a selected check on a data object</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>A Pandas DataFrame, Series, string, or other variable</p> required <code>check_fn</code> <code>Callable</code> <p>Function to apply to data for checking. For example if we're running .check.value_counts(), this function would appply the Pandas value_counts() method</p> <code>lambda df: df</code> <code>modify_fn</code> <code>Callable</code> <p>Optional function to modify data before checking</p> <code>lambda df: df</code> <code>subset</code> <code>Union[str, List, None]</code> <p>Optional list of columns or name of column to subset data before running check_fn</p> <code>None</code> <code>check_name</code> <code>Union[str, None]</code> <p>Name to use when displaying check result</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks._display_check","title":"<code>_display_check(data, name=None)</code>","text":"<p>Renders the result of a Pandas Checks method.</p> <p>Parameters:</p> Name Type Description Default <code>data</code> <code>Any</code> <p>The data to display.</p> required <code>name</code> <code>Union[str, None]</code> <p>The optional name of the check.</p> <code>None</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/run_checks/#pandas_checks.run_checks.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/timer/","title":"Timer","text":"<p>Provides a timer utility for tracking the elapsed time of steps within a Pandas method chain.</p> <p>Note that these functions rely on the <code>pdchecks.enable_checks</code> option being enabled in the Pandas configuration, as it is by default.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.get_mode","title":"<code>get_mode()</code>","text":"<p>Returns whether Pandas Checks is currently running checks and assertions.</p> <p>Returns:</p> Type Description <code>Dict[str, bool]</code> <p>A dictionary containing the current settings.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.print_time_elapsed","title":"<code>print_time_elapsed(start_time, lead_in='\u23f1\ufe0f Time elapsed', units='auto')</code>","text":"<p>Displays the time elapsed since start_time.</p> <p>Parameters:</p> Name Type Description Default <code>start_time</code> <code>float</code> <p>The index time when the stopwatch started, which comes from the Pandas Checks start_timer()</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>Optional text to print before the elapsed time.</p> <code>'\u23f1\ufe0f Time elapsed'</code> <code>units</code> <code>str</code> <p>The units in which to display the elapsed time. Accepted values: - \"auto\" - \"milliseconds\", \"seconds\", \"minutes\", \"hours\" - \"ms\", \"s\", \"m\", \"h\"</p> <code>'auto'</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p> <p>Raises:</p> Type Description <code>ValueError</code> <p>If <code>units</code> is not one of expected time units</p> Note <p>If you change the default values for this function's argument, change them in <code>.check.print_time_elapsed</code> too in DataFrameChecks and SeriesChecks so they're exposed to the user.</p>"},{"location":"API%20reference/timer/#pandas_checks.timer.start_timer","title":"<code>start_timer(verbose=False)</code>","text":"<p>Starts a Pandas Checks stopwatch to measure run time between operations, such as steps in a Pandas method chain. Use print_elapsed_time() to get timings.</p> <p>Parameters:</p> Name Type Description Default <code>verbose</code> <code>bool</code> <p>Whether to print a message that the timer has started.</p> <code>False</code> <p>Returns:</p> Type Description <code>float</code> <p>Timestamp as a float</p>"},{"location":"API%20reference/utils/","title":"Utils","text":"<p>Utility functions for the pandas_checks package.</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._display_line","title":"<code>_display_line(line, lead_in=None, colors={})</code>","text":"<p>Displays a line of text with optional formatting.</p> <p>Parameters:</p> Name Type Description Default <code>line</code> <code>str</code> <p>The text to display.</p> required <code>lead_in</code> <code>Union[str, None]</code> <p>The optional text to display before the main text.</p> <code>None</code> <code>colors</code> <code>Dict</code> <p>An optional dictionary containing color options for the text and lead-in text. See syntax in docstring for _render_text().</p> <code>{}</code> <p>Returns:</p> Type Description <code>None</code> <p>None</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._has_nulls","title":"<code>_has_nulls(data, fail_message, raise_exception=True, exception_to_raise=DataError)</code>","text":"<p>Utility function to check for nulls as part of a larger check</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._is_type","title":"<code>_is_type(data, dtype)</code>","text":"<p>Utility function to check if a dataframe's columns or one series has an expected type. Includes special handling for strings, since 'object' type in Pandas may not mean a string</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._lambda_to_string","title":"<code>_lambda_to_string(lambda_func)</code>","text":"<p>Create a string representation of a lambda function.</p> <p>Parameters:</p> Name Type Description Default <code>lambda_func</code> <code>Callable</code> <p>An arbitrary function in lambda form</p> required <p>Returns:</p> Type Description <code>str</code> <p>A string version of lambda_func</p> Todo <p>This still returns all arguments to the calling function.     They get entangled with the argument when it's a lambda function.     Try other ways to get just the argument we want.</p>"},{"location":"API%20reference/utils/#pandas_checks.utils._series_is_type","title":"<code>_series_is_type(s, dtype)</code>","text":"<p>Utility function to check if a series has an expected type. Includes special handling for strings, since 'object' type in Pandas may not mean a string</p>"}]}
\ No newline at end of file