[Python Client] Sensitive Data Exposure in Debug Logs - No Built-in Redaction Mechanism #1025

ganeshrvel · 2025-01-04T06:19:30Z

I confirm this is a bug with Supabase, not with my own application.
I confirm I have searched the Docs, GitHub Discussions, and Discord.

Describe the bug

The Supabase Python client exposes sensitive data (tokens, query parameters) in debug logs without providing any built-in mechanism to redact this information. This was previously reported in discussion https://github.com/orgs/supabase/discussions/31019 but remains unresolved. This is a security concern as sensitive tokens and data are being logged in plaintext, potentially exposing them in log files.

To Reproduce

Set up a Python application using the Supabase client
Enable debug logging for the client
Make any API call that includes sensitive data (like authentication tokens)
Check debug logs to see exposed sensitive information:

import logging
import supabase

# Configure logging
logging.basicConfig(level=logging.DEBUG)

# Initialize Supabase client
client = supabase.create_client(...)

# Make any API call
result = client.from_('sensitive_table').select('*').execute()

The debug logs will show sensitive information like:

[DEBUG] [hpack.hpack] Decoded (b'content-location', b'/sensitive_table?sensitive_token=eq.abc-1234-567899888-23333-33333-333333-333333')

Expected behavior

The Supabase Python client should:

Provide built-in configuration options to redact sensitive data in debug logs
Either mask sensitive tokens and parameters by default or
Provide clear documentation on how to properly configure logging to protect sensitive data

System information

OS: Linux
Version of supabase-py: latest
Version of Python: 3.11

Additional context

Standard Python logging filters don't work effectively as the logs are generated by underlying libraries (httpx, httpcore, hpack). This is a security issue that needs proper handling at the client library level. Custom filters like:

class SensitiveDataFilter(logging.Filter):
    def filter(self, record: logging.LogRecord) -> bool:
        record.msg = re.sub(r"abc-[0-9a-f\-]+", "[REDACTED-TOKEN]", record.msg)
        return True

don't fully address the issue as they can't catch all instances of sensitive data exposure.

This issue was previously raised in discussion https://github.com/orgs/supabase/discussions/31019 without any resolution, hence filing it as a bug report given its security implications.

The text was updated successfully, but these errors were encountered:

juancarlospaco · 2025-01-06T21:30:54Z

DEBUG log level should not be used for production, it should only be used for Debugging, it is meant to "print as much as possible" for Debugging purposes, also in DEBUG mode the performance may be bad.

Please don't use DEBUG mode for public stuff and you should be OK.

silentworks · 2025-01-08T23:42:26Z

Yes don't use DEBUG mode in production/public stuff as @juancarlospaco said.

If you are looking for a filter that works with the INFO logger then you can use this. A lot of this code was lifted from this PR supabase/realtime-py#217

import copy
import logging
import re
import httpx

class SensitiveDataFilter(logging.Filter):
    def filter(self, record: logging.LogRecord) -> bool:
        record.msg = self.sanitize_line(record.msg)
        record.args = self.sanitize_args(record.args)
        return True

    @staticmethod
    def sanitize_args(d):
        if isinstance(d, dict):
            d = d.copy()  # so we don't overwrite anything
            for k, v in d.items():
                d[k] = SensitiveDataFilter.sanitize_line(v)
        elif isinstance(d, tuple):
            # need a deepcopy of tuple turned to a list, as to not change the original values
            # otherwise we end up changing the items at the original memory location of the passed in tuple
            y = copy.deepcopy(list(d))
            for x, value in enumerate(y):
                if isinstance(value, str):
                    y[x] = re.sub(r"abc-[0-9a-f\-]+", "[REDACTED-TOKEN]", value)
                if isinstance(value, httpx.URL):
                    raw_value = str(value)
                    sanitized_url = re.sub(
                        r"abc-[0-9a-f\-]+", "[REDACTED-TOKEN]", raw_value
                    )
                    y[x] = httpx.URL(sanitized_url)
            return tuple(y)  # convert the list back to a tuple
        return d

    @staticmethod
    def sanitize_line(line):
        return re.sub(r"abc-[0-9a-f\-]+", "[REDACTED-TOKEN]", line)


# Applying the filter
logging.getLogger("httpx").addFilter(SensitiveDataFilter())

# Configure logging
logging.basicConfig(level=logging.INFO)

ganeshrvel · 2025-01-09T07:35:04Z

@silentworks We don't use DEBUG mode in production, as @juancarlospaco mentioned. This was mainly in reference to dev builds.

silentworks · 2025-01-11T13:50:05Z

If this is in dev then this is a non issue here.

ganeshrvel · 2025-01-11T13:59:23Z

By “dev,” I didn’t mean localhost. I was referring to the staging server. Even on a dev staging server, there are secrets we absolutely do not want to expose, especially sensitive keys on the server.

Moreover, what if I’m debugging an issue on a production staging server? I definitely wouldn’t want those secrets being leaked in logs. It’s silly to assume that debug mode in any staging environment is a non-issue.

I don’t understand why this was rushed to be closed with such a silly judgment on how or where I should hide my secrets. The issue is far from resolved, and the code provided didn’t work. I’m still figuring out a proper solution, and it would be appreciated if this were treated with the seriousness it deserves instead of being dismissed prematurely.

silentworks · 2025-01-11T22:30:20Z

You do realise you are asking for application debugging level help at the library level right. I've provided you with a solution before closing the issue. If you want to test that solution out with DEBUG mode then you should experiment a bit as I or any other developer building an application would have to do. I just added the filter to the logger for httpx and httpcore (this requires a bit of digging as it has multiple components to it).

logging.getLogger("httpx").addFilter(SensitiveDataFilter())
logging.getLogger("httpcore.http11").addFilter(SensitiveDataFilter())

Also when asking questions or raising issues the onus is on you the poster to provide all possible cases/scenarios from the earliest point possible.

ganeshrvel added the bug Something isn't working label Jan 4, 2025

silentworks removed the bug Something isn't working label Jan 8, 2025

silentworks closed this as completed Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Python Client] Sensitive Data Exposure in Debug Logs - No Built-in Redaction Mechanism #1025

[Python Client] Sensitive Data Exposure in Debug Logs - No Built-in Redaction Mechanism #1025

ganeshrvel commented Jan 4, 2025

juancarlospaco commented Jan 6, 2025 •

edited

Loading

silentworks commented Jan 8, 2025

ganeshrvel commented Jan 9, 2025

silentworks commented Jan 11, 2025

ganeshrvel commented Jan 11, 2025

silentworks commented Jan 11, 2025

[Python Client] Sensitive Data Exposure in Debug Logs - No Built-in Redaction Mechanism #1025

[Python Client] Sensitive Data Exposure in Debug Logs - No Built-in Redaction Mechanism #1025

Comments

ganeshrvel commented Jan 4, 2025

Describe the bug

To Reproduce

Expected behavior

System information

Additional context

juancarlospaco commented Jan 6, 2025 • edited Loading

silentworks commented Jan 8, 2025

ganeshrvel commented Jan 9, 2025

silentworks commented Jan 11, 2025

ganeshrvel commented Jan 11, 2025

silentworks commented Jan 11, 2025

juancarlospaco commented Jan 6, 2025 •

edited

Loading