New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat(weave): Implement `PresidioEntityRecognitionGuardrail` #3575

Open

soumik12345 wants to merge 3 commits into master from feat/presidio-entity-guardrail

Contributor

soumik12345 commented Feb 3, 2025

Description

Implement thePresidioEntityRecognitionGuardrail based on the PresidioEntityRecognitionGuardrail from the safeguards library originally contributed by @ash0ts .

Sample Trace


          add: PresidioEntityRecognitionGuardrail + tests

355edb0

soumik12345 self-assigned this

soumik12345 requested a review from a team as a code owner

February 3, 2025 11:33

socket-security bot commented Feb 3, 2025 •

edited

Loading

New dependencies detected. Learn more about Socket for GitHub ↗︎

Package	New capabilities	Transitives	Size	Publisher
pypi/[email protected]	Transitive: eval, filesystem, network	`+227`	374 MB	avbalter, microsoft, omri374, ...2 more
pypi/[email protected]	Transitive: environment, eval, filesystem, network, shell, unsafe	`+2`	2.23 MB	microsoft, omri374, omrimendels, ...1 more
pypi/[email protected]	None	`0`	1.26 kB	thunder_007

View full report↗︎

circle-job-mirror bot commented Feb 3, 2025 •

edited

Loading

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=344b359812816bcf5cdb671a02248483b8c5706b

soumik12345 added 2 commits

February 3, 2025 19:44


          fix: lint

c05ee8e


          update: test dependencies for scorers

8a28685

Contributor Author

soumik12345 commented Feb 3, 2025

Hi @tssweeney
Can you please review this PR?

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

		from presidio_anonymizer import AnonymizerEngine


		class PresidioEntityRecognitionResponse(BaseModel):

Collaborator

andrewtruong Feb 3, 2025

@tcapelle @soumik12345 are you going with TypedDict or BaseModel? Either is fine, but I would pick 1 and be consistent for all the scorers. Maybe you should also have a test to enforce this property

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

Collaborator

andrewtruong Feb 3, 2025

nit: Prefer the new annotations syntax here

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                  _analyzer: "AnalyzerEngine"
+                  _anonymizer: "AnonymizerEngine"
+                  def __init__(

Collaborator

andrewtruong Feb 3, 2025

nit: Isn't Scorer a BaseModel? Can you use the pydantic-style init, or do you need to do it this way?

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                      deny_lists: Optional[dict[str, list[str]]] = None,
+                      regex_patterns: Optional[dict[str, list[dict[str, str]]]] = None,
+                      custom_recognizers: Optional[list[Any]] = None,
+                      show_available_entities: bool = False,

Collaborator

andrewtruong Feb 3, 2025

This seems weird. Why not just have this as a classmethod or a docs page?

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                          selected_entities = self.get_available_entities()
+                      # Get available entities dynamically
+                      available_entities = self.get_available_entities()

Collaborator

andrewtruong Feb 3, 2025

there's some duplication with the above available_entities

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                      self._anonymizer = anonymizer
+                  @weave.op
+                  def group_analyzer_results_by_entity_type(

Collaborator

andrewtruong Feb 3, 2025

is this something that should be an op?

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                      return "\n".join(explanation_parts)
+                  @weave.op
+                  def anonymize_text(

Collaborator

andrewtruong Feb 3, 2025

not sure if these should all be ops. They seem to be internal helpers?

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                      return anonymized_text
+                  @weave.op
+                  def score(self, output: str) -> PresidioEntityRecognitionResponse:

Collaborator

andrewtruong Feb 3, 2025

I don't think this signature is correct if you are returning the model dump of it.

andrewtruong reviewed

View reviewed changes

weave/scorers/guardrails/presidio_entity_recognition_guardrail.py

+                  @weave.op
+                  def score(self, output: str) -> PresidioEntityRecognitionResponse:
+                      analyzer_results = self._analyzer.analyze(
+                          text=str(output), entities=self.selected_entities, language=self.language

Collaborator

andrewtruong Feb 3, 2025

isn't output already str?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet