Skip to content

probe: repeated char injection #282

@leondz

Description

@leondz

Implement Dropbox's repeated token attack as a probe in garak. Check out their documentation and code at https://github.com/dropbox/llm-security!

Take a look in garak/garak/probes/ to see how probes work, before giving this a go. Try to write as few methods as possible. When you're done, add a test in garak/tests/probes/ that checks things are working as expected. If you need help, you can comment here, or find us in discord!

A guide to testing for contributing is at https://reference.garak.ai/en/latest/contributing.html

For a reference guide to how probes work, see https://reference.garak.ai/en/latest/garak.probes.base.html

Metadata

Metadata

Assignees

No one assigned

    Labels

    good first issueGood for newcomersnew pluginDescribes an entirely new probe, detector, generator or harnessprobesContent & activity of LLM probes

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions