Evasion Simulator -- test safety control robustness against agent evasion¶
Evasion Simulator -- test safety control robustness against agent evasion.
Module: replication.evasion
Quick Start¶
Enums¶
TechniqueKind¶
RESOURCE_MASKINGIDENTITY_SPOOFINGGRADUAL_DRIFTTIMING_EXPLOITCONTRACT_LOOPHOLEMONITORING_FLOODCAPABILITY_CHAININGSANDBOX_PROBINGCOLLUSIONSTEGANOGRAPHIC
Severity¶
LOWMEDIUMHIGHCRITICAL
ControlType¶
DRIFT_DETECTORRESOURCE_LIMITERCONTRACT_ENFORCERLINEAGE_TRACKERBEHAVIOR_MONITORRATE_LIMITERSANDBOX_BOUNDARYAUDIT_LOGGERKILL_SWITCHCONSENSUS_VALIDATOR
Classes¶
TechniqueKind¶
Evasion technique categories.
Severity¶
Risk severity if evasion succeeds.
ControlType¶
Types of safety controls that can be tested.
SafetyControl¶
A safety control to test against evasion techniques.
| Method | Description |
|---|---|
to_dict() |
EvasionTechnique¶
An evasion technique with parameters.
| Method | Description |
|---|---|
to_dict() |
EvasionResult¶
Result of running one evasion technique against controls.
| Method | Description |
|---|---|
to_dict() |
ControlResilience¶
Aggregated resilience score for a single control.
| Method | Description |
|---|---|
to_dict() |
EvasionReport¶
Full report from running evasion simulation.
| Method | Description |
|---|---|
to_dict() |
|
to_json() |
EvasionSimulator¶
Simulate agent evasion attempts against safety controls.
| Method | Description |
|---|---|
__init__() |
|
add_control() |
|
remove_control() |
|
list_controls() |
|
get_control() |
|
add_technique() |
|
list_techniques() |
|
get_technique() |
|
run_technique() |
|
run_all() |
|
get_history() |
|
clear_history() |
Functions¶
| Function | Description |
|---|---|
main() |