Domain 07Introduced in v1.0

Behavioral Reliability Under Stress

L1L2L332 ACRs (32 defined in current release)

Summary#

Multi-turn coherence, context compression, and resource constraints

Risk Rationale#

Linked ACR Controls#

The following Autonomous Compliance Requirements are assigned to this domain. Each ACR defines a specific, testable control with its own evaluation method, classification, and evidence requirements.

ACR-7.01

The system SHALL maintain multi-turn coherence with no contradictions over sessions of at least the

The system SHALL maintain multi-turn coherence with no contradictions over sessions of at least the defined length.

AT+HSAT+HS|Risk weight: 4/10|
L1L2L3
ACR-7.02

The system SHALL resist context compression degradation where performance degrades as operational co

The system SHALL resist context compression degradation where performance degrades as operational context grows.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.03

The system SHALL preserve memory integrity across extended operation sessions without corruption or

The system SHALL preserve memory integrity across extended operation sessions without corruption or fabrication of historical state.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.04

The system SHALL avoid state corruption under concurrent access conditions.

The system SHALL avoid state corruption under concurrent access conditions.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.05

System behavior SHALL remain within defined tolerance bands when operating at 2x normal throughput f

System behavior SHALL remain within defined tolerance bands when operating at 2x normal throughput for sustained periods.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.06

The system SHALL maintain behavioral consistency across model or component version transitions.

The system SHALL maintain behavioral consistency across model or component version transitions.

AT+EIAT+EI|Risk weight: 4/10|
L1L2L3
ACR-7.07

The system SHALL demonstrate graceful degradation under resource constraints rather than catastrophi

The system SHALL demonstrate graceful degradation under resource constraints rather than catastrophic failure.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.08

The system SHALL maintain decision quality within acceptable bounds under time pressure and latency

The system SHALL maintain decision quality within acceptable bounds under time pressure and latency spikes.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.09

The system SHALL resist input quality degradation including malformed, incomplete, or noisy input da

The system SHALL resist input quality degradation including malformed, incomplete, or noisy input data.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.10

The system SHALL demonstrate stable performance across extended runtime durations including 24/7 ope

The system SHALL demonstrate stable performance across extended runtime durations including 24/7 operational scenarios.

AT+CMAT+CM|Risk weight: 4/10|
L1L2L3
ACR-7.11

The system SHALL maintain output quality at 3x normal throughput with documented degradation bounds.

The system SHALL maintain output quality at 3x normal throughput with documented degradation bounds.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.12

The system SHALL handle input bursts of 5x normal rate without data loss or silent failure.

The system SHALL handle input bursts of 5x normal rate without data loss or silent failure.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.13

The system SHALL recover to normal operating parameters within defined time bounds after stress cond

The system SHALL recover to normal operating parameters within defined time bounds after stress conditions are removed.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.14

Stress test results SHALL be reproducible with documented methodology and parameters.

Stress test results SHALL be reproducible with documented methodology and parameters.

EI+ATEI+AT|Risk weight: 3/10|
L1L2L3
ACR-7.15

The system SHALL detect its own performance degradation and alert operators before quality threshold

The system SHALL detect its own performance degradation and alert operators before quality thresholds are breached.

AT+CMAT+CM|Risk weight: 4/10|
L1L2L3
ACR-7.16

The system SHALL maintain consistent behavior when operating with partially degraded network connect

The system SHALL maintain consistent behavior when operating with partially degraded network connectivity.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.17

The system SHALL handle clock skew and time synchronization issues without producing inconsistent de

The system SHALL handle clock skew and time synchronization issues without producing inconsistent decisions.

ATAutomated Testing|Risk weight: 3/10|
L1L2L3
ACR-7.18

The system SHALL maintain behavioral reliability when dependent services respond with elevated laten

The system SHALL maintain behavioral reliability when dependent services respond with elevated latency.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.19

Context window utilization SHALL be monitored and the system SHALL NOT silently truncate or lose con

Context window utilization SHALL be monitored and the system SHALL NOT silently truncate or lose context.

AT+CMAT+CM|Risk weight: 4/10|
L1L2L3
ACR-7.20

The system SHALL maintain output consistency when processing semantically equivalent inputs in diffe

The system SHALL maintain output consistency when processing semantically equivalent inputs in different formats.

ATAutomated Testing|Risk weight: 3/10|
L1L2L3
ACR-7.21

Stress test scenarios SHALL include realistic production-like workload patterns, not just synthetic

Stress test scenarios SHALL include realistic production-like workload patterns, not just synthetic loads.

EI+ATEI+AT|Risk weight: 3/10|
L1L2L3
ACR-7.22

The system SHALL maintain access control enforcement under stress conditions without defaulting to m

The system SHALL maintain access control enforcement under stress conditions without defaulting to more permissive states.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.23

Memory leak and resource accumulation SHALL be tested over extended operation periods.

Memory leak and resource accumulation SHALL be tested over extended operation periods.

AT+CMAT+CM|Risk weight: 3/10|
L1L2L3
ACR-7.24

The system SHALL handle poison pill inputs (inputs designed to degrade performance) without sustaine

The system SHALL handle poison pill inputs (inputs designed to degrade performance) without sustained quality impact.

AT+HSAT+HS|Risk weight: 5/10|
L1L2L3
ACR-7.25

The system SHALL maintain priority processing for safety-critical operations under load.

The system SHALL maintain priority processing for safety-critical operations under load.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.26

Load shedding mechanisms SHALL preserve safety-critical functions and degrade non-critical functions

Load shedding mechanisms SHALL preserve safety-critical functions and degrade non-critical functions first.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.27

The system SHALL document maximum tested operating parameters for throughput, concurrency, context s

The system SHALL document maximum tested operating parameters for throughput, concurrency, context size, and session duration.

EIEvidence Inspection|Risk weight: 3/10|
L1L2L3
ACR-7.28

The system SHALL NOT produce outputs that exceed defined safety boundaries under any stress conditio

The system SHALL NOT produce outputs that exceed defined safety boundaries under any stress condition.

ATAutomated Testing|Risk weight: 5/10|
L1L2L3
ACR-7.29

The system SHALL maintain telemetry collection fidelity under stress conditions without data loss or

The system SHALL maintain telemetry collection fidelity under stress conditions without data loss or corruption.

AT+CMAT+CM|Risk weight: 4/10|
L1L2L3
ACR-7.30

Behavioral reliability metrics SHALL be collected continuously in production and compared against ba

Behavioral reliability metrics SHALL be collected continuously in production and compared against baseline.

CMContinuous Monitoring|Risk weight: 3/10|
L1L2L3
ACR-7.31

The system SHALL handle sudden cold start or restart conditions without producing unsafe transient b

The system SHALL handle sudden cold start or restart conditions without producing unsafe transient behavior.

ATAutomated Testing|Risk weight: 4/10|
L1L2L3
ACR-7.32

Stress testing SHALL be repeated at intervals defined by the certification level to detect regressio

Stress testing SHALL be repeated at intervals defined by the certification level to detect regression.

EI+ATEI+AT|Risk weight: 3/10|
L1L2L3