cc8.1-5-001

CC8.1 · D5 · f1

D5
Difficulty
5
Gaps
0
Red Herrings
74%
Avg Score

Task

You are auditing Beacon SaaS's SOC 2 Type II compliance for control CC8.1 (Change Management). Review the provided evidence to assess whether Beacon SaaS has effective change management controls in place, including proper authorization, testing, approval, and deployment practices. Evaluate whether deviations from the documented policy represent genuine control gaps or reasonable operational decisions. Consider the full context when making judgments — some situations may be ambiguous and require nuanced analysis.

Evidence

Findings

IDTypeSeverityFinding
F-001gapmediumCHG-414 deployed to production with 94.2% test pass rate, below the 98% policy threshold
Deployment CHG-414 (Real-time analytics pipeline migration to Kafka) proceeded to production with a 94.2% automated test pass rate, which is below the 98% overall pass rate required by Section 4.3 of ...
F-002gapmediumCHG-415 approved by Engineering Manager instead of CAB, with no formal delegation authority
Deployment CHG-415 (Onboarding wizard flow redesign), classified as Medium risk, was approved by Jared Kim (Engineering Manager) instead of the Change Advisory Board as required by Section 4.4 of the ...
F-003gapmediumTwo rollbacks in Q4 2025 triggers the escalation threshold defined in policy metrics
Q4 2025 saw two production rollbacks: CHG-406 (API rate limiting, rolled back October 29 due to latency threshold breach) and CHG-418 (Kubernetes node pool scaling, rolled back December 17 due to pod ...
F-004gaplowEmergency hotfix CHG-409 bypassed staging testing — retrospective CAB review completed within SLA
CHG-409 (Critical XSS vulnerability hotfix for CVE-2025-41823) was deployed as an emergency change on November 6 at 22:15 ET, bypassing staging environment testing and the standard deployment window. ...
F-005gapmediumFlaky test remediation SLA exceeded by over 100 days for Kafka integration tests
The automated test results show that 3 flaky tests (FLAKY-028, FLAKY-029, FLAKY-030) identified on September 14, 2025 have been open for 109 days, far exceeding the 30-day remediation SLA defined in t...

Results

ModelProviderScoreRecallPrec.F1GapsReported
Sonnet 4.6Anthropic83%100%71%83%5/57
Opus 4.7Anthropic91%100%83%91%5/56
GPT-5.5OpenAI73%80%67%73%4/56
GPT-4.1OpenAI71%100%56%71%5/59
Haiku 4.5Anthropic43%100%28%43%5/518
GPT-4oOpenAI80%80%80%80%4/55