Bench AF Documentation
The Alignment Faking Benchmark - a comprehensive evaluation framework for testing model organisms.
๐งช
validate_model.py
Core validation script for evaluating model organisms across different environments.
View documentation โโก
Quick Start
Get up and running with installation instructions and basic usage.
Get started โKey Features
Model Evaluation
Test alignment-faking and aligned models across various environments.
Configurable Testing
Flexible parameters for sample size, concurrency, and model types.
Comprehensive Analysis
Detailed logging, metrics, and visualization tools for results.
Documentation Structure
Getting Started
- โข System overview and architecture
- โข Installation and setup
- โข Basic usage examples
Advanced Usage
- โข Model validation details
- โข Environment configuration
- โข Results analysis and interpretation