AutoAlign

Converting Static Documentation into Autonomous Governance

The Problem

Documentation sits in folders gathering dust. Developers make costly mistakes because they can't manually cross-reference every governance rule in a 50-page policy doc for every feature they build. The result: compliance rework, security incidents, and project delays.

AutoAlign turns static policy documents into a living, autonomous governance system.

Architecture

                    ┌─────────────────────────────────────────┐
                    │         AutoAlign Architecture           │
                    └─────────────────────────────────────────┘

  ┌──────────┐     ┌──────────────────────────────────────────────────────┐
  │  Policy  │────▶│              Knowledge Base (ChromaDB)               │
  │   Docs   │     │   Internal_Recommendation_Doc.md + Data Dictionary   │
  └──────────┘     └────────────────────────┬─────────────────────────────┘
                                            │ RAG Retrieval
                                            ▼
  ┌──────────┐     ┌──────────────────────────────────────────────────────┐
  │   BRD    │────▶│                  LangGraph Workflow                  │
  │ (Input)  │     │                                                      │
  └──────────┘     │   ┌─────────────┐      violations     ┌──────────┐  │
                   │   │  DEFENDER   │ ─────────────────▶ │ DRAFTER  │  │
                   │   │   AGENT     │ ◀─────────────────  │  AGENT   │  │
                   │   │(Policy      │   revised BRD       │(Architect│  │
                   │   │ Guardian)   │                     │ Solution)│  │
                   │   └──────┬──────┘                     └──────────┘  │
                   │          │ compliant / max iterations                │
                   │          ▼                                           │
                   │   ┌─────────────┐                                   │
                   │   │   REPORT    │                                   │
                   │   │    NODE     │                                   │
                   │   └─────────────┘                                   │
                   └──────────────────────────────────────────────────────┘
                                            │
                                            ▼
                              ┌─────────────────────────┐
                              │  Aligned BRD + Report   │
                              │  (Output)               │
                              └─────────────────────────┘

The Debate Loop

Defender Agent analyzes the BRD against the policy knowledge base via RAG, identifying all violations
If violations exist, Drafter Agent rewrites the BRD to fix every issue while preserving business intent
The revised BRD goes back to the Defender for another pass
Loop continues until the BRD is compliant or max iterations are reached
Final Compliance Report is generated with full audit trail

Tech Stack

Component	Technology
AI Reasoning	Google Gemini 1.5 Pro (high context window for large Markdown files)
Vector Storage	ChromaDB (local) / Vertex AI Vector Search (production)
Data Warehouse	BigQuery (audit logs, policy metadata)
Orchestration	LangGraph (multi-agent state machine)
SDK	Turgon SDK (AutoAlign integration layer)
Embeddings	Google `embedding-001`
Framework	LangChain

Project Structure

AutoAlign/
├── main.py                         # CLI entry point
├── requirements.txt
├── .env.example
├── config/
│   ├── __init__.py
│   └── settings.py                 # All configuration
├── src/
│   ├── agents/
│   │   ├── defender.py             # Policy Defender Agent
│   │   └── drafter.py             # Compliance Drafter Agent
│   ├── knowledge_base/
│   │   ├── loader.py              # Document ingestion + vector store
│   │   └── retriever.py           # RAG retrieval
│   ├── workflow/
│   │   ├── state.py               # LangGraph state definitions
│   │   └── graph.py               # Multi-agent workflow graph
│   └── utils/
│       └── logger.py              # Structured logging
├── turgon/                         # Turgon SDK (high-level client)
│   ├── client.py
│   └── models.py
├── docs/                           # Policy documents (knowledge base)
│   ├── Internal_Recommendation_Doc.md
│   └── Data_Dictionary.md
└── examples/
    └── sample_brd.md              # Intentionally non-compliant BRD demo

Getting Started

1. Clone the Repo

git clone https://github.com/aakash4dev/AutoAlign.git
cd AutoAlign

2. Configure Environment

cp .env.example .env
# Edit .env and add your GOOGLE_API_KEY

Get a Google AI Studio API key: https://aistudio.google.com/app/apikey

3. Backend Setup

# Create a virtual environment
python3 -m venv .venv

# Activate it
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Start the backend API server (runs on http://localhost:8000)
uvicorn api.server:app --reload --port 8000

Keep this terminal open. The backend must be running for the frontend to work.

4. Frontend Setup

Open a new terminal and run:

cd frontend
npm install
npm run dev

The frontend will start on http://localhost:3000.

5. Using the CLI (Optional)

You can also use AutoAlign directly from the command line without the frontend:

# Activate the virtual environment (if not already)
source .venv/bin/activate

# Align the sample BRD (which has intentional violations)
python main.py align examples/sample_brd.md

# Save the aligned output
python main.py align examples/sample_brd.md --output aligned_brd.md

# Query the policy knowledge base directly
python main.py query "What are the rules for storing customer IDs in logs?"

# Rebuild the knowledge base (after adding new policy docs)
python main.py rebuild-kb

Demo Scenario

The examples/sample_brd.md contains a real-world BRD with intentional violations:

Violation	Policy	Severity
`customer_id` stored in plaintext logs	Section 4.2 (PII)	CRITICAL
`user_email` stored in plaintext	Section 4.2 (PII)	CRITICAL
`ip_address` in plaintext	Section 4.1 (PII)	CRITICAL
API key hardcoded in source code	Section 5.3 (Secrets)	CRITICAL
CORS wildcard `*` in production	Section 2.2 (API Security)	HIGH
No authentication on debug endpoint	Section 2.1 (Auth)	HIGH
No rate limiting	Section 2.2 (API Security)	HIGH
Shared service account with admin access	Section 2.1 (PoLP)	HIGH
Service account key committed to Git	Section 5.3 (Secrets)	CRITICAL
Indefinite data retention	Section 4.2.3 (Minimization)	MEDIUM
`FLOAT` type for currency	Data Dictionary	MEDIUM

AutoAlign automatically detects and fixes all of these.

Using the Turgon SDK

from turgon import TurgonClient

client = TurgonClient(max_iterations=5)

# Align a BRD string
result = client.align(brd_text)

# Align a BRD file
result = client.align_file("examples/sample_brd.md")

print(result.status)           # ComplianceStatus.COMPLIANT
print(result.compliance_score) # 1.0
print(result.aligned_brd)      # The fixed, compliant BRD
print(result.compliance_report) # Human-readable report
print(result.summary())         # One-liner summary

# Query the knowledge base
answer = client.query_policy("What are PII logging rules?")
print(answer)

Future Scope

GitHub PR Integration: Hook AutoAlign into GitHub Actions to auto-check every PR
Multi-Domain Support: Legal, HR, GDPR, DPDP Act, SOC 2 policy sources
Vertex AI Vector Search: Production-scale vector store with real-time updates
BigQuery Audit Warehouse: Full audit trail of every alignment decision
Slack/Teams Notifications: Real-time compliance alerts for developers

Team: Ninja Turtles

Member	Role
Aakash Singh Rajput	Lead AI Developer (Agentic Workflows)
Tushar Kumar	Cloud Architect (GCP Integration)
Nandini Goyal	Host and Engagement Lead at OSCG
Shivani Sahu	Full Stack Developer

Event: HackFest 2.0 — GDG Cloud New Delhi Track: Agentic AI

AutoAlign makes policy invisible and compliance inevitable.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
api		api
config		config
docs		docs
examples		examples
frontend		frontend
old-frontend		old-frontend
src		src
turgon		turgon
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
list_all_gcp_models.py		list_all_gcp_models.py
list_gcp_models.py		list_gcp_models.py
list_generate_models.py		list_generate_models.py
main.py		main.py
requirements.txt		requirements.txt
sampleBRD.md		sampleBRD.md
test_models.py		test_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoAlign

The Problem

Architecture

The Debate Loop

Tech Stack

Project Structure

Getting Started

1. Clone the Repo

2. Configure Environment

3. Backend Setup

4. Frontend Setup

5. Using the CLI (Optional)

Demo Scenario

Using the Turgon SDK

Future Scope

Team: Ninja Turtles

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AutoAlign

The Problem

Architecture

The Debate Loop

Tech Stack

Project Structure

Getting Started

1. Clone the Repo

2. Configure Environment

3. Backend Setup

4. Frontend Setup

5. Using the CLI (Optional)

Demo Scenario

Using the Turgon SDK

Future Scope

Team: Ninja Turtles

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages