Skip to content

Entrpi/autonomy-golf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

autonomy-golf

Autonomy Golf Badge

This repo is playing autonomy golf on itself. It uses CHANGELOG.md, a parser, and a badge to keep score as it tries to drive down toward total automation, or hole-in-one games; with clear evidence.

Canonical GitHub home: Entrpi/autonomy-golf

Autonomy golf works best with an agent-managed CHANGELOG.md that stays readable to humans while remaining structured enough for parsers, badges, and charts. This repository packages the working pieces:

Autonomy golf makes the push toward total automation fun to play and meaningful to trust by forcing clear agreement on a change's meaning, motivation, and purpose.

Autonomy golf works because the Score and its Grounding are managed with agent integration and tooling in a gamified loop that also helps clarify project purpose and change motivation:

  • Score: how autonomous the change was
  • Grounding: how well the change was validated

Grounding is intentionally unscored. It is the evidence layer that keeps autonomy golf useful for validation and real decision-making.

Scoring tiers:

Tier Meaning Score
Fully human the change was identified and authored by a human, with the agent absent or limited to review and minor revisions 6
Human-driven the human identified the change and specified it tightly enough that the agent mostly executed 5
Human-directed, AI-shaped the human set the direction, but the agent designed the concrete mechanism or validation 4
AI-identified within brief, human-shaped the agent surfaced the opportunity inside a broad human brief, and the human materially reshaped it 3
AI-identified within brief, human-approved the agent surfaced the opportunity inside a broad human brief, and the human approved it with little reshaping 2
Self-initiated, human-approved the agent initiated the change outside explicit human direction, but still got approval before landing 1
Fully autonomous the agent initiated the change without explicit human direction or approval in the thread 0

Current project snapshot from CHANGELOG.md:

Metric Value
Mean autonomy score 4.00 / 6
Mean complexity 10.17 / commit
Mean score per top-level bullet 3.86 / 6
History covered 6 commits across 3 subsystems

Document Map

Start Here

The fastest way to adopt the system in another codebase is:

  1. Point your agent at docs/autonomy-golf-agent.md.
  2. Copy or adapt the changelog structure from CHANGELOG.md.
  3. Copy or adapt the tooling under tools/.
  4. Add the badge to your README and keep it refreshed from the parser output.

If you are starting from outside this repository, the canonical source for those files is Entrpi/autonomy-golf.

Once autonomy golf is already installed in a project, the shorter maintenance instruction is: read and follow docs/autonomy-golf-checklist.md.

Core Ideas

  • the central game is to push the project toward total automation in a way that stays enjoyable, legible, and worth trusting
  • Score is the bounded autonomy signal on a 0..6 scale. Lower is better.
  • Complexity is the scope signal: top-level provenance weights plus a +1 bonus for each nested sub-bullet under provenance items scored 3 or higher, excluding Meaning:, Motivation:, and Purpose: narrative lines.
  • top-level provenance bullets are the scored units
  • good entries do more than log tasks; they build explicit consensus on a change's meaning, motivation, and purpose
  • Grounding is the separate validation signal: files changed, checks run, and measured effects. It is intentionally unscored so validation strength stays distinct from autonomy level.
  • subsystem-prefixed commit headers make the history analyzable by subsystem as well as by day

Example Entry

### March 9, 2026 — `abc1234` — parser: Add subsystem rollup output — score `3` — complexity `7`

**Human-directed, AI-shaped (4)**

- Requested subsystem-level autonomy reporting so the project can see where human direction is still concentrated.
  - Meaning: autonomy trends should be inspectable by subsystem, not only at whole-project level.
  - Motivation: aggregate project scores hide where the real human bottlenecks still are.
  - Purpose: make the score actionable enough to steer future autonomy work.
  - Added a subsystem rollup mode to the parser.

**AI-identified within brief, human-approved (2)**

- Added a matching README summary command so the new rollup is visible and easy to use.

**Grounding**

- Files:
  - `tools/changelog_scores.py`
  - `README.md`
- Validation:
  - `python3 -m py_compile tools/changelog_scores.py`
  - `python3 tools/changelog_scores.py --group-by subsystem --format csv`
- Measurements:
  - This is tooling work, so there are no runtime performance measurements.

Commands

Project-level rollup:

python3 tools/changelog_scores.py --group-by overall --format csv --include-latest

Per-entry verification:

python3 tools/changelog_scores.py --group-by entry --format csv --include-latest --verify

Refresh the badge and README snapshot:

python3 tools/render_autonomy_badge.py

Repository Layout

CHANGELOG.md                 starter changelog template and example history
docs/autonomy-golf.md        public explainer
docs/autonomy-golf-agent.md  reusable agent integration brief
docs/autonomy-golf-checklist.md
                             concise maintenance checklist
tools/changelog_scores.py    parser and rollup tool
tools/render_autonomy_badge.py
                             badge renderer

About

Autonomy Golf is a game you play with your projects: hole-in-one is total automation of a commit.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages