AI Knowledge Sync System — Development Plan

Phase 01

Foundation — Single Repo Proof of Concept

Wk 1–2

Start with one repo. Build the AST parser pipeline that extracts module maps, public APIs, and exports. Generate a static KNOWLEDGE.md on demand. Validate that Claude can answer questions about the codebase using this file alone.

AST parser (JS/TS/Python)

KNOWLEDGE.md schema design

manual generation script

Claude context test

TRD/PRD link format

Phase 02

Automation — Commit-triggered Updates

Wk 3–5

Hook into CI/CD (GitHub Actions, GitLab CI, etc.). On each push, detect changed files, re-parse only those modules, diff against previous KNOWLEDGE.md, and do an incremental LLM-assisted update. Avoid full re-reads — only process what changed.

git diff detector

incremental AST re-parse

CI/CD hook integration

LLM diff summarizer

KNOWLEDGE.md versioning

changelog generation

Phase 03

Scale — Multi-Repo + Cross-Repo Index

Wk 6–8

Extend to N repos using a manifest file. Build a GLOBAL_KNOWLEDGE.md that aggregates cross-repo API contracts, shared interfaces, and inter-service dependencies. Enable impact analysis — "if I change X in repo-a, what breaks in repo-b?"

repos.yaml manifest

global aggregator

interface contract tracker

cross-repo dep graph

breaking change detection

metadata tagging (repo/module/date)

Phase 04

Intelligence — MCP Server + Live Query

Wk 9–10

Build the MCP server layer that lets Claude query knowledge on-demand rather than relying on static context injection. Support queries by repo, module, date range, or change type. Optionally connect to Notion/Confluence for TRD/PRD enrichment.

MCP server scaffold

query API design

repo federation

date/module filters

Notion/Confluence connector

semantic search (optional)

Component	Recommended	Multi-Repo	Incremental	Notes
AST Parser	tree-sitter (multi-lang)	YES	YES	Supports JS, TS, Python, Go, etc.
Knowledge Format	Structured Markdown (KNOWLEDGE.md)	YES	YES	Human-readable + AI-injectable
CI Hook	GitHub Actions / GitLab CI	YES	YES	Per-repo, triggers on push
LLM Summarizer	Claude Haiku (fast + cheap)	YES	YES	Only process changed files
Serving Layer	MCP Server (custom)	YES	PARTIAL	Phase 4 — query on demand
Vector Search	pgvector / Chroma (optional)	MAYBE	MAYBE	Only if scale demands it
Doc enrichment	Notion / Confluence API	YES	MANUAL	Link TRD/PRD to modules

Component

Recommended

Multi-Repo

Incremental

Notes

AST Parser

tree-sitter (multi-lang)

YES

Supports JS, TS, Python, Go, etc.

Knowledge Format

Structured Markdown (KNOWLEDGE.md)

YES

Human-readable + AI-injectable

CI Hook

GitHub Actions / GitLab CI

YES

Per-repo, triggers on push

LLM Summarizer

Claude Haiku (fast + cheap)

YES

Only process changed files

Serving Layer

MCP Server (custom)

YES

PARTIAL

Phase 4 — query on demand

Vector Search

pgvector / Chroma (optional)

MAYBE

Only if scale demands it

Doc enrichment

Notion / Confluence API

YES

MANUAL

Link TRD/PRD to modules

▸ Where To Start Right Now