Contextor

Contextor converts existing documentation trees (e.g. Next.js, Tailwind CSS) into Model Context Protocol (.mdc) files optimised for LLMs.

In Phase 1, it walks a target docs directory from another repo, cleans and normalises Markdown/MDX, performs optional token-saving compression, and writes results to a separate repository named sourcedocs under {source-slug}/.

Phase 2 adds a Python MCP server to serve from sourcedocs. Phase 3 (future) may add polite web scraping for docs that aren’t hosted in public repos.

Key Features

Repo docs → .mdc – Recurses through Markdown/MDX docs, adds rich front-matter (repo, ref, path, canonical URL), and outputs .mdc files ready for agents.
MD/MDX aware normalisation – Strips MDX imports/exports & common JSX wrappers; fixes headings, code fences, tables, and links.
Token-saving compression (optional) – Compact large code/JSON blocks with safe elision for lower token costs while preserving meaning.
Deterministic & idempotent – Stable slugs and content hashes; writes only when content changes. Git holds the history.
Indexing for fast lookup – Appends a manifest entry to {source-slug}/index.jsonl for each emitted .mdc.
Roadmap ready – Phase 2 MCP server that serves from sourcedocs/{source-slug}/; Phase 3 optional scraping for non-repo docs.

Quick Start

Get running in under 5 minutes:

Clone the repositories

# Tooling
git clone <contextor-repository-url>
cd contextor

# Storage target (holds emitted .mdc files)
git clone <sourcedocs-repository-url> ../sourcedocs

Install dependencies

# Using Poetry
poetry install

Configure the environment

# No .env is required for Phase 1.
# (Optional) Create config/optimize.yaml for include/exclude rules and topics.

Start the application

# Convert a docs directory from another repo into .mdc files inside sourcedocs/{source-slug}
poetry run contextor optimize \
  --src ../vendor/nextjs/docs \
  --out ../sourcedocs/{source-slug} \
  --repo vercel/next.js --ref main \
  --topics "framework,nextjs,prompt-engineering"

# Commit results to sourcedocs
cd ../sourcedocs
git add {source-slug}
git diff --cached --quiet || git commit -m "chore(context): refresh MDC" && git push

You're ready to go! The .mdc files now live in sourcedocs/{source-slug}/ and can be consumed by agents (e.g. Promptman).

Optional: Run Advanced Content Intelligence

# Install intelligence dependencies
poetry install --extras intelligence

# Run intelligence analysis on the generated .mdc files
cd ../sourcedocs
poetry run contextor intelligence \
  --source-dir {source-slug} \
  --features topic-extraction,cross-linking,quality-scoring,duplicate-detection

# Commit enhanced results
git add {source-slug}
git diff --cached --quiet || git commit -m "feat: add content intelligence" && git push

In Phase 2 you'll run the MCP server to serve these files; in Phase 3 you may add optional scraping.

Documentation

Comprehensive documentation is available in the docs/ directory:

Getting Started – Detailed setup and first steps
Architecture – System design and technical decisions
FAQ – Common questions and answers
Troubleshooting – Solutions to common issues

Contributing

We welcome contributions! Please see CONTRIBUTING.md for:

Development workflow and guidelines
Code standards and quality requirements
How to submit changes and get them reviewed

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
api		api
config		config
contextor		contextor
docs		docs
scripts		scripts
temp		temp
tests		tests
.DS_Store		.DS_Store
.cursorignore		.cursorignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
TODO.md		TODO.md
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
serverless.yml		serverless.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Contextor

Key Features

Quick Start

Documentation

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

daddia/contextor

Folders and files

Latest commit

History

Repository files navigation

Contextor

Key Features

Quick Start

Documentation

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages