MultiLevel Pattern Miner

A modular Python engine for mining repeated structural patterns from unstructured text and turning them into LLM-ready specs, validation bundles, and review dashboards.

MultiLevel Pattern Miner identifies recurring document structures across six hierarchical levels, from phrase to document, in Markdown and plain text. It mines those patterns into reusable libraries, compiles them into machine-readable specifications, and supports downstream validation and dashboard-based review of new drafts.

Key Features

Status: Active


Knowledge Platform

A local-first, modular Python desktop application for structured knowledge work, centered on typed graphs, runtime-loadable modules, and SQLite-backed workspaces.

Knowledge Platform treats the graph as the primary artifact instead of files or folders. It provides a typed graph engine, SQLite persistence, and a config-driven plugin architecture that lets modules contribute graph types, services, projections, and UI widgets. The current alpha release ships with an Outline module for building hierarchical document structures on top of the shared platform.

Key Features

Status: Alpha


Open Knowledge Systems

A technical manual and Python reference implementation for turning Markdown-based content into structured, machine-readable knowledge assets for validation, graph projection, and publication workflows.

Open Knowledge Systems combines a design-level manuscript with a working ETL slice for schema-aware content pipelines. The repository focuses on normalizing Markdown chapters with YAML front matter into stable chunk records, projecting those records into graph-friendly JSON for Neo4j-oriented workflows, and publishing the surrounding architecture and implementation guidance as a documentation site and book-style outputs.

Key Features

Status: Active


DITA Package Processor

A deterministic, modular Python pipeline for analyzing DITA packages, generating validated migration plans, and executing them safely through bounded, plugin-driven handlers.

DITA Package Processor transforms DITA package migration into an explicit, contract-based workflow: discover -> normalize -> plan -> execute. It scans package structure and relationships, normalizes findings into planning contracts, generates deterministic action plans, and applies those plans through dry-run-first execution with optional explicit writes.

Key Features

Status: Active


DITA ETL Pipeline

A composable Python ETL pipeline for converting Markdown, HTML, and DOCX source content into structured DITA 1.3 XML, with built-in assessment, format-specific extraction, typed stage contracts, and documentation-focused output assembly.

DITA ETL Pipeline processes mixed-format documentation through four modular stages: Assess, Extract, Transform, and Load. It evaluates source files up front, converts them into intermediate DocBook, classifies and transforms them into DITA topics, and assembles final output bundles including a DITA map, topics, assets, and assessment artifacts for review.

Key Features

Status: Active


Markdown Validator

A rule-based Python validator for checking Markdown documents against declarative JSON rules, with support for front-matter policies, XPath-based body checks, workflows, and batch reporting.

Markdown Validator scans Markdown files used in static-site documentation workflows and evaluates them against reusable JSON rule sets. It validates YAML front matter and rendered document structure, supports conditional workflow chains, and can be used from both a CLI and a Python API for single-file checks, directory-wide scans, and CI validation gates.

Key Features

Status: Active


Novel Testbed

A modular Python narrative compiler that segments prose, builds scene-level contracts, infers reader-state changes, and assesses whether a novel’s structure actually moves.

Novel Testbed treats fiction as a testable system. It turns raw or annotated Markdown into structured narrative modules, compiles those modules into YAML contracts, uses optional OpenAI-backed inference to populate reader-state transitions, and validates whether each scene, exposition block, or transition produces meaningful structural change.

Key Features

Status: Active (Alpha)


Let’s Talk

Interested in contributing or adapting a solution for your own content workflow?

Get in touch →