Making Discourse Graphs Indexable & Discoverable

Idea

We want to create a clear path for folks to publish discourse graphs that can be published and referenced by others: we want to make shared Discourse Graphs indexable and discoverable

Data Shapes

We want to specify and promulgate a set of schema or data shapes -- like a ShEx schema-- for Discourse Graphs, so that existing graphs can be recognized in the wild, and so that existing data can be more easily organized into a d/graph.

rationale: if we can automate shape creation, we can create discourse graphs at scale, and support the conversion of datasets to d/graphs at scale.

Namespaces

We want to map out a a strategy for creating discourse graph namespaces: a namespace practice that includes

a unique mapping of an identifying string to an instance of an item
a way of capturing & indicating timestamped snapshots of the referent

We believe this implies developing community of practice about namespace resolution: resolving the situation where graphs overlap and conflict, but there is broad agreement that they address the same topic. This could be as simple as appending further descriptors when there is a potential name collision, wikipedia style. This project might involve creating or discovering a clustering rule: a way of determining/declaring the family of related d/graphs a for any focal d/graph. Ideally, we'd also like to align on a practice for attaching metadata to a d\graph.

Implementation

We think a good start to this process will be to

define a templateable unit for a discourse graph
write instructions for how to generate a stub
and set up a wiki for d/graphs to start working on the namespace and discovery issue