Copyright © 2008 by Fundación CTIC. This work
is licensed under CreativeCommons
Attribution-Share Alike 3.0, so you are granted to use, copy, distribute, transmit and adapt
this work, but you must attribute the work to the original authors and distribute the resulting
work under similar terms.
This work is supported by Vulcano project and is partially funded by the Spanish Ministry of Industry, Tourism and Commerce under the National Plan for Scientific Research, Development and Technological Innovation 2007-2010 (grant number TSI-020301-2008-22).
The DOCOM Ontology provides the main concepts and properties required to describe compound documents made from smaller fragments that are assembled to produce the final document. This report contains a detailed description of the DOCOM Ontology.
NOTE: This section describes the status of this document at the time of its publication. Other documents may supersede this document.
This is a work in progress! This document is changing on a daily if not hourly basis. Comments are very welcome, please send them to the authors. Thank you.
The keywords "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
Namespace URIs of the general form "http://www.example.com/" represent some application-dependent or context-dependent URI as defined in RFC 2396.
The XML Namespace URIs that MUST be used by implementations of this specification is:
An alphabetical index of DOCOM terms, by class (concepts) and by property (relationships, attributes), are given below. All the terms are hyperlinked to their detailed description for quick reference.
Classes: Adapter, Annotation, Document, DocumentaryProject, DocumentaryWorkproduct, File, Format, Gap, InputAdapter, IntermediateLanguageDocument, OutputAdapter, PersonProjectRoleAssociation, ProjectRole, SourceDocument, Tag, Ticket, View, Workflow,
Properties: annotates, hasAdapter, hasDocumentaryWorkproduct, hasFile, hasFormat, hasGap, hasMainFile, hasOnlineAccount, hasProject, hasProjectRole, hasRootGap, hasTag, hasTicket, hasWorkflow, inputFormat, isFilledBy, isFormatOf, isProducedBy, managedBy, outputFormat,
The DOCOM Ontology definitions presented in this report are written using a computer language ([OWL]) that makes it easy for software to process some basic facts about the terms in the ontology, and consequently about the things described using the DOCOM vocabulary.
The following figure summarizes the concepts that are part of DOCOM and their relationships. Some auxiliary concepts are referenced by DOCOM, such as rdf:Resource (from [RDF Schema]) and foaf:Person (from [FOAF]), although they are not part of the DOCOM ontology.
What follows is a very simple description of a compound document. Note that some resources are not completely described here for the sake of simplicity.
<rdf:RDF ... >
<DocumentaryProject rdf:about="http://example.org/projects/vulcano-partners">
<hasDocumentaryWorkproduct>
<DocumentaryWorkproduct rdf:about="http://example.org.org/projects/vulcano-partners/dwps/1">
<hasFormat rdf:resource="http://example.org/formats/xdoc"/>
<hasRootGap>
<Gap rdf:about="http://example.org.org/projects/vulcano-partners/dwps/1/rootGap">
<isFilledBy>
<IntermediateLanguageDocument rdf:about="http://example.org/intermediate_documents/1">
<hasFormat rdf:resource="http://example.org/formats/xdoc"/>
<isFormatOf>
<SourceDocument rdf:about="http://example.org/fragments/1">
<hasMainFile rdf:resource="http://www.example.org/20081103.xml"/>
<hasFormat rdf:resource="http://example.org/formats/xml"/>
<hasAdapter about="http://example.org/inputadapters/fromXml"/>
</SourceDocument>
</isFormatOf>
<hasGap>
<Gap rdf:about="http://example.org.org/fragments/1/gaps/2">
<isFilledBy rdf:resource="http://example.org/intermediate_documents/2"/>
</Gap>
</hasGap>
</IntermediateLanguageDocument>
</isFilledBy>
</Gap>
</hasRootGap>
</DocumentaryWorkproduct>
</hasDocumentaryWorkproduct>
</DocumentaryProject>
</rdf:RDF>
URI: http://idi.fundacionctic.org/docom#Adapter
Adapter - a software element that converts a document from one format to another.
URI: http://idi.fundacionctic.org/docom#Annotation
Annotation - the action of annotating a resource with a tag by a user.
URI: http://idi.fundacionctic.org/docom#Document
Document - a writing item that contains information designed and presented as an individual entity.
URI: http://idi.fundacionctic.org/docom#DocumentaryProject
Documentary project - an initiative that has as objective to achieve documentary workproducts.
URI: http://idi.fundacionctic.org/docom#DocumentaryWorkproduct
Documentary workproduct - input/output documents obtained as a result of a workflow activity and potentially generated as views. (comes from SPEM 2.0 by OMG).
URI: http://idi.fundacionctic.org/docom#File
File - a block of arbitrary information, or resource for storing information.
URI: http://idi.fundacionctic.org/docom#Format
Format - the way in which data is stored or held to be worked on or just to be shown.
URI: http://idi.fundacionctic.org/docom#Gap
Gap - a space that can be fulfilled by a document.
URI: http://idi.fundacionctic.org/docom#InputAdapter
Input adapter - the adapter in charge of the input documents.
URI: http://idi.fundacionctic.org/docom#IntermediateLanguageDocument
Intermediate language document - documents formated in an intermediate language.
URI: http://idi.fundacionctic.org/docom#OutputAdapter
Output adapter - the adapter in charge of the output documents.
URI: http://idi.fundacionctic.org/docom#PersonProjectRoleAssociation
Person/project/role association - the association of three entities: person, role and project.
URI: http://idi.fundacionctic.org/docom#ProjectRole
Project role - the role of a person involved in a project.
URI: http://idi.fundacionctic.org/docom#SourceDocument
Source document - an origin document from which the new one derives or is obtained.
URI: http://idi.fundacionctic.org/docom#Tag
Tag - a keyword or term associated with or assigned to a piece of information.
URI: http://idi.fundacionctic.org/docom#Ticket
Ticket - a service request, used to track complaints or requests that an issue be handled.
URI: http://idi.fundacionctic.org/docom#View
View - a possible displayed view of a generated document.
URI: http://idi.fundacionctic.org/docom#Workflow
Workflow - a depiction of a sequence of operations.
URI: http://idi.fundacionctic.org/docom#annotates
annotates - the act of annotating some resource.
URI: http://idi.fundacionctic.org/docom#hasAdapter
has adapter - a property that brings together a document with an adapter able to transform it into a new format.
URI: http://idi.fundacionctic.org/docom#hasDocumentaryWorkproduct
has documentary workproduct - a documentary project produces, as a result of its activity, a documentary workproduct.
URI: http://idi.fundacionctic.org/docom#hasFile
has file - a document has a physical representation as one or more related files.
URI: http://idi.fundacionctic.org/docom#hasFormat
has format - a document or a file is enconded in one specific format.
URI: http://idi.fundacionctic.org/docom#hasGap
has gap - a document can have structural gaps that could be filled by other documents.
URI: http://idi.fundacionctic.org/docom#hasMainFile
has main file - a source document and a view (being both documents) have, among all the integrating resources, one considered as main file.
URI: http://idi.fundacionctic.org/docom#hasOnlineAccount
has on-line account - indicates the name (identifier) associated with this online account.
URI: http://idi.fundacionctic.org/docom#hasProject
has project - a person having a role is involved in some project.
URI: http://idi.fundacionctic.org/docom#hasProjectRole
has project role - a person involved in a project has a role.
URI: http://idi.fundacionctic.org/docom#hasRootGap
has root gap - a documentary workproduct has a single root gap.
URI: http://idi.fundacionctic.org/docom#hasTag
has tag - the act of annotating involves some tag.
URI: http://idi.fundacionctic.org/docom#hasTicket
has ticket - an annotation could refer to a ticket.
URI: http://idi.fundacionctic.org/docom#hasWorkflow
has workflow - a document has a workflow.
URI: http://idi.fundacionctic.org/docom#inputFormat
input format - the adapter has an input format (of the document to be processed).
URI: http://idi.fundacionctic.org/docom#isFilledBy
is filled by - a gap is filled by a document.
URI: http://idi.fundacionctic.org/docom#isFormatOf
is format of - Created as subproperty of dcterms:isFormatOf in order to be more specific regarding domain and range.
URI: http://idi.fundacionctic.org/docom#isProducedBy
is produced by - a document that has been transformed into a new format is produced by an adapter.
URI: http://idi.fundacionctic.org/docom#managedBy
managed by - a documentary project, an intermediate language document and its annotation are managed by an on-line account.
URI: http://idi.fundacionctic.org/docom#outputFormat
output format - the adapter has an output format (of the document to be processed).
The following namespaces are referenced by the DOCOM ontology:
| Prefix | XML Namespace | Specification |
|---|---|---|
| dcterms | http://purl.org/dc/terms/ | The Dublin Core Metadata Initiative Metadata Terms |
| foaf | http://xmlns.com/foaf/0.1/ | Friend of a Friend (FOAF) Vocabulary |
| owl | http://www.w3.org/2002/07/owl# | OWL Web Ontology Language |
| rdf | http://www.w3.org/1999/02/22-rdf-syntax-ns# | Resource Description Framework (RDF) |
| rdfs | http://www.w3.org/2000/01/rdf-schema# | RDF Vocabulary Description Language 1.0: RDF Schema |
| xsd | http://www.w3.org/2001/XMLSchema# | XML Schema (Datatypes) |
Regarding the DOCOM ontology itself, the prefix docom is suggested as an alias for the namespace http://idi.fundacionctic.org/docom#, although applications may use any other prefix of their choice.
2009-05-11: Changed the domain of the property managedBy.
2008-12-10: Initial version of the DOCOM Ontology specification.
This document was produced with the participation of the Vulcano project consortium (by alphabetical order):