5.2.3.2 Sharing Data Through Data Exchange Services    (3XMF)

Using this Supplier-to-Comsumer Matrix, the DRM Team analyzed the types of data interchanges between repositories (data assets, database to database information sharing) focusing on the Exchange Package payload using tangible examples of such payloads. These information exchanges vary in their structure based upon the data objects being exchanged. Based upon this analysis, the architect may specify several types of services to support the sharing of information between databases within a collection used by a COI. These services address the data exchange element of the DRM abstract model. These services fall within the following categories:    (3XMG)

Extract, Transform, Load (Structured Data to Structured Data): Extract, Transform, Load (ETL) is the process of reading structured data objects from a data source (the extract), changing the format of the data objects to match the structure required by a target database (transform), and updating the target database with the transferred data objects (load). Services that perform ETL processes range from extremely simple to extremely complex. They may also be a component of other services. The payloads for all of these exchanges are structured data. This service applies to exchanges between:    (3XMH)

http://colab.cim3.net/file/work/das/DRM_2.0/DataExchangeServicesOne.JPG    (3YQ6)

Publication: (Structured data or documents to aggregate documents): Publication is the process of assembling a document from its component pieces, putting into a desired format and disseminating it to target databases. The payload of this type of service is a document. This service applies to exchanges between:    (3XMJ)

http://colab.cim3.net/file/work/das/DRM_2.0/Figure_publication.JPG    (3YYG)

Entity/ Relationship Extraction (Unstructured documents to structured documents or structured data objects): Entity/ Relationship Extraction is the process of identifying and pulling out specified facts from documents. Entities are nouns that designate a specific person, place or thing. Relationships are the association or affiliation of one entity to another. Typically, the entities identified during an entity/relationship extraction process may be incorporated into the source document as metadata, inserted into a separated document (such as a metadata record used to support discovery), or incorporated into a structured database. The payloads for all of these exchanges are structured data. This service applies to exchanges between:    (3XML)

http://colab.cim3.net/file/work/das/DRM_2.0/DataExchangeServicesThree.JPG    (3YQ7)

Document Translation (Document to document): Document translation is the process of transforming a document from its original format to a format required to support a target application. The transformations may be structural (e.g., transforming MS Word to PDF format), language-oriented (e.g., changing English to French), or special purpose (e.g., the development of abstracts from longer documents.) The payload of this type of service is a document. This service applies to exchanges between:    (3XMM)

http://colab.cim3.net/file/work/das/DRM_2.0/DataExchangeServicesFour.JPG    (3YQ9)