Two of the major goals of the Federal Enterprise Architecture (FEA) are information sharing and the improved effectiveness of federal IT investments. Achieving these goals requires the ability to identify and use common data across the federal government. The Office of Management and Budget (OMB) has developed the Data Reference Model (DRM) to assist in the identification and use of common data. The DRM’s primary purpose is to promote the common identification, use, and appropriate sharing of data/information across the federal government. (2IVA)
To achieve this purpose, the DRM describes three basic standardization areas: (2IVB)
• Categorization of data (2IVC)
• Exchange of data (2IVD)
• Structure of data (2IVE)
Information sharing can be enabled through the common categorization and structure of data. By understanding the business context of data, DRM users will be able to communicate more accurately about the content and purpose of the data they require. This improved communication on the content and purpose of data will improve the ability to share information throughout the federal government. Exhibit A illustrates the basic areas of the DRM. (OI)
Exhibit A: DRM Structure (2IVF)
Categorization of Data: The DRM establishes an approach to the categorization of data through the use of a concept called Business Context. The business context represents the general business purpose of the data. The business context uses the FEA Business Reference Model (BRM) as its categorization taxonomy. (2IVG)
Exchange of Data: The exchange of data is enabled by the DRM’s standard message structure, called the Information Exchange Package. The information exchange package represents an actual set of data that is requested or produced from one unit of work to another. The information exchange package makes use of the DRM’s ability to both categorize and structure data. (2IVH)
Structure of Data: To provide a logical approach to the structure of data, the DRM uses a concept called the Data Element. The data element represents information about a particular thing, and can be presented in many formats. The data element is aligned with the business context, so that users of an agency’s data understand the data’s purpose and context. The data element is adapted from the ISO/IEC 11179 standard. (2IVI)
Potential Outcomes of the DRM: (2J07)
The DRM’s primary purpose is to promote the common identification, use, and appropriate sharing of data/information across the federal government. (2IVJ)
Effective use of the DRM’s approach may facilitate several outcomes, including: (2IVK)
• Improvement of federal agencies’ ability to share information (2IVL)
• Improvement of the effectiveness of federal IT investments (2IVM)
Information sharing is improved through the use of an integrated DRM. In order to successfully share information, users must fully understand the context or business purpose of the information that is needed or produced. Using the DRM’s business context facilitates an agency’s ability to categorize its data in a common way. Once the categorization of the information is understood, agencies can use the DRM’s structure to consistently describe the actual data element. In this way, the DRM’s information exchange package uses the common approach to the categorization and structure of data to facilitate the sharing of information. (2IVN)
Adopting the DRM’s approach to the categorization, structure, and exchange of data promotes the effectiveness of agency IT investments. Agencies that define and categorize their data using a common approach can identify IT applications that meet user requirements prior to proposing new IT investments. Agencies that exchange their data in a common structure increase the likelihood that other agencies can re-use IT investments as shared services within their own architectures. (2IVO)
In addition, use of the DRM supports legislative requirements such as the Information Quality Act (IQA) See Footnote 1. The IQA, for example, establishes guidelines that are focused on the management and quality of data generated by federal agencies. Effective data management requires that agencies fully understand the use and purpose of the data they are managing. Furthermore, agencies can only manage the quality of their data if they can validate that its content and structure are accurate. Exhibit B illustrates several additional technical and business outcomes. (2IVP)
Exhibit B: Outcomes of the DRM (2IVQ)
Use of this Document (2J05)
Volume I of the DRM (this document) establishes a high-level overview of what the DRM is in the context of its ability to support a common approach to the categorization, exchange, and structure of data. The following table illustrates the content of each section within Volume I of the DRM and the type of user to which it applies. (2IVR)
DRM Roadmap (2J0B)
This is the first of four volumes of the DRM. Future volumes of the DRM will focus on providing users with more details regarding the categorization, exchange, and standardization of data. OMB will also release a complementary Data Management Strategy. This strategy document results from collaboration between the CIO Council and the FEA PMO. It will address the governance and management of data. (2J0C)
The following table is a guide to the use of future volumes of the DRM. The table illustrates which volumes that users of the DRM will find most applicable. Detailed information regarding each volume is contained within Section 5 of this document. (2IVU)
In addition to collaborating with the CIO Council, OMB will continue to develop the DRM through a series of ongoing pilots with various federal agencies and programs. Pilots represent existing government programs that will contribute to the development and refinement of future volumes of the DRM. (2IVW)
=========================================================================================================== (2JQW)
Comments (2JQX)
- ExtendedAgencyComments_Including_MDA_AOA_SemanticWeb_Ontologies_OpenSourceInfrastructure_2004_01_22 (2LU3)
- ExtendedAgencyComments_Including_Defining_Data_Elements_Semantic_Web_and_Model_Driven_Architecture_2004_01_22 (2K7W)
- ExtendedAgencyComments_Including_DRM_Requirements_ValidationOfScope_and_Data_Attributes_2004_01_22 (2KBW)
- ExtendedAgencyComments_Including_Vol_I_II_and_ISO_11179_2004_01_22 (2KH4)
- ExtendedAgencyComments_Including_Security_and_Privacy_and_Semantic_Interoperability_2004_01_22 (2KI2)
- ExtendedAgencyComments_Including_Clarity_of_Purpose_and_Data_Management_Strategy_2004_01_22 (2KIL)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JQY)
- Comment: A primary issue with the DRM is one of intent. If the goal is to develop a standardized methodology and taxonomy for data classification and definition that is already underway in several venues and this effort should simply coordinate with those. If it is meant to provide a classification and reusability strategy, that too is being done already. There are two fundamental issues involved here. The first is standardization of metadata. This is a goal long sought after but not achieved because of the sheer magnitude of the effort involved. The second issue is one of encapsulation whereby data and information on how to interact with the data is passed as a single entity. This allows for true data driven system development, which greatly simplifies development, but again it has a very high startup cost in time. To work, XML schemas or other available technologies must first be developed and standardized throughout the government AS WELL AS anyone that interchanges data with the Government. Once done, software can be quickly developed because of the predictable nature of the data it will handle. The bottom line is that if the DRM does not seek to accomplish one or the other of these goals, what is its purpose and how does it advance the interaction of the Government and all its partners, stakeholders and end-users. (agency comment / compiled 2004.12.06) (2JQZ)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K18)
- Comment: The model should become a "National" model rather than a "Federal" model. That means getting state and local reps involved in the process before rolling it out. (agency comment / compiled 2004.12.06) (2K19)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JY9)
- Comment: The stated purpose in the FEA DRM is to promote data and info sharing across the federal government. It highlights the information exchange requirements within communities of practice. The actual scope of information sharing is across federal, state, and local governments since a community of practice can include members beyond just federal agencies. The actual scope of the FEA DRM is inconsistent with what is stated in the final draft. (agency comment / compiled 2004.12.06) (2JYA)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K51)
- Comment: The idea of sharing data is of course not new. There have been many efforts to share data within work groups, companies, government agencies, whole industries, and some efforts to share data globally. Progress in sharing data has been uneven, often following the pattern of two steps forward and one step backward. (2K52)
- One of the issues that has hindered progress is the difficulty of obtaining broad agreement on data definitions. As a sharing effort expands from a small local work group to a wider scope, the human, organizational, and technical barriers to agreement mount. (2K53)
- This is not so say that we should not undertake sharing efforts in the federal government. However, based on past experience, it would be wise to distinguish the most modest and hence most achievable objectives, while laying the groundwork for more ambitious objectives. (agency comment / compiled 2004.12.06) (2K54)
- Comment: The idea of sharing data is of course not new. There have been many efforts to share data within work groups, companies, government agencies, whole industries, and some efforts to share data globally. Progress in sharing data has been uneven, often following the pattern of two steps forward and one step backward. (2K52)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K55)
- Comment:It is necessary to distinguish between three forms of sharing in the data world: (2K56)
- Having a common means for defining data elements (2K57)
- Having common data element definitions (2K58)
- Having common data, i.e. common instances of data elements (2K59)
- For example, two healthcare organizations might conceivably share the same means for defining data elements, but that does not necessarily mean that they share specific data element definitions. On the other hand, suppose they do in fact share the definitions of some data elements that describe an immunization schedule; that does not necessarily mean that they share actual immunization schedules that realize the data element definitions. (2K5A)
- It is important to clearly differentiate among these three levels of commonality in order to separate concerns properly in a data architecture, and in order to allow organizations to develop an organized approach to gradually progressing from relatively easy to more difficult tasks. In any of its discussions, the DRM must make it crystal clear which of these concerns it is talking about. (agency comment / compiled 2004.12.06) (2K5B)
- Comment:It is necessary to distinguish between three forms of sharing in the data world: (2K56)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JWI)
- Comment: The structure of associations between data is required for data interoperability at a semantic level. The DRM needs to be extended to address associations between data. (agency comment / compiled 2004.12.06) (2JWJ)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JRA)
- Comment: The initial release is a good description of a framework for data, but how it can be used to assist agencies with collaboration processes could be further explained. With multiple communities of practice functioning throughout the federal government, the FEA DRM document does not seem enough to manage the existing multiple taxonomies or assist agencies in the engagement of collaboration work in progress within each community of practice. How are communities of practice identified and recognized? How are collaboration efforts discovered? Will data taxonomies of different communities of practice be made visible and accessible in a mechanism such as a registry? How will conflicting taxonomies be resolved, as required? What role do data stewards have, if any, in the FEA DRM particularly with reference to management of data objects/elements that are common across agencies? A business strategy associated with the implementation of the FEA DRM would help clarify these and other questions. (agency comment / compiled 2004.12.06) (2JRB)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K10)
- Comment: It would be helpful if the DRM recommended the process for implementing a Data Stewardship Program. (agency comment / compiled 2004.12.06) (2K11)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JVF)
- Comment: It is difficult to identify the true technical (IT) and program (Business) impact from this "high-level" DRM document. As other volumes become available more detailed comments should be possible. (agency comment / compiled 2004.12.06) (2JVG)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JRC)
- Comment: It is very difficult to comment technically on the DRM Volume I document, which is deliberately very high-level and essentially devoid of technical content. Volume I contains very little explanation of how to implement. In fact, there are many ways to implement the DRM from Volume I. The only real technical requirement mentioned appeared to be the use of ISO 11179 for data element naming. Many of the comments below question the technical meaning of Volume I. We assume that future DRM volumes will provide more technical guidance for implementation. (agency comment / compiled 2004.12.06) (2JRE)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2KG8)
- Comment: Make DRM-I assumptions explicit. The assumptions underlying the DRM-I need to be better surfaced and discussed. For example, the document appears to place the burden of judgment on the information owner to make the correct choices. Is that a preferred assumption? Is it an assumption at all? Who decides what data is to be published, made interoperable or made accessible? Is interoperability automatic as a goal, or is the purpose of the DRM to provide a structured environment in which data existence and availability are made public and the provider and prospective user then negotiate and arrange off-line for access, including interoperability? (agency comment / compiled 2004.12.06) (2KG9)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2JV6)
- Comment:The Data Reference Model (DRM) is too confusing and too simplistic. The topic is far more complex than Volume 1 leads one to think. Mr. Michael C. Daconta of Department of Homeland Security (DHS) in his presentation to the Architecture and Infrastructure on November 18 succinctly delineated the problems with the current draft of the DRM. These problems are: (2JVD)
- The DRM needs to be turned over to an agency such as DHS that has the requisite expertise to develop the DRM and to be the steward. (2JVC)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K04)
- Comment: There is important information in both the DRM Version 1 (Sept. 2004) and the presentation that accompanied the DRM (Overview and Roadmap). However, you have to refer to both documents for a full understanding of the DRM. (2K05)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVA (2K3R)
- Comment: The DRM’s input on the “Categorization of Data” calls for the mapping of the BRM constructs of functions, activities, and processes to agency data assets. As a result, the DRM contends that data will be more readily identified, and therefore shared. The following comment expresses concern with this approach. Categorizing data based upon function (the BRM) is a categorization or arrangement of data dependent on how the data is used, viewed, and/or reported (the business context). This can negate the purpose stated for the DRM, in this paragraph: “The DRM’s primary purpose is to promote the common identification, use, and appropriate sharing of data/information across the federal government”. (agency comment / compiled 2004.12.06) (2K3S)
- Suggested Action: Incorporate additional information from the presentation into the DRM Volume 1 for clarity. In addition, please give enough detailed guidance in Volume 1 to ensure that agencies will not have to rework models when Volumes 2-4 are published. (agency comment / compiled 2004.12.06){nid 2K06} (2K07)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVG (2JR4)
- Comment: Overall this version of the Data Reference Model (DRM) has some very strong concepts and some that are difficult to grasp. The notion of "Categorization of Data" or "Business Context" as it is also called is very good. This is the area in which the DRM can deliver the greatest value to agencies in terms of standardizing data and promoting information sharing. (agency comment / compiled 2004.12.06) (2JR8)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVG (2JWK)
- Comment:Apart from mentioning that Volume IV of the DRM will define "metadata requirements" for specific communities of interest or lines of business, the DRM does not provide adequate guidance for the management and use of metadata and metadata repositories (MDR). Metadata and MDRs play an essential role in data definition and interoperability. (2JWM)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVG (2K2R)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVO (2JR9)
- Comment: What is not represented in this release, nor does it indicate a plan for, is the actual building of a set of common business objects. (agency comment / compiled 2004.12.06) (2JR5)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVR (2JR6)
- The layout of the current DRM is more of a framework, defining data; as opposed to a reference model in which government-wide, fundamental data objects are categorized and defined, similar to the way common government-wide business activities are defined in the BRM. (agency comment / compiled 2004.12.06) (2JR7)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2J0C (2JRF)
- Comment: There is important information in both the DRM Version I (Sept. 2004) and the presentation that accompanied the DRM (Overview and Roadmap). However, you have to refer to both documents for a full understanding of the DRM. (2JRG)
- Suggested Action: Incorporate additional information from the presentation into the DRM Volume I for clarity. In addition, please give enough detailed guidance in Volume I to ensure that agencies will not have to rework models when Volumes 2-4 are published. (agency comment / compiled 2004.12.06) (2JRH)
- Comment: There is important information in both the DRM Version I (Sept. 2004) and the presentation that accompanied the DRM (Overview and Roadmap). However, you have to refer to both documents for a full understanding of the DRM. (2JRG)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2J05 (2JU5)
- Comment: Reference to Other FEA Reference Models and DRM documents: It would be helpful to include the diagram of the Federal Enterprise Architecture with an explanation of the intended use of the DRM and its relationship with the other reference models (FEA Reference Model Integration diagram). The current DRM does call out the connection of categorizing the government’s information along general content areas specific to sub-functions in the Business Reference Model (BRM). However, there is no mention of the Service Reference Model (SRM) in the DRM. The FEA SRM states that the DRM will be produced on a business line by business line basis with attributes of these data components being based upon specifications as identified in the FEA TRM and accessed by a specific Service Component in the SRM. Is there a connection between the Business Components identified in the SRM and the Information Exchange Packages in the DRM? In addition, it would be helpful to include a reference to other related DRM documents, such as the FEA DRM Data Management Strategy document. (agency comment / compiled 2004.12.06) (2JU7)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVP (2JVM)
- Comment: Data Quality -Even though the DRM data standardization will allow agencies to have a common understanding of data formats and contents, the data quality policies are not adequately specified. In order for the DRM to enable access to the right data, at the right time, from the right source, agencies that exchange their data must not only have a common structure and content, but they also need to understand and agree upon the characteristics of the quality of the data. These characteristics include: Accuracy, Completeness, Consistency, Timeliness, Uniqueness, and Validity. Since a cornerstone to successful data exchange is the quality of the content, it is imperative that data quality in the form of data certification be included in the DRM. In addition, details on standard data quality measures need to be established within the future DRM Volumes that are similar to other content listed in the DRM Roadmap. The DRM states that it supports the legislative requirements of the Information Quality Act (IQA). However, the DRM actually has the potential to complicate the correction of information maintained and disseminated by agencies. The DRM needs to include the concept of shared data stewardship that will define the conditions for sharing data along with procedures for handling correction of information when identified by an affected person. (agency comment / compiled 2004.12.06) (2JVN)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVI (2JXO)
- Comment: Last sentence in this paragraph could be clarified: "This conceptual view of the data element is adapted from...." (agency comment / compiled 2004.12.06) (2JXP)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVI (2K08)
- Comment: There are many standards and regulations cited within the document, some needing clarification in the form of how to implement the standards and their acronyms spelled out. The audience for this document may not be aware of every standard identified. (2K09)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVI (2K40)
- Comment: Since 11179 includes classification of data, is that what is meant by “categorization” of data? If so, then perhaps the term “classification” should be used to ensure have consistency with 11179. At a minimum, the DRM could indicate the use of the interchangeable terms. ISO/IEC 11179 allows for multiple classifications of data. This relates to “categorization” of data as described in the DRM. (agency comment / compiled 2004.12.06) (2K41)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVI (2K4E)
- Comment: The DRM references an 'adaptation' of ISO/IEC 11179 – This implies taking bits of a standard and may not support the intent of the standard. (agency comment / compiled 2004.12.06) (2K4F)
- Suggested Action: We would like to see these standards included as an appendix (perhaps on CD) for reference or available on the OMB FEAPMO website. (agency comment / compiled 2004.12.06) (2K0A)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2J0C (2K0E)
- Comment: Revisit the strategy for the volume sequencing, including the complementary Data Management Strategy. (2K0F)
- Suggested Action: It would be helpful to standardize the DRM implementation across agencies, as it addresses governance and management of data. The Data Management Strategy should be released before other volumes are delivered due to its importance. (agency comment / compiled 2004.12.06) (2K0G)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVN (2JXQ)
- Comment: This paragraph is vague and essentially duplicates the next paragraph; it could be deleted. (agency comment / compiled 2004.12.06) (2JXR)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVH (2K4C)
- Comment: Web Services (WS) standards should be used (specifically WS-security) for exchange of Data. (agency comment / compiled 2004.12.06) (2K4D)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2IVH (2LVM)
- Comment: The Volume I document does not reference XML. However, XML is extensively mentioned in the DRM briefing. A key part of XML potential benefits to the FEA, are that XML’s capabilities go well beyond data exchange. As two examples, XML can be used to support many forms of presentation (web-forms, print, etc.) and it can be used as a storage format within database and other applications for processing. The briefing also correctly links establishment of the DRM to use of XML Schema (per the above it would also be style sheets and transformations). It should be noted that if a core-component approach to the DRM is adopted, then the DRM will create standard data components that are more than a single data elements but less than a full Schema (Information Exchange Package). (agency comment / compiled 2004.12.06) (2LVN)
- Ref: http://colab.cim3.net/cgi-bin/wiki.pl?DataReferenceModel_09_2004/ExecutiveSummary_DRM_VolIv1#nid2J0B (2LVO)
- Comment: This section, the DRM Roadmap, contains the only noted reference to a DRM governance process. The reference indicates that the Governance and management of data will be identified in the Data Management Strategy Document which may be the last volume released. We suggest that the governance structure be established very early in the process and help direct the evolution of the forthcoming volumes and the actual work product. (agency comment / compiled 2004.12.06) (2LVP)