All Posts

What You Need to Know to End Information Chaos (Part 1)

Clients I work with struggle with many issues.  Among these, two often rise to the top.

  1. How do we show the impact of metadata on our business?
  2. Do we need our own metadata standard? Alternatively, should we use industry standards?

This post covers my approach to answering these questions.

To begin with, I ask them how they define metadata.  And most often, they give me the usual tautology: metadata is data about data.   You can do better than that!   A more meaningful alternative is to say that metadata is what allows data to be searched, understood, and consistently used within a company.

Metadata provides data with a context that enables users to think about and share data in useful ways.  In short, metadata transforms data into information.  It enables a complex organization to make informed decisions and take appropriate actions because we can look at our collective experience through a common framework of understanding.

The intellectual endeavor of managing "metadata" is at the core of all information and knowledge related work.  Taxonomy is dedicated to the practice of producing logical categorization models, and therefore at the very heart of creating metadata systems.  Human-understandable taxonomies provide the words and relationships needed to access and use information.

It is important to document and get buy-in on metadata standards.  As a standard for organizing information as it is shared by many people and devices, the metadata system must be agreed to by all key stakeholders.  Metadata standards are an artifact of group decision making in response to the dynamic organizational environments and the need to capture information needed to succeed. 

When metadata is understood as critical to sharing information,  we can then point to where better management of metadata can help the organization avoid information chaos.  The process of developing and maintaining metadata as an organizational standard is called governance. 

To evaluate how well an organization is managing its data and data standards, IBM Data Governance Council produced a data maturity framework and maturity model.  This model provides a heuristic framework for evaluating organizational needs for governing data, revolves around five central concepts, starting with an Initial/Reactive phase and ending with an Optimizing/Continuous phase.   I recommend the model as a structure for thinking through where we can affect the organization by improving metadata management.


The use of standards has become something of a mantra for the developer community and for organizations in generally, as they are the essential framework for data exchange, interoperability, and technology migration.  Standards contribute to these objectives by providing a conceptual framework and detailing requirements for data structure and quality assurance. 

The desire to have common, interoperable standards  has resulted in a proliferation of generic standards documents by standards organizations, such NOIS, OASIS, ISO, W3C, and by industry platform leaders, such as Adobe and Microsoft.   Though based on deep insights into the topics, platforms, and technologies addressed, the plethora of standards has led to the complaint that there is a metadata chaos.  Not only are there many choices, but the standards can create implementation burdens and issues when an enterprises business model differs from that used to develop the standard.  From a business perspective, standards are standards only to the extent they have pragmatic value for organization. 

Pragmatism in determining what metadata is required and how the resulting standard is applied is the quintessential business of a governance process.  In the governance process, as with any other management task, an organization establishes roles, accountabilities, communication pathways, and informed stakeholder buy-in that leads to a pragmatic approach.

In part 2, I will discuss further the role of taxonomy and information modeling in reducing information chaos.

Earley Information Science Team
Earley Information Science Team
We're passionate about enterprise data and love discussing industry knowledge, best practices, and insights. We look forward to hearing from you! Comment below to join the conversation.

Recent Posts

[Earley AI Podcast] Episode 26: Daniel Faggella

Human Cognitive Science Guest: Daniel Faggella

[RECORDED] Master Data Management & Personalization: Building the Data Infrastructure to Support Orchestration

The Increasing Criticality of MDM for Personalization for Customers and Employees Master data management seems to be one of those perennial, evergreen programs that organizations continue to struggle with. Every couple of years people say, “we're going to get a handle on our master data” and then spend hundreds of thousands to millions and tens of millions of dollars working toward a solution. The challenge is that many of these solutions are not really getting to the root cause of the problem.  They start with technology and begin by looking at specific data elements rather than looking at the business concepts that are important to the organization. MDM programs are also difficult to anchor on a specific business value proposition such as improving the top line. Many initiatives are so deep in the weeds and so far upstream that executives lose interest and they lose faith in the business value that the project promises. Meanwhile frustrated data analysts, data architects and technology organizations feel cut off at the knees because they can't get the funding, support and attention that they need to be successful. We've seen this time after time and until senior executives recognize the value and envision where the organization can go with control over its data across domains, this will continue to happen over and over again. Executives all nod their heads and say “Yes! Data is important, really important!” But when they see the price tag they say, “Whoa hold on there, it's not that important”. Well, actually, it is that important. We can't forget that under all of the systems, processes and shiny new technologies such as artificial intelligence and machine learning lies data. And that data is more important than the algorithm. If you have bad data your AI is not going to be able to fix it. Yes there are data remediation applications and there are mechanisms to harmonize or normalize certain data elements. But looking at this holistically requires human judgment: understanding business processes, understanding data flows, understanding dependencies and understanding of the entire customer experience ecosystem and the role of upstream tools, technologies and processes that enable that customer experience. Until we take that holistic approach and connect it to business value these things are not going to get the time, attention and resources that they need. In our next webinar on March 15th, we're going to take another look at helping organizations connect master data to the Holy Grail of personalized experience. This is an opportunity to bring your executives to a webinar that will show them how these dots are connected and how to achieve significant and measurable business value. We will show the connection between the data, the process that the data supports, business outcomes and the and the organizational strategy. We will show how each of the domains that need to be managed and organized to enable large scale orchestration of the customer and the employee experience. Please join us on March 15th and share with your colleagues - especially with your leadership. This is critically important to the future of the organization and getting on the right track has to begin today.

[Earley AI Podcast] Episode 25: Michelle Zhou

Data Tells the Story Guest: Michelle Zhou