Metadata
What Metadata Is
Metadata is descriptive information about data, documents, systems, or other resources.
It helps people and software understand what something is, where it came from, how it should be used, and how it should be managed.
Typical metadata answers questions such as:
- What is this resource called?
- Who created it?
- When was it created or updated?
- What format does it use?
- What topic or category does it belong to?
- What policies or permissions apply to it?
Common Types of Metadata
Metadata is often grouped into a few practical categories.
Descriptive metadata: title, author, tags, keywords, summaryStructural metadata: how parts are arranged, such as chapters, fields, or sectionsAdministrative metadata: ownership, version, retention, permissions, lineageTechnical metadata: format, encoding, schema version, file type, size
What Metadata Is Good At
Metadata is useful because it makes resources easier to find, manage, and govern.
Discovery: supports search, indexing, and filteringGovernance: captures ownership, quality, lineage, and policy informationInteroperability: helps systems exchange context around raw dataOperations: improves cataloging, auditing, and lifecycle management
Many platforms depend on metadata even when users do not notice it directly.
Metadata Versus Schema And Ontology
Metadata, schema, and ontology often overlap in conversation, but their roles are different.
| Concept | Main concern |
|---|---|
| Metadata | Description and context around a resource |
| Schema | Structural shape and validation rules |
| Ontology | Shared semantic meaning, relations, and constraints |
For example:
- metadata may say a dataset is owned by
Team Aand was updated yesterday - schema may say the dataset has fields
customer_id,region, andrevenue - ontology may say
customeris a type of business entity andregionrefers to a geographic area with defined relations
This is why metadata is often necessary but not sufficient. It adds context, but it does not always define precise semantics on its own.
Practical Example
Imagine a document repository.
Each document may include metadata such as:
- title
- author
- created date
- document type
- confidentiality level
- tags
That metadata helps users search and manage documents. But it still may not fully explain whether a policy, standard, procedure, and guideline are semantically different classes with distinct relationships. That deeper meaning is closer to ontology.
Common Mistakes
- Treating any label set as complete semantic modeling
- Mixing business meaning into loose free-text tags
- Capturing metadata without ownership or governance
- Using inconsistent names for the same descriptive field
- Assuming metadata quality will improve without standards
Summary
Metadata is contextual information about resources.
It is essential for discovery, management, and governance. Its strength is practical description. Its limitation is that it often describes resources without fully defining their semantic meaning. That is why metadata works best alongside schema and ontology rather than replacing them.