Global Knowledge for AI

A Database-First Approach

By G. Sawatzky, embedded-commerce.com
August 27, 2025


Introduction

The Semantic Web envisioned intelligent machines understanding globally interconnected data. Two decades later, while this vision remains compelling, its web-document-centric foundations have faced significant limitations for modern AI needs. This article explores why that paradigm can create fundamental problems for structured data and proposes a database-first approach. This method aims to maintain clean architectural separation, potentially delivering better performance, and providing the logical rigor that may be necessary for reliable, knowledge-aware AI by leveraging modern public APIs like GraphQL.

The Problem: Web Documents Are Not Databases

Within the dynamic landscape of business, commerce, and enterprise AI, the need for robust and scalable knowledge management systems is essential. Tim Berners-Lee's vision for the Semantic Web grew naturally from his success with the World Wide Web. Technologies like HTTP, URIs, and hyperlinks solved the problem of linking documents across a distributed network. However, extending this document-centric paradigm to structured data with RDF and triple stores appears to have introduced architectural problems that persist today, particularly for the demands of enterprise-scale data.

A Flawed Data Model

The web's document model treats everything as markup, mixing structure and content. While this is effective for human-readable pages, it can lead to significant issues for structured data, potentially resulting in a loss of proven architectural discipline.

The Higher-Arity Problem

Real-world relationships are not always simple pairs. Conceptual modeling methods like Object-Role Modeling (ORM) naturally handle n-ary fact types involving multiple entities. Triple stores, by contrast, are often implemented as fundamentally binary. Modeling a relationship like "Person ordered Product from Supplier on Date" can become an awkward collection of triples with an artificial "Order" node. This reification process may obscure the original semantics and can lead to a loss of the constraints that ORM models naturally express.

The Solution: A Database-First Approach

Instead of starting with web architecture and adapting it for structured data, this article proposes building on the proven principles of industrial-strength database management systems and adding intelligent public interfaces on top. This approach aims to leverage the principles of foundational database theory to build a robust framework for knowledge-aware AI.

Object-Role Modeling: A Practical Semantic Layer

For practical use in today's neuro-symbolic AI, an ontology might be seen as more than a theoretical concept. It could be viewed as a structured, interpretable specification of a domain expressed through logic-governed constraints and formal semantics. An Object-Role Model, when developed rigorously, may serve as a powerful, machine-interpretable ontology that fully meets this definition, effectively forming the semantic layer of the knowledge architecture.

Object-Role Modeling is a conceptual methodology that uses a role-based approach to prioritize constraints and conceptual abstraction. It focuses on defining the world independent of any specific implementation and provides a precise semantic blueprint that various systems can implement. Its utility for explicit business rule modeling and robust enterprise information architecture is particularly noteworthy. This approach is an effective tool for building a practical semantic framework, which is explored in more detail in my other articles.

GraphQL as a Public-Facing Knowledge Interface

Once a solid, logical foundation, precisely defined by an ORM model and forming the semantic layer, is established, one can then consider exposing that knowledge through a flexible, modern API. GraphQL presents itself as a highly suitable public interface for this purpose, acting as the gateway to the underlying semantics.

Making Knowledge Discoverable

For a knowledge architecture to be truly valuable, its underlying semantics must be easily discoverable and consumable at scale, especially by autonomous agents. This goes beyond simple schema introspection and requires a dedicated strategy for discovery. Several practical approaches, much lighter and more effective than traditional Semantic Web crawling, are now possible:

The New Context: The Role of AI

The rise of Large Language Models (LLMs) has undeniably changed the knowledge representation landscape. Instead of machines reading semantic markup, systems now appear able to understand and reason over natural language at an unprecedented scale.

The future may lie in combining LLM's natural language capabilities with the formal logical reasoning of a clean knowledge representation. This hybrid intelligence could leverage the strengths of both neural and symbolic systems. A well-structured, ORM-based database might become the ideal foundation for knowledge-grounded AI by:

Conclusion

The Semantic Web's vision of globally accessible knowledge is still worth pursuing, but its architectural foundation appears to have been flawed. The article suggests that the solution may not lie in extending the web's document model to data, but rather in starting with proven database principles. By building intelligent public interfaces like GraphQL on top of a solid, logical foundation provided by industrial-strength database management systems and Object-Role Modeling (acting as the semantic layer), this approach aims to deliver the performance, reliability, and logical rigor that both enterprise systems and public knowledge require. The Object-Role Model itself provides the precise conceptual blueprint, independent of any specific implementation, potentially ensuring that the underlying semantics are clear and consistent, regardless of the chosen database or reasoning engine. This is a challenging and complex area to solve, and while this article does not seek to address every possible challenge, it is posited that this direction is worthy of more research and exploration.

The future of knowledge-aware AI could involve building proper data architectures with intelligent interfaces, potentially fulfilling the original vision more completely and making knowledge truly accessible to humans, AI systems, and automated agents.

Other Sources for Further Research: