Knowledge Engineering
and the 'Shortcomings' of SQL

August 14, 2025

View Slides: KE + SQL (Executive Summary)

In the world of software development, those of us deeply involved in database management systems (DBMS) built on SQL might sometimes feel a disconnect from the discussions within the Knowledge Engineering (KE) community. We often hear claims that SQL falls short in formal knowledge representation due to perceived limitations in its expressiveness and computational reasoning. For those of us who have effectively built complex systems with SQL, these claims can be puzzling; our immediate reaction might be, "that's not true, and here's how it can be done in SQL." This article aims to bridge that gap by exploring this nuanced perspective.

We will provide precise definitions, examine where SQL's practical implementations might indeed present challenges for robust knowledge representation, and, importantly, distinguish these points from the inherent strengths of the formal Relational Model. Finally, we will propose how an ORM (Object Relational Modeling) based toolkit can integrate various technologies to create a comprehensive knowledge engineering solution. These suggestions are general guidelines; their practical implementation should always be based on specific context, use cases, and volumes of data. Furthermore, these considerations do not exhaustively cover the use of LNNs and LTNs to handle uncertainty, which is another key aspect of modern knowledge representation.

As a single author, this article presents my individual interpretation and synthesis of knowledge engineering principles and their relationship to data management technologies. It does not claim to represent the views of the KE community. My perspective may be seen as coming from an individual operating at the intersection of these fields.

Defining Knowledge Representation and Reasoning

To understand the KE perspective, we first need a clear definition of what "knowledge representation" (KR) truly means in this context.

Knowledge Representation (KR) is the formalization and organization of information, along with the explicit rules, relationships, and constraints, in a way that enables computational reasoning, inference, and automated problem-solving. It goes beyond simply storing facts to encoding the logic by which new, implicit facts can be derived or verified by machines.

Reasoning is the process of manipulating a knowledge representation to derive conclusions that are implicitly contained within the represented knowledge. This includes deduction (guaranteed conclusions), induction (generalizing from specific examples), and abduction (inferring causes from effects).

The core distinction from mere "information" is that KR builds structures not just for retrieval, but for active, logical manipulation. For example, if we have information that "Mary is 30 years old" and "All persons over 18 are adults," a knowledge representation would encode the rule that allows us to infer "Mary is an adult" without explicitly stating it. This is a key capability KR aims to achieve.

Use Cases: Where Knowledge Engineers Operate

Knowledge engineers work to build systems that capture and operationalize domain expertise.

Interacting with the Model

Knowledge engineers primarily interact with a declarative model. They define:

Concepts: The types of entities in a domain (e.g., Person, Book, Department).
Relationships: How these concepts relate to each other (e.g., Person works for Department, Author wrote Book).
Attributes: Properties of concepts (e.g., Person has birthDate).
Constraints/Rules: The logical conditions that must always hold true in the domain (e.g., "every Book must have at least one Author," "a Person can only work for one Department at a time").
Inference Patterns: Rules that allow new facts to be derived (e.g., "If X is a kind of Mammal and Mammal has Hair, then X has Hair").

This interaction is typically through high-level, human-readable (and machine-interpretable) languages that focus on the semantics and logic of the domain, rather than low-level storage or procedural steps.

Beyond defining the domain's facts and rules, Knowledge Engineers also require the ability to query and reason with the model itself. This means asking questions not just about the data instances (e.g., "Who wrote '1984'?"), but about the schema, concepts, and relationships defined in the knowledge representation (e.g., "What types of relationships can exist between a Person and a Department?", "What are the superclasses of Mammal?").

Practical Outcomes and Value Proposition

The job of a knowledge engineer is done when the formalized knowledge can be effectively used by computational systems for its intended purpose. The value proposition includes:

Automated Inference: Deriving new facts or conclusions without explicit programming.
Semantic Search: Finding information based on its meaning, not just keywords.
Data Validation: Ensuring data integrity beyond basic structural checks, based on complex business rules.
Decision Support: Providing explicit reasoning steps for automated or human decisions.
System Integration: Creating a shared understanding (ontology) that allows disparate systems to communicate effectively.
Explainable AI: Providing auditable chains of reasoning for AI conclusions, crucial for trust and compliance.
Machine-interpretable models: The formalized knowledge models themselves serve as direct deliverables, providing a structured and machine-readable representation of the domain that can be consumed by various systems.

Knowledge engineers often hand off their deliverables to AI developers, data scientists, software architects, and domain experts who consume the formalized knowledge in various forms, such as:

Expert systems.
Reasoning services.
Knowledge graph databases and APIs.
Rules engines.
Intelligent agents.

Defining Pass/Fail Criteria and Tests

Just like any engineering discipline, KR requires rigorous testing. Pass/fail criteria include:

Consistency: The knowledge base should not contain logical contradictions.
Completeness (within scope): The knowledge base should be sufficient to answer expected queries and derive intended inferences.
Accuracy: The represented knowledge should align with domain expert understanding.
Performance: Reasoning and query answering should meet system requirements.
Verifiability: The ability to trace conclusions back to their originating facts and rules.

This involves unit tests for individual rules, integration tests for complex inferences, and validation against real-world data and expert judgments.

Possible Perceived Weaknesses in SQL for Knowledge Representation

For knowledge engineers, certain characteristics of SQL's typical implementation can be seen as less ideal for formal KR, leading to the perception of "shortcomings."

Composability: While SQL provides views and, in some implementations, functions, building these as truly reusable components that can be easily re-assembled into new rules and combined to form higher-level logical inferences can be cumbersome. It's not straightforward to compose new rules on top of existing rules directly within the declarative SQL paradigm, making it challenging to build layered knowledge structures.
Procedural vs. Declarative: SQL is often pitched as declarative, and for basic queries, it is ("what" to retrieve, not "how"). However, for enforcing complex "business rules" or implementing sophisticated inference, developers frequently resort to stored procedures, triggers, or application-level code, which are inherently procedural. This shifts the logical definition away from the database's declarative schema into dispersed, less transparent code.
Integrity Constraints vs. Business Rules: SQL's built-in integrity constraints (PRIMARY KEY, FOREIGN KEY, CHECK) are powerful for ensuring data validity. However, many complex "business rules" (e.g., "a customer must have a valid credit score to place an order over $1000") are often not directly implementable as simple declarative SQL constraints. Instead, they require procedural logic or are enforced at the application layer, leading to scattered rule definitions.
Burdensome DB Management and DDL: For rapidly evolving knowledge domains or agile development, the overhead of Data Definition Language (DDL) for schema changes in large, traditional SQL DBMSs can be perceived as slow. Modifying the conceptual schema often requires significant changes to the logical and physical schema, impacting agility.
Why must one define a schema a priori instead of just facts and relationships on the fly?: Traditional SQL databases are schema-on-write, meaning the structure must be defined before data can be inserted. This rigidity can clash with the exploratory and evolving nature of knowledge discovery, where facts and relationships might emerge iteratively, and a schema-less or schema-on-read approach (common in some graph or document databases) might seem more appealing.
Other: SQL's handling of NULL values and its three-valued logic (true, false, unknown) can introduce complexities that deviate from standard two-valued logic, which is the foundation for most formal knowledge representation and reasoning.

Conceptual, Logical, and Physical Layers

A significant point of contention can arise from the conflation of different modeling layers in SQL.

Conceptual Model: This is the highest level, representing the domain's meaning and business rules, independent of technology.
Logical Model: This maps the conceptual model to a specific data model (e.g., relational, graph, object) but still abstracts from physical storage.
Physical Model: This describes the actual storage details, indexing, and performance optimizations.

Traditional SQL database design often forces a blending of these layers. For instance, choosing a VARCHAR size (physical) can inadvertently impact how a "name" (conceptual) is perceived or constrained. Knowledge engineers prefer tools where the conceptual model can be defined purely, with clear, loosely coupled mappings to logical and physical implementations, allowing changes at one layer without spiraling through all others.

CWA vs. OWA: A Fundamental Difference

As discussed, this is a key philosophical and practical distinction:

Closed World Assumption (CWA): Predominant in traditional SQL databases. If a fact is not explicitly present in the database, it is assumed to be false. This simplifies querying but struggles with incomplete knowledge.
Open World Assumption (OWA): Common in many knowledge representation formalisms (like OWL for ontologies). If a fact is not explicitly known, it is considered unknown, not necessarily false. This is more suited for dynamic, incomplete real-world knowledge.

While SQL can simulate OWA behavior with constructs like LEFT JOIN or NOT EXISTS, its native behavior operates under CWA. KR languages often default to OWA or provide explicit mechanisms to define which parts of the knowledge base operate under which assumption, offering greater flexibility.

Does this apply to the Relational Model (RM) as originally conceived by Codd and extended by Date and Darwen?

This is a crucial distinction. Many criticisms aimed at "SQL" are not necessarily criticisms of the Relational Model (RM) itself, as conceived by E.F. Codd[1] and rigorously extended by C.J. Date and Hugh Darwen[2].

The formal Relational Model is a powerful logical model. It defines relations, attributes, and, critically, a rich set of integrity constraints (e.g., entity integrity, referential integrity, and general assertions/business rules) that are declarative and logically pure. In its formal sense, the RM provides a robust framework for representing domain knowledge through these declarative constraints. A "relvar" (relation variable) is not just a table; it's a predicate stating a truth about the world.

So, while SQL (the language and its common implementations) often has perceived weaknesses, the underlying Relational Model is a much more capable and logically consistent system for knowledge representation than its practical SQL manifestations sometimes allow.

Weaknesses in SQL from Date and Darwen that May Align with the Knowledge Engineering Community

Date and Darwen, while champions of the RM, are famously critical of SQL's deviations from the formal model. Many of their criticisms align with the concerns of knowledge engineers regarding logical precision and expressiveness for KR:

Null Values: SQL's concept of NULL introduces a problematic three-valued logic (true, false, unknown) that complicates logical reasoning and deviates from the two-valued logic foundational to most KR. Date argues NULLs are a fundamental flaw.
Lack of True Assertions: While SQL has CHECK constraints, its support for complex, cross-table "assertions" (general integrity constraints that evaluate to true or false across the entire database state) has historically been weak or absent. The formal RM allows for arbitrary assertions, which are crucial for declarative business rules.
Tuple Ordering/Duplicate Rows: The formal RM does not permit duplicate rows or tuple ordering, as relations are mathematical sets. SQL allows duplicates and relies on ordering, which can undermine the logical purity desired by KR.
Imperfect Type System: SQL's type system can be seen as less sophisticated than desired for robust KR, where rich conceptual types and their hierarchical relationships are paramount.

These points highlight that even within the database community's most rigorous thinkers, there's recognition that SQL is an imperfect vehicle for purely logical, declarative knowledge representation.

Exploring Alternatives and SQL Remedies for KR Tasks

Given the limitations of SQL's practical implementations, let's explore some alternatives and specific SQL remedies that may be more aligned with KR tasks. Some of these approaches aim to mitigate the perceived shortcomings of SQL DBMSs, often by more closely adhering to the formal Relational Model or by integrating its strengths with other paradigms.

Datalog: A declarative logic programming language, closely related to Prolog but typically focused on deductive databases. Datalog offers a highly declarative syntax for expressing facts and recursive rules, making it an excellent fit for complex inference and knowledge base querying. Its mathematical purity and clear semantics appeal directly to knowledge engineers.
Relational Calculus: While not a practical query language for users, it serves as a foundational theoretical benchmark for declarative data manipulation. Languages like Datalog are heavily influenced by Relational Calculus, providing a logical purity that appeals to KR.
DuckDB: An in-process SQL OLAP database. While still SQL, its embedded nature, analytical focus, and strong performance for complex queries (especially on derived facts) can make it an excellent target for efficient execution of materialized views or complex derivations from a KR system, addressing performance aspects of KR within an SQL environment.
SQL Extensions within the Relational Model Sphere: Modern SQL continues to evolve, incorporating features that address some of the long-standing criticisms and align better with KR needs. As Michael Stonebraker notes in "What Goes Around Comes Around"[4], SQL has a history of absorbing good ideas from alternative data models, extending its capabilities within the relational paradigm:
- Property Graph Queries (SQL/PGQ): A significant extension, SQL/PGQ introduces capabilities to query graph structures directly within SQL, allowing for explicit representation and traversal of relationships. This enables SQL to handle networked data patterns more naturally, which is crucial for many KR applications.
- JSON/XML/Array Data Types: SQL has integrated robust support for semi-structured data types like JSON and XML, along with array types. This allows for more flexible data modeling within relational tables, reducing the need for rigid a priori schema definitions for all data, and accommodating hierarchical information.
- User-Defined Functions and Aggregates: Enhanced support for user-defined functions and aggregate functions allows for more powerful, custom logic to be encapsulated and applied within SQL queries, extending its declarative power for specific domain computations.
Why not a new declarative language for the RM?: Efforts like Logica are precisely that: new declarative languages built on the principles of relational logic that can then translate to SQL for execution. They aim to provide the declarative power desired by KR practitioners while retaining the benefits of RDBMS infrastructure. Logica's declarative syntax, inspired by relational calculus and Datalog, makes it intuitive to define complex rules that can then be optimized and executed by SQL engines. This approach provides the composability and expressiveness often sought in KR, allowing complex derivations to be built from simpler, clearly defined logical predicates.

Proposed Solutions for the ORM (Object-Role Modeling) Toolkit

For the ORM (Object-Role Modeling)-based toolkit, the optimal path forward is not to replace, but to orchestrate these different strengths. The toolkit's ability to map ORM models to various targets is a significant advantage.

SQL (especially DuckDB) for what it's good at: Leverage SQL's strengths for robust data storage, transactional integrity, and efficient retrieval of foundational facts. DuckDB, as an embedded analytical database, could be ideal for fast, on-device processing of derived knowledge.
Prolog: For complex inference, rule-based reasoning, logical deduction, and scenarios requiring clear explainability of reasoning paths. The ORM model's constraints and fact types can directly generate Prolog rules.
LNNs/LTNs for Uncertainty and Unknowns: For neuro-symbolic integration, handling probabilistic reasoning, fuzziness, and situations where knowledge is inherently uncertain or incomplete. The ORM model provides the symbolic structure that LNNs/LTNs can then ground their learning and inference within.
Python as a possible (and strong) candidate for orchestration: Python's rich ecosystem, its libraries for data science, AI, and its ability to interact with databases, logic engines, and machine learning frameworks, makes it an excellent "glue language" for orchestrating these diverse components.

Conclusion

The perceived "shortcomings" of SQL for knowledge representation are often limitations in its practical implementation and common usage, rather than inherent flaws in the underlying Relational Model itself. The RM, in its formal sense, offers powerful declarative capabilities for knowledge representation.

Our path forward involves a more nuanced approach: recognizing SQL's strengths for data management and integrity, while simultaneously embracing other declarative languages and reasoning paradigms for their superior expressiveness in defining and inferring complex logical knowledge. By orchestrating robust RDBMS (like SQL, especially with tools like DuckDB), symbolic AI (like Datalog), and neuro-symbolic AI (like LNNs/LTNs) through a powerful conceptual modeling approach like ORM, we can build truly comprehensive and intelligent systems that meet the rigorous demands of modern knowledge engineering.

Appendix A: Knowledge Engineering Toolkit Evaluation Checklist

This checklist proposes a set of criteria for evaluating how well a language, tool, or technology aligns with the needs and principles of a Knowledge Engineer, prioritizing features that facilitate robust knowledge representation and reasoning. The aim is to remain agnostic to specific technologies while highlighting desirable characteristics.

Criteria	Description	Weight
1. Declarative Expression of Domain Logic	How easily can domain rules, relationships, and facts be expressed as direct logical statements (what is true), rather than procedural instructions (how to compute)?	(e.g., 20%)
2. Native Support for Automated Inference	Does the system natively support automated derivation of new, implicit facts from existing knowledge and defined rules? (e.g., transitive closure, property inheritance).	(e.g., 15%)
3. Semantic Richness & Expressiveness	Can the system capture complex semantic nuances like classification hierarchies, part-whole relationships, or N-ary associations directly and intuitively?	(e.g., 15%)
4. Handling of Incomplete Knowledge (OWA)	Does the system provide robust mechanisms or a default assumption for dealing with missing or unknown information, distinguishing it from explicit falsehood?	(e.g., 10%)
5. Schema Flexibility & Evolution	How easy is it to evolve the conceptual model and its associated schema to accommodate new knowledge or changing domain understanding without significant overhead or disruption?	(e.g., 10%)
6. Clear Conceptual-Logical Mapping	Does the tool/language clearly separate the abstract domain meaning from its logical and physical implementation details, allowing for independent evolution of layers?	(e.g., 10%)
7. Explainability of Reasoning	Can the system provide transparent explanations or traces for how it arrived at a particular conclusion or derived a new fact?	(e.g., 5%)
8. Robust Constraint/Rule Definition	Does it natively support the definition and enforcement of complex, domain-specific integrity constraints that go beyond basic data type checks?	(e.g., 5%)
9. Support for Complex Type Systems	Can it represent rich type hierarchies, abstract data types, and other sophisticated conceptual structures directly within the model?	(e.g., 5%)
10. Composability of Knowledge Modules	How easily can distinct, smaller modules of represented knowledge and rules be combined or reused to build larger, more complex systems?	(e.g., 5%)
11. Facilities for Knowledge Delivery	Does the technology offer built-in or easily integrable facilities for exposing and operationalizing the represented knowledge via common interfaces (e.g., Reasoning services, Knowledge Graph APIs, Rules engines, Intelligent Agent interfaces)?	(e.g., 5%)
12. Querying and Reasoning Over the Model Itself	Does the system provide facilities to directly query and reason about the defined schema, concepts, and relationships of the knowledge model, independent of data instances?	(e.g., 5%)
Total Score	(Sum of Weighted Scores)	100%

Appendix B: On the Composability of SQL

The perceived lack of composability in SQL, especially when building complex logical inferences from reusable components, is a point of contention that aligns the views of some database theorists with knowledge engineering principles. While SQL offers constructs like views and Common Table Expressions (CTEs) for organizing queries, their limitations often necessitate workarounds that compromise true logical composability.

From C.J. Date and Hugh Darwen

C.J. Date and Hugh Darwen, in their extensive critiques of SQL (often found throughout their works, particularly in books like SQL and Relational Theory and The Relational Database Dictionary), frequently highlight how SQL deviates from the logical purity and mathematical elegance of the formal Relational Model. This deviation directly impacts composability:

Lack of True Closure: In relational algebra, operations are closed, meaning the output of one operation (a relation) can always be the input to another operation. SQL's SELECT statement does not always produce a truly relational result (e.g., allowing duplicate rows, NULL semantics), which can break this closure and make it harder to reliably compose complex queries.
Imperfect Semantics: SQL's inconsistencies, such as its handling of NULL values and its multi-valued logic, mean that combining queries or views doesn't always yield logically predictable results. This makes it difficult to reason about the composition of queries, unlike the straightforward logical composition in relational calculus or Datalog.
Procedural Escapes: When SQL's declarative features fall short for complex business rules or inferences, developers often resort to stored procedures, triggers, or application-level code, which are inherently procedural. This shifts the logical definition away from the database's declarative schema into dispersed, less transparent code.

From Language Integrated Query (LINQ) and Reactive Extensions (Rx)

Language Integrated Query (LINQ), co-created by Anders Hejlsberg and Erik Meijer, and Reactive Extensions (Rx), also co-created by Erik Meijer, implicitly address SQL's composability challenges by offering alternative approaches to data manipulation. While these technologies originate from programming language integration, the underlying criticisms of SQL's composability apply more broadly to how logical rules are defined and combined:

Bridging the Impedance Mismatch: A core motivation behind LINQ was to overcome the "impedance mismatch" between traditional programming languages and relational databases. This mismatch arises because SQL queries, often written as string literals, exist outside the host language's type system and lack its native mechanisms for modularity and composition. This makes it cumbersome to build and refactor complex, reusable logical operations directly within application code, forcing developers to manage query strings manually.
Composition of Operations: Technologies like LINQ and Rx demonstrate how data operations can be made highly composable through method chaining and first-class query constructs. This contrasts with the more rigid composition patterns often found in SQL, where complex rule building can necessitate intricate nested queries or views that are less modular and harder to reason about in a layered, declarative fashion. The emphasis in LINQ and Rx on a "query algebra" that can apply to various data sources highlights a desire for a more universal and composable approach to data transformation and logical derivation, which resonates with knowledge engineering needs for building layered knowledge structures.

Appendix C: Logica Examples

A few examples of Logica have been provided below so we can see that it is a composable logic programming language which happens to compile to Sql. It is a part of the Datalog family.

For more information on Logica, see https://github.com/EvgSkv/logica

Example of Assigning a Rule to a Class

#For these examples, duckdb is used
@Engine("duckdb");

# Assume you have a predicate identifying graduate students,
# perhaps like this (using a name or ID):
GraduateStudent(person_id: 123);
GraduateStudent(person_id: 456);

# Rule: If someone is a GraduateStudent, then they are a Student.
Student(person_id:) :- GraduateStudent(person_id:);

# You might have other rules defining Students too:
Undergraduate(person_id: 789);
Student(person_id:) :- Undergraduate(person_id:);

# Rule: If someone is a GraduateStudent, then they have library access.
HasLibraryAccess(person_id:) :- GraduateStudent(person_id:);

logica students.l run Student
+-----------+
| person_id |
+-----------+
| 123       |
| 456       |
| 789       |
+-----------+

logica students.l run HasLibraryAccess
+-----------+
| person_id |
+-----------+
| 123       |
| 456       |
+-----------+

Simulating Open World Assumption

@Engine("duckdb");

IsSweet("orange");
IsSweet("apple");
IsNotSweet("lemon");
IsNotSweet("lime");

IsFruit("orange");
IsFruit("kiwi");
IsFruit("lemon");
IsFruit("apple");
IsFruit("lime");

UnknownSweetness(fruit:) :- IsFruit(fruit), ~IsSweet(fruit), ~IsNotSweet(fruit);

logica simulate_owa.l run UnknownSweetness
+-------+
| fruit |
+-------+
| kiwi  |
+-------+

Examples of Existential Quantification and Composition

@Engine("duckdb");

# Return all rules where all conditions are met.
Rule("rule1");
Rule("rule2");

RuleCondition("rule1", "condition1");
RuleCondition("rule1", "condition2");
RuleCondition("rule1", "condition3");
RuleCondition("rule2", "condition2");
RuleCondition("rule2", "condition4");
RuleCondition("rule2", "condition5");

State("condition2");
State("condition4");
State("condition5");
State("condition6");

MatchingCondition(rule, condition) :- RuleCondition(rule, condition), State(condition);
UnmetCondition(rule, condition) :- RuleCondition(rule, condition), ~State(condition);
NoUnmetConditions(rule) :- Rule(rule), ~UnmetCondition(rule, condition);

# Different flavors of the same Rules (aka Queries)
RuleSatisfied(rule) :- Rule(rule), NoUnmetConditions(rule);
RuleSatisfied2(rule) :- Rule(rule), ~(RuleCondition(rule, condition), ~State(condition));
RuleSatisfied3(rule) :- Rule(rule), ~UnmetCondition(rule);

# With a twist
ExtraCondition(condition) :- State(condition), ~RuleCondition(rule, condition);
RuleAndAllConditionsSatisfied(rule) :- Rule(rule), ~ExtraCondition(condition), ~(RuleCondition(rule, condition), ~State(condition));
RuleAndAllConditionsSatisfied2(rule) :- Rule(rule), ~(State(condition), ~RuleCondition(rule, condition)), ~(RuleCondition(rule, condition), ~State(condition));

logica quantification.l run RuleSatisfied
+-------+
| col0  |
+-------+
| rule2 |
+-------+

logica quantification.l run RuleAndAllConditionsSatisfied
+------+
| col0 |
+------+
+------+

Taxonomy Example

@Engine("duckdb");

#This is an example of a Taxonomy with Properties
SubclassOf("foods", "entity");
SubclassOf("animals", "entity");
SubclassOf("vehicles", "entity");

SubclassOf("fruits", "foods");
SubclassOf("vegetables", "foods");
SubclassOf("mammals", "animals");
SubclassOf("fish", "animals");
SubclassOf("land_vehicles", "vehicles");
SubclassOf("air_vehicles", "vehicles");

SubclassOf("citrus", "fruits");
SubclassOf("leafy_greens", "vegetables");
SubclassOf("canines", "mammals");
SubclassOf("saltwater_fish", "fish");
SubclassOf("cars", "land_vehicles");

HasProperty("foods", "has_origin");
HasProperty("foods", "is_perishable");
HasProperty("animals", "has_habitat");
HasProperty("animals", "is_domesticated");
HasProperty("vehicles", "has_manufacturer");
HasProperty("vehicles", "has_power_source");
HasProperty("fruits", "is_sweet");
HasProperty("fruits", "has_seed");
HasProperty("vegetables", "is_savory");
HasProperty("vegetables", "grows_underground");
HasProperty("mammals", "has_fur");
HasProperty("fish", "has_gills");
HasProperty("land_vehicles", "has_wheels");
HasProperty("air_vehicles", "can_fly");

HasProperty("citrus", "is_zesty");
HasProperty("leafy_greens", "is_nutritious");
HasProperty("canines", "has_tail");
HasProperty("cars", "is_fuel_efficient");
HasProperty("cars", "has_seats");
HasProperty("cars", "has_wheels");

HasProperty("vehicles", "has_capacity");
HasProperty("vehicles", "is_electric");

# Direct subclass relationship
TransitiveSubclass(x,y) :- SubclassOf(x, y);

# Indirect subclass relationship
TransitiveSubclass(x, y) :- TransitiveSubclass(x, z), TransitiveSubclass(z, y);

#This returns 1 because citrus is a fruit which is a food.
IsCitrusFood(is_food? += 1) distinct:- TransitiveSubclass("citrus", "foods");

#This returns X = mammals, X = fish, and X = canines.
Animals(animal:) :- TransitiveSubclass(animal, "animals");

#This returns all properties of vehicles
HasAllProperties(class, property) :- HasProperty(class, property), class == "vehicles";

# Find all properties inherited from a superclass
HasAllProperties(class, property) :-
    SubclassOf(class, superclass),
    HasAllProperties(superclass, property);

logica taxonomy.l run TransitiveSubclass
+----------------+---------------+
| col0           | col1          |
+----------------+---------------+
| foods          | entity        |
| animals        | entity        |
| vehicles       | entity        |
| fruits         | foods         |
| vegetables     | foods         |
| mammals        | animals       |
| fish           | animals       |
| land_vehicles  | vehicles      |
| air_vehicles   | vehicles      |
| citrus         | fruits        |
| leafy_greens   | vegetables    |
| canines        | mammals       |
| saltwater_fish | fish          |
| cars           | land_vehicles |
| fruits         | entity        |
| vegetables     | entity        |
| mammals         | entity        |
| fish           | entity        |
| land_vehicles  | entity        |
| air_vehicles   | entity        |
| citrus         | foods         |
| leafy_greens   | foods         |
| canines        | animals       |
| saltwater_fish | animals       |
| cars           | vehicles      |
+----------------+---------------+

logica taxonomy.l run IsCitrusFood
+---------+
| is_food |
+---------+
| 1       |
+---------+

logica taxonomy.l run Animals
+----------------+
| animal         |
+----------------+
| mammals        |
| fish           |
| canines        |
| saltwater_fish |
+----------------+

logica taxonomy.l run HasAllProperties
+---------------+------------------+
| col0          | col1             |
+---------------+------------------+
| vehicles      | has_manufacturer |
| vehicles      | has_power_source |
| vehicles      | has_capacity     |
| vehicles      | is_electric      |
| air_vehicles  | has_manufacturer |
| air_vehicles  | has_power_source |
| air_vehicles  | has_capacity     |
| air_vehicles  | is_electric      |
| cars          | has_manufacturer |
| cars          | has_power_source |
| cars          | has_capacity     |
| cars          | is_fuel_efficient|
| land_vehicles | has_manufacturer |
| land_vehicles | has_power_source |
| land_vehicles | has_capacity     |
| land_vehicles | is_electric      |
| land_vehicles | has_wheels       |
+---------------+------------------+

References:

[1] Codd, E. F. (1970). A Relational Model of Data for Large Shared Data Banks. Communications of the ACM, 13(6), 377–387.
[2] Darwen, H., & Date, C. J. (1998). Foundation for Future Database Systems: The Third Manifesto. Addison-Wesley.

Knowledge Engineeringand the 'Shortcomings' of SQL