Knowledge Engineering

The practice of capturing human knowledge and organising it into structures that machines can read, reason about, and act on.

What is it?

Every organisation, every field, every person carries knowledge — facts, relationships, rules, exceptions. Most of that knowledge lives in people’s heads, in unstructured documents, or in conversations. Knowledge engineering is the discipline of taking that messy human understanding and translating it into formal, structured representations that software can process.¹

The gap it bridges is fundamental: humans think in concepts, stories, and context. Machines think in data, schemas, and logic. Knowledge engineering sits in the middle — it asks “how do we encode what we know so a computer can use it reliably?”²

This isn’t a new field. Expert systems in the 1970s and 1980s were early knowledge engineering — human experts would sit with engineers to codify decision rules into software.¹ What has changed is scale and urgency. Modern AI systems (especially large language models) can process language, but they still struggle with structured reasoning. Knowledge engineering provides the scaffolding that makes AI systems more accurate, consistent, and trustworthy.³

A knowledge engineer’s work involves three core activities: acquiring knowledge from experts or sources, representing it in a formal structure (ontologies, taxonomies, knowledge graphs), and validating that the structure faithfully captures the original meaning.¹

In plain terms

Knowledge engineering is like creating a detailed map of everything an expert knows about a subject — not just the facts, but how they connect to each other. Without the map, a computer can store the facts but can’t navigate between them. With the map, it can answer questions, find patterns, and reason about new situations.

At a glance

The knowledge engineering pipeline (click to expand)
graph LR
    A[Human Knowledge] -->|Acquire| B[Raw Information]
    B -->|Structure| C[Formal Representation]
    C -->|Validate| D[Verified Knowledge Base]
    D -->|Apply| E[Machine Reasoning]
    E -->|Feedback| A
Key: Knowledge starts unstructured in human minds and documents. Knowledge engineering acquires it, structures it into formal representations (ontologies, graphs, schemas), validates it for accuracy, and makes it available for machine reasoning. Feedback from use reveals gaps, prompting further acquisition.

How does it work?

1. Knowledge acquisition

The first step is getting knowledge out of people’s heads and scattered documents into a usable form. This might involve interviewing domain experts, reading technical literature, analysing existing data, or observing workflows.¹

The challenge is that experts often can’t articulate what they know — they operate on intuition built over years. A knowledge engineer’s job is to make the implicit explicit.

Think of it like...

A journalist interviewing a master chef. The chef “just knows” when a sauce is ready. The journalist’s job is to turn that intuition into a recipe that anyone can follow — specifying temperatures, times, visual cues, and decision points the chef takes for granted.

Example: acquiring medical knowledge (click to expand)

Consider building a system to help doctors diagnose skin conditions. A knowledge engineer would:

Interview dermatologists about how they classify lesions

Review medical literature for diagnostic criteria

Analyse existing patient records for patterns

Document the decision tree: “If the lesion has irregular borders AND is asymmetric AND has changed colour, consider melanoma”

The output isn’t the AI system itself — it’s the structured knowledge that the system will use.

2. Knowledge representation

Once acquired, knowledge needs a formal structure. This is where ontologies, taxonomies, and knowledge graphs come in.²

A taxonomy is a hierarchy — categories within categories. For example: Animal > Mammal > Dog > Golden Retriever.
An ontology goes further — it defines not just categories but relationships between concepts. For example: “A Golden Retriever is a Dog, Dogs are kept by Owners, Owners live at Addresses.”²
A knowledge graph is the implementation — a network of nodes (concepts) and edges (relationships) that a machine can traverse and query.³

Think of it like...

A taxonomy is a filing cabinet (things in folders). An ontology is a filing cabinet with strings connecting related folders (this folder is related to that one, in this specific way). A knowledge graph is the entire room — filing cabinets, strings, labels, and a librarian who can follow any string to find what you need.

Concept to explore

See knowledge-graphs for how graph structures power modern search engines, recommendation systems, and AI reasoning.

3. Validation and maintenance

A knowledge base is only useful if it’s accurate. Validation ensures that the formal representation faithfully captures the original knowledge — no missing relationships, no contradictions, no stale information.¹

This is ongoing work. Knowledge changes: medical guidelines are updated, regulations evolve, new products are launched. A knowledge system without maintenance becomes a liability — it will confidently give outdated answers.

Key distinction

Data tells you what happened (the patient’s temperature was 38.5C). Knowledge tells you what it means (a temperature above 38C in a post-surgical patient warrants investigation). Knowledge engineering is about encoding the second kind — the interpretive layer that turns raw data into actionable understanding.

4. Machine-readable formats

For machines to use structured knowledge, it must be stored in formats they can parse: JSON-LD, RDF, OWL, or even well-structured YAML frontmatter.² The choice of format depends on the use case — web search engines prefer JSON-LD, enterprise systems often use RDF, and smaller systems may use simpler structured formats.

Concept to explore

See machine-readable-formats for the specific formats and standards used to store structured knowledge.

Yiuno knowledge system (click to expand)

This very knowledge system is an example of knowledge engineering in practice. Each concept card has structured YAML frontmatter (title, level, parent, children, prerequisites, related) that defines the concept’s position in a taxonomy. The body contains human-readable explanations, while the frontmatter provides machine-readable structure.

The generate-graph.py script reads this frontmatter and produces a graph.json file — a knowledge graph that maps every concept and its relationships. This is knowledge engineering at a small scale: human knowledge captured in structured form so that software can traverse and query the connections.

Why do we use it?

Key reasons

1. Making AI reliable. Large language models can generate fluent text, but they hallucinate. Grounding AI systems in structured knowledge bases dramatically reduces errors and provides verifiable sources for every answer.³

2. Preserving expertise. When a senior expert retires, their knowledge walks out the door — unless it has been captured in a structured, reusable form. Knowledge engineering makes institutional knowledge persistent and transferable.¹

3. Enabling interoperability. When two systems use the same ontology, they can exchange information seamlessly. Without shared structure, integration becomes fragile translation between incompatible formats.²

4. Powering discovery. Structured knowledge lets machines find connections humans miss. A knowledge graph can reveal that two research papers address the same problem from different angles, even if they use completely different terminology.

When do we use it?

When building an AI system that needs to reason about a specific domain (medicine, law, engineering)
When an organisation wants to capture and preserve the expertise of its people
When integrating multiple systems that need to share a common understanding of concepts
When creating a search or recommendation system that needs to understand relationships, not just keywords
When structuring a learning or documentation system where concepts have meaningful connections

Rule of thumb

If your problem involves “the computer needs to understand the relationships between things, not just store data about them,” you need knowledge engineering.

How can I think about it?

The cartographer analogy

A cartographer doesn’t create the territory — the mountains, rivers, and cities already exist. The cartographer’s job is to create a map that accurately represents the territory in a form that others can use to navigate.

A knowledge engineer is a cartographer of expertise. The knowledge already exists in experts’ heads and documents. The engineer’s job is to create a structured map — with clear labels, consistent scales, and explicit connections — so that machines (and other people) can navigate that knowledge reliably.

The territory = human knowledge and expertise

The survey = knowledge acquisition (interviews, research, analysis)

The map = the formal representation (ontology, knowledge graph)

Map legend and scale = the schema and validation rules

Using the map = machine reasoning over the knowledge base

The IKEA instruction manual analogy

You buy a bookshelf. The knowledge of how to build it exists in the designer’s head. But you can’t call the designer every time someone buys the product. So the company creates an instruction manual — a structured representation of the assembly knowledge.

The designer’s expertise = unstructured human knowledge

The instruction manual = the knowledge base

Numbered steps and diagrams = the formal structure (ontology)

Standard icons (screwdriver, hammer) = shared vocabulary (controlled terms)

Someone following the instructions = a machine reasoning over the knowledge

A bad manual (ambiguous, missing steps) leads to a wobbly bookshelf. A bad knowledge base leads to unreliable AI. The quality of the structure determines the quality of the outcome.

Concepts to explore next

Concept	What it covers	Status
knowledge-graphs	Graph structures that represent concepts and their relationships	complete
machine-readable-formats	JSON-LD, RDF, OWL, and other standards for encoding knowledge	stub
dikw-hierarchy	The Data-Information-Knowledge-Wisdom pyramid and what transforms raw data into meaning	complete
ontology	Formal specifications of shared conceptualisations --- classes, properties, individuals, axioms	complete

Some cards don't exist yet

A broken link is a placeholder for future learning, not an error.

Check your understanding

Test yourself (click to expand)

Explain what a knowledge engineer does, in terms a non-technical person would understand.

Distinguish between a taxonomy, an ontology, and a knowledge graph. How does each build on the previous one?

Name the three core activities in knowledge engineering and describe what each involves.

Interpret this scenario: a hospital has 20 years of patient records in free-text notes. A new AI diagnostic tool keeps giving inaccurate results. What role might knowledge engineering play in improving accuracy?

Connect knowledge engineering to AI reliability: why does structured knowledge reduce hallucination in language models?

Where this concept fits

Position in the knowledge graph
graph TD
    AIML[AI and Machine Learning] --> KE[Knowledge Engineering]
    AIML --> AS[Agentic Systems]
    KE --> KG[Knowledge Graphs]
    KE --> MRF[Machine-Readable Formats]
    KE --> DIKW[DIKW Hierarchy]
    KE --> O[Ontology]
    style KE fill:#4a9ede,color:#fff
Related concepts:

software-architecture — knowledge systems require architectural decisions about storage, access, and scalability

databases — knowledge bases are specialised databases optimised for relationships and reasoning

data-governance — structured knowledge must be governed: who can edit it, how changes are validated, how quality is maintained

Explorer

Knowledge Engineering

Knowledge Engineering

What is it?

At a glance

How does it work?

1. Knowledge acquisition

2. Knowledge representation

3. Validation and maintenance

4. Machine-readable formats

Why do we use it?

When do we use it?

How can I think about it?

Concepts to explore next

Check your understanding

Where this concept fits

Sources

Further reading

Graph View

Table of Contents

Backlinks

Explorer

Knowledge Engineering

Knowledge Engineering

What is it?

At a glance

How does it work?

1. Knowledge acquisition

2. Knowledge representation

3. Validation and maintenance

4. Machine-readable formats

Why do we use it?

When do we use it?

How can I think about it?

Concepts to explore next

Check your understanding

Where this concept fits

Sources

Further reading

Footnotes

Graph View

Table of Contents

Backlinks