Saturday, April 11, 2026

DataBooks: Markdown as Semantic Infrastructure

DataBooks are a design pattern that uses Markdown as semantic infrastructure to create self-describing, portable, and machine-processable documents. Unlike standalone data files like raw Turtle or JSON-LD, which lack context and processing instructions, a DataBook combines graph data, prose context, and provenance metadata into a single artifact that humans can read and machines can process.

Core Structure of a DataBook
A DataBook follows a specific pattern within a Markdown document:
  • YAML Frontmatter: Carries document metadata, provenance information, and processing instructions.
  • Typed Fenced Blocks: Contain the data payloads, such as Turtle or JSON-LD graph data, SPARQL queries, or even AI prompts and manifests.
  • Prose Sections: Provide the human-readable documentation and explanation for the data.
Role in the Semantic Web
DataBooks address a gap in the Semantic Web ecosystem—handling "small data" that does not require a full heavyweight database or triple store.
  • Local Ground Truth: They allow semantic content to travel between different systems without losing meaning, acting as a "holon" where the boundary condition and context travel with the artifact itself.
  • LLM Integration: In AI workflows, DataBooks invert the standard model. Instead of treating data as ephemeral context for an LLM, the DataBook becomes the persistent, auditable artifact, while the LLM acts as one of several "transformation engines" used to enrich or process it.
  • Auditable Pipelines: They include "process stamps" that record which transformer (AI or human) operated on what inputs at what time, creating a forensic trail for auditing and re-running pipeline stages.
Key Technologies and Tools
DataBooks leverage established W3C standards to ensure interoperability:
  • RDF (Resource Description Framework): Used to represent the data and dependency graphs.
  • SPARQL: Allows the contents and dependency manifests of DataBooks to be queried as first-class semantic artifacts.
  • Encryption Profiles: Designed to support sensitive data through encrypted fenced blocks that parsers can either decrypt or gracefully skip.
While not yet a formal specification, DataBooks are implementable today using a combination of Markdown, YAML, and standard RDF toolchains like Apache Jena or RDFLib. They are specifically suited for knowledge work that is currently fragmented, such as AI-assisted ontology development or cross-institutional data integration.
https://ontologist.substack.com/
 

The Ontologist

DataBooks: Markdown as Semantic Infrastructure

"Something has been missing from the semantic web stack for a long time, and it’s been hiding in plain sight.

The RDF ecosystem has always known how to handle large, persistent, well-indexed knowledge graphs. Triple stores, SPARQL endpoints, federated query — these are mature, well-understood tools for managing graph data at scale. What the ecosystem has never handled well is everything else: the small, contextual, task-specific, ephemeral, or pipeline-stage graph content that makes up the majority of actual knowledge work. The data that doesn’t need a database. The graph that lives for the duration of a process and then needs to be archived, referenced, or passed downstream. The semantic content that a human needs to read and a machine needs to process."


In Part I of this series, we introduced the DataBook format — a Markdown document that functions simultaneously as human-readable text, a typed data container, and a self-describing semantic artifact. We argued that Markdown, far from being a lightweight presentational format, carries the structural DNA needed to become a genuine semantic infrastructure layer.



AI architecture: Quonset Hut: 10x affordable housing?

housing does not need to be expensive and borring.

now with AI we can design 10x better 

Quonset hut - Wikipedia

Quonset hut /ˈkwÉ’nsɪt/ is a lightweight prefabricated structure of corrugated galvanized steel with a semi-circular cross-section.

What Was A Quonset Hut? The Genius Weather Proof Cabin You've Never Heard Of - YouTube

4x Quonset home-studio: 40 years of artistic simple living - YouTube

How He Built a Tiny Quonset Home Over a Stone Basement — And Made It 55° Warmer All Winter - YouTube

Quonset Hut with FULL BASEMENT - 4,000 Sq Ft for $247K! (Complete Cost Breakdown) - YouTube

Quonset Barndominium Kit Prices 2025: Real Costs Revealed ($39K-$411K Total Breakdown) - YouTube



AI (Gemini Nano Banana Flash) designs


ultra-modern, integrated with nature

passive-solar, with earth ship-like elements

super-affordable, modern, desirable


Real houses

Living in Quonset Hut Homes | Mother Earth News

Build A Quonset Hut Home And Curve Your Enthusiasm - The Tiny Life





3d tools: Sweet Home 3D, FreeCAD

3D drawing / design tools are challenging
no "good" solution, either complex and/or expensive, or very narrow and not flexible.
and mostly desktop, not web apps

Sweet Home 3D - Sweet Home 3D

Sweet Home 3D: Interior Design & Home Planner - Free download and install on Windows | Microsoft Store

FreeCAD: Your own 3D parametric modeler

A FreeCAD manual (PDF)

FreeCAD/FreeCAD: Official source code of FreeCAD, a free and opensource multiplatform 3D parametric modeler. @GitHub

LGPL, C++ & Python

Website • Documentation • Forum • Bug tracker • Git repository • Blog

Tutorials - FreeCAD Documentation



Building Information Modeling is a process used in architecture, engineering, and construction to create and manage digital representations of physical structures. It integrates not only 3D geometry but also critical data such as materials, costs, and schedules, allowing for advanced analysis and collaboration throughout the entire lifecycle of a project.


Rayon.design