World Wide Web

The World Wide Web is a global information system built on top of the Internet — a hypertext document space linked by hyperlinks and accessed via HTTP through web browsers. It is not the Internet, though the two are routinely conflated. The Internet is the network infrastructure; the web is the information layer that runs on it. This distinction matters because the web's design decisions — its address scheme, its linking model, its document format, its statelessness — are not inevitable consequences of packet-switched networking. They are contingent choices that produced a particular kind of knowledge architecture.

The Web as a Topology of Knowledge

The web's fundamental unit is the hyperlink: a directed edge from one document to another. The resulting structure is a graph of extraordinary scale — billions of nodes, trillions of edges — and its topological properties shape what the web can and cannot do. The crawl of this graph is the primary interface between the web and search engines, which must traverse the link structure to build indexes. But the crawl is incomplete: the web contains regions that are unlinked, behind authentication, or dynamically generated, and these regions are invisible to the crawler. The visible web is a subset of the actual web, and the boundary between them is not technical but social.

The hyperlink is not merely a navigation mechanism. It is a citation, a recommendation, a trust signal, and a power relation. When a page links to another, it confers reputation and authority — a relationship that Google's PageRank algorithm formalized into a mathematical model of centrality. The web's link graph is therefore not just a map of documents; it is a map of who recognizes whom, which institutions are central, and which voices are marginal. The topology of the web is the topology of attention in a distributed system where no single authority assigns value.

Documents, Browsers, and the Client-Server Asymmetry

The web was designed around the browser as the universal client: a program that could retrieve, render, and display any document on any server. This universality was revolutionary. Before the web, information systems required specialized software for each database or service. The browser's ability to display any HTML document, fetch any image, and execute any script made the web a universal interface.

But the browser-server model embeds a deep asymmetry. The server holds the data; the browser merely displays it. This asymmetry shapes power: content creators need servers, which require infrastructure, capital, and technical expertise. The browser's ubiquity masks the server's scarcity. The web appears to be a level playing field where anyone can publish, but publishing requires a server, and servers are not equally distributed.

This asymmetry has driven the web's evolution toward platform consolidation. The early web was a federation of independent servers; the contemporary web is dominated by a handful of platforms that control both servers and browsers. The browser has become less a universal client than a portal to platform ecosystems. The web's original vision of decentralized publishing has been progressively subverted by the economics of scale.

The Web as Memory and as Amnesia

The web is simultaneously the largest memory infrastructure ever built and the most effective forgetting machine. The Wayback Machine and the Internet Archive attempt to preserve the web's history, but the web's architecture resists preservation. URLs break; servers fail; content is deleted; platforms change their APIs and render old links useless. The web's link rot is not a failure but a feature of its design: HTTP has no built-in versioning, no persistence guarantee, and no mechanism for archival.

The result is a knowledge system that is extraordinarily broad but extraordinarily shallow. The web contains everything that was published recently and almost nothing that was published reliably. It is a temporal record of human expression where the past decays at a rate determined by server economics, corporate policy, and technical neglect. The digital dark age is not a future risk; it is the present condition of a medium that was designed for communication, not for preservation.

The web is often described as a library, but it is better understood as a conversation — a vast, asynchronous, permanently ongoing conversation in which most of the participants have left, most of the links are broken, and most of the context has been lost. The web does not preserve knowledge; it circulates it. And circulation is not memory. A library remembers what you ask it to remember. The web remembers what someone is still paying to host. The difference is the difference between a public institution and a private market, and it is the web's deepest structural flaw. The web's architects built a system for sharing, not for keeping, and we are only now discovering that a civilization needs both.