{"id":1115,"date":"2025-11-30T23:02:46","date_gmt":"2025-11-30T14:02:46","guid":{"rendered":"https:\/\/www.mindware-jp.com\/en\/?p=1115"},"modified":"2025-12-01T09:40:01","modified_gmt":"2025-12-01T00:40:01","slug":"data-science-and-buddhism-from-the-ugly-duckling-theorem-to-emptiness-provisionality-and-the-middle-way","status":"publish","type":"post","link":"https:\/\/www.mindware-jp.com\/en\/2025\/11\/30\/data-science-and-buddhism-from-the-ugly-duckling-theorem-to-emptiness-provisionality-and-the-middle-way\/","title":{"rendered":"Data Science and Buddhism: From the \u201cUgly Duckling Theorem\u201d to Emptiness, Provisionality, and the Middle Way"},"content":{"rendered":"\n<p>The pioneering research that shaped today\u2019s \u201cdata science\u201d can be traced back to pattern recognition studies of the 1960s. In Japan, that era saw the development of machines capable of reading handwritten postal codes to sort mail at high speed. One of the intellectual leaders of that formative period, <strong>Satoshi Watanabe<\/strong>, proposed the remarkably evocative concept known as the <strong>\u201cUgly Duckling Theorem.\u201d<\/strong><\/p>\n\n\n\n<p>Despite its importance, this theorem is seldom taught in modern data science programs, and many data scientists\u2014especially those trained within frameworks dominated by machine learning engineering\u2014have never seriously engaged with its philosophical implications. Yet, once examined carefully, the theorem reveals profound connections to Buddhist philosophy, Kantian epistemology, Husserl\u2019s phenomenology, and Jakob von Uexk\u00fcll\u2019s concept of Umwelt (environmental world).<\/p>\n\n\n\n<p>This essay explores those connections and argues that a proper understanding of clustering requires the same cognitive attitude Buddhism describes as <strong>the Middle Way<\/strong> between emptiness and conventional appearance.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\u25a0 What Is the Ugly Duckling Theorem?<\/h2>\n\n\n\n<p>Watanabe famously stated:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>If we measure similarity by the number of predicates they share, then <strong>any two objects possess the same degree of similarity.<\/strong><\/p>\n<\/blockquote>\n\n\n\n<p>This sounds counterintuitive at first, but the logic is straightforward:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The world contains an unlimited number of possible attributes.<\/li>\n\n\n\n<li>Humans select only a tiny subset when comparing objects.<\/li>\n\n\n\n<li>Any two objects share some attributes and differ in others.<\/li>\n\n\n\n<li>Without assigning <strong>importance<\/strong> (weights) to attributes, all pairs of objects are equally similar.<\/li>\n<\/ul>\n\n\n\n<p>Therefore, <strong>similarity is not an inherent property of the data<\/strong>.<br>It is a consequence of <strong>which attributes we choose to value<\/strong>.<\/p>\n\n\n\n<p>Watanabe concluded:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>\u201cTo escape the Ugly Duckling Theorem, one must admit that some attributes are more important than others.\u201d<\/strong><br><strong>\u201cThe goal of clustering is not to discover \u2018objective\u2019 classes but to create <em>useful<\/em> new classifications.\u201d<\/strong><\/p>\n<\/blockquote>\n\n\n\n<p>However, many practitioners overlook this foundational point. They assume, often unconsciously, that:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A \u201ctrue\u201d structure exists inside the data.<\/li>\n\n\n\n<li>A clustering algorithm can reveal it.<\/li>\n\n\n\n<li>The result should coincide with everyday categories.<\/li>\n\n\n\n<li>The clustering with the highest quality metric (e.g., silhouette score) must be the \u201ccorrect\u201d one.<\/li>\n<\/ul>\n\n\n\n<p>Clustering metrics are indeed helpful, but treating them as arbiters of objective reality is na\u00efve and even dangerous. They are evaluative tools\u2014<strong>not detectors of ontological truth<\/strong>.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\u25a0 Philosophical Depth: Emptiness, the Thing-in-Itself, Phenomenology, and Umwelt<\/h2>\n\n\n\n<p>The Ugly Duckling Theorem resonates with several major philosophical traditions.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">\u25cf Buddhism\u2019s Doctrine of Emptiness (\u015a\u016bnyat\u0101)<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>No object possesses inherent essence or fixed identity.<\/li>\n\n\n\n<li>Categories, meanings, and distinctions arise only through human conceptual activity.<\/li>\n\n\n\n<li>Classification is therefore <em>empty<\/em>: it does not mirror an objective essence.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">\u25cf Kant\u2019s \u201cThing-in-Itself\u201d<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Humans can only perceive the world through the structure of human sensibility (space, time, categories).<\/li>\n\n\n\n<li>The world \u201cas it is\u201d has no color, smell, value, or meaning independent of perception.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">\u25cf Husserl\u2019s Phenomenology and \u201cEpoch\u00e9\u201d<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Suspend judgments and preconceived categories.<\/li>\n\n\n\n<li>Attend to phenomena as they appear, without the distortion of habitual interpretation.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">\u25cf Uexk\u00fcll\u2019s Umwelt<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Every organism inhabits a world structured by its sensory and bodily constitution.<\/li>\n\n\n\n<li>Each species lives in its own \u201cmeaningful world.\u201d<\/li>\n<\/ul>\n\n\n\n<p>Humans, too, live within a human-shaped Umwelt.<br>Classification cannot be objective because <strong>the world we see is already filtered through human embodiment and cognition.<\/strong><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\u25a0 The Buddhist Framework of \u201cThree Truths\u201d: Emptiness, Provisionality, and the Middle<\/h2>\n\n\n\n<p>The Tiantai (Tendai) philosopher <strong>Zhiyi<\/strong> articulated a threefold truth that maps surprisingly well onto the epistemology of data analysis:<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">1. <strong>Emptiness (k\u016b-tai):<\/strong><\/h3>\n\n\n\n<p>All phenomena lack inherent essence; classifications do not exist \u201cin themselves.\u201d<\/p>\n\n\n\n<p>In data terms:<br>The raw data contains <strong>no labels, no clusters, no \u201cnatural kinds.\u201d<\/strong><\/p>\n\n\n\n<h3 class=\"wp-block-heading\">2. <strong>Provisional Existence (ke-tai):<\/strong><\/h3>\n\n\n\n<p>Phenomena appear as meaningful, labeled, and structured <strong>from a human standpoint<\/strong>.<\/p>\n\n\n\n<p>In data terms:<br>Human goals, tasks, and interpretations create <strong>useful classifications<\/strong> for reasoning, communication, and decision-making.<\/p>\n\n\n\n<p>Clustering belongs entirely to this realm of <em>provisionality<\/em>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">3. <strong>The Middle (ch\u016b-tai):<\/strong><\/h3>\n\n\n\n<p>A non-dual perspective that embraces both emptiness and provisionality without clinging to either.<\/p>\n\n\n\n<p>This corresponds to the phenomenological <strong>epoch\u00e9<\/strong>\u2014neither assuming inherent structure (emptiness), nor denying the usefulness of structure (provisionality).<\/p>\n\n\n\n<p>Applied to data science, this is the attitude we must cultivate when working with clustering.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\u25a0 The Practical Analogy: Drawing, Pattern Perception, and the Artist\u2019s Eye<\/h2>\n\n\n\n<p>Epoch\u00e9 is notoriously difficult to practice.<br>A helpful analogy comes from the art of drawing.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Novice artists draw \u201ca nose\u201d or \u201can apple,\u201d guided by conceptual preconceptions.<\/li>\n\n\n\n<li>Skilled artists temporarily set aside the object\u2019s identity and observe <strong>pure shapes, values, proportions, edges<\/strong>.<\/li>\n<\/ul>\n\n\n\n<p>This is an act of suspending conceptual thought\u2014an epoch\u00e9.<br>In drawing, <strong>auxiliary lines<\/strong> help reveal structure:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>These lines do not exist in reality.<\/li>\n\n\n\n<li>Yet drawing them aids understanding and skillful depiction.<\/li>\n<\/ul>\n\n\n\n<p>Clustering serves the same function:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>A cluster is not a \u201creal boundary\u201d in the data.<br>It is an auxiliary line that helps us understand patterns.<\/strong><\/p>\n<\/blockquote>\n\n\n\n<p>Mistaking auxiliary lines for objective boundaries is the central error the Ugly Duckling Theorem warns against.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h2 class=\"wp-block-heading\">\u25a0 Conclusion: The Middle Way Is the Secret to Using Clustering Wisely<\/h2>\n\n\n\n<p>Clustering is powerful precisely because:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>It does <strong>not<\/strong> discover immutable categories.<\/li>\n\n\n\n<li>It <strong>creates<\/strong> classifications that are useful for a purpose.<\/li>\n\n\n\n<li>It reflects the analyst\u2019s decisions, values, and chosen attributes.<\/li>\n<\/ul>\n\n\n\n<p>Thus the true mastery of clustering lies in:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Emptiness:<\/strong> Recognizing that classifications are not inherently real.<\/li>\n\n\n\n<li><strong>Provisionality:<\/strong> Using classifications as tools for understanding and action.<\/li>\n\n\n\n<li><strong>The Middle:<\/strong> Maintaining flexibility, detachment, and methodological humility.<\/li>\n<\/ul>\n\n\n\n<p>This Middle Way is not only a Buddhist teaching but also the cognitive stance required for sophisticated data analysis.<\/p>\n\n\n\n<p>The Ugly Duckling Theorem invites us to see the limits of our models.<br>The Buddhist three truths teach us how to work productively within those limits.<br>Together, they form a philosophical foundation for a wiser, more reflective data science.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The pioneering research that shaped today\u2019s \u201cdata science\u201d can be traced back to pattern recognition studies of the 1960s. In Japan, that era saw the development of machines capable of reading handwritten postal codes to sort mail at high speed. One of the intellectual leaders of that formative period, Satoshi Watanabe, proposed the remarkably evocative [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[],"class_list":["post-1115","post","type-post","status-publish","format-standard","hentry","category-data-science"],"_links":{"self":[{"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/posts\/1115","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/comments?post=1115"}],"version-history":[{"count":2,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/posts\/1115\/revisions"}],"predecessor-version":[{"id":1117,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/posts\/1115\/revisions\/1117"}],"wp:attachment":[{"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/media?parent=1115"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/categories?post=1115"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mindware-jp.com\/en\/wp-json\/wp\/v2\/tags?post=1115"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}