Skip to content
Mindware Research Institute

Mindware Research Institute

Concept Research – AI powered Creative Information Analysis

  • Home
  • Concept Research
  • Contact
  • 日本語

How to use Clustering Quality Measures

2023年10月4日
By Kunihiro TADA In Data Science

How to use Clustering Quality Measures

In recent years, clustering quality measures have been calculated in cluster analysis, and many academic societies now require that papers include clustering quality measures as a criterion for adopting a particular clustering. To be honest, I have mixed feelings about this. Because I feel like the purpose of cluster analysis has been lost. Cluster analysis is an exploratory method for finding “useful” clusters within the problem user is working on, not for determining an objective classification method.

On the calculation, cluster analysis is a method of extracting clusters of data points based on their mutual distance relationship or density distribution in a multidimensional space. It’s just like looking at clouds floating in the sky. Like clouds, data clustering is very fuzzy. Just as there is little point in asking how many clouds there are, there is little point in determining the number of clusters.

Sequentially merging smaller clusters increases the within-cluster variance. The clustering quality measure, for example, finds merging with a large growth rate and gives a higher quality measure to the clustering before merging. It helps find more “natural” clustering. However, even if a specific number of clusters is determined, it is not a mistake to adopt a different number of clusters.

For example, if you are developing a new product and want to establish target consumer personas for that product, cluster analysis can be helpful. If there are many competing products in the market and very tight differentiation is required, it makes sense to adopt a clustering that is smaller than the default clustering. In most cases, the default clustering only indicates broad categories of products.

The characteristics of each cluster must be analyzed to determine which consumers to target and what products to offer. This is called profile analysis. Cluster analysis cannot be completed with data clustering alone; it must be integrated with profile analysis. In other words, it can be called a “concept” analysis. In traditional philosophy, concepts are defined by “intension” and “extention”. Intension refers to the common properties of a certain concept, and extension refers to the examples the concept includes. There is no contradiction between the way we “develop product concepts” in business and the concepts we say in philosophy.

It is no exaggeration to say that SOM is a tool for expressing concepts inherent in data. The combination of SOM and cluster analysis is a very powerful conceptual analysis tool.

In order to delve deeper into the essence of cluster analysis, next time I will discuss the “ugly duckling theorem.”

Written by:

Kunihiro TADA

He has been a watcher of the industrial boom from the early 1980s to the present day. 1982, planner of high-tech seminars at the Japan Technology and Economy Centre, and of seminars and research projects at JMA Consulting; in 1986 he organised AI chip seminars on fuzzy inference and other topics, triggering the fuzzy boom; after freelance writing on CG and multimedia, he founded the Mindware Research Institute, selling the Japanese version of Viscovery SOMine since 2000, and Hugin and XLSTAT since 2003 in Japan.

View All Posts

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

Recent Posts

  • Epistemology vs Ontology: Why This Distinction Matters More Than Ever
  • Entered into AI governance-related business
  • A Unified Perspective on Cosmology, Causal Structure, Many-Worlds Interpretation, and Bayesian Networks
  • Data Science and Buddhism: From the “Ugly Duckling Theorem” to Emptiness, Provisionality, and the Middle Way
  • The Value of Human–AI Interfaces in the Age of AGI
  • Viscovery SOMine 8.1 Release
  • Semantic data mining that fundamentally changes information analysis 2
  • Semantic data mining that fundamentally changes information analysis 1
  • SOM as a platform for ensembles of multi-machine learning models
  • Innovation Maps: IT Industry top 1000 Services and Products Competing Map

Archives

  • April 2026
  • December 2025
  • November 2025
  • October 2025
  • January 2025
  • December 2024
  • July 2024
  • June 2024
  • April 2024
  • March 2024
  • December 2023
  • October 2023
  • September 2023
  • August 2023
RSS Error: Retrieved unsupported status code "404"
Logo  
Daiichi Central Bldg. 6-36, Honmachi, Okayama Kita-ku, 700-0901, Japan
info@mindware-jp.com
+81-86-226-0028

Recent Posts

  • Epistemology vs Ontology: Why This Distinction Matters More Than Ever
  • Entered into AI governance-related business
  • A Unified Perspective on Cosmology, Causal Structure, Many-Worlds Interpretation, and Bayesian Networks
  • Data Science and Buddhism: From the “Ugly Duckling Theorem” to Emptiness, Provisionality, and the Middle Way
  • The Value of Human–AI Interfaces in the Age of AGI

Categories

  • Data Science
  • Innovation Maps
  • Quantitative business strategy management
  • ThinkNavi
  • 未分類

Proudly powered by WordPress | Theme: BusiCare by SpiceThemes