24 June 2025

Thing vs Concept

The distinction between a "thing" and a "concept" lies at the heart of how we understand and categorize the world. A "thing" typically refers to a concrete, tangible entity that exists in reality, possessing specific properties and occupying space and time. A tree, a car, a human being – these are things. A "concept," on the other hand, is an abstract idea, a mental construct, or a generalization derived from observed things. "Forest," "transportation," "humanity" – these are concepts. The philosophy underpinning this difference is crucial when designing taxonomies and ontologies, which are structured systems for organizing knowledge.

In the realm of knowledge representation, particularly in domains like data science, artificial intelligence, and information management, deciding when to represent something as a concrete "thing" versus an abstract "concept" is not merely an academic exercise; it has profound practical implications. Taxonomies, which are hierarchical classifications, often start with concrete things and group them under broader concepts. For instance, a "Golden Retriever" (a thing, a specific breed) is classified under "Dog" (a more general concept), which falls under "Canine" (an even broader concept).

Ontologies, which provide a richer representation of knowledge by defining classes, properties, and relationships, demand an even more nuanced approach. Here, the interplay between "things" and "concepts" becomes vital. When constructing an ontology, one must determine whether an entity should be modeled as an individual instance (a "thing") or a class/category (a "concept"). For example, "my car" is a specific instance of a "Car," which is a class. The class "Car" is a concept, while "my car" is a thing.

It makes sense to use abstractions (concepts) when:

  1. Generalization is needed: To group similar things, allowing for easier reasoning and querying across diverse instances. For example, treating "Sedan," "SUV," and "Hatchback" as specific types under the abstract concept of "Car."
  2. Focus is on properties and relationships common to a group: If you want to define that all "Books" have "Authors" and "Titles," you define these properties on the concept "Book," not on every individual book.
  3. Scalability is a concern: Storing properties for every individual thing can be inefficient. Abstractions allow for a more compact and manageable knowledge base.
  4. Semantic clarity is paramount: Concepts provide the vocabulary and framework for understanding a domain, ensuring consistency in meaning.

Conversely, it is right to use concrete "things" (instances) when:

  1. Specificity is essential: When you need to refer to a particular entity with unique attributes, like "the Eiffel Tower" or "the specific transaction ID 12345."
  2. Tracking individual states or histories: If "my car" needs to track its mileage, service history, or current location, it must be represented as a distinct thing.
  3. Events or actions involving specific entities: "John bought a book" involves specific individuals ("John," a type of "person") and a specific item ("a book," an instance of the concept "Book").

The "rightness" of using an abstraction versus a concrete instance depends on the granularity required by the system and the questions it needs to answer. Over-abstracting can lead to a loss of valuable detail, making it impossible to query specific instances. Under-abstracting can lead to a bloated, unmanageable knowledge base that struggles with generalization. The challenge in taxonomy and ontology is to find the optimal balance, building robust models that allow for both generalized reasoning and detailed instance tracking, ensuring the structured knowledge reflects the complex interplay between the abstract and the tangible in our world.