What Is Entropy? A Measure of Just How Little We Really Know.

time8machine
Home
Gallery
About
FiFlow
Dark Energy & Dark Matter
Super Intelligence Agency
Research Proposal
Reflective Model
Triadic AGI Prototype
ⰆThe HiveⰆ 🐝
AGI
Basic Research
Uncertainty Principle
HYPOTHESIX
Theory Simulation
Particle Physics
Emergent “Sixth Sense”
DJ 〽️
Algorithmic Sixth Sense
Metacognition Algorithm
Metacognition Assistant
KAELUS
HNCA-AGI
Making Sense
Aurora Singularity
EVE
Sixth Sense
Modeling Reality
PHYSICIST
TASK EXTRACTOR
Research Paper
March 21, 2013
TruthSeeker
Core Concept
Dream
AGI-OPTICS v.4
SEE
time8machine
Home
Gallery
About
FiFlow
Dark Energy & Dark Matter
Super Intelligence Agency
Research Proposal
Reflective Model
Triadic AGI Prototype
ⰆThe HiveⰆ 🐝
AGI
Basic Research
Uncertainty Principle
HYPOTHESIX
Theory Simulation
Particle Physics
Emergent “Sixth Sense”
DJ 〽️
Algorithmic Sixth Sense
Metacognition Algorithm
Metacognition Assistant
KAELUS
HNCA-AGI
Making Sense
Aurora Singularity
EVE
Sixth Sense
Modeling Reality
PHYSICIST
TASK EXTRACTOR
Research Paper
March 21, 2013
TruthSeeker
Core Concept
Dream
AGI-OPTICS v.4
SEE
More
  • Home
  • Gallery
  • About
  • FiFlow
  • Dark Energy & Dark Matter
  • Super Intelligence Agency
  • Research Proposal
  • Reflective Model
  • Triadic AGI Prototype
  • ⰆThe HiveⰆ 🐝
  • AGI
  • Basic Research
  • Uncertainty Principle
  • HYPOTHESIX
  • Theory Simulation
  • Particle Physics
  • Emergent “Sixth Sense”
  • DJ 〽️
  • Algorithmic Sixth Sense
  • Metacognition Algorithm
  • Metacognition Assistant
  • KAELUS
  • HNCA-AGI
  • Making Sense
  • Aurora Singularity
  • EVE
  • Sixth Sense
  • Modeling Reality
  • PHYSICIST
  • TASK EXTRACTOR
  • Research Paper
  • March 21, 2013
  • TruthSeeker
  • Core Concept
  • Dream
  • AGI-OPTICS v.4
  • SEE
  • Sign In
  • Create Account

  • My Account
  • Signed in as:

  • filler@godaddy.com


  • My Account
  • Sign out

Signed in as:

filler@godaddy.com

  • Home
  • Gallery
  • About
  • FiFlow
  • Dark Energy & Dark Matter
  • Super Intelligence Agency
  • Research Proposal
  • Reflective Model
  • Triadic AGI Prototype
  • ⰆThe HiveⰆ 🐝
  • AGI
  • Basic Research
  • Uncertainty Principle
  • HYPOTHESIX
  • Theory Simulation
  • Particle Physics
  • Emergent “Sixth Sense”
  • DJ 〽️
  • Algorithmic Sixth Sense
  • Metacognition Algorithm
  • Metacognition Assistant
  • KAELUS
  • HNCA-AGI
  • Making Sense
  • Aurora Singularity
  • EVE
  • Sixth Sense
  • Modeling Reality
  • PHYSICIST
  • TASK EXTRACTOR
  • Research Paper
  • March 21, 2013
  • TruthSeeker
  • Core Concept
  • Dream
  • AGI-OPTICS v.4
  • SEE

Account

  • My Account
  • Sign out

  • Sign In
  • My Account

Why Web-Trained Multimodal Models Do Not See


Abstract

Contemporary multimodal artificial intelligence systems, particularly large language models augmented with vision and audio capabilities, are frequently described as systems that "see." This dissertation argues that such descriptions are category errors. Despite impressive performance on perceptual benchmarks, web-trained multimodal models lack the necessary architectural, dynamical, and epistemic properties required for genuine seeing. Drawing on computational neuroscience, statistical physics, cognitive science, and philosophy of perception, this work demonstrates that current systems perform context-conditioned inference rather than perception grounded in persistent world-models. A formal distinction is developed between retrodictive plausibility engines and predictive seeing systems. The dissertation concludes by proposing a principled research program for machine perception based on dynamical world-models, temporal coherence, and falsifiable commitments to reality.


Chapter 1: Introduction — The Misuse of the Term "Seeing"

1.1 Motivation


The term "seeing" has been widely applied to modern AI systems that classify images, describe scenes, and answer questions about visual inputs. This chapter argues that such usage conflates behavioral success with perceptual ontology. The motivation of this work is to clarify what seeing entails, why current systems do not meet that standard, and why this distinction is essential for progress toward artificial general intelligence.


1.2 Research Questions


  • What constitutes seeing as a computational process?
  • What architectural properties are necessary for perception rather than classification?
  • Why does web-scale multimodal training fail to produce these properties?
  • How can a provable distinction be drawn between seeing and non-seeing systems?


1.3 Central Thesis


Web-trained multimodal models do not see because they lack persistent world-states, predictive commitments, and mechanisms for belief revision under uncertainty. They model correlations in appearance, not the dynamics of reality.

Copyright © 2025 time8machine - All Rights Reserved.

Powered by

  • Super Intelligence Agency

This website uses cookies.

We use cookies to analyze website traffic and optimize your website experience. By accepting our use of cookies, your data will be aggregated with all other user data.

Accept