I'm not sure when the phrase 'world model' came into my lexicon, but by 3ix25 I noted that
...MYKeywords are a collection of concepts that seem to be at the core of MY enterprise, that I'm working to understand the subtleties and interconnections of ==> my World Model, and the guidebook for curation of my Collections...and on 4x25:
...the piles of books thatb embody the tenuous links I'm attempting to pull into a coherent Narrative of my lifelong and continuing Development of my World Model, the Anschauung of my Welt, my exploration of the resources around me that compose who I am and how I got here, for not much reason beyond the exploraion itself. It's somewhat like assembling a complex and multi-dimensional jigsaw puzzle, around the armature of language.That woukld be grandiose if I was doing it for anything beyond myself, and grandiosity is simply unnecessary, and embarrassing to be caught at.
...The thing about the jigsaw analogy is that what I'm navigating has no straight edges, and continues to grow as one pieces it, and in more than two dimensions. And there's no finishing it, no taking it apart and returning it to its box. Macramé is a more accurate analogy to what I aim to be constructing... zto be seen to be doing: exploring, finding, assembling pieces. If only I'd had that perspective when I started teaching in 1973...
Today I googled 'world model' and was surprised to discover that Google sees it as a term in the AI lexicon:
What Is a World Model? nvidia.com...World models are generative AI models that understand the dynamics of the real world, including physics and spatial properties. They use input data, including text, image, video, and movement, to generate videos. They understand the physical qualities of real-world environments by learning to represent and predict dynamics like motion, force, and spatial relationships from sensory data.World Models: Can agents learn inside of their own dreams? github.io
We explore building generative neural network models of popular reinforcement learning environments. Our world model can be trained quickly in an unsupervised manner to learn a compressed spatial and temporal representation of the environment. By using features extracted from the world model as inputs to an agent, we can train a very compact and simple policy that can solve the required task. We can even train our agent entirely inside of its own dream environment generated by its world model, and transfer this policy back into the actual environment.LLMs, Make Room For World Models Brian Hopkins at Forrester.com
...World Models Are Emerging And ImportantAt the frontier of AI research lives a potentially huge development: world models. Technically, a world model is a neural network architecture for learning through observation and prediction. But don't confuse it with predictive analytics. The ambition for world models is no less than approximating human observation, learning, reasoning, planning, and acting ... in other words, thinking. For those who like to read the literature, world models were first named in this research paper from David Ha in 2018. Yann LeCun from Meta is the most prominent AI researcher working on an entire cognitive architecture based on world models.
'World Models,' an Old Idea in AI, Mount a Comeback John Pavlus at Quanta Magazine
You're carrying around in your head a model of how the world works. Will AI systems need to do the same?...a world model: a representation of the environment that an AI carries around inside itself like a computational snow globe. The AI system can use this simplified representation to evaluate predictions and decisions before applying them to its real-world tasks.
...does this mean that AI researchers have finally found a core concept whose meaning everyone can agree upon? As a famous physicist once wrote (opens a new tab): Surely you're joking. A world model may sound straightforward — but as usual, no one can agree on the details (opens a new tab). What gets represented in the model, and to what level of fidelity? Is it innate or learned, or some combination of both? And how do you detect that it's even there at all?
It helps to know where the whole idea started. In 1943, a dozen years before the term "artificial intelligence" was coined, a 29-year-old Scottish psychologist named Kenneth Craik published an influential monograph in which he mused that "if the organism carries a 'small-scale model' of external reality ...within its head, it is able to try out various alternatives, conclude which is the best of them ... and in every way to react in a much fuller, safer, and more competent manner." Craik's notion of a mental model or simulation presaged the "cognitive revolution" [George Miller 2003] that transformed psychology in the 1950s and still rules the cognitive sciences today. What's more, it directly linked cognition with computation: Craik considered the "power to parallel or model external events" to be "the fundamental feature" of both "neural machinery" and "calculating machines."
...In the past few years, as the large language models behind chatbots like ChatGPT began to demonstrate emergent capabilities that they weren't explicitly trained for — like inferring movie titles from strings of emojis, or playing the board game Othello (opens a new tab) — world models provided a convenient explanation for the mystery. To prominent AI experts such as Geoffrey Hinton, Ilya Sutskever and Chris Olah, it was obvious: Buried somewhere deep within an LLM's thicket of virtual neurons must lie "a small-scale model of external reality," just as Craik imagined.
The truth, at least so far as we know, is less impressive. Instead of world models, today's generative AIs appear to learn "bags of heuristics": scores of disconnected rules of thumb that can approximate responses to specific scenarios, but don't cohere into a consistent whole. (Some may actually contradict each other.) It's a lot like the parable of the blind men and the elephant, where each man only touches one part of the animal at a time and fails to apprehend its full form. One man feels the trunk and assumes the entire elephant is snakelike; another touches a leg and guesses it's more like a tree; a third grasps the elephant's tail and says it's a rope. When researchers attempt (opens a new tab) to recover evidence of a world model from within an LLM — for example, a coherent computational representation of an Othello game board — they're looking for the whole elephant. What they find instead is a bit of snake here, a chunk of tree there, and some rope.
...Given the benefits that even simple world models can confer, it's easy to understand why every large AI lab is desperate to develop them — and why academic researchers are increasingly interested in scrutinizing them (opens a new tab), too. Robust and verifiable world models could uncover, if not the El Dorado of AGI, then at least a scientifically plausible tool for extinguishing AI hallucinations, enabling reliable reasoning, and increasing the interpretability of AI systems.
LLMs and World Models, Part 1 How do Large Language Models Make Sense of Their "Worlds"? Melanie Mitchell and Part 2
...there's a fiery debate in the AI community on how these systems achieve their high performance. Have they basically memorized their training data and then retrieve it (in some "approximate" way) to solve new problems? Have they learned much more numerous and detailed, yet still brittle, heuristic shortcuts? Or do they have something more like the robust "world models" that humans seem to use to understand and act in the world?OpenAI co-founder Ilya Sutskever asserts that these systems have learned robust world models:
"When we train a large neural network to accurately predict the next word in lots of different texts....it is learning a world model.... This text is actually a projection of the world.... What the neural network is learning is more and more aspects of the world, of people, of the human conditions, their hopes, dreams, and motivations...the neural network learns a compressed, abstract, usable representation of that."
...The term "world model" has become a buzzword in AI circles, but it doesn't have a single, agreed-upon, definition. Here are a few definitions of a world model from the AI literature.
"[I]nternal representations that simulate aspects of the external world."
"[R]epresentations which preserve the causal structure of the environment as far as is necessitated by the tasks an agent needs to perform."
"[S]tructure-preserving, behaviorally efficacious representations of the entities, relations, and processes in the real world. These representations capture, at an abstract level, their counterpart real-world processes (which typically involve causal relations), in algorithmically efficient forms, to support relevant behaviors."
These informal definitions emphasize that world models exist in an organism's brain or in, say, a LLM's neural network, that they capture something about the world that is causal and abstract (or compressed) rather than simply based on large sets of statistical associations; they don't require too much work for the agent to use ("algorithmically efficient") and are relevant to tasks the agent performs.
...It's important to note that our world models don't just exist for the real world; they can also be formed and used to reason about imaginary worlds, such as those created in science fiction or fantasy literature.
(links to Language Models, World Models, and Human Model-Building Jacob Andreas, which offers 3 models:
...The map, the orrery, and the simulator are all models of the same underlying system. Where they differ is in their affordances—the set of questions they enable a user of the model to answer, and the actions the user needs to take in order to obtain those answers. The map lets us answer static, timeless information that can be obtained by from some prior snapshot of system state. The orrery lets us answer conditional questions about the past and future states of the system, by additionally giving us a crank that moves it forward or backward in time. And the simulator lets us answer counterfactual questions of the system by representing something close to its true underlying dynamics (but requires us to do substantially more work to specify the initial conditions for these counterfactuals).With these differences in affordances come differences in the complexity required to implement each model. You can make a map with stone-age technology, and build the orrery in a 17th-century goldsmith's shop, but can really only produce the simulator with 20th-century technology (from chip fabs to FORTRAN compilers).
and he cites Evaluating the World Model Implicit in a Generative Model Keyon Vafa et al (2024)
A world model: On the political logics of generative AI Louise Amoore et al. Political Geography (2024)
...generative AI is shaping and delimiting the political parameters of what can be known and actioned in the world. Contra the promise of a generalizable "world model" in computer science, the article addresses how and why generative AI gives rise to a model of the world, and with it a set of political logics and governing rationalities that have profound and enduring effects on how we live today....This orientation of AI towards general discovery of "how the world works" and adaptation to new domains and tasks is captured by what LeCun calls a "world model", which would supply "an internal model of how the world works" so that AI becomes "configurable" to each new situation it encounters (2022:2-3). The powerful claim that a flexible, reconfigurable world model could deal with all potential future unencountered situations defines much of the politics of contemporary generative AI. It is a claim that promises a general resolution of difficult problems across technical computational and political paradigms: an AI model that draws upon a structure of "how the world works" in order to respond to an input it had never encountered in training; and a political model that is always capable of action in the face of the unencountered situation. Though the concept of a world model is present in the ambitions of AI designers for a better and more adaptive "fit" to the world, it is also present in the critical concerns of the humanities and social sciences, where it is said, for example, that "GPT3 does not have a model of the world" whereas "every human grows up with a model of the world" (Hayles, 2023, p. 258). In these formulations — spanning the AI proponents' desires and the critics' disquiet —
a model of the world defines something that AI lacks, whether due to its inefficiencies, its absence of embodiment, or its need to incorporate and learn from prior experiences.
World dynamics revisited: a realistic world model simulation Chaweng Chagchit and Joe H. Mize Socio-Economic Planning Sciences 1990
This paper investigates and reviews the world models introduced by J.W. Forrester of MIT. A survey of responses to the model and current trends in the global modeling approach are also provided.From the review, it is noted that the assumptions used in the MIT models tend to reflect a generally pessimistic point of view. On the other hand, several critics of the models tend to be overly optimistic. This paper presents a modified world model in which two new variables (technology and pollution abatement) are introduced into (he original model, adding to the existing variables (natural resources, population, pollution, capital investment and agriculture).
Three-world model Wikipedia
Data check on the world model that forecast global collapse Gaya Herrington The Club of Rome (2021)
In the 1972 bestselling book Limits to Growth (LtG), the authors (Meadows, Meadows, Randers & Behrens) concluded that if humanity kept pursuing economic growth without regard for environmental costs, global society would experience sharp declines in available food, standards of living, and ultimately the human population, within the 21st century.The LtG authors used a dynamic systems model, World3, to study key interactions between global variables for population, fertility, mortality, industrial output per capita (p.c.), food p.c., services p.c., nonrenewable resources, and pollution. World3 is based on the work of Forrester (e.g, 1971; 1975), at the time a professor at MIT and the founder of system dynamics: a modeling approach for the interactions between parts of a system, which often produce non-linear behavior like delays, feedback loops, and exponential growth or decline.,
Do we have an internal model of the outside world? Michael F Land Philos Trans R Soc Lond B Biol Sci (2014)
...Space appears as continuum, independent of the objects that from time to time populate it. It is the continuum rather than the particular contents that appears to remain still when, for example, we look around a room. On this view, our consciously perceived phenomenal world is a hybrid: the machinery of the precuneus provides a temporarily stable and sparsely populated world model, which we can use as an index for finding the sources information we need for actionPrinciples of Modeling: Real World - Model World Tony Starfield(pdf)
...the four basic elements of model construction:
(1) the real world, which we attempt to model
(2) the model world, which is a simplified version of the real world
(3) the model, containing the working parts to run the model
(4) the data, which is required to run the modelFundamental concepts of a world model? Interactive Fiction Community Forum
All that gives us some idea how 'world model' is being used as a concept in several realms, but it doesn't connect very well with what I thought I meant when I started to consider my 'world model' and how it draws upon MYKeywords, or help much with a Narrative of how my 'world model' has developed.
18x25
On 15vii25 I noted:
What Education should be aimed at is the learner's Development (a Process) of "a world model". It's really not that one "world model" is CORRECT, but that there might be many that exist, all workingtoward 'best' for the individual learner's own ...needs? purposes? And the point is to be continuing to BUILD one's own.and on 17vii25:
MY world model has its feet in the physical and the biotic, in geography and ecology and temporal processes in which Homo sapiens is engaged...MY world model inv olves Systems in which life is susceptible to human control, with the caveat of discounts for hubris, error/omission, and short-sightedness... [This was] a product of Whole Earth involvement/influence, during the Stanford years, and via CoEvolution Quarterly and Whole Earth Review through the 1970s.
It occurred to me to begin to collect a World Model bibliography of books on my shelves that could have a productive conversation if they were gathered together on a cart:
Field Guide to the Patchy Anthropocene: The New Nature Anna Lowenhaupt Tsing et al. (2024)Man's Role in Changing the Face of the Earth William Thomas (1956)
Something New Under the Sun: An Environmental History of the Twentieth-Century World JR McNeill (2001)
What is Environmental History? J Donald Hughes (2015)
An Environmental History of the World: Humankind's Changing Role in the Community of Life j Donal Hughes (2010)
Ecological Imperialism: The Biological Expansion of Europe, 900-1900 Alfred W Crosby (1993)
Europe and the People Without History Eric Wolf (1982)
World-Systems Analysis: An Introduction Immanuel Wallerstein (2004)
A World-Systems Reader: New Perspectives on Gender, Urbanism, Cultures, Indigenous Peoples, and Ecology Thomas D Hall (2000)
Friction: An Ethnography of Global Connection Anna Lowenhaupt Tsing (2024)
The Great Acceleration: An Environmental History of the Anthropocene since 1945 JR McNeill (2016)
Environmental Anthropology: A Historical Reader Michael Dove (2007)
Geographies of Global Change: Remapping the World RJ Johnston (2002)
Changing the Face of the Earth: Culture, Environment, History Ian G Simmons (1996)
The Human Impact on the Natural Environment Andrew Goudie (2000)
Technology: A World History Daniel R Headrick (2009)
The Atlas of a Changing Climate: Our Evolving Planet Visualized with More Than 100 Maps, Charts, and Infographics Brian Buma (2021)
The Systems View of Life: A Unifying Vision Fritjof Capra and Pier Luigi Luisi (2016)
The Great Derangement: Climate Change and the Unthinkable Amitav Ghosh (2017)
Thinking in Systems Donella H Meadows (2008)
Limits to Growth: The 30-Year Update Donella Meadows (2004)
Spatial organization;: The geographer's view of the world Ronald F Abler et al. 1971
(there will be more)