Between world models and model worlds: on generality, agency, and worlding in machine learning

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Standard

Between world models and model worlds: on generality, agency, and worlding in machine learning. / Mitrokhov, Konstantin.
in: AI and Society, 07.10.2024.

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Harvard

APA

Vancouver

Mitrokhov K. Between world models and model worlds: on generality, agency, and worlding in machine learning. AI and Society. 2024 Okt 7. Epub 2024 Okt 7. doi: 10.1007/s00146-024-02086-9

Bibtex

@article{8c755b71bdda444b95b77242c374341a,
title = "Between world models and model worlds: on generality, agency, and worlding in machine learning",
abstract = "The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.",
keywords = "Agency, AGI, Artificial cognition, Data, Worlding",
author = "Konstantin Mitrokhov",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
month = oct,
day = "7",
doi = "10.1007/s00146-024-02086-9",
language = "English",
journal = "AI and Society",
issn = "0951-5666",
publisher = "Springer",

}

RIS

TY - JOUR

T1 - Between world models and model worlds

T2 - on generality, agency, and worlding in machine learning

AU - Mitrokhov, Konstantin

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/10/7

Y1 - 2024/10/7

N2 - The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.

AB - The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.

KW - Agency

KW - AGI

KW - Artificial cognition

KW - Data

KW - Worlding

UR - http://www.scopus.com/inward/record.url?scp=85205803491&partnerID=8YFLogxK

U2 - 10.1007/s00146-024-02086-9

DO - 10.1007/s00146-024-02086-9

M3 - Journal articles

AN - SCOPUS:85205803491

JO - AI and Society

JF - AI and Society

SN - 0951-5666

ER -

DOI