Between world models and model worlds: on generality, agency, and worlding in machine learning

Konstantin Mitrokhov

doi:10.1007/s00146-024-02086-9

Between world models and model worlds: on generality, agency, and worlding in machine learning

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Standard

Between world models and model worlds: on generality, agency, and worlding in machine learning. / Mitrokhov, Konstantin.
in: AI and Society, 07.10.2024.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Bibtex

@article{8c755b71bdda444b95b77242c374341a,

title = "Between world models and model worlds: on generality, agency, and worlding in machine learning",

abstract = "The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.",

keywords = "Agency, AGI, Artificial cognition, Data, Worlding, Informatics, Media and communication studies",

author = "Konstantin Mitrokhov",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = oct,

day = "7",

doi = "10.1007/s00146-024-02086-9",

language = "English",

journal = "AI and Society",

issn = "0951-5666",

publisher = "Springer London",

}

RIS

TY - JOUR

T1 - Between world models and model worlds

T2 - on generality, agency, and worlding in machine learning

AU - Mitrokhov, Konstantin

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/10/7

Y1 - 2024/10/7

N2 - The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.

AB - The article offers a discursive account of what generality in machine learning research means and how it is constructed in the development of general artificial intelligence from the perspectives of cultural and media studies. I discuss several technical papers that outline novel architectures in machine learning and how they conceive of the “world”. The agency to learn and the learning curriculum are modulated through worlding (in the sense of setting up and unfolding of the world for artificial agents) in machine learning engineering. In recent computer science articles, large models trained on Internet-scale datasets are framed as general world simulators—despite their partiality, historicity, finite nature, and cultural specificity. I introduce the notion of “model worlds” to refer to composable interactive environments designed for the purpose of machine learning that partake in legitimising that claim. I discuss how large models are grounded through interaction in model worlds, arguing that model worlds mediate between the sheer scale of language models and their hypothetical capacity to generalise to new tasks and domains, rehashing the empiricist logic of “big data”. Further, I show that the emerging capacity of artificial agents to generalise redraws the epistemic boundary between artificial agents and their learning environments. Consequently, superficial statistics of language models and abstract action are made meaningful in distilled model worlds, giving rise to synthetic agency.

KW - Agency

KW - AGI

KW - Artificial cognition

KW - Data

KW - Worlding

KW - Informatics

KW - Media and communication studies

UR - http://www.scopus.com/inward/record.url?scp=85205803491&partnerID=8YFLogxK

U2 - 10.1007/s00146-024-02086-9

DO - 10.1007/s00146-024-02086-9

M3 - Journal articles

AN - SCOPUS:85205803491

JO - AI and Society

JF - AI and Society

SN - 0951-5666

ER -

In der gleichen Zeitschrift

Correction to: Operative communication: project Cybersyn and the intersection of information design, interface design, and interaction design (AI & SOCIETY, (2022), 10.1007/s00146-021-01346-2)

Vehlken, S., 06.2024, in: AI & Society. 39, 3, S. 1533-1534 2 S.

Publikation: Beiträge in Zeitschriften › Kommentare / Debatten / Berichte › Forschung

Operative communication: project Cybersyn and the intersection of information design, interface design, and interaction design

Vehlken, S., 09.2022, in: AI and Society. 37, 3, S. 1131-1152 22 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Francesca Ferrando (2019): Philosophical Posthumanism. Theory in the New Humanities, Series Editor: Rosi Braidotti, Preface by Rosi Braidotti). Bloomsbury Academic (27 June, 2019), 296 pages, ISBN:1350059501, ISBN: 9781350059504

Foerster, Y., 01.12.2020, in: AI & Society. 35, 4, S. 1079-1081 3 S.

Publikation: Beiträge in Zeitschriften › Rezensionen › Forschung

Nice-Looking Obstacles: Parkour as urban practice of deterritorialization

Brunner, C., 05.2011, in: AI & Society. 26, 2, S. 143-152 10 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Culturally adapted mathematics education with ActiveMath

Melis, E., Goguadze, G., Libbrecht, P. & Ullrich, C., 10.2009, in: AI & Society. 24, 3, S. 251-265 15 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

DOI

https://doi.org/10.1007/s00146-024-02086-9
Endgültige, publizierte Fassung

Between world models and model worlds: on generality, agency, and worlding in machine learning

Standard

Harvard

APA

Vancouver

Bibtex

RIS

In der gleichen Zeitschrift

Correction to: Operative communication: project Cybersyn and the intersection of information design, interface design, and interaction design (AI & SOCIETY, (2022), 10.1007/s00146-021-01346-2)

Operative communication: project Cybersyn and the intersection of information design, interface design, and interaction design

Francesca Ferrando (2019): Philosophical Posthumanism. Theory in the New Humanities, Series Editor: Rosi Braidotti, Preface by Rosi Braidotti). Bloomsbury Academic (27 June, 2019), 296 pages, ISBN:1350059501, ISBN: 9781350059504

Nice-Looking Obstacles: Parkour as urban practice of deterritorialization

Culturally adapted mathematics education with ActiveMath

DOI

Zuletzt angesehen

Projekte

Publikationen