Theoretical perspectives on the mechanisms of 'AI' chatbots

# Theoretical perspectives on the mechanisms of 'AI' chatbots *July 2025* #final #AI #LLM #Theory AI chatbots and the Large Language Models (LLMs) that underpin them seemingly hold an abstracted representation of 'knowledge'. As their name states, they only model language, yet they certainly have *procedural* knowledge in that they *know how* to output text that, had it been authored by a human, would be a sign of knowledge. To investigate the sources of this apparent knowledge, this essay offers a brief overview of the technical mechanisms of language modelling in LLMs, as section headings. Presented alongside in sub-sections are several theoretical perspectives that help contextualise chatbots' relationship to meaning, their internal representation of it, and eventual output. To frame the first technical explanation and ease the reader in, we need to talk briefly of structuralist linguistics. # Prelude: Linguistic Structuralism Forster (2022) describes LLMs as "operationali\[s\]ation of Saussurean structure". The Swiss linguist Ferdinand de Saussure (1857-1913) birthed linguistic structuralism, which conceptualises language as a *structure of signs*. Signs consists of pairs of *signifiers* (words, written or spoken) and *signifieds* (the concept referred to). Important in this conception is that signified are not real world object/entities - they are abstractions, once removed from the thing they describe/capture. Signifiers, further removed from the real world, are *arbitrary* (de Saussure, 1989 \[1916\]). Or inded *largely* arbitrary: an example of a notable exception is the "bouba/kiki" effect whereby people across cultures will associate the former word to a cloud-like, round shape, the second to an urchin-like, pointy one (Ramachandran and Hubbard, 2001; Ćwiek _et al._, 2021) > ![[Kiki-Boboo.png]] > In Ramachandran and Hubbard's cross-cultural reproduction of Wolfgang Köhler's original experiment, 95 to 98% of participants chose 'kiki' for the left shape, "bouba" for the right one. Signifiers share *syntagmatic* relationships: syntactical and semantic ones; conversely, signifieds share *paradigmatic* relationships. It is those sets of relationships that form the *structure* of language, which exists in the space of signifieds as it does that of signifiers. This structure is *differential*: meaning is in the relationships between signifiers on the one side, and signified, on the other, more than it is in the relationship between a notional, purely abstract signified and an arbitrary signifier (de Saussure, 1989 \[1916\]; Inglis and Thorpe, 2012). For instance the signifier 'cat' does not refer to the actual animal, but to a signified encompassing the concept of the domestic cat, the big cat, and the 1930s jazz lingo for a hip fellow, along with the connotations that underpin this particular use: graceful movement, fierce independence, quiet, understated *cool*. Development, in the last fifteen years, of deep learning techniques for Natural Language Processing implement Saussure's theory, inasmuch as they have shown to model paradigmatic relationships whilst having only encountered syntagmatic ones in their training (Forster, 2022, Vromen 2024). # In the beginning was the *word2vec* Artificial neural networks, like the ones powering LLMs, model a set of 'neurones', each receiving weighed inputs from all the other neurones of a previous *layer*, using linear algebra. Each layer is a *vector*, a column of numbers, each of which represent the degree of activation of a neurone. The strengths of the synaptic connections between neighbouring layers is a *matrix* of *weights*, transforming the vector representation of a layer into that of the next (Zou, Han and So, 2009). This mathematical detour is to stress that LLMs manipulate *vector representations of words.* To be precise, they use vector representations of *tokens*; a token can be a word, part of a word, as well as punctuation and control characters. Any vector, being just a list of numbers, can be thought of as a set of coordinates: eg. '(x, y)', or '(x, y, z)'. Those vector representations, called '*word embedding*s', can in turn be thought of as *coordinates in a hyper-dimensional space.* For GPT-3, the LLM underpinning the original ChatGPT, this space has 12,288 dimensions (Brown _et al._, 2020). The *dimensions* of this hyper-dimensional space are arbitrary; the number of dimensions is chosen before training, but what each will end up representing is arbitrary: many models will have the same version available in a variety of parameter sizes. In spite of this, with enough dimensions to this 'latent space', it will end up modelling some aspect of the meaning of the words as they appear in the training set. The oft repeated example is that, in those embeddings: $vec(queen) - vec(woman) \approx vec(king) -vec(man)$ Here, $vec(king) - vec(man)$, also a vector, representing a direction, a motion in the latent meaning space, that turns an individual into a monarch - it is the same quantity as $vec(queen) - vec(woman)$ (Mikolov, Chen, _et al._, 2013). The technique of word embedding pre-dates the transformer architecture that powers GPT-like chatbots (see Mikolov, Sutskever, _et al._, 2013), and has been used for machine translation since the 2010s. Creating those embeddings is a separate training stage, in which the system is optimised for its ability to guess missing words within a sequence. The most surprising thing is how short the sequence can be for embedding to model such semantic relationships. The 'regal equality' above is already present in spaced modelled with `word2vec`, using sequences *no longer than 9 words* (Mikolov, Chen, _et al._, 2013; Mikolov, Sutskever, *et al.*, 2013). ## Analysis: Meaning as distribution / meaning as intent Embedding is thus specifically an operationalisation of *distributionialism*, a late-twentieth century structuralist theory of linguistics (Harris, 1951, 1954; Bloomfield, 1984), best summed up by J. R. Firth: "You shall know a word by the company it keeps !" (Firth, 1957, p. 11). At its core is the hypothesis that the semantics of language - the structure of meaning - can be completely described in terms of a distributional "without intrusion of other features such as history or meaning" (Harris, 1954, p. 146). Harris gave little consideration as to whether history or meaning may 'intrude' on distributional statistics in the first place, and we will find that the linguistics of LLMs share this disregard for ground truths or communicative intent as the *origin* of linguistic behaviour, in favour of a conception of meaning solely based on the *outcome* of linguistic behaviour: the statistical properties of text. Saussure made a further distinction between the *langue*, language as the complete sets of shared signs, and *parole*, language as it is spoken or written, by specific people, in a specific context (de Saussure, 1989 \[1916\]; Inglis and Thorpe, 2012). The training data of LLM is *parole*, so is their output - can they model *langue*? Then again, could not the same questions be asked of humans? LLMs model *standing* (or *conventional*) meaning (Grice, 1968; Quine, 2013) , but not meaning as communicative intent. In French, 'to mean' is '*vouloir dire*' - literally to want to say - and this is the phrase used not just for 'I mean', but also 'this word means'; 'mean*ing*', conversely, is a separate noun: '*sens*' - obviously as in 'sense', but, coincidentally, the word used for the *direction of a vector*. LLMs represent the *sens* (meaning), but not what the words *want to say* (Bender and Koller, 2020). # The Transformer Architecture After word embeddings, another turning point in the history of language models is the development of the transformer architecture (Vaswani _et al._, 2017). Like predictive texting, all an LLM does is *guess* the next word based on those input so far - the *context*. The process of generating the next token is thus known as *inference*. When simple predictors would use blunt techniques, only considering the immediately preceding word and/or the full set of previous words, transformers use several 'attention heads', which let them consider context, in different places in the input. Each of those heads perform a series of transformations on the vectors, the matrix multiplications mentioned above, that will shift their position in the embedding space, refining their meaning. The mechanism is known as 'self-attention': each of the computed vectors will be weighted, multiplied by a scalar reflecting its relevance in the context - as pertains guessing the next token. This is repeated several times; GPT3 uses 96 transformer blocks operating sequentially, with the output of each block (a sequence of vectors representing the context) becoming the new context input to the next one. If the original embeddings represent standing meaning, what comes out of the transformer, and ultimately drives the choice of the next word, represent meaning-in-context (Vaswani *et al.*, 2017; Zhang* _et al._, 2025). This last step makes the representation of meaning during the inference a dynamic, context-dependent one, as opposed to the standing meaning of the original embedding. Both these representations of meaning are based on the distributional semantics of the training data, as they have been trained by minimising a prediction error in a missing-word exercise. Embedding and generator training are two distinct steps which may or may not use the same training data, but the system cannot model meaning beyond that present in the training set(s). When a model generates a vector for the next token, it represent a point in the embedding space that will never be the exact location of a word, falling somewhere in between. Candidate words are found nearby, ranked by proximity, and the result is drawn at random, with probabilities weighted according to this distance (Vaswani _et al._, 2017). ## Analysis: The post-structural linguistics of transformers The great challenge of structuralist linguistics to modernity was to posit that the structure of language itself, in the syntagmatic space, precludes, limits, enables or favours our ability to conceive of objects in the paradigmatic space - an hypothesis most associated with the *linguistic relativism* of Sapir and Whorf in the 1920s (Gerrig and Banaji, 1994). The impact of the training data on bias in the output of LLMs is a reflection of this tenet (Vromen, 2024). The vector for 'cat' will not be the same in a system trained on zoology textbooks, versus one trained on the correspondence of jazz musicians. Post-structuralism offers a further challenge, casting doubt on the very existence of the paradigmatic space, or at any rate of a *shared* paradigmatic space: meaning stems from language and its use in the social (if not embodied) world, and diffuses through usage, constantly negotiated, creating its own network of concepts and relationships for a given reader (Grbich, 2003; Aylesworth, 2015). Roland Barthes, whose career moved through structuralism into post-structuralism famously argued for the irrelevance of the author's intention once a work is published (1967). The reader is free to connect the text to a vast, open network of other texts and cultural codes. He distinguished between the closed 'Work' (which can be analysed structurally) and the open 'Text'. The Text, he wrote, "is a methodological field," not an object (Barthes, 1989). Manghani (2024) remarks that the training method of language model is remarkably similar to Barthes "commutation test" (Barthes, 1990 \[1967\]); and argues against Barthes claim of the Text's non-computability. This argument, in my view, is only valid for the platonic ideal of the LLM, trained on all human text (including that yet to exist); it is a philosophical position that brackets the reality of model training. Meaning emerges not solely from difference, but also from *différance*, the differential in local meaning between the times and cultures of the writing and the reading - Derrida wrote a lot on the technology of writing, and must have known he would challenge spell-checkers of future scholars as much as he did Saussurean linguistics. Derrida's view of the primacy of writing - to Saussure primacy of speech, means that there is '*nothing but the text*': intent is moot, meaning constructed on the reader's side (Derrida, 1967; Lawlor, 2023). In this respect, the LLM realises Derrida's vision: its production of text is devoid of intent, and whilst it ostensibly shows, in its output, a grasp of 'meaning', it is through a set of mathematical abstractions so alien to human thought that it forces us to acknowledge it is the reader that constructs the meaning of the synthetic text (Kuchtová, 2024; AlShalan, 2025). # Fine-tuning After vector embedding by the encoder, and the training of the parameters of the generator, the transformer is only *pre-* trained - the P in GPT. The model is then *fine-tuned* for specific applications. The *base model*, as it comes out of this pre-training process, is *in potentia* capable of all the applications its fine-tuned variants, especially if given enough context. In practice, they are further trained, on the same token prediction task, using a different, more specialised but smaller, dataset - which is still, as all the above, *unsupervised* learning. In addition to this "pre-train and fine-tune" approach, there is the "pre-train, prompt and predict" one, in which models undergo *reinforcement learning*, this time completing full prompt->response tasks and being given feedback. This typically involves the writing of specific example responses to thorough, comprehensive prompts, then using humans to rate similarity of output with the exemplars (Cheng _et al._, 2023). This now more and more automated, making this amongst the first of those lowest-level knowledge work jobs to be lost to the LLM (Mazzullo _et al._, 2025). Fine-tuning is application specific, which in practice makes the nature and details of it closer to a trade secret: the software industry has a long history of building successful proprietary systems on top of open source infrastructure. Some systems are fully proprietary, others 'open-weights', with all parameters published, available for further tuning. A public set of parameter is open source in letter, allowing anyone to run and adapt the model, but part of the spirit of open source is transparency by code inspection. To this effect, some models go further and offer a full pedigree - the training data and details of methods used (Widder, Whittaker and West, 2024). ChatGPT is at the least transparent end of the spectrum, OpenAI's own name notwithstanding (Liesenfeld, Lopez and Dingemanse, 2023). Yet we know that the numerous lexical fingerprints ('delve', 'tapestry'... see Kobak _et al._, 2025) of its output do not come from the training set of the underlying GPT base model, but from the fine-tuning. The verb 'delve' in particular has been picked up on, being relatively rare in the dialects of first-world English speaking countries (\[@JeremyNguyenPhD\], 2024). It is, however, heavily used in Nigerian business English: a marker of the off-shoring to the global South of the human labour needs of reinforcement learning (Hern, 2024), where OpenAI "paid people \[...\] $2 an hour to look at the most disturbing content imaginable" (Harrison Dupré, 2022). ## Analysis: bovine scatology Their functional mechanism means that AI chatbots' output relationship to ground truth is merely statistical: curation of the training set and fine-tuning help increase the likelihood of truthful output past an acceptable threshold (Bender _et al._, 2021). The reinforcement learning from human feedback used in the same fine-tuning process will also align the model towards output more likely to be positively rated by a human. This dangerous combination has led many to invoke Frankfurt's *Bullshit* (1986/2005): neither truth nor lie, having no regard for either; an instrumentalised language whose sole purpose is a specific effect on the reader (qv. Rudolph *et al.* (2023), Hicks *et al.* (2024) or Gorrieri (2024)). # The System Prompt We have seen that the very architecture of transformer models makes their performance directly proportional to the amount of context they are given. The more context, the more opportunity for the attention heads to transform the position of the word vectors in the latent space, towards their meaning-in-context. Fine tuning helps lock in this context-specificity; the base model of any LLM is trained to continue text, not to respond conversationally: to act as chatbots, LLMs need fine-tuned to this specific behaviour. In addition to this, the whole conversation also has to be given a frame by the bot developers - the system prompt. System prompts are as sensitive a trade secret as details of fine-tuning, but they are often extracted from the bot by inquisitive users, and leak online (Levin _et al._, 2025). When I type a prompt into ChatGPT, my dozen-word string will be appended to a thousand-plus words (for ChatGPT 4.5 see Appendix **X**), presented with subheadings and bullet-pointed list (which explains their prevalence in outputs). LLM output being only as good as the amount and quality of context given, it is easy to see how this biases the output (Neumann _et al._, 2025). The system prompt contains phrases like *"Always prioritize being truthful, nuanced, insightful, and efficient, tailoring your responses specifically to the user’s needs and preferences."* (quoted in u/EloquentPickle, 2025), reminiscent of those tales of conflicting AI instructions, core to the plot of many of Isaac Asimov's *Robot(s)* stories (1996 \[1950\]). ## Analysis: The chatbot as a tool of discursive Power Fine-tuning and system prompt are where the chatbot developers have the most influence in its eventual behaviour. With only half a dozen companies competing in this field, and OpenAI maintaining a steady ~80% market share (StatCounter, 2025), this concentrate in very few hands an enormous power: that of discourse production. Exact figures are hard to obtain, but ChatGPT alone was reported serving more than 1 billion queries a day as of June 2025 (Singh, 2025). Assuming an estimate of 50-150 words per query, that is 50-150 billion words output a day. To put this in perspective, the whole of Wikipedia as of early July 2025 totalled 4.9 billion words (Wikipedia, 2025). *ChatGPT prints between ten and thirty wikipedias every day* - to 122.5 million users, with an over-representation of 18-34, males and Americans (Singh, 2025). The discourse produced by ChatGPT reflects, expectedly, that of its training data, describing for instance communism as an ideology and economic system, but capitalism merely as an economic system (Ahmed and Mahmood, 2024). This is not surprising, and mirrors legacy media discourse. What is unsaid is that Ahmed and Mahmood were not exposed to the content the $2 an hour labour (Perrigo 2023). Chatbot designers have to make a decision as to what discourse is acceptable. In addition to their reach, chatbots are, to some, very influencial. AI researcher forewarned of the risks of LLMs as tools of radicalisation (McGuffie and Newhouse, 2020), a risk now realised (Allchorn, 2024), but they are also tools for de-radicalisation (Russo, 2024). # Final thoughts > "Why are we using a Bullshit engine for *anything serious*? Like *forreal-forreal*?" > (_Signal’s Meredith Whittaker says Chat GPT can’t be trusted_, 2023) The mechanics of LLMs make it hard to argue they 'know' anything. Factually correct output is contingent on a deterministic stage so complex as to be mathematically *chaotic*, with small change in initial conditions (different wordings of the same query) can result in large, unpredictable changes in output, followed by a probabilistic roll of the dice to pick the exact word. This is not noticeable for many inputs, where the fuzziness of human language will allow for interpretation of the output as correct; however, it is blatant when asking an LLM to do maths, where each digit in the output needs to be correct, not just semantically close enough. See also ChatGPT's persistent inability to tell how many r's are in the word *strawberry*: vector embedding does not encode the tokens' spelling, the answer is based on statistical modelling of the training set as regards phrases containing those tokens. 'Stochastic Parrots' indeed (Bender *et al.*, 2021). Thus, LLMs have no *declarative* knowledge, but we can grant them *procedural* knowledge: knowing how to sequence output tokens in a fashion likely to be interpreted as meaningful output by a human user. This 'meaning', we have seen, comes from four factors. Embeddings define the base possibilities through static meaning, and implement in vector mathematics the space of syntagmatic relationships of signifiers as per Saussure. The training and inference mechanisms of the generator use a numerical abstraction of the information contained in the training set. This means a more refined, contextual grasp of meaning, as derived without access to real-world referent, nor ground truths - an irrelevance of the *hors-texte* perfectly Derridean. The base models undergoes further fine tuning, first for safety, the enforcement of an equally perfectly Foucauldian regime of truth, defining the acceptable Knowledge to be output, wielding Power upon the user as Subject. Further training for conversation will reinforce output rated positively by the user, which, combined with the lack of truth grounding, makes said output Frankfurtian *bullshit*. *Papañca*, the Buddhist concept of 'mental proliferation' may however be more useful (Costello, 2024). It turns out Gautama Buddha was a post-structuralist twenty-five centuries before it was trendy. All those factors define the bounds of what is possible, with fine-tuning aiming to set those bounds around a safe area. But as to actual output, the architecture of the transformer model means the largest influence is the context: the conversation itself. As models support larger context windows, chatbots can have longer exchanges; as those grow in length, the relative influence of the system prompt on the output decreases. This has considerable implications for LLM safety, as we have seen this year with stories of users being induced into delusion by ChatGPT (Harrison Dupré, 2025; Klee, 2025). Fine-tuning for AI 'alignment' (of output to moral values) leaves the model exposed to 'jailbreak' attacks (Wolf _et al._, 2024; Chu _et al._, 2025), amongst them the "persona attack", whereby through persistent prompting, a user alters the chatbots persona past its safe boundaries. Paradoxically, a better aligned model, which discriminates better between 'good' and 'bad' states, is therefore *easier to steer into bad states* (West and Aydin, 2025). Long conversations leading to spiritual delusions are an inadvertent persona attack, inadvertently leading the model to unsafe outputs. Researchers in psychiatry had foreseen the problem shortly after ChatGPT's release (Østergaard, 2023); on a more hopeful note, this potential for mental harm is also a potential for mental healing (Østergaard, 2024; Rządeczka _et al._, 2025). # References Ahmed, T.N. and Mahmood, K.A. (2024) ‘A critical discourse analysis of ChatGPT’s role in knowledge and power production.’, _Arab World English Journal_ [Preprint]. AlShalan, A. (2025) ‘Bridging the Divide: Saussurean Structure and Derridean Complexity in ChatGPT’s Meaning-Making’. Rochester, NY: Social Science Research Network. Available at: [https://papers.ssrn.com/abstract=5238831](https://papers.ssrn.com/abstract=5238831) (Accessed: 28 May 2025). Asimov, I. (1996) _I, Robot_. HarperCollins UK. Aylesworth, G. (2015) ‘Postmodernism’, in E.N. Zalta (ed.) _The Stanford Encyclopedia of Philosophy_. Spring 2015. Metaphysics Research Lab, Stanford University. Available at: [https://plato.stanford.edu/archives/spr2015/entries/postmodernism/](https://plato.stanford.edu/archives/spr2015/entries/postmodernism/) (Accessed: 5 July 2025). Barthes, R. (1989) ‘From Work to Text’, in _The Rustle of Language_. Berkeley: University of California Press, pp. 73–81. Available at: [https://www.degruyterbrill.com/document/doi/10.7591/9781501743429-003/pdf?licenseType=restricted](https://www.degruyterbrill.com/document/doi/10.7591/9781501743429-003/pdf?licenseType=restricted) (Accessed: 5 July 2025). Barthes, R. (1990) _The Fashion System_. University of California Press. Bender, E.M. _et al._ (2021) ‘On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜’, in _Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency_. New York, NY, USA: Association for Computing Machinery (FAccT ’21), pp. 610–623. Available at: [https://doi.org/10.1145/3442188.3445922](https://doi.org/10.1145/3442188.3445922). Bender, E.M. and Koller, A. (2020) ‘Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data’, in D. Jurafsky et al. (eds) _Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics_. _ACL 2020_, Online: Association for Computational Linguistics, pp. 5185–5198. Available at: [https://doi.org/10.18653/v1/2020.acl-main.463](https://doi.org/10.18653/v1/2020.acl-main.463). Bloomfield, L. (1984) _Language_. Edited by C.F. Hackett. Chicago, IL: University of Chicago Press. Available at: [https://press.uchicago.edu/ucp/books/book/chicago/L/bo3636364.html](https://press.uchicago.edu/ucp/books/book/chicago/L/bo3636364.html) (Accessed: 5 July 2025). Brown, T. _et al._ (2020) ‘Language models are few-shot learners’, _Advances in neural information processing systems_, 33, pp. 1877–1901. Cheng, D. _et al._ (2023) ‘Foundations and Applications in Large-scale AI Models: Pre-training, Fine-tuning, and Prompt-based Learning’, in _Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining_. New York, NY, USA: Association for Computing Machinery (KDD ’23), pp. 5853–5854. Available at: [https://doi.org/10.1145/3580305.3599209](https://doi.org/10.1145/3580305.3599209). Chu, J. _et al._ (2025) ‘JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs’. arXiv. Available at: [https://doi.org/10.48550/arXiv.2402.05668](https://doi.org/10.48550/arXiv.2402.05668). Costello, E. (2024) ‘ChatGPT and the Educational AI Chatter: Full of Bullshit or Trying to Tell Us Something?’, _Postdigital Science and Education_, 6(2), pp. 425–430. Available at: [https://doi.org/10.1007/s42438-023-00398-5](https://doi.org/10.1007/s42438-023-00398-5). Ćwiek, A. _et al._ (2021) ‘The bouba/kiki effect is robust across cultures and writing systems’, _Philosophical Transactions of the Royal Society B: Biological Sciences_, 377(1841), p. 20200390. Available at: [https://doi.org/10.1098/rstb.2020.0390](https://doi.org/10.1098/rstb.2020.0390). De Saussure, F. (1989) _Cours de linguistique générale_. Otto Harrassowitz Verlag. Derrida, J. (1967) _De la grammatologie_. Paris: Les Éditions de Minuit. EloquentPickle (2025) ‘I made ChatGPT 4.5 leak its system prompt’, _r/PromptEngineering_. Available at: [https://www.reddit.com/r/PromptEngineering/comments/1j5mca4/i_made_chatgpt_45_leak_its_system_prompt/](https://www.reddit.com/r/PromptEngineering/comments/1j5mca4/i_made_chatgpt_45_leak_its_system_prompt/) (Accessed: 7 July 2025). Firth, J. (1957) ‘A synopsis of linguistic theory, 1930-1955’, _Studies in linguistic analysis_, pp. 10–32. Floridi, L. (2025) ‘Distant Writing: Literary Production in the Age of Artificial Intelligence’, _Minds and Machines_, 35(3), p. 30. Available at: [https://doi.org/10.1007/s11023-025-09732-1](https://doi.org/10.1007/s11023-025-09732-1). Forster, C. (2022) ‘Are Large Language Models Operationalizations of Saussurean Structure?’, 18 July. Available at: [https://cforster.com/2022/07/on-words/](https://cforster.com/2022/07/on-words/) (Accessed: 28 May 2025). Frankfurt, H.G. (2005) _On bullshit_. Princeton University Press. Gerrig, R.J. and Banaji, M.R. (1994) ‘CHAPTER 8 - Language and Thought’, in R.J. Sternberg (ed.) _Thinking and Problem Solving_. San Diego: Academic Press (Handbook of Perception and Cognition), pp. 233–261. Available at: [https://doi.org/10.1016/B978-0-08-057299-4.50014-1](https://doi.org/10.1016/B978-0-08-057299-4.50014-1). Gorrieri, L. (2024) ‘Is ChatGPT Full of Bullshit?’, _Journal of Ethics and Emerging Technologies_, 34(1), pp. 1–16. Available at: [https://doi.org/10.55613/jeet.v34i1.149](https://doi.org/10.55613/jeet.v34i1.149). Grbich, C. (2003a) ‘Postmodernity and Postmodernism’, in _New Approaches in Social Research_. London, UNITED KINGDOM: SAGE Publications, Limited. Available at: [http://ebookcentral.proquest.com/lib/roehampton-ebooks/detail.action?docID=334481](http://ebookcentral.proquest.com/lib/roehampton-ebooks/detail.action?docID=334481) (Accessed: 3 June 2025). Grbich, C. (2003b) ‘Structuralism and Poststructuralism’, in _New Approaches in Social Research_. London, UNITED KINGDOM: SAGE Publications, Limited. Available at: [http://ebookcentral.proquest.com/lib/roehampton-ebooks/detail.action?docID=334481](http://ebookcentral.proquest.com/lib/roehampton-ebooks/detail.action?docID=334481) (Accessed: 3 June 2025). Grice, H.P. (1968) ‘Utterer’s Meaning, Sentence-Meaning, and Word-Meaning’, _Foundations of Language_, 4(3), pp. 225–242. Harris, Z.S. (1951) _Methods in structural linguistics_. Chicago, IL, US: University of Chicago Press (Methods in structural linguistics), pp. xv, 384. Harris, Z.S. (1954) ‘Distributional Structure’, _WORD_, 10(2–3), pp. 146–162. Available at: [https://doi.org/10.1080/00437956.1954.11659520](https://doi.org/10.1080/00437956.1954.11659520). Harrison Dupré, M. (2025) _People Are Being Involuntarily Committed, Jailed After Spiraling Into ‘ChatGPT Psychosis’_, _Futurism_. Available at: [https://futurism.com/commitment-jail-chatgpt-psychosis](https://futurism.com/commitment-jail-chatgpt-psychosis) (Accessed: 7 July 2025). Hern, A. (2024) ‘TechScape: How cheap, outsourced labour in Africa is shaping AI English’, _The Guardian_, 16 April. Available at: [https://www.theguardian.com/technology/2024/apr/16/techscape-ai-gadgest-humane-ai-pin-chatgpt](https://www.theguardian.com/technology/2024/apr/16/techscape-ai-gadgest-humane-ai-pin-chatgpt) (Accessed: 7 July 2025). Hicks, M.T., Humphries, J. and Slater, J. (2024) ‘ChatGPT is bullshit’, _Ethics and Information Technology_, 26(2), p. 38. Available at: [https://doi.org/10.1007/s10676-024-09775-5](https://doi.org/10.1007/s10676-024-09775-5). Inglis, D. and Thorpe, C. (2012) _An invitation to social theory_. Cambridge: Polity. [@JeremyNguyenPhD] (2024) ‘Earlier this week, I asked if medical studies are being written with ChatGPT. (We all know ChatGPT overuses the word “delve”...) People in the comments pointed out that the chart should be as a PERCENTAGE of papers published on Pubmed. So here it is: https://t.co/ntOBEPm1MV https://t.co/4W6zlNSkb8’, _Twitter_. Available at: [https://x.com/JeremyNguyenPhD/status/1775846552088744106](https://x.com/JeremyNguyenPhD/status/1775846552088744106) (Accessed: 7 July 2025). Klee, M. (2025) ‘People Are Losing Loved Ones to AI-Fueled Spiritual Fantasies’, _Rolling Stone_, 4 May. Available at: [https://www.rollingstone.com/culture/culture-features/ai-spiritual-delusions-destroying-human-relationships-1235330175/](https://www.rollingstone.com/culture/culture-features/ai-spiritual-delusions-destroying-human-relationships-1235330175/) (Accessed: 7 July 2025). Kobak, D. _et al._ (2025) ‘Delving into LLM-assisted writing in biomedical publications through excess vocabulary’, _Science Advances_, 11(27), p. eadt3813. Available at: [https://doi.org/10.1126/sciadv.adt3813](https://doi.org/10.1126/sciadv.adt3813). Kuchtová, A. (2024) ‘The Incalculability of the Generated Text’, _Philosophy & Technology_, 37(1), pp. 1–20. Available at: [https://doi.org/10.1007/s13347-024-00708-0](https://doi.org/10.1007/s13347-024-00708-0). Lawlor, L. (2023) ‘Jacques Derrida’, in E.N. Zalta and U. Nodelman (eds) _The Stanford Encyclopedia of Philosophy_. Summer 2023. Metaphysics Research Lab, Stanford University. Available at: [https://plato.stanford.edu/archives/sum2023/entries/derrida/](https://plato.stanford.edu/archives/sum2023/entries/derrida/) (Accessed: 5 July 2025). Levin, R. _et al._ (2025) ‘Has My System Prompt Been Used? Large Language Model Prompt Membership Inference’. arXiv. Available at: [https://doi.org/10.48550/arXiv.2502.09974](https://doi.org/10.48550/arXiv.2502.09974). Liesenfeld, A., Lopez, A. and Dingemanse, M. (2023) ‘Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators’, in _Proceedings of the 5th International Conference on Conversational User Interfaces_. New York, NY, USA: Association for Computing Machinery (CUI ’23), pp. 1–6. Available at: [https://doi.org/10.1145/3571884.3604316](https://doi.org/10.1145/3571884.3604316). Manghani, S. (2024) ‘Preparatory space: Roland Barthes and Large Language Models’, _Barthes Studies_, 10, pp. 164–198. Mazzullo, E. _et al._ (2025) ‘Fine-Tuning GPT-3.5-Turbo for Automatic Feedback Generation’, in _Proceedings of the 40th ACM/SIGAPP Symposium on Applied Computing_. New York, NY, USA: Association for Computing Machinery, pp. 40–47. Available at: [https://doi.org/10.1145/3672608.3707735](https://doi.org/10.1145/3672608.3707735) (Accessed: 6 July 2025). Mikolov, T., Sutskever, I., _et al._ (2013) ‘Distributed Representations of Words and Phrases and their Compositionality’. arXiv. Available at: [https://doi.org/10.48550/arXiv.1310.4546](https://doi.org/10.48550/arXiv.1310.4546). Mikolov, T., Chen, K., _et al._ (2013) ‘Efficient Estimation of Word Representations in Vector Space’. arXiv. Available at: [https://doi.org/10.48550/arXiv.1301.3781](https://doi.org/10.48550/arXiv.1301.3781). Mu, N. _et al._ (2025) ‘A Closer Look at System Prompt Robustness’. arXiv. Available at: [https://doi.org/10.48550/arXiv.2502.12197](https://doi.org/10.48550/arXiv.2502.12197). Neumann, A. _et al._ (2025) ‘Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs)’, in _Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency_. New York, NY, USA: Association for Computing Machinery (FAccT ’25), pp. 573–598. Available at: [https://doi.org/10.1145/3715275.3732038](https://doi.org/10.1145/3715275.3732038). Østergaard, S.D. (2023) ‘Will Generative Artificial Intelligence Chatbots Generate Delusions in Individuals Prone to Psychosis?’, _Schizophrenia Bulletin_, 49(6), pp. 1418–1419. Available at: [https://doi.org/10.1093/schbul/sbad128](https://doi.org/10.1093/schbul/sbad128). Østergaard, S.D. (2024) ‘Can generative artificial intelligence facilitate illustration of‐ and communication regarding hallucinations and delusions?’, _Acta Psychiatrica Scandinavica_, 149(6), pp. 441–444. Available at: [https://doi.org/10.1111/acps.13680](https://doi.org/10.1111/acps.13680). Perrigo, B. (2023) _Exclusive: The $2 Per Hour Workers Who Made ChatGPT Safer_, _TIME_. Available at: [https://time.com/6247678/openai-chatgpt-kenya-workers/](https://time.com/6247678/openai-chatgpt-kenya-workers/) (Accessed: 8 July 2025). Piantadosi, S.T. and Hill, F. (2022) ‘Meaning without reference in large language models’. arXiv. Available at: [https://doi.org/10.48550/arXiv.2208.02957](https://doi.org/10.48550/arXiv.2208.02957). Quine, W.V.O. (2013) _Word and Object, new edition_. MIT Press. Ramachandran, V.S. and Hubbard, E.M. (2001) ‘Synaesthesia–a window into perception, thought and language’, _Journal of consciousness studies_, 8(12), pp. 3–34. Rudolph, J., Tan, Samson and Tan, Shannon (2023) ‘ChatGPT: Bullshit spewer or the end of traditional assessments in higher education?’, _Journal of Applied Learning and Teaching_, 6(1), pp. 342–363. Available at: [https://doi.org/10.37074/jalt.2023.6.1.9](https://doi.org/10.37074/jalt.2023.6.1.9). Rządeczka, M. _et al._ (2025) ‘The Efficacy of Conversational AI in Rectifying the Theory-of-Mind and Autonomy Biases: Comparative Analysis’, _JMIR Mental Health_, 12(1), p. e64396. Available at: [https://doi.org/10.2196/64396](https://doi.org/10.2196/64396). Selwyn, N. (2016) ‘Minding our language: why education and technology is full of bullshit … and what might be done about it†’, _Learning, Media and Technology_, 41(3), pp. 437–443. Available at: [https://doi.org/10.1080/17439884.2015.1012523](https://doi.org/10.1080/17439884.2015.1012523). _Signal’s Meredith Whittaker says Chat GPT can’t be trusted_ (2023). Available at: [https://www.youtube.com/watch?v=6ROlMFlbkWE](https://www.youtube.com/watch?v=6ROlMFlbkWE) (Accessed: 31 July 2025). Vaswani, A. _et al._ (2017) ‘Attention is All you Need’, in _Advances in Neural Information Processing Systems_. Curran Associates, Inc. Available at: [https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html](https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html) (Accessed: 8 June 2025). Vromen, E. (2024) ‘Language Models as Semiotic Machines: Reconceptualizing AI Language Systems through Structuralist and Post-Structuralist Theories of Language’. arXiv. Available at: [https://doi.org/10.48550/arXiv.2410.13065](https://doi.org/10.48550/arXiv.2410.13065). West, R. and Aydin, R. (2025) ‘The AI Alignment Paradox’, _Commun. ACM_, 68(3), pp. 24–26. Available at: [https://doi.org/10.1145/3705294](https://doi.org/10.1145/3705294). Whittaker, M. (2021) ‘The Steep Cost of Capture’. Rochester, NY: Social Science Research Network. Available at: [https://papers.ssrn.com/abstract=4135581](https://papers.ssrn.com/abstract=4135581) (Accessed: 6 July 2025). Widder, D.G., Whittaker, M. and West, S.M. (2024) ‘Why “open” AI systems are actually closed, and why this matters’, _Nature_, 635(8040), pp. 827–833. Available at: [https://doi.org/10.1038/s41586-024-08141-1](https://doi.org/10.1038/s41586-024-08141-1). Wikipedia (2025) ‘Wikipedia:Size of Wikipedia’, _Wikipedia_. Available at: [https://en.wikipedia.org/w/index.php?title=Wikipedia:Size_of_Wikipedia&oldid=1298190284](https://en.wikipedia.org/w/index.php?title=Wikipedia:Size_of_Wikipedia&oldid=1298190284) (Accessed: 8 July 2025). Zhang, X. _et al._ (2025) ‘A Survey of Theory Foundation and Key Technology in Large Models’, in _Proceedings of the 2024 3rd International Conference on Artificial Intelligence and Intelligent Information Processing_. New York, NY, USA: Association for Computing Machinery (AIIIP ’24), pp. 318–323. Available at: [https://doi.org/10.1145/3707292.3707383](https://doi.org/10.1145/3707292.3707383). Zou, J., Han, Y. and So, S.-S. (2009) ‘Overview of Artificial Neural Networks’, in D.J. Livingstone (ed.) _Artificial Neural Networks: Methods and Applications_. Totowa, NJ: Humana Press, pp. 14–22. Available at: [https://doi.org/10.1007/978-1-60327-101-1_2](https://doi.org/10.1007/978-1-60327-101-1_2).