{"id":1427,"date":"2024-08-15T10:00:23","date_gmt":"2024-08-15T13:00:23","guid":{"rendered":"https:\/\/humanas.blog.scielo.org\/en\/?p=1427"},"modified":"2024-08-15T11:10:24","modified_gmt":"2024-08-15T14:10:24","slug":"epistemological-foundations-of-machine-learning","status":"publish","type":"post","link":"https:\/\/humanas.blog.scielo.org\/en\/2024\/08\/15\/epistemological-foundations-of-machine-learning\/","title":{"rendered":"An Analysis of the Epistemological Foundations of Machine Learning"},"content":{"rendered":"<p><b>Garibaldi Sarmento, Associate Professor in the Department of Philosophy at the Federal University of Para\u00edba (UFPB), Jo\u00e3o Pessoa, PB \u2013 Brazil.<\/b><\/p>\n<p><a href=\"https:\/\/humanas.blog.scielo.org\/wp-content\/uploads\/2021\/10\/trans_glogo.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-8494 size-full\" title=\"Logo of the journal Trans\/form\/a\u00e7\u00e3o.\" src=\"https:\/\/humanas.blog.scielo.org\/wp-content\/uploads\/2021\/10\/trans_glogo.gif\" alt=\"Logo of the journal Trans\/form\/a\u00e7\u00e3o.\" width=\"300\" height=\"63\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Ar\u00e3o in his article titled <\/span><a href=\"https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400163\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Behind Artificial Intelligence: An Analysis of the Epistemological Foundations of Machine Learning<\/span><\/a><span style=\"font-weight: 400;\"> (2024), states that &#8220;[&#8230;] it will be <\/span><i><span style=\"font-weight: 400;\">demonstrated <\/span><\/i><span style=\"font-weight: 400;\">how <\/span><i><span style=\"font-weight: 400;\">induction and mathematization <\/span><\/i><span style=\"font-weight: 400;\">work as the epistemological basis of artificial intelligence and how some of the <\/span><i><span style=\"font-weight: 400;\">limitations <\/span><\/i><span style=\"font-weight: 400;\">of this technology can be explained through the weaknesses of the <\/span><i><span style=\"font-weight: 400;\">methods <\/span><\/i><span style=\"font-weight: 400;\">that support it&#8221; (Ar\u00e3o, 2024, p. 01, italics mine).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Nevertheless, the author is <\/span><i><span style=\"font-weight: 400;\">not<\/span><\/i><span style=\"font-weight: 400;\">, <\/span><i><span style=\"font-weight: 400;\">strictly speaking<\/span><\/i><span style=\"font-weight: 400;\">, presenting a <\/span><i><span style=\"font-weight: 400;\">demonstration <\/span><\/i><span style=\"font-weight: 400;\">in the said article, as we should, for this purpose, consider both a <\/span><i><span style=\"font-weight: 400;\">logical-mathematical <\/span><\/i><span style=\"font-weight: 400;\">analysis and a methodological <\/span><i><span style=\"font-weight: 400;\">refutation <\/span><\/i><span style=\"font-weight: 400;\">from the perspective of computer science, as emphasized in Sarmento (2004). In principle, the noted &#8220;weaknesses&#8221; are subject to correction if we <\/span><i><span style=\"font-weight: 400;\">a priori <\/span><\/i><span style=\"font-weight: 400;\">admit the construction of a maximally consistent system as the base of knowledge for the <\/span><i><span style=\"font-weight: 400;\">learning machines <\/span><\/i><span style=\"font-weight: 400;\">model.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">These observations are presented in the <\/span><a href=\"https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400221\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Commentary on &#8220;Behind Artificial Intelligence: An Analysis of the Epistemological Foundations of Machine Learning&#8221;<\/span><\/a><span style=\"font-weight: 400;\">, recently published in the journal <\/span><i><span style=\"font-weight: 400;\">Trans\/Form\/A\u00e7\u00e3o<\/span><\/i><span style=\"font-weight: 400;\">, whose reflections will be summarized here.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In an article titled &#8220;Artificial Intelligence, Undecidability, and Trans-computability&#8221; (see Sarmento, 2004) I proposed a critical approach to the <\/span><i><span style=\"font-weight: 400;\">logical-epistemic<\/span><\/i><span style=\"font-weight: 400;\"> foundations of Artificial Intelligence, in which some formal and technical aspects \u2013 that instantiate and delineate the field of investigations encompassing Artificial Intelligence \u2013 were addressed.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Without proposing a technical and formal discussion in this essay, I present, in general terms, some points directed towards the issue of using an inductive-probabilistic logic for the &#8220;Large Language Models&#8221; (LLMs) built based on &#8220;neural networks,&#8221; which, in turn, enable the machine learning process.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Firstly, some of the areas of computer science directly related to the development of LLMs are <\/span><i><span style=\"font-weight: 400;\">computational linguistics<\/span><\/i><span style=\"font-weight: 400;\">, <\/span><i><span style=\"font-weight: 400;\">computational neuroscience<\/span><\/i><span style=\"font-weight: 400;\"> and <\/span><i><span style=\"font-weight: 400;\">quantum computing<\/span><\/i><span style=\"font-weight: 400;\"> and, especially, the <\/span><i><span style=\"font-weight: 400;\">algorithmic theory<\/span><\/i><span style=\"font-weight: 400;\"> of information. In all the highlighted areas, algorithmic methods of statistical inference are widely used.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">One of the subfields of the algorithmic theory of information is called Kolmogorov <\/span><i><span style=\"font-weight: 400;\">complexity<\/span><\/i><span style=\"font-weight: 400;\">, which deals with essential concepts for statistical-probabilistic methods of inference. This field of study involves a rigorous treatment of the concept of randomness, utilizing algorithmic methods such as, for example, the &#8220;Solomonoff probability&#8221; algorithm.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Another specific method for machine learning systems, via neural networks, is the use of the so-called Bayesian Logic. In general, there are various algorithmic methodological techniques that allow for a rigorous (and subject to correction) computational treatment of probabilistic-statistical approaches applicable to the development of AI systems.<\/span><\/p>\n<div id=\"attachment_1429\" style=\"width: 577px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/humanas.blog.scielo.org\/en\/wp-content\/uploads\/sites\/2\/2024\/08\/Img.jpg\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1429\" class=\"wp-image-1429 size-full\" title=\"Photograph of a robotic hand and a human hand reaching towards each other against a plain background, nearly touching fingertips.\" src=\"https:\/\/humanas.blog.scielo.org\/en\/wp-content\/uploads\/sites\/2\/2024\/08\/Img.jpg\" alt=\"Photograph of a robotic hand and a human hand reaching towards each other against a plain background, nearly touching fingertips.\" width=\"567\" height=\"378\" srcset=\"https:\/\/humanas.blog.scielo.org\/en\/wp-content\/uploads\/sites\/2\/2024\/08\/Img.jpg 567w, https:\/\/humanas.blog.scielo.org\/en\/wp-content\/uploads\/sites\/2\/2024\/08\/Img-300x200.jpg 300w\" sizes=\"auto, (max-width: 567px) 100vw, 567px\" \/><\/a><p id=\"caption-attachment-1429\" class=\"wp-caption-text\">Imagem: Freepik.<\/p><\/div>\n<p><span style=\"font-weight: 400;\">From a philosophical perspective, the theory of knowledge compatible with the development of these AI systems is the <\/span><i><span style=\"font-weight: 400;\">reductionist epistemic<\/span><\/i><span style=\"font-weight: 400;\"> approach proposed by Lehrer (1990), in which the concept of knowledge is strictly reduced to the concept of information. Lehrer admits that the explanation of knowledge, in the sense of <\/span><i><span style=\"font-weight: 400;\">information<\/span><\/i><span style=\"font-weight: 400;\">, assumes that:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">\u201cP\u201d accepts the information that \u201cs\u201d.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">The information that \u201cs\u201d is correct<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">The acceptance of the information that \u201cs\u201d is justified.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\">Here, &#8220;P&#8221; and &#8220;s&#8221; are variables, &#8220;P&#8221; designates any individual, and &#8220;s&#8221; denotes any information that can be obtained. Thus, &#8220;P knows that s&#8221; expresses the &#8220;recognition of the correctness of the information.&#8221; The justification (or corroboration) of the information is the central aspect of the epistemic analysis, given that this procedure should, in principle, be understood as a &#8220;measure&#8221; between &#8220;strong reasonableness and not complete certainty,&#8221; as Lehrer argues.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This means that justification, from the perspective of inductive-probabilistic logic, is bounded between reasonableness and complete certainty (the concept of &#8220;certainty&#8221; here means strong evidence of a probative nature, given the sufficient conditions at a certain time \u201ct\u201d, for the confirmation of the correctness of the information). In probabilistic terms, the closer the correctness of the information is to 1, the higher the degree of certainty of its confirmation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The algorithmic theory of information (a subfield of information theory), which lies at the intersection of Turing&#8217;s computability theory and Shannon&#8217;s classical information theory, could serve as a <\/span><i><span style=\"font-weight: 400;\">computational foundation<\/span><\/i><span style=\"font-weight: 400;\"> for the approach advocated by Lehrer, when it comes to the concept of information and, consequently, justify that LLMs process (and generate) knowledge about the &#8220;physical world&#8221; in a manner analogous to humans.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The main philosophical issue, in this case, shifts to the level of autonomy that AI cybernetic systems could have to act independently of human intervention. This is a question that undoubtedly requires deeper reflection.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Up to the present moment, however, these cybernetic systems do not effectively exhibit a capacity for self-correction and decision-making in an interactive and autonomous manner.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Those who advocate for such an approach, i.e., that AI cybernetic systems can generate knowledge, cannot ignore that AI will always be limited \u2014 with respect to its cognitive capacity \u2014 due to <\/span><i><span style=\"font-weight: 400;\">Chaitin&#8217;s incompleteness<\/span><\/i><span style=\"font-weight: 400;\"> theorem (a version of G\u00f6del&#8217;s incompleteness metatheorem). And, consequently, AI cannot surpass the cognitive capacity of humans.<\/span><\/p>\n<h3>About Garibaldi Sarmento<\/h3>\n<p><span style=\"font-weight: 400;\">He has a doctorate in philosophy from Unicamp, SP, and is an Associate Professor in the Department of Philosophy at the Federal University of Para\u00edba (UFPB), Jo\u00e3o Pessoa, PB \u2013 Brazil. Research Areas: Mathematical Logic and Philosophy of Mathematics.<\/span><\/p>\n<h3>References<\/h3>\n<p><span style=\"font-weight: 400;\">AR\u00c3O, C. Behind artificial intelligence: an analysis of the epistemological bases of machine learning. <\/span><i><span style=\"font-weight: 400;\">Trans\/Form\/A\u00e7\u00e3o<\/span><\/i><span style=\"font-weight: 400;\"> [online]. 2024, vol. 47, n. 3, e02400163 [viewed 15 August 2024]. <\/span><a href=\"https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400163\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400163<\/span><\/a><span style=\"font-weight: 400;\">. Available from: <\/span><a title=\"https:\/\/www.scielo.br\/j\/trans\/a\/wKP3thTz35fmhG9pnmXNdMj\/\" href=\"https:\/\/www.scielo.br\/j\/trans\/a\/wKP3thTz35fmhG9pnmXNdMj\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">https:\/\/www.scielo.br\/j\/trans\/a\/wKP3thTz35fmhG9pnmXNdMj\/<\/span><\/a><\/p>\n<p><span style=\"font-weight: 400;\">LEHRER, K. <\/span><i><span style=\"font-weight: 400;\">Theory of Knowledge<\/span><\/i><span style=\"font-weight: 400;\">. New York. Westview, 1990.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">SARMENTO, G. Intelig\u00eancia artificial, indecidibilidade e transcomputabilidade. <\/span><i><span style=\"font-weight: 400;\">Perspectiva Filos\u00f3fica<\/span><\/i><span style=\"font-weight: 400;\">. 2004, vol. I, no. 21, pp. 179-209.<\/span><\/p>\n<h3>To read the article, access<\/h3>\n<p><span style=\"font-weight: 400;\">SARMENTO, G. Coment\u00e1rio a \u201cPor tr\u00e1s da intelig\u00eancia artificial: uma an\u00e1lise das bases epistemol\u00f3gicas do aprendizado de m\u00e1quina\u201d. <\/span><i><span style=\"font-weight: 400;\">Trans\/Form\/A\u00e7\u00e3o<\/span><\/i><span style=\"font-weight: 400;\"> [online]. 2024, vol. 47, no. 3, e02400221 [viewed 15 August 2024]. <\/span><a href=\"https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400221\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">https:\/\/doi.org\/10.1590\/0101-3173.2024.v47.n3.e02400221<\/span><\/a><span style=\"font-weight: 400;\">. Available from: <\/span><a href=\"https:\/\/www.scielo.br\/j\/trans\/a\/Zk3GNmcnCHVmCkDRdWvXcRj\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">https:\/\/www.scielo.br\/j\/trans\/a\/Zk3GNmcnCHVmCkDRdWvXcRj\/<\/span><\/a><span style=\"font-weight: 400;\">\u00a0<\/span><\/p>\n<h3>External links<\/h3>\n<p>Trans\/Form\/A\u00e7\u00e3o \u2013 TRANS: <a href=\"https:\/\/www.scielo.br\/trans\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.scielo.br\/trans\/<\/a><\/p>\n<p>Trans\/Form\/A\u00e7\u00e3o \u2013 Journal: <a href=\"https:\/\/www.instagram.com\/revista.transformacao\/\" target=\"_blank\" rel=\"noopener\">Instagram<\/a> | <a href=\"https:\/\/www.facebook.com\/RevistaTransFormAcao\" target=\"_blank\" rel=\"noopener\">Facebook<\/a> | <a href=\"https:\/\/unep.academia.edu\/TrevistadefilosofiadaUnesp?from_navbar=true\" target=\"_blank\" rel=\"noopener\">Academia.edu<\/a><\/p>\n<p><span style=\"font-weight: 400;\">Garibaldi Sarmento \u2013 ORCID: <\/span><a title=\"https:\/\/orcid.org\/0000-0002-9242-3945\" href=\"https:\/\/orcid.org\/0000-0002-9242-3945\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">https:\/\/orcid.org\/0000-0002-9242-3945<\/span><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Outlined here is a critical review of the logical-epistemic foundations of machine learning, focusing on the limitation of AI systems&#8217; autonomy in generating knowledge. It contrasts this possibility with the theoretical constraints posed by Chaitin&#8217;s incompleteness theorem, which argues that AI cannot surpass human cognitive capacity. <span class=\"ellipsis\">&hellip;<\/span> <span class=\"more-link-wrap\"><a href=\"https:\/\/humanas.blog.scielo.org\/en\/2024\/08\/15\/epistemological-foundations-of-machine-learning\/\" class=\"more-link\"><span>Read More &rarr;<\/span><\/a><\/span><\/p>\n","protected":false},"author":664,"featured_media":1428,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":"","_links_to":"","_links_to_target":""},"categories":[7,110],"tags":[146,111],"class_list":["post-1427","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-press-release","category-trans","tag-philosophy","tag-trans-form-acao"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/posts\/1427","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/users\/664"}],"replies":[{"embeddable":true,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/comments?post=1427"}],"version-history":[{"count":4,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/posts\/1427\/revisions"}],"predecessor-version":[{"id":1434,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/posts\/1427\/revisions\/1434"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/media\/1428"}],"wp:attachment":[{"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/media?parent=1427"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/categories?post=1427"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/humanas.blog.scielo.org\/en\/wp-json\/wp\/v2\/tags?post=1427"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}