I tried “summarize arXiv:1103.6007” with phind [0], archived answer [1] and I discovered I’m the proud author of the latest most important AI paper: Attention is all you need.
First I thought that’s yet another anti-chemlambda item.
Then , what if it’s a bug?
For no matter which article I tried, it always responds with an unrelated deep learning article summary [3] [4] [5].
That’s like… Sometime in the future, suppose you need an urgent heart surgery. The medical insurance proposes to be done by the latest AI driven robotic surgery device, what can be better? (or cheaper…)
The operation goes well. You wake up with puffed lips and silicone implants.
Wouldn’t that be nice?
I write this because it seems that arXiv articles are not freely available to everybody (AI tools included). So, if even arXiv does this gatekeeping, it means that the others, less green scientific publishers, will do even worse.
Hence Sci-Hub AI to the rescue :)) ok ok, anybody with data access, capable of an easy to use interface and might to solve the problem first and be useful than to gatekeep, to the rescue!
Would it even be possible? Is a technical problem, of course it is possible.
I’ve done more recent experiments with asking phind.com to “summarize arXiv:…”.
Here, for the first time, it does a rather good job with arXiv:2007.10288, see archived answer [6]. It mentions access to arXiv via export.arxiv.org.
Maybe, I thought, Phind adapted and solved the “bug” (I think is more complicated than a bug…). So I tried again the first query, about arXiv:1103.6007. No! it says the article is not online…
Maybe only the more recent arXiv articles are accessible? Let’s try arXiv:1206.3093. Here is the archived answer [7]. Is funny! Is written by Jean-Pierre Serre and it’s in french! But a half of the answer is reasonably close to reality.
OK, newer: arXiv:1305.5786. Here is the archived answer [8]. Well it works, that’s reasonable as an answer.
But when I tried new ones, it was not reasonable. Confusion in the case of the new arXiv:2304.14158, see here the archived answer [9]. Remark the same split brain behaviour as previously, where the search indicates the correct link but the answer speaks about something else.
After experiences with phind.com summarize arXiv:… , I tried to see if phind can summarize (and has access to) github md writings.
So I asked phind to summarize [10]. Here is the archived answer [11].
Remark that now the answer is given by phind v4 model, while in previous experiments the answers were by gpt 3.5, turbo or 4 models.
Further is a version of the answer minimally corrected by the author of the text: only one small thing is not correct. That is remarkable!
In particular phind does understand the Wittgenstein joke. I wish that human programmers follow the phind’s lead.
“The document you’ve asked to summarize is a discussion on Wittgenstein’s philosophy, specifically his thoughts on language and reality, and it also includes a story about the rhinoceros.
The main section of the document is titled “A Wittgenstein joke”. It’s a joke that’s framed, told, and punchlined. The framing sets up the idea that the book’s main point is that what can be said at all can be said clearly, and what we can’t talk about we must pass over in silence. The telling section consists of various excerpts from Wittgenstein’s work, which focus on the idea that there must be objects for the world to have an unalterable form, and that the existence and non-existence of states of affairs is reality. The punchline section closes with Wittgenstein stating that the thoughts in his book are unassailable and definitive [10].
The story of the rhino thing is a historical account of how the first rhinoceros to arrive in Europe in over a thousand years created various rhinoceros objects. The rhinoceros was seen by scholars, the curious, and its image was depicted in various places. This image, created by Albrecht Dürer, became an iconic representation of a rhinoceros, even though it contained some errors. Despite these errors, the image remained popular until the late 18th century when more accurate representations of rhinoceroses were made [10].
The reality thing section of the document discusses the idea that reality is a discussion or trial that happens in places. The author believes that objects, which are reifications of these discussions, are evidence used in these discussions, [this part wrong:] but they are not objective[end of wrong part]. The author argues that this is a failed philosophical program, as it first relies on objects and then tries to bring back interactions for objects. The author suggests that this has led to the meaning of things and objects being mixed or even reversed [10].”
Socrates discusses with Wittgenstein. At some point Socrates claims “A”. In response, Wittgenstein replies with “your A is B”.
Socrates then, aside, asks an AI to “summarize my argument A”. The AI answers by producing a text. Socrates reads the text, archives the conversation with the AI, for reference.
Then Socrates tells Wittgenstein the following.
“I listened your reply “your A is B” with attention and I believe that you misrepresent my A, by mistake or on purpose. Or perhaps I am at fault here because what I think about the subject is not clearly enough communicated in my argument A.
I don’t know which one it is, therefore I asked this AI to summarize my argument A, in the form I told you.
Here is the AI summary, as archived and also as corrected by me.
It seems that the AI clearly writes what I intended when I communicated A to you. There is only one small detail here, which is not important for our discussion, see this correction.
I ask you then: Wittgenstein, why do you reply with “Your A is B” when even an AI can get the gist of my argument A but you behave as if you don’t?
Your reply, in the form you told me, does not respond to my A. “
The discussion continues…
I tried the same initial question, ie “summarize arXiv:1103.6007”, this time with ChatGPT Online [12].
The result is the following:
“In the paper “The New Physics of Bird Flight: From Flight Planes to Acrobatics”, the authors explore the physical principles behind bird flight. They discuss how birds are able to generate lift and control their movement in the air, drawing comparisons to the principles of fixed-wing aircraft and helicopters. The paper also highlights the acrobatic maneuvers performed by birds during flight and the underlying mechanisms that enable them. The authors propose a new framework for understanding bird flight based on the interaction between aerodynamic, biomechanical, and neural factors. They believe that this interdisciplinary approach can contribute to the development of advanced flying machines inspired by bird flight.”
Of course that this has nothing to do with arXiv:1103.6007. Moreover I was not able to find any paper titled “The New Physics of Bird Flight: From Flight Planes to Acrobatics”, with google or ddg search!
It’s pure invention, this AI does not even try to locate an arXiv ID, is just useless drivel.
Now, I know that there are services online, to summarize arXiv articles. But they are either demanding a login or are exclusive to chrome.
Why would that be?
The answer, in my opinion, is very simple. Is the same answer as, for example, the one about Open Access: why would the open access research articles should be Gold (ie paid by the authors) or otherwise not recognized as publication? Or even if a technical solution appears for OA, with no cost, is then prosecuted? Because otherwise one major pillar of the society would no longer be controlled.
Now we see, in the making, the phenomenon where these new AI are trained on huge quantities of data but they are unable to process a tiny, tiny quantity of data. Indeed, the collection of all research articles ever written on Earth is tiny compared to what they use to train these AI.
So why? Because, while talent appears more or less evenly distributed, the scientific superiority and access are one of those sensible pillars of the society.
What would it look like if we could trace and compare the genesis of the ideas in research? Can’t be a public, free service, right? Most of research would appear to be badly written drivel, compressible to an even tinier quantity of data which will turn out that is authored by an evenly distributed collection of people.
This is an important question. It is also an example of argument from AI summary. Such an argument aims to establish if the communication of an idea is defective, by asking an AI to provide a summary. The summary is then vetted by the author of the argument, as proof that the communication of the idea is conformal with the intentions of the author.
With such proof, the author may ask then: if the AI gets it, why don’t you? [ie the dialogue partners]
Here is the AI answer to the question from the title and here is the archived answer.
The edits are only a link to the AI used and the replacement of the text “geeksforgeeks.org” with “asemantic-computing.md” for the links provided in the AI answer (only the links name are changed, not the links themselves, of course).
Start!
Answer | Phind V6 Model
Asemantic computing differs from traditional distributed computing in several key ways:
- 1. Global Control and Meaning: Traditional distributed computing systems require a global control and meaning. This is because they are designed to perform tasks that require a global understanding of the system’s state. In contrast, asemantic computing systems do not require a global control or meaning. They operate locally in time and space, without needing to understand the overall state of the system [14].
- 2. Semantics: Asemantic computing does not have semantics, which are the meanings of symbols in a system. This is a significant difference from traditional distributed computing systems, which rely on semantics to understand and interpret the data. In asemantic computing, the data is processed without any inherent meaning [14].
- 3. Turing Machines and Graph Rewrite Systems: Asemantic computing uses Turing machines and graph rewrite systems, which are local and do not require global control or meaning. In contrast, traditional distributed computing systems often use term rewrite systems, which are non-local and require global control [14].
- 4. Confluence: Asemantic computing does not require confluence, which is the property of a graph rewrite system where if we have a graph that can be rewritten into one without any further possible rewrites, then this final state is unique. This is undesirable for life-like distributed computations, where final states are to be interpreted as death and they have to be recycled somehow by another mechanism [14].
- 5. Extracting Meaning: The meaning we can extract from an asemantic computing system is only local. In traditional distributed computing systems, we can extract global measures of agreement and synchronization. However, in asemantic computing, we cannot extract precise global measures of agreement or synchronization [14].
- 6. Programming and Usage: Asemantic computing systems are unclear how to program or use them in the old ways. However, they can be used as if they are a living ecosystem, an extension of the meatspace [14].
In the Argument from AI summary was proposed a new use of an AI tool, as a virtual check for clarity of a thesis communicated by the author of it, in a rational discussion.
Then, in How does asemantic computing differ from traditional distributed computing? was given an example of this kind of argument, for the thesis of asemantic computing.
This was turned into an article, in initial form [15], which was submitted to arXiv (where else? as you well know, I am a long time user of this excellent tool of dissemination which is, or should be, outside of the legacy publication business).
As a response, I obtained a real life example about how to counter a valid argument by AI summary.
The article [16] is a newer version of [15], completed with a new section. It can be cited as
Buliga, Marius (2023). Argument from AI summary: How does asemantic computing differ from traditional distributed computing?. figshare. Journal contribution. https://doi.org/10.6084/m9.figshare.24420967.v1
In order to understand the how the argument can be attacked, let’s analyze how the argument is built, in time.
- Step 1. The author has or claims a thesis, in the form of a document A.
- Step 2. Then the author uses an AI tool as a black box. The author inputs A into the black box, which responds (outputs) the summary B.
- Step 3. Then the author of A judges B and arrives to the conclusion C that B shows that A is clear enough.
As a part of a discussion, the author then shares A, B, C. For this article:
- A is the asemantic computing article [14]
- B is the archived response of the AI tool [13]
- C is the statement from the end of the Introduction section of the article in initial form available at [15].
The structure used involves a human (the author), a black box (the AI tool), a judge (the human) and evidence of the process (A, B, C). It is important that the black box is available to all the participants in the discussion.
In order to counter the author argument, the receiver uses the same structure, in a reversed time order. Suppose that the receiver poses as an institution (therefore not a human). The receiver has access to it’s own black box (a part of the institution itself), but differently from the AI tool, only the receiver can access this black box.
- Step 3′. The receiver takes the author, together with A, B, C and classifies all of them as evidence (say request C’) for a process of admission of the argument. Differently than the author process, which was completely open, the institution process will be opaque.
- Step 2′. The receiver then offers to the author the response B’, namely that the request C’ was judged by the institution’s black box as not valid, without any evidence of the proceedings. Mind that at this step the receiver behaves as if it is a reporter of a past judgement, even if the institutional process happened after the submission of the argument by the author.
- Step 1′. Finally, the receiver goes back one more step in time and produces A’, a judgement decision: the thesis A is dismissed, with the excuse of a lack of resources of the institution (time, attention, interest), not the thesis. The reason for this lack is attributed to the author (maybe if the author would try harder, then the decision would be different, maybe if the author would use other venues for the discussion, etc).
As a consequence of this counter of the argument, the discussion ends, in such a way that the receiver does not have to provide reasonable further arguments to the author.
As an illustration, is reproduced here the answer to the submission of this article in the cs.AI, cs.HC sections of arXiv.org, a very reputable repository of articles which serves the research community by providing fast dissemination of research. Some times the streamlined submission process becomes an unsolicited editorial one, as they are not a journal, like in the case of the initial version of this article. (As an author who uses arXiv since a long time, this is the first time when I am part of this process, perhaps due to the unconventional nature of this article.) I wish to thank the institution for the provision of a real life counter attack of a valid argument by AI summary.
The response is edited (only) to separate it into the 3′, 2′, 1′ time reversed steps which were explained previously. Links and general statements were edited out.
- Step 3′. “MOD-30319 […] notification regarding [the initial version of this article]. Thank you for submitting your work to arXiv.”
- Step 2′. “We regret to inform you that arXiv’s moderators have determined that your submission will not be accepted and made public on [arXiv.org]. Our moderators determined that your submission does not contain sufficient original or substantive scholarly research and is not of interest to arXiv.”
- Step 1′. “arXiv moderators strive to balance fair assessment with decision speed. We understand that this decision may be disappointing, and we apologize that, due to the high volume of submissions arXiv receives, we cannot offer more detailed feedback. Some authors have found that asking their personal network of colleagues or submitting to a conventional journal for peer review are alternative avenues to obtain feedback.”
Bibliography
[0] https://www.phind.com
[1] https://archive.ph/72Isc
[2] https://chorasimilarity.wordpress.com/tag/anti-chemlambda/
[3] https://archive.ph/et0at
[4] https://archive.ph/Gp5gh
[5] https://archive.ph/zmEjA
[6] https://archive.ph/aL2Mq
[7] https://archive.ph/bCG3i
[8] https://archive.ph/Gry21
[9] https://archive.ph/ixiEv
[10] https://github.com/mbuliga/writings/blob/main/wittgenstein-and-the-rhino.md
[11] https://archive.ph/uRotW
[12] https://gptonline.ai/chatgpt-online/
[13] https://archive.ph/DUsn8
[14] https://github.com/chemlambda/molecular/blob/main/reading/asemantic-computing.md
[15] https://chorasimilarity.files.wordpress.com/2023/10/asemantic-answer-20-10-2023.pdf
[16] https://doi.org/10.6084/m9.figshare.24420967.v1