Tag Archives: arXiv

An example of “Official EU Agencies Falsely Report More Than 550 Archive.org URLs as Terrorist Content”

Today I read Official EU Agencies Falsely Report More Than 550 Archive.org URLs as Terrorist Content.  Two comments on this.

1. It happened to me in Feb 2019. I archived one of my stories from the chemical sneakernet universe. The original story is posted on telegra.ph. Here is the message which appeared when I checked the archived link:

GVFoyIh

What? I contacted archive.org and got an answer from the webmaster, pretty fast. The problem was with telegra.ph, not with my link in particular. Now the archived link is available.

After I sent the message to archive but before I received the answer, I searched for a way to contact EU IRU, to ask what the problem might be.  I was unable to identify any such way. However there was a way to send a message to EU officials, who might redirect my message to whom it may concern. It worked, but it took longer than the time needed by archive webmaster to respond and unblock the link. I was not contacted since.

2. As you see in the post from archive, it was not EU IRU the institution which sent the blocking orders. But nevermind, how can one try to block arXiv articles? This reminded me of a very recent story: Google Scholar lost my Molecular computers arXiv article. As the article is on the same subject as the story from point 1, I wonder if by any (mis)chance Google Scholar received a blocking order.

What I do according to ADS search

ArXiv  links to the Astrophysics  Data System, which got a new fancy look. It may be a bit heavy, as a supporter of the wonderful arXiv I would rather applaud if they would allow me to put articles with animations inside, be them only animated gifs. But is nevertheless interesting.

So if I go to my arXiv articles, choose an article and then click on NASA ADS link on the right panel, then I get this page.  Funny that they don’t use the Journal Reference from the arXiv to decide which article is “refereed”, i.e. peer reviewed, even if peer review is less than validation.

I am very pleased though   about the visual representation of what I do, as seen from the arXiv articles.

no_papers

This is the image which tells how many articles I have on certain keywords, as well as links between keywords which are proportional with the number of the articles which fit a pair of keywords.

TBH this is the first time a neutral bibliometric system  shows an accurate image of my work.

The darker blue sector, which has no words on it is related to variational methods in fracture, Mumford-Shah and convexity articles.

The same picture, but according to the downloads in the last 90 days, is this one.

no_downloads

This is also very satisfying because the hamiltonian/information/… has a big future. For the moment it looks unrelated to the other sectors, but wait for the kaleidos project 🙂

The em-convex rewrite system, where I guess I found the equivalent of the Church numbers for space, is in the dilatation structures/…/selfsimilar sector. In my opinion, important subject.

Scientific publishers take their money from the academic managers, blame them too

Wonderful thread  at HN: https://news.ycombinator.com/item?id=19114786

Starting with “All this is an excellent ad for sci-hub, which avoids most of the serious drawbacks of publishers like Elsevier. It was interesting how that was relegated to a veiled comment at the end, “or finding access in other channels”. But basically if the mainstream publishers can’t meet the need, we do need other channels, and right now sci-hub is the only one that actually works at scale.

Then the discussion goes to “Blame the academic administrators who demand publications in top tier journals – the same ones who charge a ton for access.

Or “ in market terms the clients (researchers) manifest a strong preference for other products than those offered by the publishers. Why do they still exist? Does not make any sense, except if we recognize also that the market is perturbed

Enjoy the thread!  It shows that people think better than, you choose:  pirates who fight  only for the media corporation rights,  gold OA diggers who ask for more money than legacy publishers, etc…

UPDATE: for those who don’t know me, I’m for OA and Open Science. I do what I support. I am not for legacy publishers. I don’t believe in the artificial distinction between green OA, which is said to be for archiving, and gold OA which is said to be for publishing. I’m for arXiv and other really needed services for research communication.

What’s new around Open Access and Open Science? [updated]

In the last year I was not very much interested into Open Access and Open Science. There are several reasons, I shall explain them. But before: what’s new?

My reasons were that:

  • I’m a supporter of OA, but not under the banner of gold OA. You know that I have a very bad impression about the whole BOAI thing, which introduced the false distinction between gold which is publication and green which is archival. They succeeded to delay the adoption of what researchers need (i.e. basically older than BOAI inventions, like arXiv) and the recognition that the whole academic publication system is working actively against the researchers interests. Academic managers are the first to be blamed about this, because they don’t have the excuse that they work for a private entity which has to make money no matter the price. Publishers are greedy, OK, but who gives them the money?
  • Practically, for the working researcher, we can now publish in any place, no matter how close or anachronically managed, because we can find anything on Sci-Hub, if we want. So there is no reason to fight for more OA than this. Except for those who make money from gold OA…
  • I was very wrong with my efforts and attempts to use corporate social media for scientific communication.
  • Bu still, I believe strongly in the superiority of validation over peer-review. Open Science is the future.

I was also interested in the implications for OA and OS of the new EU Copyright Directive. I expressed my concern that again it seems that nobody cares about the needs of researchers (as opposed to publishers and corporations in general) and I asked some questions which interest me and nobody else seems to ask: will the new EU Copyright Directive affect arXiv or Figshare?  The problem I see is related to automatic filters, or to real ways the researchers may use these repositories.  See for example here for a discussion.  In   Sept 2018 I filed requests for answers to arXiv and to Figshare. For me at least the answers will be very interesting and I hope them to be as bland as possible, in the sense that there is nothing to worry about.

So from my side, that’s about all, not much. I feel like except the gold OA money sucking there’s nothing new happening. Please tell me I’m very wrong and also what can I do with my research output, in 2019.

UPDATE: I submitted two days ago a comment at Julia Reda post Article 13 is back on – and it got worse, not better. About the implications for the research articles repositories, the big ones, I mean, the ones which are used millions of times by many researchers. I waited patiently, either for the appearance of the comment or for a reaction. Any reaction. For me this is a clear answer: pirates fight for the freedom of the corporation to share in its walled garden the product of a publisher. The rest is immaterial for them. They pirates not explorers.

UPDATE 2: This draft of Article 13 contains the following definition: “‘online content sharing service provider’ means a provider of an information society service whose main or one of the main purposes is to store and give the public access to a large amount of copyright protected works or other protected subject-matter uploaded by its users which it organises and promotes for profit-making purposes. Providers of services such as not-for profit online encyclopedias, not-for profit educational and scientific repositories, open source software developing and sharing platforms, electronic communication service providers as defined in Directive 2018/1972 establishing the European Communications Code, online marketplaces and business-to business cloud services and cloud services which allow users to upload content for their own use shall not be considered online content sharing service providers within the meaning of this Directive.

If this is part of the final version of Article 13 then there is nothing to worry as concerns arXiv, for example.

Maybe a separate push should be on upload filters and their legal side (who is responsible for the output of this algorithm? surely not the algorithm!), perhaps by asking for complete, reproducible, transparent information about those: source code and all the dependencies source code, reproducible behavior.

 

The price of publishing with GitHub, Figshare, G+, etc

Three years ago I posted The price of publishing with arXiv. If you look at my arXiv articles then you’ll notice that I barely posted on arXiv.org since then. Instead I went into territory which is even less recognized as serious by a big part of academia. I used:

The effects of this choice are put in front of my homepage, so go there to read them. (Besides, it is a good exercise to remember how to click on links and use them, that lost art from the age when internet was free.)

In this post I want to explain what is the price I paid for these choices and what I think now about them.

First, it is a very stressful way of living. I am not joking, as you know stress comes from realizing that there are many choices and one has to choose. Random reward from the social media is addictive. The discovery that there is a way to get out from the situation which keeps us locked into the legacy publishing system (validation). The realization that the problem is not technical but social. A much more cynical view of the undercurrents of the social life of researchers.

The feeling that I can really change the world with my research. The worries that some possible changes might be very dangerous.

The debt I owe concerning the scarcity of my explanations. The effort to show only the aspects I think are relevant, putting aside those who are not. (Btw, if you look at my About page then you’ll read “This blog contains ideas from the future”. It is true because I already pruned the 99% of the paths leading nowhere interesting.)

The desire to go much deeper, the desire to explain once again what and why, to people who seem either lacking long term attention capability or having shallow pet theories.

Is like fishing for Moby Dick.

ArXiv is 3 times bigger than all megajournals taken together

 How big are the “megajournals” compared to arXiv?
I use data from the article

[1] Have the “mega-journals” reached the limits to growth? by Bo-Christer Björk ​https://dx.doi.org/10.7717/peerj.981 , table 3

and the arXiv monthly submission rates

[2] http://arxiv.org/stats/monthly_submissions

To have a clear comparison I shall look at the window 2010-2014.

Before showing the numbers, there are some things to add.

1.  I saw the article [1] via the post by +Mike Taylor

[3] Have we reached Peak Megajournal? http://svpow.com/2015/05/29/have-we-reached-peak-megajournal/

I invite you to read it, it is interesting as usual.

2. Usually, the activity of counting articles is that dumb thing which is used by managers to hide behind, in order to not be accountable for their decisions.
Counting  articles is a very lossy compression technique, which associates to an article a very small number of bits.
I indulged into this activity because of the discussions from the G+ post

[4] https://plus.google.com/+MariusBuliga/posts/efzia2KxVzo

and its clone

[4′] Eisen’ “parasitic green OA” is the apt name for Harnad’ flawed definition of green OA, but all that is old timers disputes, the future is here and different than both green and gold OA https://chorasimilarity.wordpress.com/2015/05/28/eisen-parasitic-green-oa-is-the-apt-name-for-harnad-flawed-definition-of-green-oa-but-all-that-is-old-timers-disputes-the-future-is-here-and-different-than-both-green-and-gold-oa/

These discussions made me realize that the arXiv model is carefully edited out from reality by the creators and core supporters of green OA and gold OA.

[see more about in the G+ variant of the post https://plus.google.com/+MariusBuliga/posts/RY8wSk3wA3c ]
Now, let’s see those numbers. Just how big is that arXiv thing compared to “megajournals”?

From [1]  the total number of articles per year for “megajournals” is

2010:  6,913
2011:  14,521
2012:   25,923
2013:  37,525
2014:  37,794
2015:  33,872

(for 2015 the number represents  “the articles published in the first quarter of the year multiplied by four” [1])

ArXiv: (based on counting the monthly submissions listed in [2])

2010: 70,131
2011: 76,578
2012: 84,603
2013: 92,641
2014:  97,517
2015:  100,628  (by the same procedure as in [1])

This shows that arXiv is 3 times bigger than all the megajournals at once, despite that:
– it is not a publisher
– does not ask for APC
– it covers fields far less attractive and prolific than the megajournals.

And that is because:
– arxiv answers to a real demand from researchers, to communicate fast and reliable their work to their fellows, in a way which respects their authorship
– also a reaction of support for what most of them think is “green OA”, namely to put their work there where is away from the publishers locks.

_____________________________________

Eisen’ “parasitic green OA” is the apt name for Harnad’ flawed definition of green OA, but all that is old timers disputes, the future is here and different than both green and gold OA

See this post and the replies on G+ at [archived post].

My short description of the situation: the future is here, and it is not gold OA (nor the flawed green OA definition which ignores arXiv). So, visually:

imageedit_34_6157098125

It has never occurred to me that putting an article in a visible place (like arXiv.org) is parasitic green OA+Michael B. Eisen  calls it parasitic because he supposes that this has to come along with the real publication. But what if not?

[Added: Eisen writes in the body of the post that he uses the definition given by Harnad to green OA, which ignores the reality. It is very conveniently for gold OA to have a definition of green OA which does not apply to the oldest (1991) and fully functional example of a research communication experiment which is OA and green: the arXiv.org.]
Then, compared to that, gold OA appears as a progress.
http://www.michaeleisen.org/blog/?p=1710

I think gold OA, in the best of cases, is a waste of money for nothing.

A more future oriented reply has +Mike Taylor
http://svpow.com/2015/05/26/green-and-gold-the-possible-futures-of-open-access/
who sees two possible futures, green (without the assumption from Eisen post) and gold.

I think that the future comes faster. It is already here.

Relax. Try validation instead peer review. Is more scientific.

Definition. Peer-reviewed article: published by the man who saw the man who claims to have read it, but does not back the claim with his name.

The reviewers are not supermen. They use the information from the traditional article. The only thing they are supposed to do is that they read it. This is what they use to give their approval stamp.

Validation means that the article provides enough means so that the readers can reproduce the research by themselves. This is almost impossible with  an article in the format inherited from the time when it was printed on paper. But when the article is replaced by a program which runs in the browser, which uses databases, simulations, whatever means which facilitate the validation, then the reader can, if he so wishes, make a scientific motivated opinion about this.

Practically the future has come already and we see it on Github. Today. Non-exclusively. Tomorrow? Who knows?

Going back to the green-gold OA dispute, and Elsevier recent change of sharing and hosting articles (which of course should have been the real subject of discussions, instead of waxing poetic about OA, only a straw man).

This is not even interesting. The discussion about OA revolves around who has the copyright and who pays (for nothing).

I would be curious to see discussions about DRM, who cares who has the copyright?

But then I realised that, as I wrote at the beginning of the post, the future is here.

Here to invent it. Open for everybody.

I took the image from this post by +Ivan Pierre and modified the text.
https://plus.google.com/+IvanPierreKilroySoft/posts/BiPbePuHxiH

_____________

Don’t forget to read the replies from the G+ post. I archived this G+ post because the platform went down. Read here why I deleted the chemlambda collection from G+.

____________________________________________________