15 April 2011

"404 not found": An update for "bioinformatics/cabios"

Yesterday, I blogged about the persistence of the URLs present in the abstract of NAR. Today , I've updated my tool and used it to scan the abstracts of the following pubmed query: "Bioinformatics"[JOUR] or "Comput Appl Biosci"[JOUR].

Here is the result:

YearTotalAlive%
1815
1995100
19969333
199713323
1998861922
1999701724
2000832530
20011106458
20021217864
200328417059
200440225763
200549535972
200637429779
200744838185
200846641589
200950746291
201060556693
201128326894


Again, even if we can reach a web site, it doesn't mean that the service described in an article is still available or maintained.

As suggested by Egon Willighagen, I've uploaded the RDF output of my program on figshare: http://figshare.com/figures/index.php/Bioinformatics.404_20110415.rdf.

That's it,

Pierre

6 comments:

James Thompson said...

Could you correlate the probability of a resource giving a 404 with citation count?

Egon Willighagen said...

Pierre, please check the figure on FigShare... it gives a Zoho 404. Oh, the irony :)

caseybergman said...

Nice work, Pierre, it looks like the Chief Executive of the BBSRC is paying attention to the analysis on your blog: http://blogs.bbsrc.ac.uk/index.php/2011/04/biofuels-bioenergy-neuroscience/

caseybergman said...
This comment has been removed by the author.
Phil Lord said...

I wonder how many of the URLs are recoverable from the Internet Archive or Momento though.

We don't assume that because a journal no longer exists, the papers are not in the library any more. We seem to be making this mistake here.

Joerg Kurt Wegner said...

I think we could need a virtual service landscape simply ensuring that services in their state are being hold on that level, even if all technology around them is moving on.

Food for thought ...