<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.plos.org/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Gobbledygook</title>
	
	<link>http://blogs.plos.org/mfenner</link>
	<description>Diverse Perspectives on Science and Medicine</description>
	<lastBuildDate>Tue, 21 May 2013 14:09:56 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.plos.org/plos/blogs/mfenner" /><feedburner:info uri="plos/blogs/mfenner" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Metrics and attribution: my thoughts for the panel at the ORCID-Dryad symposium on research attribution</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/4x1beliDrLs/</link>
		<comments>http://blogs.plos.org/mfenner/2013/05/21/metrics-and-attribution-my-thoughts-for-the-panel-at-the-orcid-dryad-symposium-on-research-attribution/#comments</comments>
		<pubDate>Tue, 21 May 2013 14:09:56 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[alm]]></category>
		<category><![CDATA[altmetrics]]></category>
		<category><![CDATA[orcid]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3821</guid>
		<description><![CDATA[This Thursday I take part in a panel discussion at the Joint ORCID &#8211; Dryad Symposium on Research Attribution. Together with Trish Groves (BMJ) and Christine Borgman (UCL <a href="http://blogs.plos.org/mfenner/2013/05/21/metrics-and-attribution-my-thoughts-for-the-panel-at-the-orcid-dryad-symposium-on-research-attribution/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>This Thursday I take part in a panel discussion at the <a href="http://orcid.org/orcid-outreach-meeting-symposium-and-codefest-may-2013">Joint ORCID &#8211; Dryad Symposium on Research Attribution</a>. Together with <a href="http://www.bmj.com/about-bmj/editorial-staff/trish-groves">Trish Groves</a> (BMJ) and <a href="http://polaris.gseis.ucla.edu/cborgman/Chriss_Site/Welcome.html">Christine Borgman</a> (UCLA) I will discuss several aspects of attribution. Trish will speak about ethics of attribution, Christine will highlight problems with attribution, and I will add my perspective on metrics and attribution. This blog post summarizes the main points I want to make.</p>
<p><span id="more-3821"></span></p>
<p><a href="http://commons.wikimedia.org/wiki/File:A_Bicycle_in_Oxford.JPG"><img class="size-full wp-image-3833 alignnone" alt="A_Bicycle_in_Oxford" src="http://blogs.plos.org/mfenner/files/2013/05/A_Bicycle_in_Oxford.jpg" width="500" height="375" /></a></p>
<p><em>Oxford. Source: <a href="http://commons.wikimedia.org/wiki/File:A_Bicycle_in_Oxford.JPG">Wikimedia Commons</a></em></p>
<p>Scholarly metrics can be used in discovery tools, as business intelligence for funders, research organizations or publishers, and for research assessment. For all these scenarios – and in particular for research assessment – it is important to not only collect metrics for a particular journal publication, dataset or other research output, but to also link these metrics to the creators of that research output.  That is why unique identifiers for researchers, and ORCID in particular, are so important for scholarly metrics, and this is also reflected in the ORCID membership of organizations such as Thomson Reuters, Elsevier/Scopus, Altmetric or F1000Prime that provide metrics in a variety of ways.</p>
<h2>DORA</h2>
<p>A good starting point for any discussion on metrics for research assessment is the San Francisco Declaration on Research Assessment (<a href="http://am.ascb.org/dora/">DORA</a>) that was published, together with a set of editorials in several journals, last week. The first three recommendations are a good starting point for the panel discussion:</p>
<ol>
<li><em>Do not use journal-based metrics, such as Journal Impact Factors, as a surrogate measure of the quality of individual research articles, to assess an individual scientist’s contributions, or in hiring, promotion, or funding decisions.</em></li>
<li><em>Be explicit about the criteria used in evaluating the scientific productivity of grant applicants and clearly highlight, especially for early-stage investigators, that the scientific content of a paper is much more important than publication metrics or the identity of the journal in which it was published.</em></li>
<li><em>For the purposes of research assessment, consider the value and impact of all research outputs (including datasets and software) in addition to research publications, and consider a broad range of impact measures including qualitative indicators of research impact, such as influence on policy and practice. </em></li>
</ol>
<h2>Persistent Identifiers</h2>
<p>Before we can collect any metrics, we need persistent identifiers for research outputs. Most journal articles now come with a DOI, but we should make it easier for smaller publishers to use DOIs, as cost unfortunately is still an issue.</p>
<p>Persistent identifiers for data are a much more complex issue, but here DataCite DOIs are probably the first choice, as DataCite focusses on attribution. There are many other use cases for identifiers for data (e.g. to identify temporary datasets in an ongoing experiment), and is of course possible to use several identifiers for the same dataset. Particular challenges with persistent identifiers for research data include different versions of a dataset, and aggregation of datasets (e.g. whether we want to cite the aggregate dataset, or a particular subset).</p>
<p>In addition to persistent identifiers for research outputs, we also need persistent identifiers for researchers. ORCID is obviously a good candidate, but it is clear that there are also other persistent identifiers for researchers, and that we need to link these identifiers, e.g. <a href="http://orcid.org/blog/2013/04/22/orcid-and-isni-issue-joint-statement-interoperation-april-2013">ORCID and ISNI</a>.</p>
<p>Depending on the metrics we are interested in, we might also need persistent identifiers for institutions, for funding agencies and their grant IDs, and for resources such as particle accelerators or research vessels. Unfortunately much more work is needed in these areas.</p>
<h2>Attribution</h2>
<p>Attribution is then the next step, linking persistent identifiers for research outputs to their creators. The <a href="http://www.force11.org/AmsterdamManifesto">Amsterdam Manifesto on Data Citation Principles</a> that came out of the Beyond the PDF 2 workshop in March are an excellent document, but are unfortunately missing the important step of linking persistent identifiers for data to the persistent identifiers of their creators.</p>
<p>One important issue related to attribution is the provenance of the claims. Has a researcher claimed authorship for a particular paper, is a data center linking creators to research data, or is a funder doing this? The ORCID registry is built around the concept of self-claims by authors, but will allow the other stakeholders to confirm these claims.</p>
<h2>Metrics</h2>
<p>Metrics for scholarly content fall into one of three categories:</p>
<ul>
<li>Citations</li>
<li>Usage stats</li>
<li>Altmetrics</li>
</ul>
<p>Altmetrics is a mixed bag of many different things, from sharing on social media such as Twitter or Facebook to more scholarly activities such as Mendeley bookmarks or F1000Prime reviews. I therefore expect the altmetrics category to over time further evolve into 2-3 sub-categories.</p>
<p>We are all familiar with citation-based metrics for journal articles. We currently see the long-overdue shift from journal-based citation metrics to article-level metrics (see #1 from the DORA statement above for the reasoning), and as the technical lead for the PLOS Article-Level Metrics project I of course welcome this shift in focus. We also see a trend towards opening up reference lists that will make citation-based metrics much more accessible, and the <a href="http://opencitations.net">JISC Open Citations</a> project by David Shotton and others is an important driver in this, as is the <a href="http://openbiblio.net">Open Bibliographic Data</a> project by OKFN. Until open bibliographic data become the norm, we have to deal with different citation counts from different sources. PLOS is collecting citations from Web of Science, Scopus, CrossRef and PubMed Central, and the citation counts are highly correlated overall (e.g. R2= 0.87 for CrossRef and Scopus citations for 2009 PLOS Biology papers), but for some papers differ substantially.</p>
<p>The data citation community has adopted the journal article citation model, and we are starting to see more citations to datasets. Even though data citations look similar to citations of journal articles, many essential tools and services still don&#8217;t properly handle datasets. The <a href="http://wokinfo.com/products_tools/multidisciplinary/dci/">Web of Knowledge Data Citation Index</a> is an important step in the right direction, as is the<a href="http://odin-project.eu/2013/05/13/new-orcid-integrated-data-citation-tool/"> new DataCite import tool for ORCID</a>. Something that we should pay closer attention to is the citation counts of the paper(s) associated with a dataset. Maybe the major scientific impact is in the data, but scientific practice still dictates to the cite the corresponding paper and not the dataset itself. The DataCite metadata can contain the persistent identifier of the corresponding journal article, thus making it possible to associate the citation count of the corresponding paper with the dataset. This approach is particularly important for datasets that are always part of a paper, as is the case for Dryad.</p>
<p>Another problem with data citation is that citation counts might not be the best way to reflect the scientific impact of a dataset. We are increasingly seeing usage stats for datasets, and DataCite for example has <a href="http://www.datacite.org/node/76">started in January</a> to publish monthly stats for the most popular datasets by number DOI resolutions. The #1 dataset in March was the raw data to a figure in a F1000Research article, <a href="http://figshare.com/articles/Figure_7_raw_data_Effect_of_variable_exposure_to_PTHrP_1_36_on_bone_nodules_and_AP_activity_in_high_plating_density_cultures_/154685">hosted on figshare</a>.</p>
<p>Similar to citations we see a strong trend for usage stats to move from aggregate numbers for journals to article-level metrics. COUNTER has <a href="http://www.projectcounter.org/pirus.html">released</a> a draft code of practice for their PIRUS (Publisher and Institutional Repository Usage Statistics) standard in February, and increasing numbers of publishers and repository infrastructure providers such as <a href="http://www.irus.mimas.ac.uk">IRUS-UK</a> and <a href="http://www.dini.de/projekte/oa-statistik/english">OA-Statistics</a> are providing usage stats for individual articles.</p>
<p>One challenge with usage stats, in particular with Open Access content, is that an article or other research output might be available in more than one place, e.g. publisher (or data center), disciplinary repository and institutional repository. For PLOS articles we don&#8217;t know the aggregated usage stats from institutional repositories, but we know that 17% of HTML pageviews and 33% of PDF downloads happen not at the PLOS website, but at PubMed Central.</p>
<p>Altmetrics provide new challenges, but they are also a more recent development compared to usage stats and citations. Similar to usage stats they are fairly easy to game, and for some altmetrics sources (e.g. Twitter) standardization is still difficult. Altmetrics not necessarily measure impact, but sometimes rather reflect attention or self-promotion. We have just started to look into altmetrics beyond the numbers, as it makes a huge difference who is tweeting, bookmarking or discussing a paper or dataset. Altmetrics provide the opportunity to show the broader social impact (as Mike Taylor from Elsevier explains it) of research, e.g. changing clinical practice or policies.</p>
<h2>Contributions</h2>
<p>Related to attribution is contribution, i.e. what is the specific contribution by a researcher to a paper or other research output. An <a href="http://projects.iq.harvard.edu/attribution_workshop">International Workshop on Contributorship and Scholarly Attribution</a> was held together with the May 2012 ORCID Outreach Meeting to discuss this topic. Authorship position (e.g. first author, last author) is used in some metrics, but overall the contributor role is still poorly appreciated in most metrics. David Shotton has proposed a <a href="http://purl.org/spar/scoro/Shotton_SCoRO_and_SCoRF_Contributions-Workshop_Harvard_16May2012.pdf">Scholarly Contributions and Roles Ontology</a> (ScoRO), and is suggesting to split authorship credit in percentage points based on relative contributions, but I haven&#8217;t seen these numbers used in the context of metrics.</p>
<h2>Conclusions</h2>
<p>Persistent identifiers for people, attribution and metrics are closely interrelated and we have seen a lot of exciting developments in this area in the last two years. But with all the excitement we should never forget that a) there will never be a single metric that can be used for research assessment, and b) that scientific content will always be more important than any metric. I look forward to a great panel discussions on Thursday, and welcome any feedback via comments, Twitter or email.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/4x1beliDrLs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/05/21/metrics-and-attribution-my-thoughts-for-the-panel-at-the-orcid-dryad-symposium-on-research-attribution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/05/21/metrics-and-attribution-my-thoughts-for-the-panel-at-the-orcid-dryad-symposium-on-research-attribution/</feedburner:origLink></item>
		<item>
		<title>New DataCite / ORCID Integration Tool</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/YSA2Fh3mKPQ/</link>
		<comments>http://blogs.plos.org/mfenner/2013/05/18/new-datacite-orcid-integration-tool/#comments</comments>
		<pubDate>Sat, 18 May 2013 12:05:34 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[datacite]]></category>
		<category><![CDATA[orcid]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3811</guid>
		<description><![CDATA[A new service allows researchers to add research datasets &#8211; and other content with DataCite DOIs, including all figshare content &#8211; to their ORCID profile by integra <a href="http://blogs.plos.org/mfenner/2013/05/18/new-datacite-orcid-integration-tool/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>A <a href="http://datacite.labs.orcid-eu.org">new service</a> allows researchers to add research datasets &#8211; and other content with DataCite DOIs, including all <a href="http://figshare.com">figshare</a> content &#8211; to their <a href="http://about.orcid.org">ORCID</a> profile by integrating with the <a href="http://search.datacite.org/ui">DataCite Metadata Store</a>. The tool is an adaption (or fork) of the <a href="http://search.crossref.org">CrossRef Metadata Search</a> developed by <a href="https://twitter.com/karlward">Karl Ward</a>, and was developed by <a href="https://twitter.com/gthorisson">Gudmundur Thorisson</a> and myself as part of work in the EU-funded <a href="http://odin-project.eu">ODIN project</a>. More details can be found <a href="http://odin-project.eu/2013/05/13/new-orcid-integrated-data-citation-tool/">here</a>.</p>
<p><span id="more-3811"></span></p>
<p>There are many things I like about this new DataCite/ORCID integration tool:</p>
<ul>
<li><span style="line-height: 16px">it makes it easier for researchers to get credit for their research outputs.</span></li>
<li>it shows the value of persistent identifiers for data, publications and people, and linking them together</li>
<li>it shows the Creative Commons licenses for DataCite content where this info is available, facilitating reuse of content</li>
<li>it demonstrates the power of open source (thanks CrossRef!), open collaboration, standard REST APIs, and lightweight programming (Sinatra/Ruby) and deployment (Vagrant, Amazon EC2, Rackspace) tools</li>
<li>it shows that we don&#8217;t need a single &#8211; often closed &#8211; system, but open services that build on top of each other using accepted community standards. Tools using the ORCID API can immediately reuse the new DataCite content, altmetrics provided by <a href="http://impactstory.org">ImpactStory</a> are a good example</li>
</ul>
<p>I want to explore some of these ideas in the panel <strong>Attribution: Managing Provenance, Ethics, and Metrics</strong> at the combined <a href="http://orcid.org/orcid-outreach-meeting-symposium-and-codefest-may-2013">ORCID/Dryad Meeting</a> in Oxford next Thursday.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/YSA2Fh3mKPQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/05/18/new-datacite-orcid-integration-tool/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/05/18/new-datacite-orcid-integration-tool/</feedburner:origLink></item>
		<item>
		<title>Announcing Markdown for Science Workshop on June 8th</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/LTGCnWXcf78/</link>
		<comments>http://blogs.plos.org/mfenner/2013/05/08/announcing-markdown-for-science-workshop-on-june-8th/#comments</comments>
		<pubDate>Wed, 08 May 2013 22:53:00 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Snippets]]></category>
		<category><![CDATA[markdown]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3803</guid>
		<description><![CDATA[On Saturday June 8th &#8211; exactly a month from today &#8211; the PLOS San Francisco offices will host a workshop/hackathon about using markdown for science. A lot of people are <a href="http://blogs.plos.org/mfenner/2013/05/08/announcing-markdown-for-science-workshop-on-june-8th/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>On Saturday June 8th &#8211; exactly a month from today &#8211; the PLOS San Francisco offices will host a workshop/hackathon about using markdown for science. A lot of people are experimenting with markdown for authoring scientific articles – see blog posts <a href="http://blog.yoavram.com/markx/">here</a>, <a href="http://inundata.org/2012/06/01/markdown-and-the-future-of-collaborative-manuscript-writing/">here</a> or my post <a href="http://blogs.plos.org/mfenner/2012/12/13/a-call-for-scholarly-markdown/">here</a>, and the scientific manuscript <a href="https://github.com/weecology/data-sharing-paper/">here</a>.</p>
<p>Markdown is a simple markup language for text, and is primarily used for HTML content on the web, but can also be converted to PDF, LaTeX and others. One challenge with markdown is that there are a number of slightly different &#8220;flavors&#8221; out there, from the original markdown to multimarkdown, github-flavored markdown and pandoc. Some of the advanced formatting of scientific documents &#8211; tables, citations, math &#8211; is still a challenge for markdown.</p>
<p>Will markdown become our next authoring format for scientific content? Will there be yet another flavor, scholarly markdown? How will markdown writing tools be different from LaTeX tools or Microsoft Word? If you care about any of these questions and are in or near San Francisco, join us on for all full day on June 8th. Free registration is open at <a href="http://mdsci13.eventbrite.com">http://mdsci13.eventbrite.com</a>. We are collecting workshop ideas at <a href="https://github.com/karthikram/markdown_science/wiki/workshop">https://github.com/karthikram/markdown_science/wiki/workshop</a>, the Twitter hashtag is #mdsci13.</p>
<p>This event is organized by <a title="Stian Håklev" href="http://twitter.com/houshuang">Stian Haklev</a> and myself, with generous support by a <a href="http://www.force11.org/node/4358">1K Challenge prize from Force11</a>, and hosting provided by PLOS.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/LTGCnWXcf78" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/05/08/announcing-markdown-for-science-workshop-on-june-8th/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/05/08/announcing-markdown-for-science-workshop-on-june-8th/</feedburner:origLink></item>
		<item>
		<title>Baby steps toward better metrics</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/EvPKQl1Ey5A/</link>
		<comments>http://blogs.plos.org/mfenner/2013/04/16/baby-steps-toward-better-metrics/#comments</comments>
		<pubDate>Tue, 16 Apr 2013 08:22:18 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Snippets]]></category>
		<category><![CDATA[article-level metrics]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3791</guid>
		<description><![CDATA[Article-Level Metrics provide new ways to look at the impact of scholarly research. Two important concepts are a) to track metrics for individual scholarly articles instead o <a href="http://blogs.plos.org/mfenner/2013/04/16/baby-steps-toward-better-metrics/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://article-level-metrics.plos.org">Article-Level Metrics</a> provide new ways to look at the impact of scholarly research. Two important concepts are a) to track metrics for individual scholarly articles instead of using numbers aggregated by journal, and b) to go beyond citations and also include usage stats and altmetrics.</p>
<p>Article-Level Metrics is also doing something else: instead of tracking impact by year, it looks at usage, altmetrics and citations in real-time. There might have been technical reasons to do so 20 years ago, but there really is no longer any reason why scholarly impact should be tracked on a yearly basis in 2013. Unfortunately there is one big stumbling block:</p>
<blockquote><p>The publication date of a scholarly article is often difficult or impossible to obtain. Publication year may be the only available information.</p></blockquote>
<p>A good example is CrossRef. They provide a lot of interesting metadata about an article and make this information available in <a href="http://search.crossref.org">a very nice search interface</a>. But they only require the publisher to provide the publication year, information about the publication month and day is optional. There are many other examples of journals and services that just can&#8217;t tell you when exactly an article was published. This might have made sense when periodicals were printed on paper, but doesn&#8217;t work for digital content.</p>
<p>&nbsp;</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/EvPKQl1Ey5A" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/04/16/baby-steps-toward-better-metrics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/04/16/baby-steps-toward-better-metrics/</feedburner:origLink></item>
		<item>
		<title>You should be able to install my software in less than one hour – or why DevOps is important</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/04IzMQO3BVo/</link>
		<comments>http://blogs.plos.org/mfenner/2013/04/14/you-should-be-able-to-install-my-software-in-less-than-one-hour-or-why-devops-is-important/#comments</comments>
		<pubDate>Sun, 14 Apr 2013 10:45:48 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[alm]]></category>
		<category><![CDATA[altmetrics]]></category>
		<category><![CDATA[amazon aws]]></category>
		<category><![CDATA[devops]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3769</guid>
		<description><![CDATA[Cameron Neylon yesterday wrote a great blog post about appropriate business models for shared scholarly communications infrastructure. This is an area I have also been thinkin <a href="http://blogs.plos.org/mfenner/2013/04/14/you-should-be-able-to-install-my-software-in-less-than-one-hour-or-why-devops-is-important/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>Cameron Neylon yesterday wrote a<a href="http://cameronneylon.net/blog/whats-the-right-model-for-shared-scholarly-communications-infrastructure/"> great blog post</a> about appropriate business models for shared scholarly communications infrastructure. This is an area I have also been <a href="http://blogs.plos.org/mfenner/2013/03/20/the-price-of-innovation-my-thoughts-for-beyond-the-pdf/">thinking about a lot recently</a>, and in this post I want to add a technical perspective (and an announcement) to the discussion.</p>
<p><span id="more-3769"></span></p>
<p><a href="http://en.wikipedia.org/wiki/DevOps">DevOps</a> is an important trend that brings software development and administration of IT infrastructure closer together. Agile software development, server virtualization, cloud infrastructure and software automation tools such as <a href="http://www.opscode.com/chef/">Chef</a>, <a href="https://puppetlabs.com">Puppet</a> or <a href="http://cfengine.com">CFEngine</a> are an important pars of DevOps, but it is really the collaborative aspect of IT administrators working much closer with software developers what defines DevOps. The end result is often <a href="http://readwrite.com/2013/03/27/devops-booms-in-the-enterprise">faster and more stable software releases</a>, and that is what is users and customers care about.</p>
<p>This makes DevOps particularly relevant for all areas where innovation is important, and that of course includes <a href="http://science.okfn.org/tools-for-open-science/">tools and services for Open Science</a>. We not only need infrastructure that facilitates software development (with services like Github, among many others), but we also have to streamline IT administration. The question is not whether you do your development in Java, Python, Ruby, PHP or Javascript, but how well you integrate your software development and IT administration. The shift towards web-based tools has centralized software installation and updates, but these web-based services are becoming increasingly complex and difficult to set up and administer. Running an institutional respository, research information system or a journal is a complex task. The software may be freely available as open source (e.g. <a href="http://www.dspace.org">Dspace</a>, <a href="http://vivoweb.org">VIVO</a> or <a href="http://pkp.sfu.ca/?q=ojs">Open Journal Systems</a>), but the resources required to run such a service still make this a big investment.</p>
<p>Two solutions to this dilemma are to pay either a vendor for installation and maintenance, or to use the software as a service (SaaS) that is hosted somewhere else. Why these two options are popular, they may not always be the best choices because they mean that you are locked in to a particular vendor or service provider, and that you may give expertise and direct access to your data away. I believe that these are helpful approaches for auxillary services, but that ideally the core services of a library, publisher or other provider of scientific infrastructure should not be outsourced. Developing software for scientific infrastructure that you want organizations to install locally should therefore always include work on integration with IT infrastructure, and just providing manual installation instructions isn&#8217;t good enough anymore.</p>
<p><a href="http://blogs.plos.org/mfenner/tag/article-level-metrics/">Article-Level Metrics</a> (ALM) and the related altmetrics are becoming increasingly popular. The collection and display of this information is a complex process, as it requires the integration of information from several upstream APIs which may be temporarily unavailable, have changed their data format, or put up restrictions on how you can use the data. In turn this information has to be processed and aggregated, and then reliably be provided to downstream users. This kind of information gathering fits perfectly with a service provider model, and organizations such as <a href="http://www.altmetric.com">Altmetric</a>, <a href="http://impactstory.org">ImpactStory</a> and <a href="http://www.plumanalytics.com">Plum Analytics</a>. PLOS is collecting and displaying this information with <a href="https://github.com/articlemetrics/alm">its own tool</a>. The simple reason is that PLOS started doing this several years before the services above became available, and none of them currently provide the same comprehensive set of information about citations, usage stats and altmetrics (although there are of course a lot of things they do better than the PLOS ALM application).</p>
<p>But there is also the question of whether Article-Level Metrics are a core service for every publisher and are best collected in-house. This not only makes it easier to collect information from some sources (e.g. usage stats or CrossRef citations), but also gives unrestricted access to the data in real-time. When I took over as technical lead for the PLOS Article-Level Metrics project last May, I therefore not only worked on improving the ALM application for PLOS, but we are also working hard on making it easier for other publishers to install and use the application. We want to provide an attractive alternative for organizations for which the service provider model is not the best option.</p>
<p>To that end I want to announce the latest feature which allows the automated installation of the PLOS ALM application on an Amazon Web Services (AWS) EC2 instance. This option is great not only for setting up an ALM production service, but because of the EC2 pricing model by hour (about $1 a day for a small EC2 instance) without setup costs is a great way to test-drive the application for a publisher, to analyze a particular set of papers from different publishers for a research project, or to set up a PLOS ALM server for a hackathon or workshop.</p>
<p><a href="http://blogs.plos.org/mfenner/files/2013/04/vagrant_aws.png"><img class="size-full wp-image-3781 alignnone" alt="vagrant_aws" src="http://blogs.plos.org/mfenner/files/2013/04/vagrant_aws.png" width="500" height="316" /></a></p>
<p>There are of course many options to automate software deployment on a production server, including the PaaS (platform as a service) providers <a href="https://www.heroku.com">Heroku</a>, <a href="http://www.cloudfoundry.com">CloudFoundry</a> and <a href="https://www.openshift.com">OpenShift</a>, and the recently announced <a href="http://aws.amazon.com/de/opsworks/">Amazon OpsWorks</a>. I am a big fan of the <a href="http://www.vagrantup.com">Vagrant</a> software development tool in combination with Chef for automation, and in March Vagrant added <a href="http://www.hashicorp.com/blog/preview-vagrant-aws.html">support for Amazon AWS</a>. This makes deployment of the PLOS ALM application to AWS really simple:</p>
<ol>
<li>Install Vagrant and the <a href="https://github.com/mitchellh/vagrant-aws">vagrant-aws plugin</a></li>
<li>Setup an Amazon Web Services Account</li>
<li>Check out the <a href="https://github.com/articlemetrics/alm">PLOS ALM source code</a> from Github</li>
<li>run the command <strong>vagrant up &#8211;provider aws</strong></li>
</ol>
<p>Step #4 took 898 sec or about 15 min on my computer (see screenshot), and at the end I had a PLOS ALM server where I could access the admin dashboard via the web interface. If you are familiar with Amazon Web Services – you have to think about  the right size for the EC2 instance, an appropriate AMI, security groups, elastic IPs, and DNS service – then the whole process should be done in well under an hour. I will use this instance to load and analyze some articles from a publisher for a presentation next week. When I&#8217;m done, another command (<strong>vagrant destroy</strong>) will destroy this server and Amazon will stop billing me. During testing I have created and destroyed many servers, and the <a href="http://www.hashicorp.com/blog/preview-vagrant-aws.html">vagrant-aws video</a> shows you how easy this process is.</p>
<p>At this stage the installation process is working (and has been working for a local Virtualbox install for many months), but needs testing and documentation. I therefore invite everyone interested in testing this out to contact me so that we can make this well-documented and working reliably.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/04IzMQO3BVo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/04/14/you-should-be-able-to-install-my-software-in-less-than-one-hour-or-why-devops-is-important/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/04/14/you-should-be-able-to-install-my-software-in-less-than-one-hour-or-why-devops-is-important/</feedburner:origLink></item>
		<item>
		<title>Mendeley and Elsevier</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/vSOyhfZBGhU/</link>
		<comments>http://blogs.plos.org/mfenner/2013/04/11/mendeley-and-elsevier/#comments</comments>
		<pubDate>Thu, 11 Apr 2013 05:50:36 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[elsevier]]></category>
		<category><![CDATA[mendeley]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3763</guid>
		<description><![CDATA[Earlier this week the rumors that started in January became official: Elsevier is buying Mendeley (see also here). A lot has been written about this announcement, in particula <a href="http://blogs.plos.org/mfenner/2013/04/11/mendeley-and-elsevier/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p style="text-align: justify">Earlier this week the rumors that started in January became official: <a href="http://blog.mendeley.com/start-up-life/team-mendeley-is-joining-elsevier/">Elsevier is buying Mendeley</a> (see also <a href="http://elsevierconnect.com/elsevier-welcomes-mendeley/">here</a>). A <a href="http://enjoythedisruption.com/post/47527556151/my-thoughts-on-mendeley-elsevier-why-i-left-to-start">lot has been written</a> about this announcement, in particular about the fear that Mendeley as a product and organization will turn into something not as open and collaborative as before.</p>
<p style="text-align: justify">I first met Victor and Jan from Mendeley in 2008 and did an <a href="http://blogs.plos.org/mfenner/2008/09/05/interview_with_victor_henning_from_mendeley/">interview with Victor</a> in September 2008. We worked together in the organization of two <a href="http://www.scienceonlinelondon.org">Science Online London conferences</a> (2009 and 2010, together with Nature.com and others), and my current job started with an entry for an <a href="http://blog.mendeley.com/design-research-tools/winners-of-the-first-binary-battle-apps-for-science-contest/">API programming contest</a> co-organized by PLOS and Mendeley, with the <a href="http://blogs.plos.org/mfenner/2011/09/28/announcing-sciencecard/">first lines of code written</a> in the Mendeley offices during the Science Online London 2011 hackathon. I wish Mendeley all the best with their new parent.</p>
<p style="text-align: justify">What this acquisition signals to me is that commercial publishers are now moving into the software tools for scientists business at full speed. They have always done this, but with <a href="http://www.readcube.com">ReadCube</a> by Digital Science (a Nature Publishing Group sister company) in 2011, the acquisition of <a href="http://www.papersapp.com">Papers</a> by Springer last year and now Mendeley, reference management now often means using a tool owned by a publisher &#8211; this market used to be dominated academic software such as <a href="http://www.zotero.org">Zotero</a> and commercial software vendors such as Thomson Reuters (<a href="http://endnote.com">Endnote</a>) or ProQuest (<a href="http://www.refworks.com">RefWorks</a>).</p>
<p style="text-align: justify">For me this trend signals that publishers have realized that we are moving into an Open Access publishing model, which in contrast to subscription publishing is not about owning the content, but about providing valuable services around content that is free to read and reuse.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/vSOyhfZBGhU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/04/11/mendeley-and-elsevier/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/04/11/mendeley-and-elsevier/</feedburner:origLink></item>
		<item>
		<title>Comment: the case for open preprints in biology</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/-UApNoBzulE/</link>
		<comments>http://blogs.plos.org/mfenner/2013/03/30/comment-the-case-for-open-preprints-in-biology/#comments</comments>
		<pubDate>Sat, 30 Mar 2013 09:56:05 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Reviews]]></category>
		<category><![CDATA[arxiv]]></category>
		<category><![CDATA[preprint archive]]></category>
		<category><![CDATA[presentation]]></category>
		<category><![CDATA[pubmed]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3743</guid>
		<description><![CDATA[Last week Philippe Desjardins-Prouly et al. published the article The case for open preprints in biology – naturally as a preprint on figshare. The article sees preprint serve <a href="http://blogs.plos.org/mfenner/2013/03/30/comment-the-case-for-open-preprints-in-biology/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>Last week Philippe Desjardins-Prouly et al. published the article <a href="http://dx.doi.org/10.6084/M9.FIGSHARE.655710">The case for open preprints in biology</a> – naturally as a preprint on figshare. The article sees preprint servers as a great opportunity for open science, and discusses the status of preprints in the biological sciences. In this blog post I want to add some comments to the text.</p>
<p><span id="more-3743"></span></p>
<h2>E-BIOMED</h2>
<p>What is now PubMed Central <a href="http://www.nih.gov/about/director/pubmedcentral/ebiomedarch.htm">started out as E-BIOMED in 1999</a> and initially was envisioned to include a repository for preprints. It is important to look back at what happended then, and why the preprint repository was dropped from what then became PubMed Central. Harold Varmus talks a bit about this in this <a href="http://poynder.blogspot.de/2006/06/interview-with-harold-varmus.html">interview</a> from 2006.</p>
<h2>Nature Precedings</h2>
<p>The article talks about why biologists have not developed a culture of sharing preprints. It would be good to mention <a href="http://precedings.nature.com">Nature Precedings</a>, a preprint server for the life sciences started in 2007 that stopped taking new submissions in 2012. <a href="http://retractionwatch.wordpress.com/2012/03/30/nature-precedings-to-stop-accepting-submissions-next-week-after-finding-model-unsustainable/">This blog post</a> on RetractionWatch cites the announcement by Nature Publishing Group (which doesn&#8217;t explain why the service was shut down), and there are a good number of interesting comments.</p>
<h2>SSRN</h2>
<p>Preprints in other disciplines are mentioned in the text, in particular ArXiv, but also RePEc. I would also include <strong>SSRN</strong> (<a href="http://www.ssrn.com">Social Science Research Network</a>), which uses a different model, but is as important for the working paper and preprint culture in the social sciences as ArXiV is in physics/mathematics.</p>
<h2>Google Scholar Metrics</h2>
<p>In April 2012 Google launched <a href="http://scholar.google.com/citations?view_op=top_venues&amp;hl=en&amp;vq=en">Google Scholar Metrics</a>, listing the top 100 publications (according to their h5-index) in several disciplines. Six out of the top 10 publications in physics/mathematics are ArXiV sections (arXiv Astrophysics (astro-ph) is #2), the <a href="http://www.iza.org/en/webcontent/publications/papers">IZA Discussion Papers</a> are #1 in Social Sciences, and the <a href="http://www.nber.org/papers.html">NBER Working Papers</a> are #1 in Economics, and arXiv Astrophysics (astro-ph) is #12 on the top 100 list for all disciplines (#1-5 are journals in biology and medicine: <em>Nature</em>, <em>New England Journal of Medicine</em>, <em>Science</em>, <em>Lancet</em>, <em>Cell</em>). All these metrics are a strong indicator that preprints can be highly cited.</p>
<h2>Citation Advantage of Preprints</h2>
<div title="Page 1">
<p>Anne Gentil-Beccot et al. have written a nice paper (of course <a href="http://arxiv.org/abs/0906.5418">available as preprint</a>) that shows that publication as preprint now only increases the citation rate for the corresponding peer-reviewed article published later, but also leads to much faster citations, with a peak immediately after publication.</p>
<p><a href="http://blogs.plos.org/mfenner/files/2013/03/gentil-beccot.png"><img class="size-full wp-image-3749 alignnone" alt="gentil-beccot" src="http://blogs.plos.org/mfenner/files/2013/03/gentil-beccot.png" width="500" height="319" /></a></p>
<div title="Page 6">
<p><em>Average number of citations per article per month as a function of the time of the citation relative to the time of publication. From <a href="http://arxiv.org/abs/0906.5418">http://arxiv.org/abs/0906.5418</a></em></p>
<h2>SCOAP3</h2>
<p>The Sponsoring Consortium for Open Access Publications in High Energy Physics (<a href="http://scoap3.org">SCOAP3</a>) is working on turning the majority of peer-reviewed publications in high energy physics into gold open access. It is important to understand that the high energy physics community feels that they need peer-reviewed journal articles in addition to ArXiV.</p>
<h2>Preprint Culture in Clinical Medicine</h2>
<p>It is a little known fact that there is a strong preprint culture in clinical medicine. I have written about this topic in <a href="http://blogs.plos.org/mfenner/2010/10/16/in-which-i-suggest-a-preprint-archive-for-clinical-trials/">October 2010</a>. Clinical trials have to be registered before starting the trial, and information about the trial is publicly available in <a href="http://clinicaltrials.gov">clinicaltrials.gov</a> and other registries. Results are presented in conferences (as poster or oral presentation), at which stage it becomes public information. The peer-reviewed paper – with a few exceptions – follows much later, sometimes even after drug approval by the FDA (in the blog post I used the <a href="http://dx.doi.org/10.1016/S0140-6736(10)61389-X">TROPIC</a> trial as example). The problem is of course that information in oral presentations and posters is incomplete and difficult to find. But publication of a clinical trial in a peer-reviewed journal is more about giving credit to the researchers involved (similar to SCOAP3 in high energy physics) than about spreading the knowledge. Peer review is not an appropriate filter for whether or not a new drug or drug combination should be used to treat patients – the approval process by regulatory authorities is much more extensive than any peer review can ever be.</p>
<h2>Preprint culture in biology</h2>
<p>The paper mentions several reasons why the field of biology has essentially no preprint culture. One argument against preprints is that it would be easier to steal ideas. Although I agree with the authors that preprints are a great way to establish precedence, there is a big difference between research based on years of work using expensive equipment (as is often the case in high energy physics but also some other fields), and research that can be reproduced in a few weeks. In the latter case it is possible that someone else is faster in publishing the peer-reviewed paper. Another difference is the community: &#8220;stealing&#8221; ideas from someone else is probably more difficult in smaller scientific communities, and some scientific communities are more competitive and less collaborative than others.</p>
<p>Another concern about preprints raised in the paper is the Ingelfinger rule, i.e. the uncertainty that a journal would accept a manuscript if already published as a preprint. This concern is fortunately unfounded regarding most publishers, and the paper includes a table listing the preprint policies of important publishers in biology.</p>
<p>I would like to add two other reasons why the preprint culture is probably not established in biology. Preprints are competition for the peer-reviewed journal article and scholarly publishers might not be particularly interested in encouraging a preprint culture. A lot has fortunately changed since E-BIOMED in 1999.</p>
<p>Finally, whereas some disciplines use preprints and working papers to communicate, in biology the preferred way to communicate research findings before publication of a peer-reviewed paper is the oral presentation. What we may need is a service that makes it easy to upload and share scientific presentations. We for example already have Slideshare, Speaker Deck as generic tools, and SciVee, fishare aimed at scientists. Speaker Deck <a href="http://blogs.plos.org/mfenner/2012/05/29/speaker-deck-for-sharing-presentations/">is currently my favorite tool</a> and is a Github product (Github has been mentioned in the manuscript as an option for hosting preprints). Maybe what is missing is a killer combination of features in a new or existing service – persistent identifiers, uploading of background material (text, data, software, video) in addition to the slides, non-textual search, cooperation with conference organizers, etc. – for presentation sharing to take off as a way to establish a preprint culture in biology.</p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_id=info%3Adoi%2F10.6084%2FM9.FIGSHARE.655710&amp;rfr_id=info%3Asid%2Fdatacite.org%3Asearch&amp;rft.atitle=The+case+for+open+preprints+in+biology&amp;rft.date=2013&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rft.genre=unknown&amp;rft.au=Philippe+Desjardins-Proulx&amp;rft.au=Ethan+P.+White&amp;rft.au=Joel+Adamson&amp;rft.au=Karthik+Ram&amp;rft.au=Timoth%C3%A9e+Poisot&amp;rft.au=Dominique+Gravel"><strong>Philippe Desjardins-Proulx, Ethan P. White, Joel Adamson, Karthik Ram, Timothée Poisot, Dominique Gravel</strong>. The case for open preprints in biology. Figshare; 2013. Available from: <a href="http://dx.doi.org/10.6084/M9.FIGSHARE.655710">http://dx.doi.org/10.6084/M9.FIGSHARE.655710</a></span></p>
<p><em>Update 4/4/13: Yesterday PeerJ launched a new preprint service for life sciences research. Read <a href="http://blogs.scientificamerican.com/guest-blog/2013/04/03/who-killed-the-preprint-and-could-it-make-a-return/">this blog post</a> for details, and this <a href="http://blog.mendeley.com/open-access/is-the-time-right-for-a-preprint-server-for-life-science/">post</a> on the Mendeley blog.</em></p>
</div>
</div>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/-UApNoBzulE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/03/30/comment-the-case-for-open-preprints-in-biology/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/03/30/comment-the-case-for-open-preprints-in-biology/</feedburner:origLink></item>
		<item>
		<title>Some Thoughts on Beyond the Paper</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/SdqHqtrJ7TE/</link>
		<comments>http://blogs.plos.org/mfenner/2013/03/28/some-thoughts-on-beyond-the-paper/#comments</comments>
		<pubDate>Thu, 28 Mar 2013 08:41:57 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[altmetrics]]></category>
		<category><![CDATA[beyondthepdf]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3721</guid>
		<description><![CDATA[Today the journal Nature has released a special on the Future of Publishing. It includes a lot of interesting reading, but I want to focus on the comment Beyond the Paper by Jason P <a href="http://blogs.plos.org/mfenner/2013/03/28/some-thoughts-on-beyond-the-paper/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>Today the journal <em>Nature</em> has released a special on the <a href="http://www.nature.com/news/specials/scipublishing/index.html">Future of Publishing</a>. It includes a lot of interesting reading, but I want to focus on the comment <a href="http://dx.doi.org/10.1038/495437a">Beyond the Paper</a> by Jason Priem. In the comment Jason describes his vision of the future of scholarly communication, a future where many of today&#8217;s roles for articles and journals will be replaced by the <em>decoupled journal</em> and online tools taking the lead in dissemination and filtering of scholarly content.</p>
<p>Jason makes a strong case for this vision, and takes his time to also discuss the concerns and challenges. He doesn&#8217;t have the space to discuss in more detail how we get to that future, and in particular what the role of researchers, publishers, libraries and funders be in that transition.</p>
<p>Jason&#8217;s vision will probably be overwhelming for many researchers, and might not directly address what is probably the biggest issue for most researchers: funding for grants and jobs is limited, and the processes we use to select for good science and good scientists are inefficient and often arbitrary. Most students entering graduate school will not be able to have a career in academia, and most academics will say that they spend far too much time with evaluations &#8211; of their own work and the work of others. It is unclear to me how we can get from the current system &#8211; where one misstep such as denied grant or submission to the wrong journal can mean the end of a career &#8211; to the system that Jason envisions. The current climate doesn&#8217;t really foster experimentation by researchers and I am interested to understand how researchers can take part in this process of change.</p>
<p>The vision of the decoupled journal is very threatening for some of the stakeholders of the current scholarly communication ecosystem, in particular publishers and libraries. Every journal publisher and library knows that it has to reinvent itself to survive the digital transformation, but a vision that is build around a new ecosystem of service providers needs to be clear how publishers and libraries can be part of the transformation process.</p>
<p>Lastly, I disagree with the notion that <em>today&#8217;s publication silos will be replaced by a set of decentralized, interoperable services that are built on a core infrastructure of open data and evolving standards — like the Web itself.</em> I would argue that both scholarly communication and the web in general have a tendency for centralization, and that scientific infrastructure needs to be interoperable first and decentralized second. Without a focus on interoperability the future of scholarly communication will not be open and in the hands of many, but will be a race to become one of the dominant players in this new ecosystem, and we might end up with not 1000s of libraries and publishers but just a handful of technology companies holding the keys to our scientific infrastructure.</p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Nature&amp;rft_id=info%3Adoi%2F10.1038%2F495437a&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Scholarship%3A+Beyond+the+paper&amp;rft.issn=0028-0836&amp;rft.date=2013&amp;rft.volume=495&amp;rft.issue=7442&amp;rft.spage=437&amp;rft.epage=440&amp;rft.artnum=http%3A%2F%2Fwww.nature.com%2Fdoifinder%2F10.1038%2F495437a&amp;rft.au=Priem%2C+J.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Research+%2F+Scholarship">Priem, J. (2013). Scholarship: Beyond the paper <span style="font-style: italic">Nature, 495</span> (7442), 437-440 DOI: <a rev="review" href="http://dx.doi.org/10.1038/495437a">10.1038/495437a</a></span></p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/SdqHqtrJ7TE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/03/28/some-thoughts-on-beyond-the-paper/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/03/28/some-thoughts-on-beyond-the-paper/</feedburner:origLink></item>
		<item>
		<title>Using d3.js to visualize Article-Level Metrics over time</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/PLy3kbfeJK8/</link>
		<comments>http://blogs.plos.org/mfenner/2013/03/26/using-d3-js-to-visualize-article-level-metrics-over-time/#comments</comments>
		<pubDate>Tue, 26 Mar 2013 23:42:52 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Thoughts]]></category>
		<category><![CDATA[article-level metrics]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3679</guid>
		<description><![CDATA[PLOS Article-Level Metrics (ALM) are a great set of data (available via API and as monthly data dump) for some nice data visualizations. I have recently become a big fan of th <a href="http://blogs.plos.org/mfenner/2013/03/26/using-d3-js-to-visualize-article-level-metrics-over-time/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>PLOS Article-Level Metrics (ALM) are a great set of data (available via API and as <a href="http://article-level-metrics.plos.org/plos-alm-data/">monthly data dump</a>) for some nice data visualizations. I have recently become a big fan of the <a href="http://d3js.org">d3.js</a> javascript library, and have now used d3 to look at some ALM data over time.</p>
<p><span id="more-3679"></span></p>
<p>I like simple visualizations without too many labels or axes, and wanted to do a visualization inspired by <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR">sparklines</a> ever since we discussed this idea in our <em>altviz</em> breakout group at the <a href="http://article-level-metrics.plos.org/alm-workshop-2012/hackathon/#altviz">ALM workshop hackathon</a> in November 2012 (kudos in particular to Juan Alperin, Karthik Ram and Carl Boettiger). In the chart below every column represents the numbers for a given month, with alternating colors for the years (the article was published November 2009).</p>
<p><a href="http://blogs.plos.org/mfenner/files/2013/03/sparklines4-e1364338936408.png"><img class="size-full wp-image-3693 alignnone" alt="sparklines4" src="http://blogs.plos.org/mfenner/files/2013/03/sparklines4-e1364338936408.png" width="300" height="263" /></a></p>
<p><em>CiteULike bookmarks, usage stats from PLOS website and blog posts for article <strong>Article-Level Metrics and the Evolution of Scientific Impact</strong> by month, available at <a href="http://dx.doi.org/10.1371/journal.pbio.1000242">http://dx.doi.org/10.1371/journal.pbio.1000242</a>.</em></p>
<p>You can see a pattern that is probably typical for many articles independent of the absolute numbers: most pageviews and downloads happen in the weeks after publication, as does academic bookmarking and science blogging.</p>
<p>The second example shows a very different pattern. This is not only the <a href="http://alm.plos.org/sources/counter">most-downloaded</a> PLOS article, but  the distribution of downloads over time is very different, with the number of monthly downloads actually higher the last two years (this article was published in August 2005). We also see a few spikes in the usage stats, probably indicating events that triggered usage. Academic bookmarking was most active from 2009 to 2011 and not right after publication, although that might also have to do with the relative popularity of CiteULike over time.</p>
<p><a href="http://blogs.plos.org/mfenner/files/2013/03/sparklines5.png"><img class="size-medium wp-image-3697 alignnone" alt="sparklines5" src="http://blogs.plos.org/mfenner/files/2013/03/sparklines5-500x262.png" width="500" height="262" /></a></p>
<p><em>CiteULike bookmarks, usage stats from PLOS website and blog posts for article <strong>Why Most Published Research Findings Are False</strong> by month, available at <a href="http://dx.doi.org/10.1371/journal.pmed.0020124">http://dx.doi.org/10.1371/journal.pmed.0020124</a>.</em></p>
<p>Citation data are unfortunately more difficult to get with exact publication dates (why is that so difficult?), but we can at least look at CrossRef numbers by year for the same article.</p>
<p><a href="http://blogs.plos.org/mfenner/files/2013/03/sparklines31-e1364340861413.png"><img class="size-full wp-image-3689 alignnone" alt="sparklines3" src="http://blogs.plos.org/mfenner/files/2013/03/sparklines31-e1364340861413.png" width="200" height="169" /></a></p>
<p><em>CiteULike bookmarks, usage stats from PLOS website and blog posts for article <strong>Why Most Published Research Findings Are False</strong> by year, available at <a href="http://dx.doi.org/10.1371/journal.pmed.0020124">http://dx.doi.org/10.1371/journal.pmed.0020124</a>.</em></p>
<p>The citation numbers by year are still increasing (the last bar is for 2013), indicating that this article is still of general interest 8 years after publication. This would probably be unusual for a life sciences research article, but the article is an essay looking at common pitfalls in the statistical analysis of research data.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/PLy3kbfeJK8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/03/26/using-d3-js-to-visualize-article-level-metrics-over-time/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/03/26/using-d3-js-to-visualize-article-level-metrics-over-time/</feedburner:origLink></item>
		<item>
		<title>New version of Article-Level Metrics app released</title>
		<link>http://feeds.plos.org/~r/plos/blogs/mfenner/~3/HQaElkb_HnA/</link>
		<comments>http://blogs.plos.org/mfenner/2013/03/21/new-version-of-article-level-metrics-app-released/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 17:11:45 +0000</pubDate>
		<dc:creator>Martin Fenner</dc:creator>
				<category><![CDATA[Snippets]]></category>
		<category><![CDATA[altmetrics]]></category>
		<category><![CDATA[article-level metrics]]></category>

		<guid isPermaLink="false">http://blogs.plos.org/mfenner/?p=3671</guid>
		<description><![CDATA[On Tuesday we released the latest version of the PLOS Article-Level Metrics application. As always, the source code is available at Github. The changes in this version focus on im <a href="http://blogs.plos.org/mfenner/2013/03/21/new-version-of-article-level-metrics-app-released/" class="read_more">Read more ...</a>]]></description>
				<content:encoded><![CDATA[<p>On Tuesday we released the latest version of the <a href="http://article-level-metrics.plos.org">PLOS Article-Level Metrics application</a>. As always, the source code is available at <a href="https://github.com/articlemetrics">Github</a>. The changes in this version focus on improving API perfomance, making it easier to install the application, and RSS feeds for the most popular articles by source and publication date (e.g. <a href="http://alm.plos.org/sources/twitter.rss?days=7">the most tweeted papers published in the last 7 days</a>). See the <a href="https://github.com/articlemetrics/alm/wiki/2.6">Github Wiki page</a> for more details, in the Wiki you also find the development <a href="https://github.com/articlemetrics/alm/wiki/Roadmap">roadmap</a> and the <a href="https://github.com/articlemetrics/alm/issues">issue tracker</a> for feature suggestions.</p>
<img src="http://feeds.feedburner.com/~r/plos/blogs/mfenner/~4/HQaElkb_HnA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.plos.org/mfenner/2013/03/21/new-version-of-article-level-metrics-app-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
	<feedburner:origLink>http://blogs.plos.org/mfenner/2013/03/21/new-version-of-article-level-metrics-app-released/</feedburner:origLink></item>
	</channel>
</rss>
