Academia will eat itself*: the awkward love triangle of scholarly publishing

6 Jun

*With apologies to Pop Will Eat Itself, and thanks to my colleague Becky for the idea

The problem

There are three major stakeholder groups in scholarly publishing: publishers, academics, and libraries. The awkward love triangle arises because each of these groups needs one of the others, but this relationship is not symmetrical between any two parties. This reminded me of the German chemist Kekulé, who had a dream about a snake eating its own tail, which inspired him to propose a cyclic (ring) structure for benzene. Because of the asymmetrical love triangle, it’s difficult to reach agreement through negotiation, as none of the three stakeholder groups can bargain fairly with each other.  One party needs something from another, but it’s always the other who holds all the cards.

Consequences

Publishers need academics, as without their research output, there would be no market for scholarly publications. Librarians need publishers, as this is the only way they can purchase access to academic materials for library collections (both print and online). Academics need to read, and to get their research published (both to advance their field and because of the prestige, specifically that which comes from publishing in journals with high impact factors).  They expect books to be on the shelves/articles to be available online, and are not usually interested in the financial/collection managements aspects of how they came to be there. Researchers need librarians to manage the library collection; and make it discoverable (cataloguing and metadata) and accessible (arrangement in physical buildings, technology behind online access, inter-library loan agreements).  However, as the balance of library collections shifts towards digital and away from physical media, this role is becoming less visible. Publishers argue that despite declining print publications and associated overheads, the cost of e-resources publication justifies their continued high prices and indeed annual price rises, in a time of budget cuts for many library services.  These cuts often mean that not only can we not afford any new subscriptions, but we often have to cut some existing titles just to be able to maintain core subscriptions, which are growing more expensive every year.  For acquisition teams purchasing materials in currencies other than British pounds sterling, this problem is exacerbated by fluctuating exchange rates.  And don’t get me started on VAT…

Where is the added value in academic publishing?

However, the key ingredient that makes scholarly publishing valuable is the peer review process.  The details and timeline vary between disciplines, but this is an outline of how the process work. Academics sit on editorial boards and review papers submitted for publication in a journal.  During this process, errors will be identified and suggestions made for clarification or extension of the work, so that by the time an academic article has been published, it will have undergone a rigorous review process. It is important to note that the publishers do not usually pay peer reviewers for their work.  So the core principle which makes this type of publishing “scholarly” is in fact not part of the financial mechanism.  And yet, where are all the profits going?

Access to knowledge

In order to protect their revenues, academic publishers host their content on platforms which are protected by a paywall, so you can only (legally) access the content if you are an authorised user – usually a current student or staff member at a university with a current subscription to the content you’re trying to access. Access to scholarly content is very difficult for those without a university affiliation, and is becoming harder as more content is available online rather than in print at university libraries, because the licencing restrictions for e-resources are generally less favourable to walk-in users than the rules for the same content in print format.  For example: switching from print books to ebooks limits access to content for walk-in users, who would otherwise have been entitled to come in and consult the print copy. What about access for researchers whose university library has a subscription to a particular platform, but not the specific content they wish to read?  Niche subjects are particularly vulnerable as pressures on budgets oblige libraries to focus on core, high-use subscriptions. Some researchers do not have university affiliations; for example, they may work on projects funded by charities or funding councils; and many will not have access to digital or print collections except as a walk-in user. Not everyone who is capable of making a valid contribution to our sum of knowledge is working as an academic, losing us the scholarly potential of all those who are in other forms of employment, or unemployed, or retired, or studying outside the HE system (e.g. at school, or independently). Questions:

  • As a society, what value do we place on universal access to knowledge, and how much money should we make available to pay for it?
  • Should the results of publicly-funded research should be freely available for all to read?
  • Does the current purchasing system in which scholarly articles can only be bought from one publisher amount to a monopoly on legal access to that knowledge?
  • What is an appropriate pricing strategy for access to knowledge (especially that which was not created or moderated by the seller)?
  • How will digital media affect our laws about copyright and copying?  These laws were originally developed for print media and are increasingly at odds with behaviour in a digital environment (e.g. format shifting).

Value and price

Despite the best efforts of a presenter at the UKSG conference last month, I am not convinced that publishers and librarians share the same values. For example, consider our beliefs about what happens to the value of knowledge once it is shared: for many information professionals, sharing knowledge increases its value; whereas to publishers, if this happens without money changing hands, it represents a lost revenue stream. Does knowledge depreciate?  This matters when it comes to the issue of embargoes in the Green OA model.  For example, a 12-month embargo on articles in literature or history will be much less of an obstacle in that field than the same embargo in medicine or financial mathematics.  The stats on article downloads show that the half-life of papers in the sciences is much shorter than in the humanities.

Open Access?

The problem with Open Access is that neither the gold nor the green options challenge the underlying publishers’ pricing model.  The Gold OA model, an article processing charge (APC) is payable when an article is submitted for publication, effectively shifting the cost from pay-to-access to pay-to-publish. We are currently living in the second information revolution – like the printing press before it, the availability of information via the internet is having wide-ranging effects on the way we live.  In both cases, the sudden change in how much it cost to make information available held great promise for a more democratic and inclusive culture, as well as a scramble for political and financial control of the new medium. The additional layer of social media, which allows us to connect with each other in a way that overcomes barriers of geography, time zones, and social hierarchy; allows us a second chance at a period of Enlightenment.  Perhaps our Twitter connections and special interest groups on LinkedIn are a modern version of the 18th-century coffeehouses where people met to discuss new ideas

Possible solutions

  • Organise a No Access Day to raise awareness of subscription resources and their costs (like a boycott)

However, this is rather negative and seems rather too much like going on strike. I think it’s key to get academics involved, as they are the only stakeholders who can put pressure on the publishers.  How about:

  • Encourage academics get involved in boycotts e.g. signing up to The Cost of Knowledge, a site where academics publicly sign up to a boycott of Elsevier
  • Encourage academics to make different choices about where they publish e.g. switching to true* open access journals such as Public Library of Science (PLOS)

*By “true” open access journals, I mean those of not-for-profit publishers rather than those from commercial publishers whose open access is funded by APCs.

  • Educate academics about how publishing with the Big Publishers often involves signing away their intellectual property rights – if this makes them angry, perhaps they will mobilise and act with us?
  • Appeal to academics’ sense of the value of universal education and the reduction of cost barriers to education for everyone

What do you think?  Could this be an opportunity for a wholesale review of the economics of scholarly publishing?

JISC Collections Roadshow

16 May

My summary of yesterday’s JISC Collections Roadshow in Edinburgh/Musselburgh

Report on JISC Collections satisfaction survey (Vicky Legge)

  • High priority from respondents that licences should be 100% compliant with the NESLi2 model licence
  • Open Access main priority is gaining discounts on APCs

I can’t help wondering about value of satisfaction survey about a product you’re virtually obliged to use… it’s not like there’s much choice!

Update on upcoming renewals (Richard Savory)

  • “We have no control whatsoever about how much they increase the access fee…”
  • “Keeping price increases as low as possible” but this still means we’re having to make cuts as our budgets shrink…
  • Possible movement between JISC bands to reflect budget situation? Not going to help if we all shift down one
  • Seeking clarity over post-cancellation access – perpetual access to previously paid-for content
  • Negotiations are seeking to ensure that licences take APCs paid into account to avoid double-dipping
  • Usage trends: usage is up 2012 vs 2011 on the whole. Exceptions: IoP (-1%), Project MUSE (-0.14%). Maybe to do with Arxiv/Open Access?
  • Nature Publishing Group goes from strength to strength re: usage, making it hard to keep price increases down
  • ACS usage flat; RSC up on last year
  • Small publishers update: 21 new offers this year; new deals for BMC, ICE, NRC, RSM, Zeta Books; several more under consideration for 2014

JISC bands (Carolyn Alderson)

  • JISC bands changing from A-E to 1-6 to introduce slight increase for band 1 & decrease for band 6 (and some of 5) – spread costs a bit more
  • One band E institution is changing to band 5, with 28.72% increase
  • Several mentions of increasing charges tied to increasing usage… but increased usage doesn’t change the publishers’ overheads, so why charge more?

Update from Thomson Reuters

  • “It’s the year of the WoK renewal!”
  • He is going to try to explain the difference between WoK and WoS (good luck with that) [@richperkinslib: My attempt at a stir-fry in a wos was a miserable failure]
  • “It’s more difficult for you to manage than it is for us to administer” and an acknowledgement of difficulties accommodating walk-in users!
  • New search architecture for WoK
  • Transforming WoS into WoK: expansion of citations, extension of regional content, a new design philosophy

Pondering: publishers – which of you would like to be first to be well-known for a good reason rather than a bad one?

  • WoS (product) = past, WoK (platform) = future. So that’s that, then.  New pricing model will be more like a Big Deal bundle
  • Medline… available through variety of vendors. But if the product isn’t the same, why keep same name? Very hard for users to grasp difference
  • Thomson Reuters are looking to introduce a common metadata standard across repositories
  • “We need to break free from the shackles of…” Sadly the rep paused to think and then steered sentence in new direction
  • Someone asks about how to reconcile availability of more info with shrinking budgets…
  • … Rep: “these are the cards we’ve been dealt… we just have to work with it”

Seems like the elephant in the room is The Cost Of Your Subs Is Just Too High

  • JISC employee suggests that new bands may help – no they won’t, all they do is redistribute costs, not challenge prices!
  • From ‏@moananddrone (MT): Indeed. This is why Gold OA with corporate publishers is a fallacy as their APCs are defined by brand, not measured by cost.

Demo of Knowledge Base+

I wrote about KB+ in my Highlights from UKSG post.

@orangeaurochs I’m going to write a murder mystery which is solved using the knowledge that no two lists of ebooks are ever identical.

- seems especially appropriate during presentation about combining and managing data sources in KB+!

JUSP (Vicky Legge)

  • JUSP [the Journal Usage Statistics Portal] provides a single point of access to ejournal usage data – can use federated login
  • 152 UK HE and research council libraries in JUSP, and over 50 publishers. Can use it for SCONUL returns
  • JUSP can help you identify usage of core titles in a deal, and compare with unsubscribed titles in the deal
  • JUSP is a free service!

Janet update (Robert Prabucki)

  • Janet is the UK’s National Research & Education Network (NREN) – private network provider
  • Janet is not-for-profit, and is part of the new company called “JISC Collections & Janet”
  • Janet national backbone and regional hubs being upgraded - Janet6 will be switched on this month

JISC eCollections update (Richard Savory)

  • JISC MediaHub has improved advanced search, new MyMediaHub section, new collections including IET.tv and Courtald

And a bit of chat about licences…

Decision tool for determining if partner college students count as HE authorised users… Survey results re: decision tool suggest that it didn’t help to clarify the situation about whether partner college students counted

Le sigh. RT @daveyp: @laurajwilkinson Think we were once told by a vendor “just don’t ask” when we asked about partner colleges :-D

I asked a question about whether JISC Collections have the appetite for renegotiating some of the T&Cs of the licences.  The wording of parts of the NESLi2 licences is out of step with the variety of modes of study that are available at many UK HE institutions, and above all, the wording has evolved from the regulations for print materials, which don’t translate well into a digital environment.

One of the JISC Collections employees said that various documents were available to help with clarifying what you can and can’t do (e.g. exactly who our authorised users are) but I explained that this creates uncertainty and that senior library staff may not be willing to take a risk on interpretation – it is better for the original rules to be straightforward.

My suggested rewriting of the Ts&Cs:

I license this resource for educational, non-commercial use only; and I trust my professional library colleagues to do what is necessary to enable such use to any and all users who wish it.  Love, [publisher] xx

Highlights from #UKSGlive 2013

25 Apr

My learning round-up from the 2013 UKSG Annual Conference

Phil Sykes – University of Liverpool – Open Access Gets Tough

Video of talkSlides from talk

SykesIs the triumph of OA is now inevitable? Maybe not – the current situation results from a fortuitous combination of circumstances, such as the political involvement of strongly pro-OA individuals.  We have to provide strong support for OA through our professional bodies and via skilled advocacy on campus.

“Librarians insulate departments and academics too well from the true costs of their journal subscriptions.”  We now have the opportunity to be star actors in the transformation.  Nothing is inevitable, it’s time to get to work.

Jill Emery – Portland State University – Mining for gold: identifying the librarians’ toolkit for managing hybrid OA

Video of talkSlides from talk

EmeryWe need to influence the change in academia not just within scholarly publishing, but also by getting academics involved.  Make recommendations to them on where to publish.  Don’t wait for the invitation – start the conversation on campus.  We need to re-evaluate our budgets and allocate resource for APCs [article processing charges].  Get into marketing and promotion game.

Research and researchers: identity and evaluation Jenny Delasalle – Uni of Warwick – Research evaluation: why is it relevant to librarians?

Video of talkSlides from talk

DelasalleSnowball metrics project – recipe book now available - sets out best practice for how data can be used to support institutional decision-making

Alt metrics [alternative metrics, linked to social web] – open to manipulation, but so are citation measurements.  Opportunities for librarians – the article-level economy is coming, availability of alt metrics will support interest in other kinds of inputs than journal articles.  Researchers want someone with technical expertise who can provide answers and reassurance – librarians can do this.

Laurel L. Haak – ORCID – Connecting research and researchers: ORCID ORCID mission: connecting research with researchers

Video of talkSlides from talk

HaakORCID is an open, non-profit, community-based effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers”

ORCID is to individuals what DOIs are to articles.

Register for your ORCID iD here.  Can embed ORCID IDs in workflows e.g. University CRISs, manuscript submission, grant applications, links with repositories, linkage with other IDs like Scopus Switchboard.

The new digital students, or, “I don’t think I have ever picked up a book out of library to do any research – all I have used is my computer” Lynn Silipigni Connaway, OCLC

Video of talkSlides from talk

Connaway

“A diamond is a chunk of coal that is made good under pressure.” – Henry Kissinger.

Users don’t think that e-resources are library resources.  Users are confident in their information seeking and evaluation skills,and they believe that the same info in multiple places means it’s true…

The learning black market – covert online study habits e.g. they use Wikipedia, they don’t cite it, they feel guilty about it. Some admit to citing the references at the end of Wikipedia articles, even when they haven’t read them.  ”One size fits no-one.”

Idea: Create personas from special collections and use them to make social media accounts more personal.

The student-information relationship: a perspective of its evolution – Joshua James Harding, Warwick Medical School

Video of talkSlides from talk

HardingDigital consumption was Joshua’s solution to the problem of many heavy textbooks.  He was an early adopter of IT and is now a paperless student.

Having everything on his iPad means he can do clinics and be able to check details, notes, and carry on with confidence.   Inkling – interactive ebooks, also called smart or multitouch ebooks, including the option to buy chapters. While he studies a textbook, it studies him – maps his progress in terms of what’s been covered, how long spent on each chapter, sets alerts for him to go back and revise specific areas after set times.  Librarians must improve pathways for making this info available to students.

Joshua asks why all paper textbooks aren’t available in e format.  Other problems: fragmentation, reduced variety, variable quality, different platforms and formats…  ”Epubs are horrible as textbooks and I urge you, don’t make them any more!”

“I want to be able to annotate PDFs using third-party apps… I think we all want to see the end of proprietary formats.”

Maximising the Knowledge base – the community-driven initiatives KB+ and GOKb – Liam Earney, JISC Collections

Video of talkSlides from talk

Earney

Knowledge Base+ (KB+) in the UK; Global Open KnowledgeBase (GOKb) in the USA.

Both projects aim to capture info that libraries need for managing their eresources portfolios and make this information available to other stakeholders too.  KB+ and GOKb share interests in licensing, but GOKb has a greater focus on sharing a higher level of information across many institutions.

Issues surrounding quality and availability of data in the supply chain:

  • Accuracy (many publishers don’t seem to understand what they publish) e.g. They often can’t make lists of what they publish, or sales/back office have different lists… Laughter in the room as this problem is widely recognised!
  • Availability – not all parts of supply chain have access to all info they need; despite huge duplication of effort with many people involved in maintaining various different databases, but which contain broadly the same info
  • Interoperability: spreadsheets, library staff, link resolver, publisher rep, JISC Collections – all have different silos of info about an institution’s subscription.

Open data delivers practical benefits e.g sharing and collaboration, improved accuracy, reduced burden on any one element in supply chain.

“The nice thing about standards is that you have so many to choose from” – Andrew Tanenbaum

… But do please choose one!  Let your data be promiscuous… Set your data free (but tidy it up first!).  Liam noted that titles have longer relationships with institutions than with publishers.

Electronic resources and ILL – a self-contradiction? – Helle Brink, Aalborg University

Helle gave us an overview of inter-library loans in Denmark, and neatly summarised the current situation in which we can often supply digitised print items but have to send print copies of electronic resources (owing to licencing restrictions).

Possible new models for partial access include:

  • Updating the definition of “walk-in user” to include electronic walk-in?
  • Pay-per-view
  • Voucher solutions e.g. 10 articles per year
  • Read-only, no download or print
  • ILL after embargo e.g. 3 months
  • Public access after embargo
  • New ideas?

It seems to me that most problems involving e-resources and ILL arise from the nature of the licences for electronic resources  the terms and conditions evolved from contracts regulating the use of print resources, and they don’t translate well to the new medium.  Maybe it’s time to design e-resources licences specifically for digital media, rather than simply adapting contracts designed for a print environment?

The twenty-year butterflies: which web cookies have stuck in the internet’s pan? – Jason Scott, Archive Team

Video of talk

Archiveteam

“There is no rare – there is only expensive” – Juke Joint Johnnie.  Jason says, “There is no gone, there is only forgotten.  If we take the smallest amount of effort to set up things to be remembered, they will never be forgotten.”

Archive Team recognise three virtues: rage, paranoia, and kleptomania.

“Instead of the cloud, I call it the clown.  It’s more descriptive!”

“Tiny URLs are one-time crytographic keypads.  We have discovered link shortening services than re-use short URLs.  I’m not a Luddite, but too many people are putting too much trust in storing things on the Internet.”

Publishers and librarians: we share the same values – why are we fighting? – T. Scott Plutchak, University of Alabama at Birmingham

Video of talkSlides from talk

Scott Plutchak“We are some of the luckiest people alive because we get to be part of this.”

“The challenges are technological, cultural, and social.”

“Librarians and publishers – communities that are two key players, but have diametrically opposed views about how to achieve the same goal of making info widely available.”

I disagree that publishers share this goal with librarians.  Are publishers really focused on access to information? Would they still be interested if it were not for profit?  And the routes into the two careers could hardly be more different – how many publishers do vocational training, or volunteer in the information sector as part of their career development?

I think there is a fundamental difference between librarians and publishers in their perception of the value of knowledge and what happens to it when it is shared – to librarians, sharing knowledge increases its value (especially in research); but to a publisher, knowledge sharing without payment represents lost revenue.

The difference between the price and the value of knowledge, eh…

Other

  • A group of us went for a run on Tuesday evening – the inaugural UKSG 5K?  Maybe it should be part of the official programme from 2014 onwards!
  • A knitting breakout session – another idea for the programme
  • Meeting lots of people from other countries who were talking about their work in English has prompted me to acquire some professional vocabulary in other languages.  I’m starting with ALA’s Multilingual Glossary, which includes French and Spanish
  • Referring to a discovery interface with different resource elements as a “bento box approach” – nice description!
  • I think many people still think of UKSG as a serials organisation, but their remit has evolved in recent years and their current mission is to “connect the knowledge community and encourage the exchange of ideas on scholarly communication” – so if you think UKSG isn’t relevant to your role, perhaps it’s worth taking another look?

Perspectives on the Elsevier takeover of Mendeley

22 Apr

Quotations are cherry-picked to represent the points I found most interesting – please follow the links to each post to see each in its original context.

Elsevier’s press release

“Mendeley is an innovative company with great culture, talent and collaborative spirit, and we will keep it that way,” said Olivier Dumon, Managing Director of Academic and Government Research Markets at Elsevier. “Not only that, but together we intend to scale and evolve Mendeley in ways that benefit the entire research community. We will provide greater access to content, data, and analytics tools to Mendeley’s users and its flourishing third-party app ecosystem, all of which will enable us to increase both Elsevier’s and Mendeley’s engagement with researchers.”

Announcement on the Mendeley blog

Of course, we are aware that – especially in the past year – the academic community has criticized Elsevier for some of its policies and positions. Our own relationship with Elsevier has been conflicted at times. Elsevier is a multi-faceted company with over 7000 employees, so it is impossible to put them into a single box. We were being challenged by some parts of the organization over whether we intended to undermine journal publishers (which was never the case), while other parts of the organization were building successful working relationships with us and even helped to promote Mendeley.

See also this follow-up post: Mendeley and Elsevier – here’s more info

A Matter of Perspective — Elsevier Acquires Mendeley . . . or, Mendeley Sells Itself to Elsevier from Scholarly Kitchen

Mendeley’s future at Elsevier seems to be a mix of “remain a standalone platform, building on what people know and like” and “integrated more deeply with Elsevier’s sales and technology strategies.” There seem to be some obvious paths for Mendeley at Elsevier, as a discovery tool for readers in Science Direct or as an almetrics tool built into Scopus. In a blog post published last night, Mendeley has this cryptic description of its role at Elsevier:

“Mendeley will become Elsevier’s central workflow, collaboration, and networking platform, while we continue on our mission of making science more open and collaborative.”

So perhaps Scopus and Science Direct are about to be subsumed by Mendeley. 

Comment on this post by Rick Anderson:

Personally, I’d like to see our community stop using the word “sharing” when what we actually mean is “copying.” The word “sharing” invokes a halo effect (just as the word “piracy” does the opposite), whereas “copying” has the twin virtues of accuracy and political neutrality. It seems to me that the more we call this behavior what it is, the more likely it is that we’ll be able to have rational conversations about it.

The Story Behind the Elsevier Purchase of Mendeley – Interview with Co-founder Victor Henning from mmit blog

The truth is that no one really knows what impact this will have on the open access and altmetrics movements just yet. In addition how it may impact digital copyright, as many saw Mendeley as not only the iTunes of Reference Management, but also the Napster.

A fiery response to the news: why I’m quitting Mendeley (and why my employer has nothing to do with it)

Elsevier’s practices make me deeply deeply angry. While academic publishing as a whole is pretty flawed, Elsevier takes the most insidious practices further at each and every turn, always at the expense of those of us who are trying to produce, publish, and distribute research. Their prices are astronomical, bankrupting libraries and siloing knowledge for private profit off of free labor.

Read the comments too!

On Twitter, some users signalled their intention to delete their Mendeley accounts using the hashtag #mendelete.  Jason B. Colditz sums it up nicely: Word of the day: “mendelete”.

All in all, yet another a tale about a small independent being bought out by a big multinational – but with so many additional layers about data, privacy, piracy, access, Open Access, subscriptions, licences… It will be interesting to see whether the initial angry response from many Mendeley fans has any effect in the longer term.

RAPTOR workshop – an adventure 65 million sessions in the making?*

21 Mar

*with apologies to Jurassic Park

What is RAPTOR?

RAPTORRAPTOR is a JISC-funded kit for looking at e-resources statistics.  RAPTOR stands for Retrieval, Analysis, and Presentation Toolkit for usage of Online Resources.  It was a JISC-funded project led by the University of Cardiff - read more about the project here.  This post summarises my notes from the RAPTOR workshop in Birmingham earlier this week, delivered by Dr Rhys Smith and Dr Phil Smart of the University of Cardiff.  The first version of RAPTOR was released in 2011. Institutions have multiple authentications systems (e.g. Shibboleth, IP), and each logs usage by username.  However, each of these logs are on different systems and in different formats, and some info is missing (e.g. usernames, departments).  Federation operators have a need for stats to demonstrate value for money to their funders.  RAPTOR is a piece of software which allows these usage logs to be collated.

RAPTOR’s goals

  • easy to install & configure
  • not intrusive
  • web front-end for non-tech users
  • scalable
  • standards-based where possible
  • free to use
  • open source
  • community-driven

RAPTOR components

Local deployment options diagram

Local deployment options – image credit RAPTOR wiki

Client (ICA – information collector agents) sends info to the server (MUA – multi-unit aggregator; web):

This picture isn’t as good but it captures Phil doing RAPTOR hands:

Components

RAPTOR is a set of Java programs.  Each competent runs on its own Jetty instance.  Public/private keys, SSL handshakes.  Working on exposing MUAs to SAML metadata instead of keys.

Supported authentication systems

  • Shibboleth IdP
  • EZproxy
  • freeRADIUS

And soon to include OA LA (OpenAthens), OA LA proxy, simpleSAMLphp, Radiator – plus anything you can manually configure.  You can configure RAPTOR to parse any log file you like, you just need to be brave.

If you love xml

Application of RAPTOR

More information about usage, enriched with identity info, gives more business intelligence.  RAPTOR can currently pull out department and affiliation from the IdP [identity provider].  This could be extended in future to include other attributes – let the RAPTOR team know your requirements.
Can use the data to show usage of e-resources by department, system use by affiliation (e.g. UG/PG/staff) e.g. PC cluster room usage.  Could map e-resources usage to attainment info – caveat of correlation not causation.  SWITCH is the Swiss version of JANET - SWITCH AMAAIS [Accounting and Monitoring of AAI Services] project is doing similar things to RAPTOR.

RAPTOR-JUse project

The RAPTOR-JUse project aims to integrate stats from people and platforms by combining data from RAPTOR about the activity of individuals (via the IdP) and data from JUSP [Journal Usage Statistics Portal] about journal usage stats from the SP [service provider] end.

RAPTOR and JUSP have different reporting periods – RAPTOR is per event; JUSP uses defined reporting periods.  This is just one example of the issues to be overcome in this project.

Demo of RAPTOR

The RAPTOR login page is comfortingly simple – though you can’t use federated login (for now).  The irony was acknowledged :)  After logging in,  you will see something like this:

Example stats

Can you spot the summer holidays trough on this graph?

Summer holiday trough
You can add postprocessors to sort rows, extract top 10 only etc.  It’s possible to format the entity IDs with SAML organisation name.  The team hope to develop a layer in RAPTOR to represent stats by affiliation as a proportion of the total users, not just raw number.

Can’t do Boolean ‘AND’s in the filter

RAPTOR data can be downloaded in .xlsx .csv and .pdf formats.  It’s not (yet?) possible to see total combined stats for different authentication mechanisms through the web interface – the problem is caused by the different host names being owned by different publishers.  If unique IDs are brought in for publishers in future, this would then be possible.  For any users who’ve dropped out of the directory, no values will be recorded.

Data will be a lot more correct from the moment you install it and run it correctly

Installation options

  • simple - good for test deployment but won’t scale well
  • normal (one ICA on each service to monitor, MUA & web on a Raptor-server server (sic)) – good for large deployment, production use
  • completely separate (ICA, MUA, web elements all on different servers) - probably overkill for most situations

RAPTOR local deployment options in diagrammatic form:

Components

For different components to talk to each other, they need to know each other’s host name, and have encryption keys to swap.  Could have Shib/IP info going to different MUAs.

What do you want from RAPTOR?

Ease of config, supported systems, look & feel, dashboard, reporting vs graphing…?  Let the team know what enhancements you would like to see!  Tell them via the RAPTOR wiki.

WUGEN and WAYFless URLs

To explain what a WAYFless URL is, it’s best to begin with explaining what a WAYF URL is.  WAYF stands for Where Are You From, and it’s a type of URL that allows you access to a service provider via single sign-on by including a step where you have to choose your institution/organisation – hence “where are you from?”.  Therefore, a WAYFless URL is one which does not ask you for your institutional affiliation, and bypassing this step makes it easier and quicker for your users to access platforms.

Setting up service providers to work with your identity provider often involves building WAYFless URLs that are specific to your organisation.  However, they can be brittle and prone to breaking if the target platform changes domain name structure.

A WAYFless URL is one that takes you to an error page

And that’s where WUGEN comes in.  WUGEN [WAYFless URL Generator] is a tool for building robust WAYFless URLs.  The site leads you through a few steps and builds the URL for you.

Click on “Explain my WAYFless URL” to see a rating of the URL on the reliability thermometer:

Explanation of your WAYFless URL

Thanks Rhys and Phil for an excellent workshop :)  Before I left, there was time for a final RAPTOR hands moment:

RAPTOR hands

If you enjoy other forms of raptor-related humour, see Philosoraptor

See more on Know Your Meme

Open Access – time for a review of the whole model of academic journal publishing?

1 Mar

So… hello again!  Sorry for being away from the blog for a while.  To celebrate the end of the first 4 months in my new post, I’d like to record some of my recent thoughts (personal, not those of my employer whether current or former) about open access (OA).

If you’re new to the discussion about OA, you might like to pause to read this summary about open access, including a description of gold and green routes.

Last week, the House of Lords Science and Technology Committee published their report on the implementation of open access.  In all the discussion about gold vs green routes to open access, including last year’s Finch Report; it seems to be taken for granted that the current business model for academic publishing will stay the same and it’s just the sources of funding that will change.  For example, universities may move funds away from library subscriptions for e-resources and use them instead to fund an author-pays model.

But…

Wouldn’t this be a good moment to step back and see how profoundly the internet has changed the way we do (and could) distribute information in a global scholarly community?  And how this fresh look could allow us to build a completely new model for publishing and distributing knowledge?

Almost all the characteristics of the print publishing paradigm are no longer applicable: consider how the overheads, timescales, ease and cost of making copies, distribution logistics – and more – are completely different for PDFs from what they were for print journals.

So why aren’t we challenging the underlying pricing model?

I’ve heard of some libraries spending 80% of their acquisitions budget on e-resources, and only 20% on print materials.  The costs to libraries is on an unprecedented scale, especially when you consider that these hefty invoices are annually recurrent, and that under some circumstances, cancelling the subscription can incur cancellation fees (which are also recurrent) and a loss of access to the content previously paid for (unlike print journals, were you could at least hang on to the hard copies).

In my new job, I am seeing this problem from a slightly different angle too.  Consider a university which is primarily teaching-focused, but is trying to increase its research profile.  This costs money: researchers need funds for attending conferences and other academic events; if they are spending less time teaching in order to carry out their research, then someone else must be employed to ensure that the timetable is covered; and of course, they need access to a greater range of information resources to support their studies.

This isn’t just about budgets and prestige, it’s also about social mobility.  Imagine the difference between this scenario, in which many junior lecturers are pursuing their PhDs alongside a full teaching job, with that at other institutions at which there is funding to support PhD/DPhil students to study full-time.

I know that the situation for publishers is sensitive and they are trying to defend their subscription charges, but I don’t think it makes sense to apply a print-world-based pricing model and licence type directly to e-journals in a new format and a different medium.

There is no single agreed protocol for the article publishing process: peer review, impact factor, half-lives of articles (in terms of access stats), and other factors vary between disciplines; and what is true of the life sciences is not necessarily transferable to the humanities.

Furthermore, journal articles (in any format) play a different role in different disciplines: for example, original research articles in peer-reviewed journals are primary sources in the life sciences, but are often secondary sources in the humanities.  This means that journal articles occupy a different niche in the research ecology of different disciplines, and may need to be funded or administered differently.  The current publishing model and associated norms are based on practise in the sciences.

This could be an excellent opportunity to stand back and consider a new model for making available the products of research, operating a system of peer review, charging (to whom?) a proportionate fee (how much?) for hosting the content, and building in the functionality and possibilities of new information technologies.

How do we begin this conversation?

Transforming the library at St Hugh’s College

26 Oct

Today is my last day in Oxford.  This weekend I’m moving home to the north-east to start a new job at the University of Sunderland on 1st November.  So, lest we forget how St Hugh’s College Library used to look, here are some ‘before’ and ‘after’ photos…

The library then (2010)

What the library looked like in 2010

L-R from top: View of the lobby; self-issue system (no security); accumulation of students’ belongings left unattended; returns system consisted of piling up books by the office door waiting for staff to check them back in; upstairs computer and printing facilities

The library now (2012)

What the library looks like now in 2012

L-R from top: Printed spine labels using DDC23 and author/title suffix; secure item return bin; informal seating area in the lobby with recreational reading, DVDs, Living Well and other new collections; students using the group study table; students using books and computers for learning; combined printer/copier/scanner and modern computers; RFID self-service kiosk

Thank you to everyone who has contributed to this work by being a member of my library team, Library Committee, or part of the wider college and library communities in Oxford.  I’m really pleased with what we’ve achieved :)

Follow

Get every new post delivered to your Inbox.

Join 45 other followers