A new beginning; here’s why.

Rich Apodaca asks whether the new NIH OA mandate marks a new beginning, or more of the same. His argument hinges on the (admittedly unfortunate) phrase “in a manner consistent with copyright law”, and he concludes that

Neither HR 2764 nor any form of government intervention will bring widespread Open Access into being.

Here’s why I think Rich is wrong.
Point the first: Rich claims that

Most of the journals in question will be hostile to the idea of having their copyrighted material deposited into PubMed Central and so understandably won’t allow it to be done by the authors of papers or anyone else.

The available data do not support this. Of the 355 publishers indexed by SHERPA/RoMEO, 66% formally allow self-archiving; more importantly, 56% formally allow archiving after refereeing. (There’s a big gap between “formally allow” and “formally forbid”, too.) The numbers are even more OA-positive at the journal level. Those publishers between them account for 10199 journals, of which 91% are at least “pale green” — that is, allow at least preprint archiving. Well over 6000 journals, 62% of the total, are “green” — that is, allow self-archiving of refereed postprints. You can use the web interface to find out whether your favorite journal or publisher will allow you to self-archive; here’s a quick look at the big names (> 50 journals) and a few usual suspects (sorry about the jpg, I can’t make html tables to save myself):


romeo.JPG


Point the second: Rich goes on to give the following hypothetical:
Professor Gross at California University gets his manuscript approved for publication in the Journal of Nanoscale Devices (JND). Professor Gross is fully aware both of HR 2764 and JND’s refusal to deposit manuscripts into PubMed Central – the reasons why Professor Gross would choose JND anyway are interesting, but not relevant here. Along with the acceptance letter, JND requests prompt return of a signed copyright transfer agreement. Professor Gross sends in the signed form and from that point on, all rights to his article belong to JND. As is their policy, JND refuses Professor Gross permission to deposit a copy of his paper into PubMed Central within 12 months after publication.
Unless I’m missing something, neither Professor Gross nor JND have violated any laws.

Does Professor Gross have to publish in JND? Pace Rich, the good Professor’s reasons are relevant. Let’s take a look at those publication-related sins through an OA lens:

  • Greed — the OA advantage should drive the greedy to reject journals like JND which deny them the opportunity fully to profit from their own work
  • Envy — if you want your publication record to be all it can be, publish OA (either by choosing OA journals, or by self archiving)
  • Pride — if you want your science to have maximal impact, ditto
  • Wrath — STM publishing is big business with big fat profit margins; as consumers and producers, let’s at least get value for money (i.e., OA) and put the hurt on greedy publishers who won’t at least allow us to make our own work OA
  • Gluttony, Lust — see Greed, Envy, Pride
  • Sloth — for just a few keystrokes, you can increase your research impact and professional standing; why would you not?

Given all that, will the good Professor continue to kowtow before the little godlings who publish JND? Or will he simply find himself a journal that will play ball?
Point the third: Rich continues:

The assumption made by proponents of the new law seems to be that to implement the new policy, the Director of NIH will forbid publication by grant recipients in journals that don’t allow deposition of articles into PubMed Central.
How many influential scientist do you know of who would tolerate the government telling them which journals they can and can’t publish in? The minute such a misguided policy is put in place, the national scientific outcry would more than overwhelm anything Open Access proponents could muster.

How many? All of them. When a funder says “jump”, even “influential” scientists say “Was that high enough? Shall I try again?”. (Besides which, this is not “the government telling them” anything, this is a funding body making a reasonable demand.) Where scientists do have some weight to throw around is with publishers: the NIH can always get another benchmonkey, but publishers need a steady supply of authors. So if I want to publish in the Journal of Dodgy Results, which won’t allow repository archiving, and the NIH says “not if you take our money — not until they comply with the mandate”, I can: look for other funding (believe me, there ain’t a lot); fight authority (see Mellencamp, J.C., 1983); or I can try to get the editors of JDR to let me put a copy in PubMed Central after 12 months. Identifying the path of least resistance is left as an exercise for the reader.
Here again, the data (though scanty) are on my side. A 2005 survey of nearly 1300 authors found 81% of respondants reporting that they would willingly comply with a green OA mandate; a further 13% replied that they would comply unwillingly,and 5% claimed they would not comply. Not only is 94% a great deal better than the roughly 4% compliance observed while the NIH policy was voluntary, but I’ve got five bucks right here that says those 5% are full of it. If push comes to shove, they won’t be handing back any grants or handing in any letters of resignation. Most of them, confronted with the evidence, will do what scientists are supposed to do in such cases: say “oh, I was wrong”, and change their views and behaviour. The few who don’t do that will still comply, they’ll just yell at a couple of editors to make themselves feel all tough again.
(Stevan Harnad and Alma Swan have both reported that Arthur Sale’s ongoing study of institutional repositories in Australia corroborates these figures, showing that authors comply in much the same way that they claimed they would in the survey. What I’ve seen of Sale’s data is certainly consistent with that notion… but more on that later perhaps.)
So, to recap:
1. The majority of journals for which information is readily available are already compliant with the new NIH mandate; I see no reason to assume that any significant proportion of the remainder will be hostile to the policy.
2. I disagree that the NIH will not be able to enforce the policy; faced with the evidence that OA is a good idea and the fait accompli of an NIH mandate, researchers will comply and journals will have to follow suit. To believe otherwise is, I think, to give the publishing industry too much credit for being able to cow their authors.
3. Voluntary reposit policies simply don’t work; we have evidence to suggest that mandates will, and already do. (An aside: the new NIH policy joins 20 funder mandates, 11 institutional mandates, 3 departmental mandates, 5 proposed funder mandates, 1 proposed institutional mandate and 2 proposed multi-institutional mandates. Most of those include growth data in their ROARMAP entries. Why don’t we have more data on the effects of mandates?)
Happily, I can finish up on a note of agreement with Rich, who says:

The only things that will change the status quo are: (1) the availability of tools for making it happen; and (2) the realization by individual investigators that continuing to give away their hard-earned copyright makes them far less competitive than their peers who don’t.
Open Access proponents should forget about getting the Federal Government to fix the mess that modern scientific publication has become. Instead, they should focus on making Open Access-like options more attractive to scientists.

I’ve outlined my disagreements above, now let me agree with the more important points here:
1. It is vitally important that tools for OA (and Open Science) be built — tools that researchers will want to use; to see a graphic illustration of this, listen to the forlorn cry of the repository-rat
2. OA provides a host of benefits, not least the boost to individual impact and standing; the clearer this becomes, the closer we get to 100% OA
3. Modern scientific publishing is a mess, and needs fixing. Making OA more attractive to the benchmonkeys is going to be an indispensible part of that fix (see also #1).

P.S. still on hiatus… sorta. Still haven’t put that ms together so posting will remain infrequent at best.
P.P.S. see also Peter Murray-Rust’s response to Rich’s entry.

Help a blogger out?

(I post more on hiatus than when I’m supposed to be blogging, no?)
Gary Farber is in all kinds of trouble. I’m going to do a little to help him out, and ask you, O my tens of readers, to consider doing likewise, because:
1. He asked. Ceteris paribus, what else does one need?
2. He doesn’t seem to have anyone else.
The thing here is, one of Gary’s problems is intensely personal to me: I, too, have major depressive disorder. There — in Gary’s shoes — but for the grace of a god I don’t believe in, go I. I was lucky: all along, I had family and friends, and now especially I have my wife. These people stood in for me and stood up for me and picked me up and pushed me along, and I’d be dead without their selfless assistance. Really truly not-pining-for-the-fjords dead, and it really is just sheer dumb luck that I had these people in my life; I could just as easily have wound up on my own, and I wouldn’t have made it — or if I did, out of sheer orneriness, it wouldn’t have been any kind of a life. So it’s very easy for me to imagine myself in Gary’s shoes.
I can never repay my debt to those who kept me out of the pit, but I can “pay it forward” — right now, that means sending Gary a little spare cash. And asking you to consider doing the same.
(Title shamelessly stolen from Cosma. You should steal it from me, if you have a blog.)

If it won’t sink in, maybe we can pound it in…

Another brief un-hiatus, this one sparked by a question asked by Dave Munger at BPR3:

If you know of a peer-reviewed, open-access journal that does not charge a publication fee, let us know about it in the comments.

Practically every time I talk about OA, online or in meatspace, I hear “I’d like to support OA but I can’t afford it, don’t all those journals charge, like, $2500 per article?”
No. They don’t.
Everyone seems to be thinking of PLoS, never mind that they waive their fees at the drop of a hat; the assumption that most OA journals charge (high) author-side fees is both widespread and completely wrong.
In fact, more than 2/3 of the journals listed in the Directory of Open Access Journals (DOAJ) and more than 80% of OA journals published by scholarly societies charge no author-side fees at all; in contrast, more than 75% of the 247 non-DOAJ journals in a 2005 survey do charge author-side fees (page charges, colour charges, reprint charges, etc) in addition to subscription charges.
Let’s unpack those numbers a little (especially since I generated the first one myself, and so you should take a look at how I did that).
In October 2005, the Kaufman-Wills group published a commissioned survey of journal publishing practices, The Facts about Open Access. The study was initially designed to include only full OA journals (listed in the DOAJ, OA immediately upon publication) and delayed-OA (“embargo”) journals from the HighWire Press stable, but was expanded to include the full range of financial models by inclusion of journals published by the Association of Learned Professional and Scholarly Publishers (ALPSP) and the Association of American Medical Colleges (AAMC). The final report included responses from 248 DOAJ, 85 HighWire, 34 AAMC and 128 ALPSP journals and showed that:

52.8% of DOAJ journals charge no author-side fees at all. The percentage for subscription journals was much lower: ALPSP journals overall (23.4), ALPSP for-profit journals (44.9), ALPSP non-profit journals (10.1), AAMC journals (14.7), Highwire subset (17.6)

These are the figures that Kaufman and Wills summarize as “…more than half of DOAJ journals did not charge author-side fees of any type, whereas more than 75% of ALPSP, AAMC, and HW subset journals did charge author-side fees.”
So — not only do the majority of OA journals charge nothing on the author side, an even larger majority of non-OA journals do charge author-side fees. If the sample is representative, you’re less likely to have to pay to publish if you choose an OA journal than if you don’t.
When I first heard these numbers I thought, as Peter Suber did, that they should “recast the debate” around OA. In January 2006 Peter’s regular yearly predictions included this forecast:

It will start to sink in that fewer than half of OA journals charge author-side fees and that many more subscription-based journals do so than OA journals…. People will stop talking about “the OA business model” for journals as if there were just one. People will talk less about how OA journals might exclude indigent authors and compromise on peer review and talk more about how toll-access journals do so. We’ll start to document the range of models actually in use for OA journals… We’ll get more creative in finding models that suit the range of niches…

He has since called this “the worst prediction I’ve ever made”. I confess myself at something of a loss as to why the Kaufman-Wills study has not come to dominate and reconfigure the OA debate; I can only guess that profit-hungry lowlifes have successfully sidestepped it. In this year’s predictions, Peter expects more of the same:

Because both Hindawi and Medknow have both been profitable for more than year, you’d think that the fact of their success would start to sink in, with corresponding effects on attitudes toward the sustainability of OA journals and interest in their business models. But well-documented truths about OA tend to sink in very, very slowly, because they have to compete with myths, misinformation, and misunderstanding. With regret, I predict more of the same.
In 2005 the Kaufman-Wills Group discovered that the majority of OA journals charged no publication fees at all. In 2006 I predicted that that fact would start to sink in. I was dead wrong. The fact still hasn’t sunk in, and I’ve learned my lesson.
Caroline Sutton and I discovered last month that the OA journals published by learned societies follow same pattern as OA journals overall: most of them charge no publication fees. But while 52.8% of OA journals overall use no-fee business models (from Kaufman-Wills, 2005), we found that 83% of society OA journals use no-fee business models, a significantly greater fraction. However, I’m not predicting that this fact will sink in any time soon. Likewise, we found 425 societies publishing 450 OA journals, a much larger number than the societies known to oppose OA policies. But neither am I predicting that this fact will sink in any time soon. We’ll continue to hear the unargued claim that society publishers are intrinsically vulnerable to OA and predominantly opposed to it.

The Kaufman-Wills study is not the only one of its kind, either. As discussed in the quote above, just last month Peter Suber and Caroline Sutton of Co-Action Publishing released preliminary findings from their ongoing study of OA journals published by scholarly societies. They identified 468 societies which publish, between them, 450 full OA journals and 73 hybrid (“pay-for-OA”) journals. Of the full OA journals, only 75 charge author-side fees — meaning that more than 80% of society journals do not charge such fees.
Finally, there’s me. All of the above got me to wondering what proportion of journals in the entire DOAJ database charge author-side fees (since Suber and Sutton showed that when the dataset was expanded, at least among society publishers, the no-fee percentage went up considerably).
Fortunately, the DOAJ now includes a metadata field indicating whether or not a particular journal charges author-side publication fees. Unfortunately, that field is not included in the downloadable comma-delimited metadata file they make available. Fortunately, it’s not a whole lot of work to make a replacement file by copy-and-pasting from the “browse by title” page. Unfortunately, you have to do this from the new “for authors” section, because the front-page browsing interface doesn’t include the “fee/no fee” field. What’s unfortunate about that, for my purposes (though it’s a wonderful thing overall), is that the “for authors” browse does include hybrid journals, in which regular articles are subscription-only but authors can pay extra to have their work made OA. (In fact, even the logo at the top is different; on the front page you are seeing the Directory of Open Access Journals, but in the “for authors” section it becomes the Directory of Open Access and Hybrid Journals.) The front page says 2971 journals are indexed, but if you browse by title from the “for authors” page, the totals add up to 4638, the database having apparently added 1667 hybrid journals.
There’s probably a smarter way to do this using the OAI-PMH, but that syntax is as impenetrable to me as Ancient High Martian, so I simply pasted the browse-by-title pages into a text document and imported that (colon-delimited) into Excel. Then I filtered on “publication fees”, sorted by yes/no/missing and read off the totals from the row numbers. Horrible hack, but it worked.
Including hybrid journals, we get:

charge publication fees: 2185 (47%)
don’t charge pub fees: 1998 (43%)
fee information missing: 455 (10%)
total no. of journals: 4638

Given the DOAJ definition of hybrid journal, those should obviously be excluded and the data reworked. This is where a smart person would have stopped and waited for the DOAJ to autogenerate the numbers, but I went ahead and deleted the hybrid entries by hand. (Shut up. I wanted to know, OK?) That yields:

charge publication fees: 534 (18%)
don’t charge pub fees: 1980 (67%)
fee information missing: 453 (15%)
total no. of journals: 2967

The second total should presumably be 2971 and it would make sense for the “missing” totals to be the same in both sets, so either there are some errors in the database or I made a couple myself. In either case the errors appear small and make no difference to the percentages, and anyway did I mention this kept me up to 4 am? Actually I suspect some inconsistencies in the database, because the front-page total does not update as quickly as the actual entries, and because there are in fact hybrid journals which don’t charge fees (e.g. Emerald Engineering’s model).
So now we have three studies (OK, two studies and one ungainly hack) showing that a (strong) majority of OA journals do not charge author-side fees, and one of those studies further showing that a strong majority of non-full-OA journals do in fact charge author-side in addition to subscription fees.
Now, can we please put to rest the myth/FUD/whatever that there is only one OA model, the author-side fees/PLoS model? While we’re at it, let’s have a few more closely related ideas go the way of the dodo: that OA journals discriminate against indigent authors (because they charge publication fees — except that most of them don’t); that OA journals will compromise on quality (in order to collect payment for manuscripts — except that most of them don’t); that if most journals went OA, universities would have to pay more in author-side fees (which, remember, most OA journals don’t, but most non-OA journals do, charge) than they do now in subscription fees.
I swiped that list of candidates for memetic extinction from Peter Suber, and you should go read his full discussion, which offers a lot more detail, especially on that last point. Me, I’m going to take a nap and go back to my blog hiatus. But the next time you hear someone talk about the “cost” of publishing in OA journals, please point ‘em here.