RePEc and the democratization of research

December 9, 2008

In the last issue of the American Economic Review, the following article caught my eye: Restructuring Research: Communication Costs and the Democratization of University Innovation by Ajay Agrawal & Avi Goldfarb. In short, it documents who gained in electrical engineering faculties from the reduced cost of collaboration through the introduction of Bitnet, in the early Internet days. The basic result is that the middle-tier universities benefited the most. Indeed, the top ones were already well connected with each other, and the middle ones took advantage of collaborating with the top ones.

The main goal of RePEc is precisely the democratization of research. Given publication delays in Economics, if one wants to stay abreast of developments at the frontier of research, one needs to read working papers. Before the Internet, the only way to get hold of them was either if you were already at a top ranked Economics department, or if you were somehow within a club of well connected researchers. Just being aware of the most current research was a challenge for anybody outside these circles. This is what motivated Thomas Krichel, as a research assistant in 1991, to find ways to learn about new working papers, and share what he found. This initiative evolved into RePEc in 1997.

Are Elite Universities Losing Their Competitive Edge? by E. Han Kim, Adair Morse & Luigi Zingales documents that Economics faculty in elite universities where more productive at least in part due to their location in the 1970s, and that such a location effect has disappeared by the 1990s. While it is open whether RePEc has contributed to such democratization, we have always favored it: everybody should be able to learn about current research, and everybody should be able to contribute to it.


RePEc in June 2008

July 3, 2008

June was a surpisingly busy month, especially in terms of content expansion. We have now reached 600,000 works listed on RePEc, and it took only 10 months to add the last 100,000. Traffic was also heavy for the season, reaching 584,843 downloads and 2,803,705 abstract views.

The following institutions joined RePEc with an archive: World Scientific Publishing, Queens College (CUNY), GEFRA, Kobe University, Institut für Angewandte Wirtschaftsforschung (IAW), Université d’Auvergne, Universtät Freiburg, Società Italiana degli Economisti. Finally, here are the thresholds we reached this month:

140,000,000 cumulative abstract views
100,000,000 cumulative abstract views on IDEAS
45,000,000 cumulative abstract views for articles
600,000 listed works
350,000 articles listed
300,000 online articles listed
240,000 working papers listed
180,000 working paper abstracts
150,000 items with references
120,000 article abstracts
20,000 NEP reports


Where are the women?

June 19, 2008

Women have always been underrepresented in Economics. For example, regarding US faculty, the Committee on the Status of Women in the Economics Profession (CSWEP), a subcommittee of the American Economic Association, determines in its latest annual report that women represent 28% of assistant professors, 21% of associate professors and 8% of full professors in PhD granting Economics department. As a whole, they represent 19% of all Economics faculty.

The point of this post is not to complain about the low proportion of women in the profession, or about their dwindling share up the ladder, but about the lack of involvement on women in RePEc. Currently, their share is at 14.5%. It is clearly below the 19% mentioned, although it is slowly increasing (it was 13.6% a year ago and 12.7% two years ago). Why this underrepresentation?

It is of course possible that their a bias in those numbers, because the CSWEP numbers pertain only to the United States and the RePEc Author Service covers the whole world. So, let us analyze the top 1000 economists from Tom Coupé’s list. Of the men, only 22.9% are not registered taking the ranking by publications, and 32.4% with the ranking by citations (which includes quite a few non-economists). For women, the numbers are 37.2% and 44.4%. We see that top female economists are less likely to be signed up with RePEc.

Therefore, encourage women to register at the RePEc Author Service!

PS: You may wonder how these numbers are determined, as gender is not indicated when registering with the RePEc Author Service. It is inferred from first names, using a database of gender likelihood by name. For the more uncertain cases, an exception table was created using additional information, in particular from pictures on personal web pages.


Citation Accuracy

December 19, 2007

Open Access News pointed out a very interesting article in the Journal of Cell Biology, Show Me the Data. Written by that journal’s executive editor, the executive editor of Journal of Experimental Medicine, and the Executive Director of The Rockefeller University Press, it first reiterates many quality issues with journal impact factors that seem to be well-known among biologists, but I suspect that they are news to many economists. Many of these issues also hold for citation rankings for individuals. Beyond that, there are other issues that make citation data suspect. Fortunately, there are potential solutions to many of these problems.

First, it helps to describe impact factors as they are calculated by Thomson Scientific (previously the Institute of Scientific Information, or ISI). An impact factor in year t is the mean number of cites to all articles in that journal in years t-1 and t-2 divided by the number of number of research or review articles. Criticisms include

  • the data in the denominator and numerator are not consistent
  • Thomson is unclear on what exactly defines a research or review article
  • some journals have negotiated with Thomson on exactly what defines the article type
  • retracted papers are not excluded
  • of course, the mean is inflated by a few star papers
  • editors can game the system; apparently some do and some don’t (I’ve even seen this in the Wall Street Journal)

The authors go on to say that they contacted Thomson and received some of their data. They found numerous errors in how article were categorized. Further, “The total number of citations for each journal was substantially fewer than the number published” as reported by Thomson. When they requested further data from Thomson, the data still didn’t add up. They conclude “It became clear that Thomson Scientific could not or (for some as yet unexplained reason) would not sell us the data used to calculate their published impact factor.”

Their bottom line is even more clear: “If an author is unable to produce original data to verify a figure in one of our papers, we revoke the acceptance of the paper. We hope this account will convince some scientists and funding organizations to revoke their acceptance of impact factors as an accurate representation of the quality—or impact—of a paper published in a given journal. Just as scientists would not accept the findings in a scientific paper without seeing the primary data, so should they not rely on Thomson Scientific’s impact factor, which is based on hidden data.”

Besides the points reiterated and brought up in the Journal of Cell Biology, there are further accuracy issues with Thomson data. For example, to identify authors, they only use initials for the their first and middle name. As they pool papers from all fields, this is a more severe error than one might first guess. Thomson reports that Kit Baum (known to Thomson as CF Baum) has publications in the Fordham Law Review (on nuclear waste) and the Sociology of Education (on group leadership).

A further issue is Thomson’s coverage; EconLit lists some 1,240 journals in our field while the last time I checked Thomson covered but a fraction of these. I don’t have recent data for their coverage, but in total Thomson covers 8,700 journals encompassing all academic fields, so it seems doubtful that Thomas has substantially changed its economics coverage.

A further problem plaguing all citation analysis is simply extracting citation data with software. After all, citations are written for people, not machines. I haven’t seen data for Thomson on this (one wonders if it is public), but I do know that CitEc has faced a very real challenge here.

There would seem to be several solutions to these problems. First, all of us should treat impact factors and citation data with considerable caution. Basing journal rankings, tenure, promotion, and raises on uncritical acceptance of this data is a poor idea. In the extreme, one could imagine legal action in a tenure case.

Second, as the authors of the Journal of Cell Biology argue, this data should be public, just as research findings should be. One initiative here is a Petition for OA [open access] to bibliographic data. My understanding is that through a “RePEc service” like EconPapers or IDEAS, raw CitEc data can be accessed by the public. Further, CitEc works with RePEc Author Services to correct citations. Here’s one more reason to join those 15,000 who have registered with it!

Third, we should investigate putting unique identifiers into each reference so that software can easily read it. That is, besides listing the journal, its volume, and so on, it would also include a unique identifier to the cited paper. DOIs are one possibility, but it is prohibitively expensive to get a license to dispense DOIs. However, “RePEc handles,” which identify papers in RePEc, are permanent and also cover working papers. Thus, we might start including them in each reference. This highlights a further issue: there is little incentive for authors to add this to their citations as it aids others. Perhaps one step in this direction would be for sites like IDEAS, which provide references for papers in different formats like BibTeX or EndNote, could include the RePEc handle along with the current author, title, journal, etc.


Further Thoughts on “New Peer Review Systems”

November 5, 2007

Several thoughts on various points raised in New Peer Review Systems and the comments that followed.

  • In a sense, the lag in the review process might be optimal. A publication of most any sort is valuable to the author and one in a leading journal of course has a very substantial return. Journals thus have a good reason to deter papers that aren’t at all appropriate; this was pointed out by Ofer Azar, “The Slowdown in First-Response Times of Economics Journals: Can It Be Beneficial?,” Economic Inquiry, 2007, 45 (1). The constraint here would seem to be editor’s and referees’ time. In economics, the most common cost that journals impose on authors is a lengthy review process. I’d hazard a guess that bepress gets around this by their ranking of papers into different tiers; they don’t have to deter less than stellar papers as they’ll likely get a home there. This is combined with their system where authors who submit there agree to review two papers quickly (a nice example of a virtuous cycle). Another way to speed up the referee process is a system where any reader can submit comments on a paper. But, as Christian points out, this doesn’t seem to attract many comments. It turns out I’ve looked a bit at this and found 5 journals that have tried a reader rating system and none have attracted a sufficient number of comments to make it fly. From here, one option is something that ranks papers after they’ve been out, such as citations. Paul Ginsparg has some thoughts on one approach, as does Hal Varian (now the chief economist at Google). But, these might take years to generate sufficient data to render a judgment on a paper. I think many of us want something that is quicker.

    Another possibility is something like Faculty of 1000 in biology and medicine where a level of review beyond journals takes place. I very much like the summaries in their sample web pages; you don’t see that in economics. One could imagine it working on top of our working paper culture. But, I wonder if some of their success comes from the grant culture in these fields as this is a fee-base service that seems to pay the reviewers? How might one set up something similar in economics with our working paper culture? Journals would likely see it as preempting their role.

  • I agree with Preston that perhaps the most interesting part of Economics E-Journal is the open review system (all can read reviews) and the feature that allows authors to publicly respond to referee reports. Both would seem to give referees correct incentives. I would think that journals could implement this quickly and easily at low cost. Also, while deep thinking and working through a paper you are writing is extremely valuable, so is getting feedback and discussing a paper and the ideas in it. I have a first draft of a paper on these topics (see below) but in writing this blog entry I have developed some new insights. I wouldd add that prompt discussion is something that the Internet can aid for those of us without local colleagues in our fields.A very minor point on his post: if you count economics journals by the number in EconLit, there are about 1,240. In short, most any paper should certainly be published!
  • I certainly second Christian’s point about blogs and research. All the economics blogs I know of do not discuss serious research that much if at all. Much more common are discussions of current economic events and policy that members of the public find interesting. I don’t know of a one where someone might say, “Say, that paper by Sam Jones in Computational Economics” is interesting because the algorithm he used to calculate…” Also, debates by papers take years given the time to write one and respond. This seems rather silly given today’s technology; after all, journals in the their current form came about when information traveled at the speed of a horse or ship. Perhaps a blog is too quick for complete works, but I understand in law their use is leading to a relative decline in the importance of law journals. One example (first paper found in a Google search) is Guest Blogger: The Start of the Supreme Court’s 2007-08 Employment Discrimination Docket: Federal Express Corporation v. Holowecki. Yes, it discusses current events, but the topic is one that only specialists would seem to care about. You do not seem to see such in economics blogs that I am aware of.

Where does this leave RePEc? Well, I’m not really sure. It is hard to change norms in a field and I am not sure that RePEc could swing it. But, I do not think that a comment system on individual papers would get that much traction. Somehow you want to get the judgment of peers (for promotion, tenure, and annual raises) in a speedy manner. Those two criteria seem to be at odds with each other.
It turns out I have done some thinking on these issues; at the risk of self-promotion they can be found in Next Steps in the Information Infrastructure in Economics. Note that this draft was written for a conference of non-economists, so some parts will strike a very obvious cord to an economist’s ear. I have also have yet to incorporate some very useful comments that Christian kindly wrote. In a nice mix of blog and papers, further comments on the paper are greatly appreciated.


New Peer Review Systems

October 29, 2007

The traditional peer review system is to submit an article to a journal, wait for the editor to get anonymous referee reports and then deliver a decision. We all have horror stories on how inefficient this system is, both in terms of time lost (and Economics seems particularly bad here) and in terms of the arbitrariness of the process. Yet despite all the complaints and the many announcements of its imminent demise in the age of the Internet, this peer review system is still going strong.

What has been done to reform it? Editors have worked hard to reduce decision times, sometimes with success, but there seems to be a lot of habit in the slow response of referees (so editors claim). New models are tried, such as the bepress journals that provide quality ratings and thus avoid authors to submit repeatedly down the journal ladder, the new American Economic Journals that can “automatically” feed on the rejects from the AER, or Economic Inquiry that now asks referees to only provide an up or down vote, thus bypassing the revisions. While these are all important initiatives, they are after all only a variation of the original system.

We can think completely differently. Think of this blog. I rant on a topic, and then others can comment on it and openly declare whether this rant was valuable or not. Why not do this with academic work? An early attempt was done with WoPEc. This was the first RePEc service, similar to IDEAS and EconPapers today, which offered for some time on each paper’s abstract page a discussion section. Participation was minimal and there was very little value added (see an example, I could not find one that actually had comments). This aspect of WoPEc was finally abandoned. A second attempt was organized by SOLE (Society of Labor Economists), that would post every two weeks a new paper to discuss. Again, participation was small, and the project was finally abandoned.

The latest attempt is the Economics E-Journal, which allows registered users to rate and comment on discussion papers. Once the editors find that a paper has generated sufficient interest, it is promoted to the journal, where it can still be discussed. This initiative started this year, so the jury is still out whether it will be successful in the end. So far, it looks very promising.

From time to time, members of the RePEc team are approached and asked whether a discussion section could be added to our services. Given the past experience with WoPEc and the large monitoring costs involved, we are not enthusiastic. Of course if other volunteers are interested in working on this, we may think about it. But first we need to understand whether there is really a demand for this. Maybe RePEc is now too large for this and such initiative should be left to field specific initiatives (SOLE again?).