RePEc in July 2012

August 3, 2012

One project that makes particular strides lately is CitEc, our citation analysis initiative. Thanks to the collaboration of several publishers, references from articles are now being added at a substantial pace, and the citation coverage is now becoming more respectable. There is a lot of material in the pipeline, and we hope that other commercial publishers will join in this effort. Also, NEP is moving to a new location and will be off-line a few days. This blog will report on both over the coming weeks.

We counted 504,492 file downloads and 1,823,518 abstract views in July 2012, as always contingent to the pruning performed by the LogEc project. Our new participating archives for the month of July: Ottawa United Learning Academy, Nottingham Trent University (II), University of Central Florida, Europa Grande, Universidade de Santiago de Compostela (II), Vistula University, Universidad de Montevideo, University of Antwerp (II), Mathematica Policy Research, Singapore Management University, Lahore School of Economics, Universidad de Oviedo.

And finally a few thresholds we recently passed:
100000000 article abstract views
12000000 cumulative downloads through EconPapers
1100000 items available online
800000 items with abstracts
750000 links to research in registered author profiles
400000 articles with abstracts
333333 JEL coded items


When an email address goes bad

July 25, 2012

Authors register with the RePEc Author Service using their email address, which serves as a user name. That email address is also used by the service to alert them about new potential works to add to their profile and to send them monthly statistics. Occasionally, some of these emails bounce back to the service: the mailbox is full, or the address has become obsolete. What happens now?

First, EconPapers and IDEAS add a mention to the author profile page encouraging users to supply a new email address to the RePEc Author Service. IDEAS also publishes the complete list of lost authors.

Second, lost authors are considered without affiliation for ranking purposes. Indeed, a major reason why an email address goes bad is that the author has moved, in which case the affiliation is likely obsolete. Another reason may be that the author has died, in which case having an affiliation make no sense.

Third, once the email address has remained bad for a second month, the administrator of the RePEc Author Service goes fishing for a better one. This involves Internet searches, asking co-authors and former colleagues. This is repeated later if necessary.

These efforts have allowed to keep the number of lost authors remarkably low: currently 273, or 0.8% of all registered authors. To this, add 152 authors who are known to be deceased. The records of the latter continue to be maintained by volunteers, as research may still be added posthumously, or new archives may still join with works written by these scholars.

We appreciate any help in tracing lost authors, notifications about deceased authors and support in maintaining their profiles. Simply email the administrator of the RePEc Author Service.


Rules for email addresses in RePEc

July 12, 2012

Many people trust RePEc with their email address. RePEc earned this trust, we believe, by not abusing the use of these email addresses. This has been so far an implicit commitment, as no policy was established. This post now puts in words the practice since the inception of RePEc in 1997, and establishes a few additional rules.

Covered email addresses

These rules cover all email addresses that are collected and used by RePEc and its services. These addresses include those contained in metadata from RePEc archives, from author profiles in the RePEc Author Service, and subscriptions to the various NEP mailing lists.

Display of email addresses

RePEc services, if they choose to display email addresses, commit to always encrypt any public email address to prevent harvesting by robots.

Authors registered with the RePEc Author Service have the option to have their email address not displayed. The option is available as a checkbox on the “contact information” page at the RePEc Author Service. In such a case, the RePEc Author Service does not include the address in the metadata disseminated to other RePEc services.

NEP does not display any email address. Only the list maintainer (the NEP editor) has access to subscription details.

Use of email addresses

RePEc archive and series maintainers receive one monthly email from RePEc with statistics, reminders and links pertaining to their material. They may receive additional messages if a problem arises with their archive or metadata.

RePEc authors also receive a monthly email with statistics, latest citations, and news. It is possible to opt out of the monthly messages by replying to the sender. The RePEc Author Service may also send messages if it suspects an author may have some new works waiting to be claimed.

Email addresses are used as user names in the RePEc Author Service. If a RePEc service requires authentication through the RePEc Author Service, it cannot store this email address unless it is explicitly stated. The authentication form must have a link to a list of authorized services. This list is on the RePEc Author Service site.

NEP subscribers are to receive only messages pertaining to their NEP report, plus rare housekeeping messages. This policy may be amended to also include professional messages, like calls for papers, if relevant to the specific field.

None of the gathered email addresses is to be given, for a fee or not, to any third party. A yearly survey, though, may be conducted on questions relevant to the profession and/or RePEc (one call plus one reminder).

Accuracy of email contacts

It is the responsibility of archive and series maintainers to keep contact information current. This is done by maintaining appropriate coordinates in the archive and series templates of their RePEc archive (files ___arch.rdf and ___seri.rdf). Email addresses are required.

Authors and NEP subscribers are asked to maintain current coordinates so as to reduce the workload of RePEc volunteers. The latter may change an person’s email address in the RePEc Author Service or a NEP mailing lists if it appears to be obsolete.


RePEc in June 2012

July 3, 2012

First off: RePEc is now 15 years old. It was launched at a meeting in Guildford (UK) where Thomas Krichel exposed his idea to a group of people who were already doing some indexing of working papers. Soon thereafter, they started providing the relevant bibliographic metadata in the relevant format and RePEc services could use them. Thomas Krichel had previously been running various initiatives to improve the dissemination of working papers, all the way back to 1991.

Speaking of volunteer personnel, Kyle Fluegge stepped down as NEP editor. His successor is Laura Stefanescu, who is now in charge of creating the weekly list of new working papers submitted to NEP editors, who can then select those relevant to their field.

During June 2012, we counted 520,118 file downloads and 2,010,606 abstract views. We also welcomed the following new participating archives: : HERMIN, Società editrice il Mulino, Catalactica NGO, Bangor University, Missouri Valley Economic Association, Université Bordeaux 4, University of New South Wales (III), Japan Ministry of Finance, Rockwool Foundation, University of South Bohemia, Babes-Bolyai University.

Finally, the following thresholds were reached during the past month:
400000 working papers available online
100000 articles with references
15000 book chapters available online
1500 indexed journals
500 blogs linked on EconAcademics.org with discussions about RePEc material


Linking with social media and page translation

June 26, 2012

We have noticed that material on RePEc is used quite heavily on social media, in particular Facebook, Twitter and Delicious. To make it easier, IDEAS now features on every page buttons that allow a user to quickly link with some services. Currently, the following options are available: Facebook, Twitter, Delicious, StumbleUpon, Google+ and Reddit. In addition, there is a button to print and one to email the link.

The code for all this was custom written to avoid using tracking cookies that are common with similar referral services. One consequence is that the email button will only work if an email client is installed on the computer. It will not work with Gmail, Hotmail, and other web-based email services.

In addition, Google Translate has been implemented as well, which should be able to translate any page you your language of choice, even when several languages are present. Of course, only the original version is guaranteed to be accurate. Note that this service uses a tracking cookie, but virtually everyone already has a Google cookie anyway.

EconPapers will soon implement the same features.


RePEc in May 2012

June 4, 2012

We counted in the past month 620,959 file downloads and 2,338,668 abstract views. RePEc has also grown to include now over 1.2 million items. This growth comes in part from the following newly participating archives: Lucius & Lucius Verlag, BI Norwegian Business School, Superintendencia de Pensiones de Chile, Rivista Bancaria, Harvard University (II), University of Tokyo (II), Universidade Técnica de Lisboa, Université de Liège, Flinders University, Economics for Energy, Central European University, Hellenic Association of Regional Scientists, Southern Regional Science Association.

And last month, we passed the following thresholds:
200000000 cumulative abstract views on IDEAS
8000000 references extracted
1200000 research items listed on RePEc
400000 items with citations
100000 articles with JEL codes
5000 identified links from blog posts to research items on RePEc
2000 links from Wikipedia to research items on RePEc


My citation count just went down!

May 26, 2012

We sometimes get angry emails from authors pointing out that their citation count decreased. And this is quite understandable, as no one likes to see decreasing citation numbers. But it can happen, and most of the time it is for very legitimate reasons, not some conspiracy. Here is a rundown.


  1. A citing or cited item has been removed from RePEc. This is rare but does happen on occasion. We recommend against removing item in general, as discussed in a previous blog post. Hint: the fact that a working paper got published in a journal is very rarely a reason to remove its link to full text, let alone its entire entry.
  2. A RePEc archive renumbers its handles. A handle is the identifier of every item in RePEc. It is supposed to be unique and permanent, yet every month some archive has the bright idea to renumber its material. The consequence: the citation analysis needs to be completely redone, and all the other links as well. This leads to a temporary decrease in citation numbers.
  3. When different versions of some work are linked, citations are consolidated. Thus what counted as separate citations now only counts as one. And this version consolidation can happen for the citing or the cited work.
  4. The author removed some work from the author profile, for example when a paper got published. Citations to the working paper version are then temporarily lost until the article is indexed in the citation service.
  5. An error with RePEc. This can happen despite our best efforts, and it always has been temporary.

And lastly, the citation count in the statistics sent to authors is different from what you see on the web at IDEAS or the RePEc Author Services: this is because multiple versions of the same work are aggregated, and because self-citations do not count.


On RePEc traffic numbers

May 19, 2012

RePEc provides traffic statistics to authors, editors, and archive and series maintainers. LogEc displays them to the public. These downloads and abstract views are obtained from some, but not all, RePEc services which compile their server log files: Economists Online, EconPapers, IDEAS, NEP and Socionet. The raw logs are purged from anything that does not look like human traffic (robots, spiders), repeat traffic as well as anything that does not seem licit. This typically divides traffic numbers by four or five. More details are available at LogEc.

Several people have noticed that RePEc traffic numbers have exhibited a downward trend over the last couple of years. We do not have a definite answer, but we have a few possible explanations for this trend.

Tightening of criteria

The criteria for what is considered licit traffic have been tightened in July 2010, retroactively to January 2008 (see blog post). This can explain a one-time decrease in reported traffic, but not the trend.

Server caches

We noticed that several institutions run server caches of IDEAS. This means that any request to an IDEAS page comes from this server, and thus requests from separate people count as one. Worse, if there are many such requests, the server is considered a robot and none count. While this has an impact on traffic numbers, it is not believed to be a major impact.

Proxy servers

These are servers through which all web traffic of an institution is routed. The main goal is security: all computers behind the proxy server appear to have the same IP address, and a potential attacker cannot find the individual IP addresses. The impact is the same as with the above: all traffic from an institution (for example the hundreds of economists from the US Federal Reserve System) are considered to be from a single person, and possibly a illicit robot. This can explain stagnating traffic and, if the adoption of proxy servers is increasing, even decreasing traffic.

Non-reporting RePEc services

Several RePEc services are not reporting traffic logs to LogEc. If they are diverting traffic away from reporting services, the visible statistics suffer. As RePEc is getting used more and more by bibliographic services, the declining recorded traffic gives then a false image of the evolution.

Google Scholar priorities

Substantial traffic is coming from Google Scholar, which used to rely substantially on RePEc for its initial launch, at least for economics material. As Google established partnerships with commercial publishers, commercial material is now privileged in search results and RePEc is often confined to the much less prominent “other versions.”

Better usage

It could also be that users are getting more efficient at finding what they are looking for, either because they are getting better at it, or because the RePEc services have improved their websites. If one needs to read fewer abstracts until one finds the right works, this will lower traffic and increase user satisfaction.

Reduced popularity

Finally, it could be also that RePEc services are becoming less popular, given the existence of several good alternatives (which are almost all using RePEc data). The fact that raw traffic, which includes all that LogEc eliminates, is still going up would invalidate this argument, unless this increase is entirely due to an increase in robot traffic.

All in all, we are not sure why we have the decrease in the “visible” traffic numbers. Maybe our loyal readers have further suggestions.


RePEc in April 2012

May 3, 2012

The innovation of the month is the complete overhaul of EconAcademics.org, the blog aggregator for economics research. It now monitors a much longer list of blogs and selects the blog posts that discuss research. Also, IDEAS now links back from the abstract and author pages to the posts.

We counted 634,507 file downloads and 2,319,912 abstract views during last month and welcomed the following new participating RePEc archives: Kasetsart University, Review of Agrarian Studies, Brigham Young University, Karl-Franzens-Universität Graz, Bahcesehir University, JICA Research Institute, La Trobe University (II), Western Risk and Insurance Association, IGI Global, Barcelona Graduate School of Economics, Universitatea Andrei Saguna, Scientific Society of Management from Romania, Duke University (II), and Università di Trieste.

Finally, we passed the following thresholds:
3000000 matched citations
2500000 cumulative downloads through NEP
2000000 cumulative abstract views for book chapters
700000 listed articles


How does RePEc get its data?

April 24, 2012

The RePEc team regularly gets requests to from authors to add this or that item to the database, or enquiries from editors why RePEc is discriminating against their journal by not listing it. It is therefore useful to discuss again how RePEc gathers all its bibliographic data, and thus what various users can do to enhance the listings.

RePEc does not have any data entry staff, one because RePEc has a budget of zero, two because the data entry is done by the respective publishers. The same rules apply to all, whether it is a large commercial publisher with many journals or a small research center with a working paper series: they have to open a local metadata archive with bibliographic information formatted in a way that RePEc services can automatically gather and analyze on a regular basis (usually every night). So far, over 1400 archives have followed the detailed instructions necessary for participation. Authors with institutions that fail to participate in RePEc can still get their work listed, by uploading it with MPRA. They need copyright clearance for this, which is granted by most publishers, according to the list compiled by SHERPA/RoMEO.

Author profiles are also maintained by the authors themselves, by registering at the RePEc Author Service. The citation analysis (CitEc project) also depends on the collaboration of publishers, either by allowing the free download of the full texts or by providing the metadata about references separately.

The extremely decentralized nature of RePEc is what allows to reduce central costs to almost nothing and thus keep RePEc free for all: publishers, authors, and readers. The collected data can then be offered by the various RePEc services, and those bear the (small) cost of massaging the RePEc data to make it useful for everyone.


Follow

Get every new post delivered to your Inbox.

Join 48 other followers