The Matthew effect in Science – citing the most cited
The introduction to most scientific papers will probably contain something along the lines of “It is widely accepted that….”, followed by the citation of a few more or less recent reviews of the topic. Last week’s blog noted the frequency of mis-citation, and this leads, surprisingly naturally, into asking the question ‘which reviews or papers might one then cite to bolster a view of present-day knowledge on a subject, and on what basis are these chosen?’ A partial linkage between these two issues (mis-citation and choice of material to cite) comes via what Merton (1968) (with a follow-up in 1988) called the Matthew Effect, on the basis of the lines in Matthew’s Gospel (25:29) that read “For unto every one that hath shall be given, and he shall have abundance: but from him that hath not shall be taken away even that which he hath”. As phrased by Goldstone, one variant is that “In scientific journals, and at scientific conferences, new articles and papers by already-prestigious scientists usually receive far more attention than articles by scientists still on the way up, regardless of the intrinsic merit of such contributions”. Strevens suggests a mechanistic explanation of why. (Note that while greater longevity may increase one’s fame, it may even work both ways – winning a Nobel Prize can apparently extend one’s longevity (statistically) by 1-2 years!)
The Matthew effect applies to journals and papers too – a highly cited journal or paper is likely to attract more citations (and mis-citations), probably for the simple psychological reasoning that ‘if so many people cite it, it must be a reasonable paper to cite’ (and such a paper is, by definition, more likely to appear in the reference list of another paper). Clearly that reasoning can be applied whether the paper has been read or otherwise. Simkin and Roychowdhury (2005 and 2007) note that a clear pointer to the citation of a paper one has not read is if it copies a mis-citation, and an analysis of the frequency of such serial mis-citations allows one to estimate, statistically, what fraction of cited articles have actually been read – at least at or near the time of writing a paper – by the citing author. Their analyses show (at least for certain physics papers) that “about 70-90% of scientific citations are copied from the lists of references used in other papers”, and that a typical device is to start with a few recent ones plus their citations. Some aspects of this tendency in bibliometrics, especially with highly cited papers, can be detected from the power law form of the distribution of citation numbers, as in the Laws of Bradford and Lotka that I discussed before. Of course the mindless propagation of errors without checking sources properly is hardly confined to Science – a famous recent example with spoof data showed how some journalists simply copied Obituary material from Wikipedia!
Modern Web-based data mining tools and databases allow one to find the duplication of scientific (textual) content in papers fairly easily. Scientists citing papers they have not read might do well to remember this. Out of cite, out of mined, one might say <ahem>.
- Errami, M., Wren, J. D., Hicks, J. M. & Garner, H. R. (2007). eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications. Nucleic Acids Res 35, W12-5. http://invention.swmed.edu/etblast/index.shtml
- Errami, M., Hicks, J. M., Fisher, W., Trusty, D., Wren, J. D., Long, T. C. & Garner, H. R. (2008). Déjà vu: a study of duplicate citations in Medline. Bioinformatics 24, 243-9
- Errami, M., Sun, Z., Long, T. C., George, A. C. & Garner, H. R. (2009). Déjà vu: a database of highly similar citations in the scientific literature. Nucleic Acids Res 37, D921-4. http://spore.swmed.edu/dejavu/
- Goldstone, J. A. (1979). Deductive explanation of the Matthew Effect in science. Social Studies of Science 9, 385-391
- Havemann, F., Heinz, M. & Wagner-Dobler, R. (2005). Firm-like behavior of journals? Scaling properties of their output and impact growth dynamics. J Amer Soc Inf Sci Technol 56, 3-12
- Merton, R. K. (1968). The Matthew Effect in Science: The reward and communication systems of science are considered. Science 159, 56-63
- Merton, R. K. (1988). The Matthew Effect in Science II: Cumulative advantage and the symbolism of intellectual property. Isis 79, 606-623
- Rablen, M. D. & Oswald, A. J. (2008). Mortality and immortality: the Nobel Prize as an experiment into the effect of status upon longevity. J Health Econ 27, 1462-71
- Simkin, M. V. & Roychowdhury, V. P. (2005). Stochastic modeling of citation slips. Scientometrics 62, 367-384
- Simkin, M. V. & Roychowdhury, V. P. (2007). A mathematical theory of citing. J Am Soc Inf Sci Technol 58, 1661-1673
- Strevens, M. (2006). The role of the Matthew effect in science. Stud Hist Philos Sci 37, 159-170. Manuscript
Related posts (based on tags and chronology):

Scientific data visualisation and #SciFoo09
13 July 2009

What’s in a name? A tag cloud of recent blogs
08 June 2009

What’s in a name? Guest, ghost and indeed quite imaginary authorships
23 February 2009

Evaluations, Gatsby and partnerships
07 May 2013

Energy, the Russell group, Research Advisory Panel and the Research Environment
29 October 2012
You can follow any responses to this entry through the comments RSS feed. You can leave a comment, or trackback from your own site.
3 comments to 'The Matthew effect in Science – citing the most cited'
Citation in Science: Don’t Quote Me on That!
For those interested in an ongoing discussion on citation behaviours, bibiliometrics and the effectiveness, or not, of citation as a measure of research excellence, you may wish to visit:
http://network.nature.com/groups/citation-science/forum/topics
How to Maximize Citations…
I have just found some great advice on how to boost your citation count, i.e., get more citations for your publications. An interesting discussion of the so-called Matthew effect and its influence on the citation patterns can be found here.
As for the …
[...] ‘knowledge’) is supposed to be based on objective and verifiable facts, but as I have blogged before there can occasionally be a tendency to groupthink. This is a phenomenon in which, simply by [...]
Leave a comment