Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of ‘‘new’’ dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning ‘‘old’’ dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.

Do Scopus and WoS correct ‘‘old’’ omitted citations? / Franceschini, Fiorenzo; Maisano, DOMENICO AUGUSTO FRANCESCO; Mastrogiacomo, Luca. - In: SCIENTOMETRICS. - ISSN 0138-9130. - STAMPA. - 107:2(2016), pp. 321-335. [10.1007/s11192-016-1867-8]

Do Scopus and WoS correct ‘‘old’’ omitted citations?

FRANCESCHINI, FIORENZO;MAISANO, DOMENICO AUGUSTO FRANCESCO;MASTROGIACOMO, LUCA
2016

Abstract

Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of ‘‘new’’ dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning ‘‘old’’ dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.
File in questo prodotto:
File Dimensione Formato  
SCIENTOMETRICS Revised_SCIM-D-15-00388R1 (Accepted version DM) no yellow.doc

Open Access dal 02/02/2017

Descrizione: SCIENTOMETRICS Revised_SCIM-D-15-00388R1 (Accepted version DM)
Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 4.56 MB
Formato Microsoft Word
4.56 MB Microsoft Word Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2640161
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo