[Sigmetrics] Fractional Counting of Authors and Addresses

Loet Leydesdorff loet at leydesdorff.net
Thu Aug 27 03:08:37 EDT 2015

Fractional Counting of Authors and Addresses: Fraction.EXE


This website < http://www.leydesdorff.net/software/fraction
<http://www.leydesdorff.net/software/fraction%20>  > provides a routine to
compute fractional values for the numbers of publications and citations in a
set of papers downloaded from the Web-of-Science (v5). First, this set can
be organized in a relational database using ISI.exe
<http://www.leydesdorff.net/software/isi> . ISI.exe uses as input the
download in the tagged format of the WoS which is available in the same
folder and named "data.txt" (see at http://www.leydesdorff.net/software/isi
for further explanation). The output is a set of databases (.dbf) which can
be read using Excel or SPSS. For example, authors are organized into au.dbf
and addresses into cs.dbf. (The various files are related in terms of the
field "nr;" MSAccess can be used for relational database management.)


The resulting files can be used by fraction.exe as input. This program will
add au2.dbf and cs2.dbf to the dbf-files generated by isi.exe. 


1.        Au2.dbf contains additional to the fields in au.dbf, the fields
"tc" (times cited), "tc_au" (times cited/author), and "npub_au" (fractional
count of the publication). One can, for example, read au2.dbf into excel and
use a pivot table for generating a fractional count of the authorship of the
authors in the set. 

2.        Cs2.dbf contains similar fields, but normalizes in terms of the
number of unique addresses attributed to the publication in the C1-field of
WoS. For example, if a paper contains three addresses of which two are the
same, each address obtains half a point (and not 1/3 and 2/3). The RP-field
of WoS is not used since it usually extends on one of the other addresses.
Records before 2008 are not used because the address fields before this date
are not consistently organized in WoS.

3.        Additionally, a scratch file core2.dbf is generated that is equal
to core.dbf, but to which the various fields under 1. and 2. are added at
the end. This does not add new information and can be disregarded. 


The routine overwrites files with the same names in each next run.


August 26, 2015.  


Feel free to feedback! 



Loet Leydesdorff 

Professor Emeritus, University of Amsterdam
Amsterdam School of Communication Research (ASCoR)

 <mailto:loet at leydesdorff.net> loet at leydesdorff.net ;
<http://www.leydesdorff.net/> http://www.leydesdorff.net/ 
Honorary Professor,  <http://www.sussex.ac.uk/spru/> SPRU, University of

Guest Professor  <http://www.zju.edu.cn/english/> Zhejiang Univ., Hangzhou;
Visiting Professor,  <http://www.istic.ac.cn/Eng/brief_en.html> ISTIC,

Visiting Professor,  <http://www.bbk.ac.uk/> Birkbeck, University of London;



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.asis.org/pipermail/sigmetrics/attachments/20150827/1347a3f1/attachment-0001.html>

More information about the SIGMETRICS mailing list