Program Combine.EXE for Mapping Heterogeneous Network Analysis (Co-word, Co-authorship, and Journal Analysis combined)

Loet Leydesdorff loet at LEYDESDORFF.NET
Wed Jul 21 03:42:42 EDT 2010

Program  <>
Combine.EXE for Mapping Heterogeneous Network Analysis

(Co-word, Co-authorship, and Journal Analysis combined) 


This program enables the user to generate a representation of the co-words,
coauthorship relations, and journals in a document set. Input is a set saved
using ISI’s Web of Science, and output is, among others, a file cosine.dat
in Pajek format. 


The input file has to be saved in the tagged format from the Science
Citation Index (Social Sciences Citation Index, Arts & Humanities Citation
Index) at the Web-of-Science. The default filename “savedrecs.txt” should
not be used, but “data.txt” instead. The program is based on DOS-legacy
software. It runs in a MS-Dos Command Box under Windows. The programs and
the input files have to be contained in the same folder. The output files
are written into this directory. Please, note that existing files from a
previous run are overwritten by the program. (The user is advised to save
output elsewhere if one wishes to continue with these materials.)


The routine creates the asymmetrical occurrence matrix (matrix.dbf which can
be read in excel or spss), the symmetrical affiliations (co-occurrence)
matrix (coocc.dbf, coocc.dat) and the cosine-normalized output (cosine.dbf,
cosine.dat) based on the asymmetrical occurrence matrix. Words which occur
only once in the input file are not included. If stopword.txt is made
available in the same directory, these words are also excluded. A possible
stopword list of 429 words in English is available
<> here.
(Originally from  <> This file has to be
renamed “stopword.txt”.) The variable labels are also available in
words.dbf; the order is; (1) title words; (2) author names; (3) journal


The labels in the output files (cosine.dat and coocc.dat) can be edited
(using an ASCII editor such as NotePad or WordPad) before feeding the files
into Pajek. A series of these matrices can be used for animations (in
<> the dynamic version of Visone,
SVG2Pajek or SoNIA) after saving these in Pajek. The necessary steps in
between are further explained in
<> this manual.

See for examples:

*	 <> "What Can
Heterogeneity Add to the Scientometric Map? Steps towards algorithmic
historiography" in: Festschrift for Michel Callon’s 65th birthday, Madeleine
Akrich, Yannick Barthe, Fabian Muniesa, and Philip Mustar (Eds.). Paris:
École Nationale Supérieure des Mines (forthcoming); <pdf-version
<> >
*	Eugene
<> Garfield and
Algorithmic Historiography: Co-Words, Co-Authors, and Journal Names, Annals
of Library and Informaiton Studies (forthcoming); <pdf-version
pdf> >

Available at 

Loet Leydesdorff 
Amsterdam School of Communications Research (ASCoR)
Kloveniersburgwal 48, 1012 CX Amsterdam.
Tel. +31-20-525 6598; fax: +31-842239111

 <mailto:loet at> loet at ;

Visiting Professor 2007-2010,  <>
ISTIC, Beijing; Honorary Fellow 2007-2010,  <>
SPRU, University of Sussex 
Now available:
The Knowledge-Based Economy: Modeled, Measured, Simulated, 385 pp.; US$
The Self-Organization of the Knowledge-Based Society ;
The Challenge of Scientometrics


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the SIGMETRICS mailing list