extensions of FullText.exe and Ti.ex for co-word analysis with provisions for stop word lists and word frequency lists
Loet Leydesdorff
loet at LEYDESDORFF.NET
Thu Mar 23 04:31:56 EST 2006
Dear colleagues,
While using the programs in class, it became clear that one needs a
provision to generate word frequency lists from the texts and to correct for
stop words. The latter facility is now added to these programs (at
http://www.leydesdorff.net/software/ti/stopword.exe ) and the former is set
as a hyperlink to TextSTAT-2 of Dutch Linguistics Department of the
Technical University in Berlin (at
http://www.niederlandistik.fu-berlin.de/textstat/software-en.html).
The programs themselves can be found at
http://www.leydesdorff.net/software/ti and
http://www.leydesdorff.net/software/fulltext , respectively. For advanced
users it may be useful to remark that one can also replace the cosine
matrices with Pearson correlation matrices by feeding the output file
matrix.dbf into SPSS and running the appropriate routines. In my opinion,
some convincing arguments have been made to use the cosine as the similarity
criterion for the visualizations.
With best wishes,
Loet
________________________________
Loet Leydesdorff
Amsterdam School of Communications Research (ASCoR)
Kloveniersburgwal 48, 1012 CX Amsterdam
Tel.: +31-20- 525 6598; fax: +31-20- 525 3681
loet at leydesdorff.net ; http://www.leydesdorff.net/
More information about the SIGMETRICS
mailing list