Kostoff RN, Toothman DR, Eberhart HJ, Humenik JA "Text mining using database tomography and bibliometrics: A review "TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE 68 (3): 223-253 NOV 2001

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Thu Jan 10 16:16:38 EST 2002


R.N. Kostoff : E-mail: kostofr at onr.navy.mil


TITLE : Text mining using database tomography and bibliometrics: A review
AUTHORS : Kostoff RN, Toothman DR, Eberhart HJ, Humenik JA
JOURNAL : TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE 68(3):223-253 NOV 2001

 Document type: Review  Language: English     Cited References: 27    Times
Cited: 0


Abstract:
Database tomography (DT) is a textual database analysis system consisting of
two major components: (1) algorithms for extracting multiword phrase
frequencies and phrase proximities (physical closeness of the multiword
technical phrases) from any type of large textual database, to augment (2)
interpretative capabilities of the expert human analyst. DT has been used to
derive technical intelligence from a variety of textual database sources,
most recently the published technical literature as exemplified by the
Science Citation Index (SCI) and the Engineering Compendex (EC). Phrase
frequency analysis (the occurrence frequency of multiword technical phrases)
provides the pervasive technical themes of the topical databases of
interest, and phrase proximity analysis provides the relationships among the
pervasive technical themes. In the structured published literature
databases, bibliometric analysis of the database records supplements the DT
results by identifying the recent most prolific topical area authors; the
journals that contain numerous topical area papers; the institutions that
produce numerous topical area papers; the keywords specified most frequently
by the topical area authors; the authors whose works are cited most
frequently in the topical area papers; and the particular papers and
journals cited most frequently in the topical area papers. This review paper
summarizes: (1) the theory and background development of DT; (2) past
published and unpublished literature study results; (3) present application
activities; (4) potential expansion to new DT applications. In addition,
application of DT to technology forecasting is addressed. (C) 2001 Elsevier
Science Inc. All rights reserved.

Author Keywords:
database tomography, text mining, bibliometrics, innovation, information
retrieval, information extraction, cluster, taxonomies

Addresses:
Kostoff RN, Off Naval Res, 800 N Quincy St, Arlington, VA 22217 USA
Off Naval Res, Arlington, VA 22217 USA
RSIS Inc, Mclean, VA 22102 USA
NOESIS Inc, Manassas, VA USA

Publisher:
ELSEVIER SCIENCE INC, NEW YORK

IDS Number:
498UT

ISSN:
0040-1625

Cited Author            Cited Work                Volume      Page      Year

 BRADFORD SC           ENGINEERING                  137                1934
 KOSTOFF RN            5000 ONR                                        1991
 KOSTOFF RN            5440481                       US                1995
 KOSTOFF RN            COMPETITIVE INTELLIG           5         1      1994
 KOSTOFF RN            IEEE T ENG MANAGE             48       132      2001
 KOSTOFF RN            INFORMATION PROCESSI          34         1      1998
 KOSTOFF RN            J AIRCR                       37                2000
 KOSTOFF RN            J CHEM INF COMPU JAN                            2000
 KOSTOFF RN            J INFORM SCI                  23         4      1997
 KOSTOFF RN            JASIS           0415                            1999
 KOSTOFF RN            P 3 INT C MAN TECHN                             1992
 KOSTOFF RN            P PORTL INT C MAN EN                            1991
 KOSTOFF RN            SCI TECHNOLOGY INNOV
 KOSTOFF RN            SCIENTOMETRICS                43                1998
 KOSTOFF RN            SCIENTOMETRICS                40                1997
 KOSTOFF RN            SCIENTOMETRICS                39                1997
 KOSTOFF RN            TECHNOVATION                  19                1999
 KOSTOFF RN            UNPUB J SHIP RES
 LOTKA AJ              J WASH ACAD SCI               16                1926
 MACROBERTS M          SCIENTOMETRICS                36                1996
 SMALHEISER NR         ARCH GEN PSYCHIAT             55                1998
 SMALHEISER NR         COMPUT METHODS PROGR          57                1998
 SMALHEISER NR         NEUROSCI RES COMMUN           15                1994
 SWANSON DR            ARTIF INTELL                  91                1997
 SWANSON DR            COMPUTER ASSISTED SE                   217      1999
 SWANSON DR            PERSPECT BIOL MED             30                1986
 ZAMIR O               THESIS U WASHINGTON                             1999


When responding, please attach my original message
----------------------------------------------------------------------
Eugene Garfield, Ph.D. E-mail: mailto:garfield at codex.cis.upenn.edu Web site:
http://www.eugenegarfield.org
Telephone: (215)243-2205 Fax: (215)387-1266
Past President, American Society for Information Science & Technology
http://www.asis.org
Chairman Emeritus, Institute for Scientific Information ( ISI),
http://www.isinet.com  3501 Market St , Philadelphia, PA 19104-3389,
Pres.,Ed.-in-Chief, The Scientist, http://www.the-scientist.com 3535 Market
St , Philadelphia, PA 19104-3385,



More information about the SIGMETRICS mailing list