Paper available: Indexing and Retrieval of Scientific Literature -- Steve Lawrence
Stephen D. Clark 16 Feb 2000 12:58 UTC
-------- Original Message --------
Subject: Paper available: Indexing and Retrieval of Scientific
Literature
Date: Tue, 15 Feb 2000 19:40:48 -0500
From: "S t e v e _ L a w r e n c e (Steve Lawrence)"
<lawrence@research.nj.nec.com>
The following paper discusses ResearchIndex (CiteSeer). ResearchIndex
is the world's largest free full-text index of scientific literature.
http://www.neci.nec.com/~lawrence/papers.html
http://citeseer.nj.nec.com/details/lawrence99indexing.html
Indexing and Retrieval of Scientific Literature
Steve Lawrence, Kurt Bollacker, C. Lee Giles
NEC Research Institute
The web has greatly improved access to scientific literature. However,
scientific articles on the web are largely disorganized, with research
articles being spread across archive sites, institution sites, journal
sites, and researcher homepages. No index covers all of the available
literature, and the major web search engines typically do not index
the content of Postscript/PDF documents at all. This paper discusses
the creation of digital libraries of scientific literature on the web,
including the efficient location of articles, full-text indexing of
the articles, autonomous citation indexing, information extraction,
display of query-sensitive summaries and citation context, hubs and
authorities computation, similar document detection, user profiling,
distributed error correction, graph analysis, and detection of
overlapping documents. The software for the system is available at no
cost for non-commercial use.
--
Steve Lawrence - http://www.neci.nec.com/~lawrence/
http://csindex.com/ - 250,000+ computer science papers