CiteSeer is a public search engine and digital library for scientific and academic papers. A digital library is a Library in which collections are stored in digital formats (as opposed to print Microform, or other media and accessible by computers It was created by researchers Steve Lawrence, Kurt Bollacker and Lee Giles while they were at the NEC Research Institute (now NEC Labs), Princeton, New Jersey, USA. SUP YALL This is about the singer/actor For other uses see Steve Lawrence (disambiguation. Dr Kurt Bollacker is a computer scientist with a research background in the areas of Machine learning, Digital libraries, Semantic networks and Electro-cardiographic Dr C Lee Giles is the David Reese Professor at the College of Information Sciences and Technology at the Pennsylvania State University. is a Japanese multinational IT company headquartered in Minato Tokyo, Japan. is a Japanese multinational IT company headquartered in Minato Tokyo, Japan. See also Princeton Township New Jersey, Borough of Princeton New Jersey Princeton Borough New Jersey Princeton Township New Jersey this CiteSeer's goal was to actively crawl and harvest academic and scientific documents on the web and use autonomous citation indexing to permit querying by citation or by document ranking them by citation impact. A citation index is an index of Citations between publications allowing the user to easily establish which later documents cite which earlier documents Citation is the process of acknowledging or citing the author year title and locus of publication (journal book or other of a source used in a published work It is hosted on the World Wide Web at the College of Information Sciences and Technology, The Pennsylvania State University, and has over 700,000 documents, primarily in the fields of computer and information science and engineering. The World Wide Web (commonly shortened to the Web) is a system of interlinked Hypertext documents accessed via the Internet. The Pennsylvania State University (commonly known as Penn State) is a state-related, land-grant, space grant public research University Computer science (or computing science) is the study and the Science of the theoretical foundations of Information and Computation and their Information science is an interdisciplinary science primarily concerned with the collection classification, manipulation storage retrieval and dissemination
CiteSeer freely provides Open Archives Initiative metadata of all indexed documents and links indexed documents when possible to other sources of metadata such as DBLP and the ACM portal. The Open Archives Initiative (OAI is an attempt to build a "low-barrier Interoperability framework" for archives ( institutional repositories) containing Metadata ( meta data, or sometimes metainformation) is "data about data" of any sort in any media DBLP ( Digital Bibliography & Library Project) is a Computer science Bibliography website hosted at Universität Trier, in Germany The Association for Computing Machinery, or ACM, was founded in 1947 as the world's first scientific and educational Computing society
CiteSeer's goal is to improve the dissemination and access of academic and scientific literature. As a non-profit service that can be freely used by anyone, it has been considered as part of the open access movement that is attempting to change academic and scientific publishing to allow greater access to scientific literature. Open access ( OA) is free immediate permanent full-text Online access for any user web-wide to digital scientific and scholarly material primarily Academic publishing describes the subfield of Publishing which distributes academic Research and Scholarship.
The name is a pun. A 'sightseer' is a tourist who looks at the sights, so a 'cite seer' would be a researcher who looks at cited papers.
CiteSeer has not been comprehensively updated since roughly 2005 due to limitations in its architecture design. It's a representative sampling of research in computer and information science but is limited in its coverage since it only has access to papers that are freely available, usually at an authors homepage. A comparison of DBLP references versus those in CiteSeer will always be found lacking since DBLP is manually implemented bibliography. DBLP ( Digital Bibliography & Library Project) is a Computer science Bibliography website hosted at Universität Trier, in Germany As an example consider the references in DBLP for well known authors such as Alex Pentland (MIT) or Ramesh Jain (UCI) (DBLP listings for Alex Pentland - http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/p/Pentland:Alex.html or Ramesh Jain - http://www.informatik.uni-trier.de/~ley/db/indices/a-tree/j/Jain:Ramesh.html). Alex (“Sandy” Pentland is the Toshiba Professor at MIT a serial entrepreneur and is one of the most-cited authors in computer science Ramesh Chandra Jain is a Scientist and Entrepreneur whose decades long career has spanned several universities and startup companies The University of California Irvine is a public Coeducational Research university situated in Irvine, California. DBLP shows a regular number of publications (~9) each year in DBLP through 2007. DBLP ( Digital Bibliography & Library Project) is a Computer science Bibliography website hosted at Universität Trier, in Germany While CiteSeer has only one of their publications after 2000, DBLP has none of their actual publications but link to those publications on publisher websites.
A more up to date version of CiteSeer can be found at the Next Generation CiteSeer, CiteSeerx, website. It's important to note that CiteSeer-like engines and archives usually only harvest documents from publicly available websites and do not crawl publisher websites. As such authors whose documents are freely available are more likely to be represented in the index.
The CiteSeer model had been extended to cover academic documents in business with SmealSearch and in e-business with eBizSearch. SmealSearch (now BizSeer is a Web portal, Search engine and Digital library for academic business documents that was originally hosted at the defunct However, these were not maintained by their sponsors. A older version of both of these can be found at BizSeer. IST. For enhanced access and performance, mirrors of CiteSeer were made available at universities such as the Massachusetts Institute of Technology, University of Zürich and the National University of Singapore. The University of Zurich ( German: Universität Zürich) located in the city of Zurich, is the largest University in Switzerland The National University of Singapore ( Abbreviation: NUS;; Abbreviated 国大 Malay: Universiti Kebangsaan Singapura; Tamil: However, mirrors of CiteSeer proved difficult to maintain.
The Next Generation CiteSeer project, CiteSeerx, funded by the National Science Foundation and Microsoft Research, enhances CiteSeer both as a search engine and as a digital library. CiteSeer x is a public Search engine and Digital library for scientific and academic papers with a focus on computer and Information Microsoft Research (MSR is a division of Microsoft created in 1991 for researching various Computer science topics and issues As an example, CiteSeer's notion of "contribution" to acknowledgments in addition to citations, which would make it the first automatically generated acknowledgment index. In the creative arts and Scientific literature, an acknowledgment (also spelled acknowledgement) is an expression of gratitude for assistance in creating An acknowledgment index is an experimental method for analyzing the Scientific literature; it quantifies the acknowledgments in scientific journals A beta version is currently available at the CiteSeer site.
The Next Generation CiteSeer, CiteSeerx, is now available in alpha  with over one million documents indexed.