ASF
HOME | MIRROR | BROWSE |
SEARCH | E-MAIL |
Updated 30 Aug 1998 Security and Privacy Notice |
URL:/about/terms.html
Concepts and Terms
ASF concepts and base principles
- Searchers employ a range of techniques to find many kinds of
information.
- In the ASF, search is directly supported through Internet access to search services at
ASF Information Community nodes operated by ASF participants. Searchers using Internet
client software typically access these services through Web browsers and Hypertext
Transfer Protocol (HTTP) gateways or directly using GILS-aware client software. Although
the search itself occurs on the Internet, referenced content is not so constrained and may
include people, artifacts, events, maps, images, organizations, software--anything that
can be documented in a Locator Record. It is also understood that participants and other
intermediaries may exploit the ASF to support voice telephone information services,
kiosks, printed catalogs, etc.
- Information providers may support search access to primary
information content and also provide referral to sets of information elsewhere.
- In the ASF, primary information content is called a document set. Information that
characterizes document sets is represented in ASF Summary Locator Records. An ASF
Information Community consists of ASF Nodes and is delineated by the sharing of ASF
Summary Locator Records among ASF participants.
- Searches for information involve query/response interactions
between searchers and information providers.
- In the ASF, query/response interactions are standardized using the GILS Profile. This
standard search interface applies to all document sets, including sets of Locator Records.
- Each information provider may characterize its own and other
document sets and may be subject to characterization by others.
- ASF Summary Locator Records can be created through a machine-aided process that includes
automated word indexing for some types of documents (e.g., HTML, ASCII text, and other
types with proprietary add-on filters). ASF Summary Locator Records may also be created
for document sets held at sources not actively participating in the ASF. Locator records
within the ASF are always individually editable by people and searchable via a
GILS-compliant server.
- Each information provider focuses on the information needs
of particular clientele and this focus influences the selection of document sets,
referrals, and aspects of the search support provided.
- In the ASF, each participant may support distributed or centralized search of document
sets through the immediate ASF Node as well as throughout an ASF Information Community.
Searchers may begin at any server and exploit other servers through ASF Summary Locator
Records. Each ASF Node provides at least a GILS-compliant search service but may also
offer additional search capabilities, such as advanced geospatial search or multi-lingual
support.
- Each information provider may characterize document sets in
different ways to diverse clientele.
- In the ASF, a document set is part of an ASF Information Community as long as the ASF
participant provides an ASF Summary Locator Record for the document set. Each ASF
Information Community has a Fallback Node with ASF Summary Locator Records that
characterize each document set in the ASF Information Community. A document set may be
referenced in multiple ASF information communities, and may be represented by different
ASF Summary Locator Records in each.
- Each information provider supports search of contents by
characteristics, which may be unstructured (e.g., "full-text",
"pattern-matching"), or structured (e.g., Title, Author, Subject).
- Each ASF Node supports GILS-compliant search of Locator Record sets as well as document
sets. Structured searches in the GILS Profile are defined for Boolean operations on about
100 common bibliographic and metadata elements and for temporal and spatial constraints.
Unstructured search is supported only through full-text search at this time.
- Information providers employ a complex range of server
platforms, services, and search engines.
- In the ASF, the design platform is a Unix environment with the Apache HTTP server.
However, essential software components are freely distributed in source code to support
porting to additional platforms. Also, the ASF uses open systems design principles where
feasible to allow for maximum interoperability and component substitution. For example:
the network interface supports OSI as well as TCP/IP transports; the network search
interface is ISO 23950; the search engine is accessed through a generic Application
Programming Interface, the ASF characterization of document sets is adapted from the IETF
FIND protocol; and the network exchange of Locator Records is encoded in Extensible Markup
Language (XML).
- Information providers must be able to control access and
support authentication.
- In the ASF, access to documents and Locator Records is controllable through standard
network and ISO 23950 mechanisms. A mechanism is also provided for searchers to
authenticate the content served.
- Reliability of Internet information access is a major
concern.
- In the ASF, the persistence of referenced information sources is somewhat improved
through the use of Uniform Resource Names (URNs).
Term hierarchies in ASF Content, Participation, and Freeware
- Content
- Document Set
- Locator Record
- Information Community Fallback Node document set
- Information Community Participant Locator Records
- Information Community Summary Locator Records
- Participation
- Freeware
Alphabetic list of ASF Terms
- Advanced Search Facility (ASF):
- The ASF has three distinct aspects: content, participation, and software. ASF content
consists of information in the form of Locator Records and documents, with special cases
for ASF Participant Locator Records and ASF Summary Locator Records. ASF participation is
organized into ASF Information Communities and there are particular operational
responsibilities prescribed for ASF participants. ASF software is implemented at ASF
Nodes, each of which includes a suite of software components to achieve the various
requirements of the ASF.
- ASF Node:
- A single host machine, which runs the ASF software, is an ASF Node. ASF Nodes may
participate in any number of ASF Information Communities.
- Document:
- An ASF document is any information that can be indexed by an ASF Indexer Service,
including Locator Records as well as textual matter. (The ASF Indexer Service does not
handle images, numeric tables, maps, or other abstract information sources at this time.)
- Document Collection:
- The set of document sets accessible at an ASF Node.
- Document Set:
- An ASF document set is made accessible through an ASF Search Service. ASF does not
prescribe the rationale for grouping documents into sets, although common criteria include
subject, format, ownership, etc. In addition to being directly searchable through the
Search Service of an ASF Information Community node, a document set is referenced in an
ASF Information Community through an ASF Summary Locator Record. An example of a document
set is the set of HTML pages at the Internet location "http://www.usgs.gov/".
- Gatherer Service:
- An ASF Gatherer Service is a component of an ASF Search Subsystem. It compiles a list of
documents to be indexed by extracting documents from other servers (one such technique is
known as "Web crawling"). The compilation can be constrained by location, data
type, and other parameters.
- Indexer Service:
- An ASF Indexer Service is a component of an ASF Search Subsystem. It provides for
parsing documents and creating index files that allow fast search and retrieval on
document sets. The ASF Indexer Service also has the capability to generate a locator
record for each indexed document.
- Information Centroid:
- A summary of the information content contained in a document set. This might be as
simple as a list of the indexed words in a document set.
- Information Community:
- An ASF Information Community is an organization of ASF participants. The sharing of ASF
Participant Locator Records and ASF Summary Locator Records delineates its content. In
software, an ASF Information Community is implemented at ASF Nodes. The full set of
participants and summarized document sets within an ASF Information Community are
described in a compilation of Locator Records at the ASF Information Community Fallback
Node. An example of an ASF Information Community is the Clearinghouse of the U.S. National
Spatial Data Infrastructure.
- Information Community Fallback Node:
- An ASF Information Community Fallback Node provides Search Service access to the full
set of ASF Participant Locator Records and ASF Summary Locator Records within an ASF
Information Community. For example, a likely host for an ASF Information Community
fallback Server for the "U.S. Government Cabinet-level Agencies" information
community would be the Executive Office of the President.
- Information Community Node:
- An ASF Information Community node is a software component of an ASF Information
Community as operated by an ASF participant. It includes a Web browser user interface, an
ASF Search Subsystem, and an ASF Web-Z39.50 gateway with embedded GILS-aware client. ASF
provides freely distributable software for all node components, with enhanced capabilities
possibly available on a component basis.
- Information Community Node Administrator:
- A person performing administrative functions in support of an ASF Information Community
Node.
- Information Provider:
- In the ASF, an information provider is a person or organization providing access to
information as owner or intermediary, whether publicly or restricted. Every ASF
participant is an information provider. However, because Locator Records can reference
outside information sources, not every information provider is an ASF participant. For
example, the U.S. Geological Survey would be a participant in the "U.S. Government
Earth Science" information community. Outside Earth Science sources (e.g.,
commercial, educational, foreign governments) could be summarized as information providers
but would not be participants in this particular community.
- Intermediary:
- An intermediary is any information provider referencing content owned by another. ASF
does not distinguish between intermediaries and other information providers, except that
the provider role is evident by information in Locator Records. For example, the U.S.
Government Printing Office acts as an intermediary for most U.S. Geological Survey
publications.
- Locator Record:
- A Locator Record describes an information resource, the information available in the
resource, and how to obtain the information. ASF introduces ASF Participant Locator
Records and ASF Summary Locator Records for document sets. Except for how they are
constructed and exchanged with an ASF Information Community, these and all other Locator
Records are handled as any other document sets in the ASF. While Locator Records may
originate in a range of formats, all are searchable and presentable through the model
described in the GILS Profile.
- Management Service:
- The ASF Management service consists of a variety of software tools to simplify tasks
performed by ASF Information Community node administrators. Among the functions addressed
by these tools are management of document sets, URNs, and thesauri, and configuring
information communities and Z39.50 subsystem components.
- Participant:
- An ASF participant is an information provider operating at least one ASF Information
Community node. An ASF participant must be described in an ASF Participant Locator Record
on the ASF Information Community Fallback Node.
- Participant Locator Record:
- An ASF Participant Locator Record describes an ASF participant in the context of an ASF
Information Community. The initial building of an ASF Participant Locator Record is
accomplished when the participant first joins the ASF Information Community. The content
of an ASF Participant Locator Record is similar to a "GPO pathway record".
- Resource:
- Any information described in a Locator Record.
- Search API (Applications Programming Interface):
- The ASF search API is a software layer within an ASF Search Service. It provides
communication to an ASF Z39.50 Subsystem, but is generalized to support other query
mechanisms such as LDAP and SQL as well.
- Search Engine:
- This term is ambiguous in this document and in the industry itself. In its narrow usage,
a search engine is software that responds to search requests, typically by indexing some
set of information. A GILS-compliant search engine in this sense is embedded within an ASF
Search Service. A broader usage of the term "search engine" includes the means
by which information content supported by the search engine is acquired.
- Search Service:
- An ASF Search Service is a component of an ASF Search Subsystem. It provides search
access to at least one set of Locator Records, and may offer search access to other sets
of Locator Records or document sets. The ASF Search Service includes the ASF search API.
- Search Subsystem:
- An ASF Search Subsystem is a component of an ASF Node. The ASF Search Subsystem includes
an ASF Gatherer Service, an ASF Indexer Service, and an ASF Search Service.
- Summary Locator Record:
- An ASF Summary Locator Record describes a document set made available by an ASF
participant on an ASF Node. The content of an ASF Summary Locator Record is similar to
other Locator Records except that the Subject Index element of the record references a
summary object such as a centroid.
- Uniform Resource Name (URN):
- A persistent name applied to an online resource.
- URN Resolver:
- A URN resolver is a software utility function used throughout the ASF. The function can
be achieved through a variety of software and administrative mechanisms.
- Web Browser User Interface:
- An ASF Web browser user interface is a component of an ASF Node. It communicates with
the Web-Z39.50 gateway on the same ASF Node.
- Web-Z39.50 Gateway:
- An ASF Web-Z39.50 gateway is a component of an ASF Web browser interface and includes an
embedded GILS-aware client. An ASF Web-Z39.50 gateway communicates with an ASF Search
Subsystem.
- Z39.50 Subsystem:
- An ASF Z39.50 Subsystem is a component of an ASF Search Service. It includes the core
ANSI Z39.50 (ISO 23950) capabilities required, including GILS-compliance.