Chapter 1

Introduction to Information Storage and Retrieval

Information storage and retrieval

Information storage and retrieval, the systematic process of collecting and cataloging data so that

they can be located and displayed on request. Computers and data processing techniques have

made possible the high-speed, selective retrieval of large amounts of information for

government, commercial, and academic purposes. There are several basic types of information-

storage-and-retrieval systems.

Document-retrieval systems store entire documents, which are usually retrieved by title or by key

words associated with the document. In some systems, the text of documents is stored as data.

This permits full text searching, enabling retrieval on the basis of any words in the document. In

others, a digitized image of the document is stored, usually on a write-once optical disc.

Database systems store the information as a series of discrete records that are, in turn, divided

into discrete fields (e.g., name, address, and phone number); records can be searched and

retrieved on the basis of the content of the fields (e.g., all people who have a particular telephone

area code). The data are stored within the computer, either in main storage or auxiliary storage,

for ready access.

Reference-retrieval systems store references to documents rather than the documents themselves.

Such systems, in response to a search request, provide the titles of relevant documents and

frequently their physical locations. Such systems are efficient when large amounts of different

types of printed data must be stored. They have proven extremely effective in libraries, where

material is constantly changing.

Data

Data is a collection of raw facts from which conclusions may be drawn. Handwritten letters, a

printed book, a family photograph, a movie on video tape, printed and duly signed copies of

mortgage papers, a bank’s ledgers, and an account holder’s passbooks are all examples of data.

Before the advent of computers, the procedures and methods adopted for data creation and

sharing were limited to fewer forms, such as paper and film. Today, the same data can be

converted into more convenient forms such as an e-mail message, an e-book, a bitmapped image,

or a digital movie. This data can be generated using a computer and stored in strings of 0s and

1s, as shown in the Figure Bellow. Data in this form is called digital data and is accessible by the

user only after it is processed by a computer.

The Concept of Informtion Storage and Information Retreival , Thesis of Computer Science

Related documents