

Archie Search Engine
Archie was the first search engine which was created in 1990 by Alan Emtage, Bill Heelan, and J. Peter Deutsch who were students attending McGill University in Montreal. Initially their idea for a name was 'archives' but they soon shortened it and is now known as Archie. Below is an example of what the Archie Query Form looked like:


In September 1990 Alan Emtage referred to Archie as "pretty brain-damaged" although later he showed more confidence in the abilities of Archie. When Archie was first created it downloaded the directory listings of all the files on public anonymous FTP (File Transfer Protocol) sites. FTP's allowed them to exchange and manipulate files from computer to computer through a network, e.g. the Internet as it is a network protocol. There are many FTP clients and server programs available for the different operating systems. FTP has been found very popular for exchanging independent files using the operating system that is involved. With Archie this was only done on a regular basis and therefore it did not waste too many resources. All of these listings were then stored in local files which were to be searched using the Unix grep command. Grep (Global regular expression print) is a command line text search which was originally written for Unix. The grep command searches files or standard input, globally for lines which match the sentance given. These are then printed to the program's standard output, although the user must be careful as by default the grep command is case sensitive and therefore it will only accept lower case e.g. archie not Archie.
After a while they began to develop more and more efficient versions of Archie, this meant improving the front and back end of the system. When the developed system began working it took off as the system then spread from a local tool, to a network wide resource and to a popular service available from sites around the Internet. The Archie servers could then be accessed in the following ways:
- Using a local client (Archie or xarchie)
- Telneting to a server directly
- Sending queried by electronic mail
Local client: xarchie
Xarchie is an X11 browser interface for the Archie Internet Archives database which uses the Prospero virtual file system protocol. The best known application of the Archie system is to maintain the Internet Archives database. This database is already available from a number of service providers throughout the Internet and currently contains the names of over 2,100,000 files at over 1000 anonymous FTP archive sites.
Xarchie displays the relevant information which is returned from Archie with a user friendly interface making it easier for the user to understand. This allowed many people to explore FTP sites by examining the directories which were returned as query matches and then it retrieves files which the user wanted. Xarchie was designed like most other X applications, which was to be highly costomisable. In Xarchie almost all details of the interface, display appearence and command interface, can be customised using X resources.
The pitucre below shows that in the left their directories which when clicked on are displayed on the right panel. The bottom panel of Xarchie shows a set of text items where you can select the appropiate one when nessesary. The following image shows the Xarchie Help Pannel:

The next picture shows another Xarchie document. The contents of the image can all be found using the status bar at the top of the box. This provides information about the progress of a query. In the middle section it displays functions as a host location, i.e. file browser. The far right section then displays the directory which was being searched. The bottm pannel of Xarchie shows a set of text items were you can enter information which can then be browsed for. Once the return key is pressed it invokes an action, as a shortcut for selecting if from a menu.

With the World Wide Web continually growing Archie has become less and less important as new ways are being inroduced. This has happened as more efficient serach engines are now used and researchers try try the web's main search engines first or ones which they already know has the topic of their search and therefore are likely to be found on FTP servers. Archie didn’t work anywhere near as well as search engines do today although then it allowed users to search the Internet for files they were looking for, carrying out the job it was designed for. Archie became like a database of web filenames which it used to match the users queries. Archie didn’t index the content of text files and therefore in 1991 Gopher developed another program which was a search enabling this. The following picture shows a simple search in Archie:

A few Statistics:
Archie servers processed approximately 50,000 queries per day which was generated by a few thousand users worldwide. Every two months of Internet growth required another replica of Archie. A dozen Archie services had to replicate continually developing the database of records. Information shows that Archie took longer to answer queries during the week compared to a Saturday night as traffic was less.
In 1992 Archie contained roughly 2.6 million files with 150 gigabytes of information according to seobythesea.com. Also in 1992, Archie had catalogued over 200 public FTP sites. Compared to figures now that seems laughable but a decade ago, it was already beyond anyone's wildest dreams. Later in 1993 reports said that Archie was still active although they were seeing some signs of strain in handling searches. At it's peak in 1995, 30 Archie engines crawled the internet and had catalogued millions of pages. Currently there are very few Archie servers which are still in operation although one gateway can be found in Poland.
Members of the Archie group are:
Comments (0)
You don't have permission to comment on this page.