What is the Place Names Database?
The Place Names Database (KNAB) is a systematic computerized collection of data on geographical names from both Estonia and abroad that is being developed at the Institute of Estonian Language. Its purpose is to facilitate the study and standardization of geographical names by providing data on their history and modern use. It has been planned as a linguistically-oriented database, to enable to compile and prepare different gazetteers and dictionaries.
History
The beginning of the database goes back to 1988 when it started operating on mainframe computers. The first data entered were the street names of Estonia using the format of columns. In 1989 a more sophisticated and flexible text-based database structure was envisaged which continues to be used even nowadays, with many improvements. In 1990 there were already 26,000 records in the database but the transition to personal computers had negative effects because a part of the data was conserved on older tapes that could not be converted later. The actual work on the database started again in 1994 with the support of the Estonian Science Foundation; the presentation of different data elements was elaborated and old files updated. At the end of 1996 the database contained already 40,000 records  or name articles including some 37,000 variant names. The foreign name files were the first to be integrated into one systematic collection; on the basis of the data a dictionary was published in 1999.
The integration of the Estonian place names data was finished in 2000 together with a systematic treatment of names of natural features, the project was supported again by the Estonian Science Foundation.
At the end of 2000 the Place Names Database KNAB contained 63,000 name articles (additionally 86,000 variant names in the articles) -- 41,000 Estonian places name articles (incl. 28,000 variant names) and 22,000 name articles of foreign countries (incl. 58,000 variant names). 32,000 Estonian place names are integrated into one territorially organized database excluding in principle any double records; the remainder constitute different map name lists or parts that are yet under processing (e.g. the list of roads). The Internet version contains only the integrated part of the database.
Structure of the database
A complete database record (name article) would contain information on the following items.
- Principal name form, with divisions into the name core, qualifying attributes or first names (in commemorative names), numbers, generic terms and explanative terms. Principal names are usually the official names if there are any, or other preferred name forms that are linguistically verified; during further processing the order of principal names and variant names is subject to revision. The Internet version gives all names in full, without dividing the names into elements; pronunciation will be omitted.
- Variant names including parallel (official) names or obsolete or foreign-language names. Erroneous name forms from different sources are also registered.
- To each name form also the following sub-items may be supplied:
	
	-  labels, i.e. the language of the name, information on the stylistic or functional value of the name, the use of locative cases, etc. (see here for details)
	
-  name sources (see here for source formats)
	
-  comments (not given in the Internet version)
	
 
- Status of the name article: ordinary, historic name, etc. (See here for details.)
- Feature designation code (classification of the named feature: populated place, lake, river, island, etc. See the table of feature codes. This field also contains the ID of the feature in national catalogues or indexes.
- Present administrative division where the named feature belongs to: county and municipality (municipalities). For foreign countries this data field contains the country codes
- Parish or parishes (historical church parishes) where the named feature belonged to in 1918; in the case of several parishes also the abbreviation of the old county is added.
- Minor area to which the named feature belongs. This field may contain the abbreviation of a city district, the name of a populated place (in the case of streets or houses) or a larger geographical entity, e.g. the peninsula or a bay; for rivers this will contain the name of the main river or the bay where it flows into. For individual houses also addresses may be given.
- Geographical coordinates of the feature: latitude and longitude. The coordinates for features within Estonia are established on the basis of the digital version of the Estonian Basic Map (1 : 10,000), those for foreign features are, as a rule, based on the GEOnet database of the United States Board on Geographic Names. Many name records (esp. for foreign countries) do not contain any coordinates at all. Point features have the coordinates of the conventional centre (preferrably those of the historic centrepoint). In Estonia line features have the coordinates of both the beginning and end points; area features have the coordinates of the most south-western and the most north-eastern point of the minimal binding rectangle.
The Place Names Database may contain also the following information that is not given in the Internet version.
- The name of the administrative centre in the case of administrative units
- Temporal extensions of the feature designation codes, administrative divisions, etc. (this will allow to store also data on earlier administrative units where the feature belonged to, etc.)
- Short textual description of the named feature (used in dictionaries)
- Use of locative cases (either internal or external cases are associated in Estonian for names of specific populated places)
- Names of subdivisions (i.e. which features might be included, e.g. which former populated places have been annexed to present official populated places)
- Names of superior features (under which features they might be included)
- Comparisons to other named features, links
- Historical background of the named feature, the origins of the name and other explanations
Actual content of the database
As the database has been compiled over a longer period of time, it is in many ways unproportional. In general the following categories of names may be outlined.
Place names of Estonia
- Street names were the first to be entered into the database. Various sources were used, including the names of streets mentioned in Soviet-time descriptions of electoral districts, telephone books, etc. In 1991 the Statistics Board sent out a questionnaire to establish all official street names. This made it possible to collect more or less completely all the street names at that time. A list of street names was compiled also by AS Regio for the census of 2000, its data have been included here also. At present the database contains some 8900 valid street names plus hundreds of historical names. Name variants used in the 20th century have been included, especially for historical cities; for Tallinn and Tartu the data goes back to even earlier centuries, thanks to the fundamental publications by Aleksander Kivi and Niina Raid [links here and in the following will give the full bibliographic entries of the sources]. The actual list of street names for Tallinn may be considered official and exhaustive, this being updated in cooperation with the name committee of Tallinn. Data for Tartu has been expanded also independently on the basis of maps, travel guides and reference books of the 19th century and name spellings from Estonian-language newspapers of that time. Among other cities of Estonia Haapsalu, Kuressaare and Narva have also been investigated more thoroughly.
- Although the names of institutions, companies and organizations are not strictly speaking toponyms, the data on these entities have also been collected at various times, mainly in the case of Tallinn and Tartu. The names of institutions of Tallinn date back often to the 1980's when the file was originally created; it has been only partially updated (schools, libraries, medical institutions, hotels), others (shops, cafes, etc.) are still outdated and contain the note "1980. aastad" ('the 1980's'). There are 1700 name articles on institutions in the database.
- The names of populated places in the database reflect the official status of each name, i.e. all official names have been verified. (The Place Names Database was used in fact to compile the official listing published in Riigi Teataja or the official gazette.) Additionally main sources from the 20th century (incl. maps) have been analyzed for the database: listing of populated places of the census 1970, populated places in the lists of village soviets in 1945, topographical maps of the 1930's (1 : 50,000, 1 : 200,000), names of populated places in the documents of the 1922 census, list of place names in the province of Estonia 1913 and names from the 1 : 42,000 map of the beginning of the 20th century. All listings have fully been incorporated into the database and the name forms related to modern features with coordinates, if applicable. In a few cases the names in sources have not been identified, these are in the database with a label HIST. From maps all names marked as those of populated places have been included.
- The place names of Petserimaa (Pechorskiy Kray) have been under special investigation and therefore name articles of that region contain more information than usual. In addition to maps and listings of the 1920's and 1930's also some archival sources were consulted, notably the plans for name changes at the end of the 1930's.
- The names of manor houses are given mainly on the basis of a list published in 1994 ("Eesti mõisad") but with later modifications and updatings. In 2008 the data were fully updated (with some names corrected also).
- The names of farms have earlier been added casually, since 2008 this is being conducted in a more systematic way, parish by parish.
- The list of names of administrative units has been compiled quite recently, where possible, it contains also earlier non-Estonian name variants.
- The names of natural features have been collected on the basis of several official or half-official listings but it has not yet been compared to the data on newest maps (esp. the Basic Map of 1 : 20,000) and also with the card index at the Institute of Estonian Language. Therefore they should be considered provisional only.
	
	- The list of rivers and streams is based on an official listing of 1986 that included also variant names. This has been modified slightly on the basis of newest name spellings of populated places but officially the revision should be done under the guidance of the Place Names Board of Estonia. There are 1,900 articles on river names in KNAB. The coordinates are approximate.
	
- The list of lake names originates from a publication of 1934 by H. Riikoja. This was expanded by another list of lakes in 1964 (I. Kask) and a monograph on lakes by A. Mäemets in 1977. KNAB reflects the 2006 official listing of lakes.
	
- The list of island names was compiled on the basis of a book by A. Loopmann (1996) which has been considerably expanded by data from the card index at the Institute of the Estonian Language and other maps. KNAB reflects the 2008 official listing of islands (though some revision is still needed).
	
- The names of mires have been mainly taken from a map of mires of 1993 by AS Regio and the Estonian Geological Centre. New names have been added but there is still much work to do.
	
- The names of features on the shoreline include those of capes, peninsulas, bays, shoals etc. A preliminary listing of those features was prepared by the Estonian National Maritime Board on the basis of nautical charts of the first half of the 1990's. This was expanded with data from the Basic Map, other maps and the card index at the Institute of the Estonian Language.
	
 
Place names of foreign countries
The data files for foreign place names have also been compiled from various sources (the listing of Estonian exonyms, place names annex to the Russian-Estonian dictionary, etc.) and is relatively less uniform than the Estonian collection. Place names from around Estonia are represented in larger numbers than those of other more distant areas. See a table to have an overview on the distribution of place names records country by country. The following name categories should be mentioned.
- The names of first-level administrative units are given for all countries of the world, this includes names of provinces, counties, districts, etc. For Finland, Latvia, Russia, Sweden and increasingly for several other countries also ADM2 level names are represented. A separate page lists all included administrative units.
- The place names of some regions of Russia (Adygea, Ingushetia, Kabardino-Balkaria, Karachay-Cherkessia, Northern Ossetia) have been collected more thorougly having in mind also the local-language name forms; Udmurt place names from within and around Udmurtia are especially well represented.
- The place names of smaller peoples have been under special investigation, also thanks to the GeoNative website. KNAB contains names data covering Chechen, Gaelic and Welsh names, it also incorporates the  whole Basque place names list.
- The lists of exonyms have been compiled originally for the purpose of the dictionary on the geographical names of the world; as many name variants as possible have been included. In addition to Estonian exonyms the database contains also English, Finnish, French, German, Russian and other exonyms.
Important to know
The user of the database should also be aware of the following.
- This is not the official source of place names for Estonia giving the correct spellings of each name. The official status of names is reflected only in the case of names of populated places and street names of Tallinn. Official national place names register is being maintained by the Estonian Land Board.
- Although one of the final targets of the database is to provide linguistically sound and standardized name spellings, for many different reasons this is still not the case at present. Principal name forms given in the present output are therefore subject to further changes as new data will be gathered and entered.
- The place names database is being expanded and updated constantly. Some of the data included might be highly questionable, even erroneous. Still, it is believed that even with its present content the database may be utilised by critical users to get at least preliminary information on different place names.
- The user of the foreign place names data should be aware that any definition of names, national boundaries, etc. can not be viewed as the de iure recognition of the current situation. The database presents name spellings and other information for practical purposes only, reflecting mostly the actual situation. But the database does not refer to certain political entities as independent states if they are not internationally recognized (e.g. Nagornyy Karabakh, Northern Cyprus, Somaliland, etc.). In some other cases also it follows the guidelines accepted by the international community.