Loading…

The O.S.A. Project: Computerisation of the Dictionary of the Swedish Academy

The 28-volume, not yet finished dictionary, comprising about 200 million characters, is being made machine-readable by means of optical character recognition. The structure of the database is highly dependent on the very consistent format of the dictionary articles and on the intricate typography of...

Full description

Saved in:
Bibliographic Details
Published in:Literary and linguistic computing 1988, Vol.3 (3), p.166-168
Main Author: MALMGREN, S.-G.
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The 28-volume, not yet finished dictionary, comprising about 200 million characters, is being made machine-readable by means of optical character recognition. The structure of the database is highly dependent on the very consistent format of the dictionary articles and on the intricate typography of the dictionary. Different kinds of possible linguistic information retrieval from the database are presented. Not surprisingly, these are mainly of a non-semantic nature. To make possible systematic investigations on, e. g., sense development, it will probably be necessary to implement various kinds of semantic tags: this issue is also discussed.
ISSN:0268-1145
1477-4615
DOI:10.1093/llc/3.3.166