Loading…
The O.S.A. Project: Computerisation of the Dictionary of the Swedish Academy
The 28-volume, not yet finished dictionary, comprising about 200 million characters, is being made machine-readable by means of optical character recognition. The structure of the database is highly dependent on the very consistent format of the dictionary articles and on the intricate typography of...
Saved in:
Published in: | Literary and linguistic computing 1988, Vol.3 (3), p.166-168 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The 28-volume, not yet finished dictionary, comprising about 200 million characters, is being made machine-readable by means of optical character recognition. The structure of the database is highly dependent on the very consistent format of the dictionary articles and on the intricate typography of the dictionary. Different kinds of possible linguistic information retrieval from the database are presented. Not surprisingly, these are mainly of a non-semantic nature. To make possible systematic investigations on, e. g., sense development, it will probably be necessary to implement various kinds of semantic tags: this issue is also discussed. |
---|---|
ISSN: | 0268-1145 1477-4615 |
DOI: | 10.1093/llc/3.3.166 |