|
Search for language resources
|
|
Main Content
LT TTT
Information about this record
| Title: |
LT TTT
|
| ID: |
LT TTT
|
| Link to the object: | |
| Online: |
Yes
|
| Archive: | |
| Contributor: |
Andrei Mikheev (author)
Claire Grover (author)
Colin Matheson (author)
|
| Publisher: |
LTG, University of Edinburgh
|
| Description: |
The LT TTT system provides a flexible means of tokenising texts and adding markup at various levels. The main component of
the LT TTT system is a program called fsgmatch. This is a general purpose cascaded transducer which processes an input stream
deterministically and rewrites it according to a set of rules provided in a grammar file. Although it can be used to alter
the input in a variety of ways, the grammars provided with the LT TTT system are all used simply to add mark-up information.
We have provided grammars to segment texts into paragraphs, segment paragraphs into words, recognise numerical expressions,
mark up money, date and time expressions in newspaper texts, and mark up
bibliographic information in academic texts. The documentation provides a description of the rule formalis
Contact: grover@cogsci.ed.ac.uk
Documentation: online
Platform: Solaris
Distribution: Online
Price (Academic, Commercial, Multi-user) : free , to negotiate , to negotiate
|
| Other date: |
2000
|
| Other type: |
Annotation Tools , Corpus Analysis , Part-of-Speech Tagging , Processing Mark-Up Languages , Tokenization
|
| Complete OLAC record: | |
| Link for this page: |

