OLAC Language Resource Catalog

Navigation Aids

OLAC Language Resource Catalog
Search for language resources
 

Main Content

LT TTT
Title:
LT TTT
ID:
LT TTT
Link to the object:
Online:
Yes
Archive:
Contributor:
Andrei Mikheev (author)
Claire Grover (author)
Colin Matheson (author)
Publisher:
LTG, University of Edinburgh
Description:
The LT TTT system provides a flexible means of tokenising texts and adding markup at various levels. The main component of the LT TTT system is a program called fsgmatch. This is a general purpose cascaded transducer which processes an input stream deterministically and rewrites it according to a set of rules provided in a grammar file. Although it can be used to alter the input in a variety of ways, the grammars provided with the LT TTT system are all used simply to add mark-up information. We have provided grammars to segment texts into paragraphs, segment paragraphs into words, recognise numerical expressions, mark up money, date and time expressions in newspaper texts, and mark up bibliographic information in academic texts. The documentation provides a description of the rule formalis
Contact: grover@cogsci.ed.ac.uk
Documentation: online
Platform: Solaris
Distribution: Online
Price (Academic, Commercial, Multi-user) : free , to negotiate , to negotiate
Other date:
2000
Other type:
Annotation Tools , Corpus Analysis , Part-of-Speech Tagging , Processing Mark-Up Languages , Tokenization
Complete OLAC record:
Link for this page:

Find Related Information:

Archive: The Natural Language Software Registry
Online: Yes
Contributor: Andrei Mikheev
Contributor: Claire Grover
Contributor: Colin Matheson
Publisher: LTG, University of Edinburgh
Title: LT TTT
Other date: 2000
Other type: Annotation Tools , Corpus Analysis , Part-of-Speech Tagging , Processing Mark-Up Languages , Tokenization