Tagging accurately – Don’t guess if you know

Tagging accurately – Don’t guess if you know
Fourth Conference on Applied Natural Language Processing
Association for Computational Linguistics
Germany, 1994
Pasi Tapanainen; Atro Voutilainen
http://aclweb.org/anthology/A/A94

We discuss combining knowledge-based (or rule-based) and statistical part-of-speech taggers.

We use two mature taggers, ENGCG and Xerox Tagger, to independently tag the same text and combine the results to produce a fully disambiguated text.
In a 27000 word test sample taken from a previously unseen corpus we achieve 98.5 % accuracy.
This paper presents the data in detail.

We describe the problems we encountered in the course of combining the two taggers and discuss the problem of evaluating taggers.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s