Akira UshiodaDavid A. EvansTed GibsonAlex Waibel
We describe a mechanism for automatically acquiring verb subcategorization frames and their frequencies in a large corpus. A tagged corpus is first partially parsed to identify noun phrases and then a finear grammar is used to estimate the appropriate subcategorization frame for each verb token in the corpus. In an experiment involving the identification of six fixed subcategorization frames, our current system showed more than 80% accuracy. In addition, a new statistical approach substantially improves the accuracy of the frequency estimation.
Michael R. BrentRobert C. Berwick
Dipankar DasAsif EkbalSivaji Bandyopadhyay
Jeremy YallopAnna KorhonenTed Briscoe