CSAIL Event Calendar: Previous Series
Dealing with the Complexities of Syntactic Parsing in Hebrew: Addressing Agreement, Word-segmentation and Rich Morphology in a Fast Dependency Parser
Speaker: Yoav Goldberg , Ben Gurion University
I will describe my experience with designing a syntactic parser for Hebrew, a language with rich morphology and a small treebank. After describing some of the characteristics that make automatic syntactic processing of Hebrew challenging and discuss some data representation issues, I will present some solutions to these challenges. These include improvements of a semi-supervised broad-coverage tagger, and a greedy dependency parser which can accommodate rich feature-sets and cope with noisy data while remaining fast. I will also briefly discuss a constituency parsing system that performs joint morphological segmentation and syntactic parsing. The work on Hebrew brought about solutions that work well also for English, I will point to these results when appropriate.