Natural Language Parsing: Treebanks and the Knowledge of Language

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Thursday November 17, 2011
      2:05 pm - 3:05 pm
  • Location: Technology Square Research Building (TSRB) Auditorium/Classroom 175
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact

Kyoko Masuda, Associate Professor of Japanese & Linguistics, School of Modern Languages

kyoko.masuda@modlangs.gatech.edu

Summaries

Summary Sentence: Dr. Sandiway Fong presents "Natural Language parsing: Treebanks and the Knowledge of Language"

Full Summary: A public talk by Dr. Sandiway Fong, Natural Language Parsing: Treebanks and the Knowledge of Language.Please Note: This talk will be delivered via Skype

A Public Talk presented by Dr. Sandiway Fong

(Please Note - This talk will be delivered via Skype)

Over the past two decades, there has been increasing use of linguistically annotated sentence collections, such as the Penn Treebank (PTB), for constructing statisticallybased parsers across a variety of languages. Such Treebank-based parsers are trained on and exploit syntactic regularities in (manually-annotated) phrase structures - statistically interpolating where data is missing - in order to synthesize a most-likely parse when presented with novel (and pre-existing) sentences. Within this framework, one must rely on the promissory note that all necessary grammatical knowledge is encapsulated and statistically extractable from the Treebank corpus. In this talk we discuss the practical limits of this approach, cognitive implications for the problem of language acquisition, and finally, on-going Treebank work for a bilingual English/Arabic roboceptionist (Hala), a joint project with Carnegie Mellon University
(Qatar and Pittsburgh).

Co-sponsored by the School of Modern Languages, Ivan Allen College Dean's Office and the School of Interactive Computing.

Biography of Dr. Fong:  Sandiway Fong received his B.Sc. in Computing Science, at Imperial College of Science and Technology, University of London. He received an S.M. in 1986 at MIT, where he worked in the Artificial Intelligence Laboratory.  After working at IBM's Watson Research Center, he returned to MIT  for his Ph.D. which was awarded in 1991.  Upon graduation Dr. Fong joined the NEC Research Institute to work on natural language processing, and machine translation.  In 2003, he moved to the University of Arizona, where he is now an Associate Professor.  His research interests are at the intersection of computer science and formal linguistics, with a focus on multilingual parsing, ontolinguistics, computational lexical semantics and computational morphology.

Related Links

Additional Information

In Campus Calendar
Yes
Groups

Ivan Allen College of Liberal Arts, School of Modern Languages

Invited Audience
No audiences were selected.
Categories
Seminar/Lecture/Colloquium
Keywords
Dr. Sandiway Fong, language, Modern Languages, TSRB
Status
  • Created By: Carol Silvers
  • Workflow Status: Published
  • Created On: Oct 10, 2011 - 2:21pm
  • Last Updated: Oct 7, 2016 - 9:56pm