Hoifung Poon



Address:
Dept. of Computer Science & Engineering
University of Washington
Box 352350
Seattle, WA 98195-2350

Telephone: (206) 543-8086
Fax: (206) 543-2969
Email: <first name > at cs dot washington dot edu
Office: 318 Allen Center


I am now a researcher in the NLP group at Microsoft Research. Please visit my new website.

In June, I defended my Ph.D. dissertation in the Department of Computer Science and Engineering at the University of Washington. My advisor is Pedro Domingos.

My research interest are in statistical relational learning (SRL) and natural language processing (NLP). SRL is an emerging direction that combines logic and probability to handle both complexity and uncertainty in large-scale machine learning. Recent work has shown that SRL can compactly specify very large and complex probabilistic models and substantially improve predictive accuracy. In addition, my thesis work shows that SRL can help breach the bottleneck of large-scale machine learning: the scarcity of labeled examples. To compensate for the lack of label examples, my research has applied SRL to leverage indirect supervision via joint inference, where the labels of some objects can be used to predict the labels of others. This has enabled me to develop state-of-the-art unsupervised learning algorithms for a variety of NLP problems ranging from segmenting words to inducing ontologies.

These advances are based on Markov logic, a leading unifying framework for SRL. I have developed efficient learning and inference algorithms for Markov logic that can handle problems with millions of variables and billions of dependencies.

I am particularly excited about the long-standing goal of harnessing human knowledge by automatically understanding texts, a.k.a. machine reading. I have developed USP, an end-to-end machine reading system that can read text, extract knowledge, and answer questions, all without any labeled examples. In a machine reading experiment, USP extracted five times as many correct answers compared to state of the art such as TextRunner, and raised accuracy from below 60% to 91%.



I recently gave a tutorial in NAACL-2010 on Markov logic in natural language processing. It was the most popular tutorial in the conference (thanks all who came!). Here are the slides.


Publications

Software


Last modified: September, 2011