Michael J. Cafarella

Email: username mjc, found at cs dot washington dot edu.

Physical mail:
Mike Cafarella
University of Washington
Department of Computer Science and Engineering
Box 352350
Seattle, WA 98195-2350

Office: 482 Allen Center

I am a last-year graduate student at the Department of Computer Science and Engineering at the University of Washington. My research interests are databases, information retrieval and extraction, and machine learning/data mining. I am particularly interested in extracting and managing Web data.

I am looking for positions in both academia and industrial research, to start in the fall of 2009. I have accepted a faculty position at the University of Michigan!

My advisors are Oren Etzioni and Dan Suciu. I've collaborated with many fellow students, most recently with Michele Banko, Chris Re, and Nodira Khoussainova. (And, from other universities, Daisy Zhe Wang, Eugene Wu, and Yang Zhang.) I have also completed two research projects at Google with Alon Halevy.

I've earned degrees from Brown and the University of Edinburgh, Scotland.

Before grad school, I worked at Marimba (later bought by BMC) and Tellme Networks (later bought by Microsoft). I also costarted the Nutch and Hadoop open source search projects with Doug Cutting (but the demands of grad school have finally pushed me to emeritus status).

Recent news (6/11/09): The TextRunner project, which I started with Michele Banko and Oren Etzioni (and has since been carried forward by many others), has been picked up in MIT's Tech Review.


Publications

2009

2008

2007

2006

2005

2004


Teaching

I TA'ed CSE454, Advanced Internet and Web Services in Winter '04 and Autumn '06. I really enjoyed helping to teach this class; if you're a UW student, give it a shot.

Personal


Last modified: January 13, 2009