I am an Associate Professor in the Department of Computing Science, University of Alberta.
My areas of research are knowledge extraction, data management, information retrieval, and natural language processing. I also work on machine learning methods applied to these fields. I have supervised graduate level research on the problems of named entity recognition, entity typing and disambiguation; open relation extraction from text; understanding social processes in Wikipedia article authoring; mining citation networks; and semistructured data management.
I am passionate about open linked data and the Semantic Web, and I work with Digital Humanities colleagues on creating, indexing, and processing knowledge graphs out of heritage, cultural, scholarly, and literary work.
I am also passionate about data management systems and applications, and I was a founding member and principal investigator of the NSERC Business Intelligence Network, working on information extraction from text.
I teach databases, information extraction and retrieval, and programming. Assignments, code and lectures are available on my Teaching "blog".
I recently served as associate editor of the IEEE Transactions on Knowledge and Data Engineering (TKDE) and Computational Intelligence.
Occasionally, I serve as a reviewer for ACM Computing Reviews.
My students and I contribute often open source research code. Some recent projects:
- EXEMPLAR (relation extraction)
- WNED (named entity disambiguation)
- PRIMA (IR on the Internet archive)
- LECTOR+ (information extraction from Wikipedia)
I am thrilled to count on the generous support of Diffbot.
in person: Office: ATH 4-51