I am a computer science PhD student in Department of Electrical Engineering and Computer Science at Northwestern University. My advisor is Prof. Doug Downey, and our lab is WebSAIL. My research interests include natural language processing and machine learning. I have been working on information extraction in Wikipedia and scholar articles.
Currently I am focusing on statistical language modelling. I am exploring how data from dictionaries and encyclopedias can be used to effectively improve natural language capabiliy of language models.
This project aims to build generative models of definitions in dictionaries and encyclopedias. The project is part of my current research. I am exploring deep learning algorithms and other language models.
My colleague, Chandra, has extened WebSAIL Wikifier to extract enities from tables on websites. The algorithm here is considered as an improved version of the original project, though I have not yet applied on normal articles. For more information and resources, see TabEL project page.
TextJoiner is a system that allows users to interactively query and perform joins on facts expressed in Wikipedia using text patterns like "cities such as $x". Just language models and word embedding.
The backend might be unavailable. Other than discussion, I mainly implemented UI in this project.
This project extracted Wikipedia tables and used machine learning to enable table exploration via "search" and "join". WikiTables allows user to find and view columns from different table side by side, and potentially discover an interesting corelation. Demo: Join and Search.
Again, I mainly implemented UI in this project.