"It is a capital mistake to theorize before one has
data." (Arthur Conan Doyle)
Research
I am interested in stochastic modeling of natural language. For this, I often use the EM
Algorithm. [more]
Keywords
Artificial Intelligence: Computational Linguistics, Natural
Language Processing (NLP), Parsing, Lexical Semantics;
Statistics: Probabilistic Modeling, Statistical Inference, Maximum-Likelihood
Estimation, Expectation-Maximization (EM) Algorithm;
Machine Learning: Unsupervised Learning, Soft Clustering.
Selected Papers
-
Head-Driven PCFGs with Latent-Head Statistics. In Proceedings of the 9th
International Workshop on Parsing Technologies (IWPT 2005).
(pdf)
[presentation]
-
Inducing Head-Driven PCFGs with Latent Heads: Refining a Tree-bank
Grammar for Parsing. In Proceedings of the 16th European
Conference on Machine Learning (ECML 2005),
LNCS series.
(pdf)
-
A Tutorial on the Expectation-Maximization Algorithm Including
Maximum-Likelihood Estimation and EM Training of Probabilistic
Context-Free Grammars. Presented at the 15th European Summer
School in Logic, Language, and Information (ESSLLI 2003).
(pdf)
-
A Novel Disambiguation Method For Unification-Based Grammars Using
Probabilistic Context-Free Approximations. With Bernd Kiefer and
Hans-Ulrich Krieger. In Proceedings of COLING 2002.
(pdf)
-
Inside-Outside Estimation Meets Dynamic EM. In Proceedings of the
7th International Workshop on Parsing Technologies (IWPT 2001).
(pdf)
[presentation]
-
Inducing Probabilistic Syllable Classes using Multivariate
Clustering. With Karin Müller and Bernd Möbius. In
Proceedings of ACL 2000. (pdf)
[presentation]
-
Lexicalized Stochastic Modeling of Constraint-Based Grammars using
Log-Linear Measures and EM Training. With Stefan Riezler, Jonas
Kuhn, and Mark Johnson. In Proceedings of ACL 2000.
(pdf)
[presentation]
-
Using a Probabilistic Class-Based Lexicon for Lexical Ambiguity
Resolution. With Stefan Riezler and Mats Rooth. In Proceedings
of COLING 2000. (pdf)
[presentation]
-
Inside-Outside Estimation of a Lexicalized PCFG for German. With
Franz Beil, Glenn Carroll, Stefan Riezler, and Mats Rooth. In
Proceeding of ACL 1999. (pdf)
-
Inducing a Semantically Annotated Lexicon via EM-Based Clustering.
With Mats Rooth, Stefan Riezler, Glenn Carroll, and Franz Beil. In
Proceeding of ACL 1999.
(pdf)
[presentation]
More
Last updated: October 2007.