skip to content

Home

Faculty of Modern and Medieval Languages and Linguistics

 

Anna Korhonen webpage

Please see below for:

  • Publications
  • Media
  • Activities
  • Teaching
  • Projects
  • People
  • Biography
  • Research
  • News

Anna Korhonen

 

image

Reader in Computational Linguistics
Co-Director of the Language Technology Lab

Department of Theoretical and Applied Linguistics (DTAL)
Faculty of English Building, 9 West Road
Cambridge CB3 9DB, UK
Office: TR-12
Phone: (+44) 1223 767 389

Affiliated scientist
University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK

Email:
anna.korhonen @ cl.cam.ac.uk

News

MPhil students:

 

 

Prospective PhD students:

  • I supervise PhD students at DTAL and the Computer Laboratory. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me. If you are interested in pursuing a PhD and want to apply for PhD funding from Cambridge, please contact me well in advance (e.g. for entry in October 2016 please get in touch with me by October-November 2015 the latest).

 

Research

My research has principally been in the area of Natural Language Processing and Computational Linguistics. Some current areas of interest include:

 

  • lexical acquisition
  • computational semantics
  • computational models of discourse
  • lexical and domain adaptation
  • statistical and machine learning approaches for NLP
  • text mining
  • multilingual NLP
  • NLP for biomedicine
  • NLP for real-world applications
  • computational models of human language learning
  • computational neuro-linguistics

Biography

I am a Reader in Computational Linguistics at the University of Cambridge. I am based at the Department of Theoretical and Applied Linguistics (DTAL) where I co-direct the Language Technology Laboratory (LTL). I am also affiliated with the Computer Laboratory.

People

Current PhD students:

Current postdocs:

Past PhD students and postdocs:

 

Projects

 

Current projects:

  • LEXICAL - Lexical Acquisition across Languages
    Funded by ERC (2015-2020)
    Working with Roi Reichart, Martha Palmer and Ivan Vulić.
  • ENRICH - Enriched phrasal representations for improved language understanding
    Google Faculty Award (2015-2016)
    Working with Felix Hill and Yoshua Bengio.
  • LION - Literature-based discovery for cancer biology
    Funded by MRC (2015-2018)
    Working with Masashi Narita and Ulla Stenius.
  • PheneBank - automatic extraction and validation of a database of human phenotype-disease associations from the scientific literature.
    Funded by MRC (2015-2018)
    Working with Nigel Collier.
  • CRAB - Using Text Mining to Aid Cancer Risk Assessment.
    Funded by MRC, EU and FSA and FORMAS in Sweden
    (2008-). Working with Ulla Stenius, Johan Hogberg, Ilona Silins, Lin Sun and Yufan Guo.

Past projects:

  • The Education First-Cambridge Learner Corpus of English - a data driven approach to second language learning.
    Funded by EF and Isaac Newton Trust (2010-2015).
    Working with Dora Alexopoulou, Brechtje Post and Jeroen Geertzen.
  • Developing Lexical Resources for Natural Language Processing Applications.
    University Research Fellowship.
    Funded by the Royal Society (2005-2014).
  • PANACEA - Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies.
    Funded by EU FP7 (2010-2012).
    Working with Laura Rimell, and project partners UPF (Spain), CNR-ILC (Italy), ILSP (Greece), Linguatec (Germany), DCU (Ireland)
  • Lexical Acquisition for the Biomedical Domain
    Funded by EPSRC (2009-2012).
    Working with Lin Sun, Diarmuid Ó Séaghdha, and Tom Lippincott.
  • Developing Multilingual Technologies for Automatic Lexical Acquisition.
    Funded by Isaac Newton Trust (2010-2012).
    Working with Tim Van de Cruys and Thierry Poibeau.
  • COMPLEX - Computational Natural Language Processing and the Neuro-Cognition of Language.
    Co-funded by EPSRC, ESRC and MRC (2008-2011).
    Working with with Lorraine K. Tyler, William Marslen-Wilson, and Paula Buttery.
  • Developing Multilingual Technologies for Automatic Lexical Acquisition.
    Funded by British Council (2008-2009).
    Working with Thierry Poibeau.
  • ACLEX - Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications.
    Funded by EPSRC (2005-2008).
    Working with Ted Briscoe and Judita Preiss.
  • Using Automatic Verb Classification to Aid Event Extraction.
    JSPS Postdoctoral Fellowship.
    Funded by the Japan Society for the Promotion of Science (2004-2005)
  • FLYSLIP - Integrating Literature, Experiments and Curation in Drosophila Genomics Research.
    Funded by BBSRC (2004-2007).
    With Ted Briscoe, Simone Teufel, and Rachel Drysdale.

Teaching

In 2015-2016, I am teaching the following courses

Computer Laboratory:

 

DTAL:

 

Activities

 

Current activities:

Recent activities:

Media

Mining the Language of Science. Research Horizons. November 18, 2011.

Computer System Developed to Analyse the Cancer Risk of a Chemical. CNN News. November 21, 2011.

 

Publications

 

2015

Imran Ali, Ilona Silins, Yufan Guo, Imran Ali, Johan Högberg, Ulla Stenius and Anna Korhonen. 2015. Grouping chemicals for health risk assessment: a text mining-based case study of polychlorinated biphenyls (PCBs) . Toxicology Letters. doi:10.1016/j.toxlet.2015.11.003.

Simon Baker, Ilona Silins, Yufan Guo, Imran Ali, Johan Högberg, Ulla Stenius and Anna Korhonen. 2015. Automatic Semantic Classification of Scientific Literature According to the Hallmarks of Cancer. Bioinformatics Oct 9. pii: btv585.
LINK

Felix Hill, Kyunghyun Cho, Anna Korhonen and Joshua Bengio. 2015. Learning to Understand Phrases by Embedding the Dictionary. arxiv preprint arxiv:1504.00548. Accepted for publication in the Transactions of the Association for Computational Linguistics (TACL).
LINK

Felix Hill, Roi Reichart and Anna Korhonen. 2015. SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation. Accepted for publication in Computational Linguistics. arxiv preprint arxiv:1408:3456
LINK
Accompanying dataset

Aline Villavicencio, Thierry Poibeau, Bob Berwick, and Alessandro Lenci. 2015. Proceedings of the EMNLP 2015 Workshop on Cognitive Aspects of Language Learning.
LINK

Jeroen Geertzen, Theodora Alexopoulou, Brechtje Post, and Anna Korhonen. 2015. Native language effects on pronunciation accuracy in L2 English Accepted for the International Symposium on Monolingual and Bilingual Speech 2015 (ISMBS 2015).
LINK

Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2015. Relativisors and animacy in L2 English. Accepted for the Second Language Research Forum (SLRF) 2015. Atlanta, Georgia.
LINK

Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2015. Exploring big educational learner corpora for SLA research: perspectives on relative clauses. International Journal of Learner Corpus Research,1(1), 96-129. doi: 10.1075/ijlcr.1.1.04ale.
LINK

Jussi Karlgren, Jimmy Callin, Kevyn Collins-Thompson, Amaru Cuba Gyllensten, Ariel Ekgren, David Jurgens, Anna Korhonen, Fredrik Olsson, Magnus Sahlgren and Hinrich Schütze. 2015. Evaluating Learning Language Representations. To appear in the Proceedings of CLEF 2015 Conference and Labs of the Evaluation Forum.
LINK

Yufan Guo, Roi Reichart and Anna Korhonen. 2015. Unsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents. Transactions of the Association for Computational Linguistics, TACL(3):131-143.
LINK

Douwe Kiela, Yufan Guo, Ulla Stenius and Anna Korhonen. 2015. Unsupervised Discovery of Information Structure in Biomedical Documents . Bioinformatics April 1;31(7):1084-92. doi: 10.1093/bioinformatics/btu758.
LINK

Anna Korhonen, Yufan Guo, Meliha Yetisgen-Yildiz, Ulla Stenius, Masashi Narita and Pietro Lio. 2015. Improving Literature-Based Discovery with Text Mining. In Proceedings of CIBB 2015. Cambridge, UK.
LINK

2014

Felix Hill, Roi Reichart and Anna Korhonen. 2014. Multi-Modal Models for Concrete and Abstract Concept Meaning. Transactions of ACL (TACL). Volume 2, 2014.
LINK

Felix Hill and Anna Korhonen. 2014. Learning Abstract Concepts from Multi-Modal Data: Since You Probably Can't See What I Mean. In Proceedings of EMNLP 2014. Doha, Qatar.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2014. Probabilistic distributional semantics with latent variable models. Computational Linguistics 40(3): 587-631.
LINK

Simon Baker, Roi Reichart and Anna Korhonen. 2014. An Unsupervised Model for Instance Level Subcategorization Acquisition. In Proceedings of EMNLP 2014, Doha, Qatar.
LINK

Yufan Guo, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg, Ulla Stenius and Anna Korhonen. 2014. CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment. In Proceedings of Coling 2014 (a demo paper), Dublin, Ireland.
LINK

Douwe Kiela, Felix Hill, Anna Korhonen and Stephen Clark. 2014. Improving multi-modal representations using image dispersion: Why less is sometimes more. In Proceedings of ACL 2014. Baltimore, USA.
LINK

Felix Hill and Anna Korhonen. 2014. Concreteness and subjectivity as dimensions of lexical meaning. In Proceedings of ACL 2014. Baltimore, USA.
LINK

Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer and Anna Korhonen. 2014. Verb Clustering for Brazilian Portuguese. 15th International Conference in Computational Linguistics and Intelligent Text Processing. In Lecture Notes in Computer Science. Vol. 8404. Springer. 25-40.
LINK

Xiao Jiang, Yufan Guo, Jeroen Geertzen, Theodora Alexopoulou, Lin Sun and Anna Korhonen. 2014. Native Language Identification Using Large, Longitudinal Data. In Proceedings of LREC. Reykjavik, Iceland.
LINK

Ilona Silins, Anna Korhonen and Ulla Stenius. 2014. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool. Front Pharmacol. 2014 Jun 23;5:145. doi: 10.3389/fphar.
LINK

Ilona Silins, Anna Korhonen, Yufan Guo and Ulla Stenius. 2014. A text mining approach for chemical risk assessment and cancer research. In Proceedings of Eurotox 2014. Edinburgh, UK.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2014. Automatic extraction of property norm-like data from large text corpora. Cognitive Science, 38: 638-682. doi: 10.1111/cogs.12091.
LINK

2013

 

Felix Hill, Anna Korhonen and Christian Bentz. 2013. A quantitative empirical analysis of the abstract/concrete distinction. Cognitive Science. Issue Cognitive Science, 38 (1): 162-177. doi: 0.1111/cogs.12076.
LINK

Ekaterina Shutova, Barry Devereux and Anna Korhonen. 2013. Conceptual Metaphor Theory Meets the Data: A Corpus-based Human Annotation Study. Language Resources and Evaluation.
LINK

Ekaterina Shutova, Jakub Kaplan, Simone Teufel and Anna Korhonen. 2013. A Computational Model of Logical Metonymy. ACM Transactions on Speech and Language Processing. 10(3). 11.
LINK

Jeroen Geertzen, Theodora Alexopoulou and Anna Korhonen. 2013. Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT). In Proceedings of the 31st Second Language Research Forum (SLRF), Carnegie Mellon, Cascadilla Press.
LINK

Roi Reichart and Anna Korhonen. 2013. Improved Lexical Acquisition through DPP-based Verb Clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Lin Sun, Diana McCarthy and Anna Korhonen. 2013. Diathesis alternation approximation for verb clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Felix Hill, Douwe Kiela and Anna Korhonen. 2013. Concreteness and corpora: A theoretical and practical analysis. In Proceedings of the ACL 2013 Workshop on Cognitive Modelling and Computational Linguistics, Sofia, Bulgaria.
LINK

Felix Hill, Christian Bentz and Anna Korhonen. 2013. Large-scale empirical analyses of concreteness. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2013. Minimally Supervised Learning for Unconstrained Conceptual Property Extraction. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Yufan Guo, Roi Reichart and Anna Korhonen. 2013. Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2013. A Tensor-based Factorization Model of Semantic Compositionality. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Yufan Guo, Ilona Silins, Ulla Stenius and Anna Korhonen. 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. Bioinformatics (2013) 29 (11): 1440-1447.
LINK

Thomas Lippincott, Laura Rimell, Karin Verspoor and Anna Korhonen. 2013. Approaches to verb subcategorization for biomedicine . Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 212-227.
LINK

Thomas Lippincott, Laura Rimell, Helen L. Johnson, Karin Verspoor and Anna Korhonen. 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 228-237.
LINK

Aline Villavicencio, Thierry Poibeau, Anna Korhonen and Afra Alishahi. 2013. Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Thierry Poibeau, Aline Villavicencio, Anna Korhonen and Afra Alishahi. 2013. Computational Modeling as a Methodology for Studying Human Language Learning . In Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Anna Korhonen. 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. In the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.
LINK

2012

 

Roi Reichart and Anna Korhonen. 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Multi-way Tensor Factorization for Unsupervised Lexical Acquisition. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Tim van de Cruys and Anna Korhonen. 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Danish Contractor, Yufan Guo and Anna Korhonen. 2012. Using Argumentative Zones for Extractive Summarization of Scientific Articles. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen. 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Simone Teufel and Anna Korhonen. 2012. Statistical Metaphor Processing. Computational Linguistics, 39(2).
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Learning syntactic verb frames using graphical models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea.
LINK

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Hogberg and Ulla Stenius. 2012. Exocrine pancreatic carcinogenesis and autotaxin expression. PLoS ONE 7(8): e43209.
LINK

Anna Korhonen, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg and Ulla Stenius. 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS ONE 7(4):e33427.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Modelling selectional preferences in a lexical hierarchy. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics (*SEM 2012). Montreal, QC.
LINK

Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Merging Lexicons for Higher Precision Subcategorization Frame Acquisition. Proceedings of the LREC 2012 Workshop on Language Resource Merging, Istanbul, Turkey.
LINK

Ilona Silins, Anna Korhonen, Johan Hogberg and Ulla Stenius. 2012. Data and Literature Gathering in Chemical Cancer Risk Assessment. Integrated Environmental Assessment and Management. 2012, Jan 3.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2012. Semi-supervised learning for automatic conceptual property extraction. Proceedings of the NAACL 2012 Cognitive Modeling and Computational Linguistics Workshop.
LINK

Omri Abend, Chris Biemann, Anna Korhonen, Ari Rappoport, Roi Reichart and Anders Sogaard. 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP.
LINK

Robert Berwick, Anna Korhonen, Thierry Poibeau and Aline Villavicencio. 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss.
LINK

Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2012. L1 effects in L2 English relative clauses: evidence from corpus production In Book of Abstracts of the 22nd Annual Conference of the European Second Language Association (EUROSLA-22).
LINK

2011

 

Yufan Guo, Anna Korhonen, Ilona Silins and Ulla Stenius. 2011. Weakly-supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics 2011; doi: 10.1093/bioinformatics/btr536.
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics 12:212.
LINK

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics 2011, 12:69.
LINK

Lin Sun and Anna Korhonen. Hierarchical Verb Clustering Using Graph Factorization. 2011. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Yufan Guo, Anna Korhonen and Thierry Poibeau. 2011. A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2011. Latent Vector Weighting for Word Meaning in Context Edinburgh. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Omri Abend, Anna Korhonen, Ari Rappoport and Roi Reichart. 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP.
LINK

Barry Devereux, Anna Korhonen, Paula Buttery and Lorraine Tyler. 2011. The role of verb subcategorization frames and selectional preferences in sentence processing: an investigation using corpus-derived measures. Multidisciplinary Workshop on the mental representation of verbal argument structure. Paris, France.
LINK

Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Jie Zhuang, Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2011. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations. AMLaP. Paris, France.
LINK

2010

 

Anna Korhonen. 2010. Automatic Lexical Classification - Bridging Research and Practice. In Philoshophical Transactions A of the Royal Society. 368: 3621-3632.
LINK

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen. 2010. Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data. Research on Language and Computation.
PDF

Lin Sun, Thierry Poibeau, Anna Korhonen and Cedric Messiant. 2010. Investigating the cross-linguistic potential of VerbNet -style classification. In Proceedings of Coling. Beijing, China.
PDF

 

Ekaterina Shutova, Lin Sun and Anna Korhonen. 2010. Metaphor Identification Using Verb and Noun Clustering. In Proceedings of Coling. Beijing, China.
PDF

Tom Lippincott, Diarmuid O Seaghdha, Lin Sun and Anna Korhonen. 2010. Exploring variation across biomedical subdomains. In Proceedings of Coling. Beijing, China.
PDF

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PDF

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun and Ulla Stenius. 2010. Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes. In Proceedings of bio-NLP 2010. Uppsala, Sweden
PDF

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Johan Hogberg, Kristian Dreij, and Ulla Stenius. 2010. Carcinogen-induced inflammation and pancreatic cancer. In Proceedings of the 101th Annual Meeting of the American Association for Cancer Research. Washington, D.C., USA.
PDF

Colin Kelly, Barry Devereux and Anna Korhonen. 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Barry Devereux, Colin Kelly and Anna Korhonen. 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Brian Murphy, Kai-min Kevin Chang and Anna Korhonen. 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
LINK

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2010. Annotating the Enron Email Corpus with Number Senses. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). Valletta, Malta.
PDF

Barry Devereux, Colin Kelly, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora. The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives.
PDF

2009

 

Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
PDF

Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009. Number Sense Disambiguation. In Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF

Lin Sun and Anna Korhonen. 2009. Improving Verb Clustering with Automatically Acquired Selectional Preferences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF

Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment. In Proceedings of BioNLP. Boulder, Colorado.
PDF

Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF

Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009. Improved Cancer Risk Assessment Using Text Mining. In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF

Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009. Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF

Anna Korhonen. 2009. Automatic Lexical Acquisition: Bridging Research and Practice. In the FLaReNet Workshop. Vienna, Austria.
PDF

2008

 

Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proceedings of Coling 2008. Manchester, UK.
PDF

Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008. A New Challenge for Text Mining: Cancer Risk Assessment. In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF

Anna Korhonen, Ian Lewin, Ilona Silins, Johan Hogberg, and Ulla Stenius. 2008. CRAB - Cancer Risk Assessment and Biomedical Text Mining. In Proceedings of the European Conference on Computational Biology. Sardinia, Italy.
LINK

Andreas Vlachos, Zoubin Ghahramani and Anna Korhonen. 2008. Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF

Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008. LexSchem: A Large Subcategorization Lexicon for French Verbs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008. A Large-Scale Classification of English Verbs. In the Journal of Language Resources and Evaluation. 42(1). 21-40.
LINK

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Verb Class Discovery from Rich Syntactic Data. In Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics. Haifa, Israel.
PDF

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. Hyderabad, India.
PDF

2007

Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF

Paula Buttery and Anna Korhonen. 2007. I will shoot your shopping down and you can shoot all my tins - Automatic Lexical Acquisition from the CHILDES Database. In Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

Paula Buttery, Aline Villavicencio and Anna Korhonen. 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

2006

 

Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006. Automatic Classification of Verbs in Biomedical Texts. In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF

Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006. Zone Analysis in Biology Articles as a Basis for Information Extraction. In the International Journal of Medical Informatics on Natural Language Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes. In Proceedings of EURALEX. Turin, Italy.
DOC

Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006. A Large Subcategorization Lexicon for Natural Language Processing Applications. In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. Extending VerbNet with Novel Verb Classes. In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

2005

 

Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005. Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.
LINK

Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Ann Arbor, Michigan.
PDF

Timothy Baldwin, Anna Korhonen and Aline Villavicencio. 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition. Ann Arbor, Michigan.
PDF

Paula Buttery and Anna Korhonen. 2005. Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech. In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes. Saarbrucken, Germany.
PDF

2004

 

Judita Preiss and Anna Korhonen. 2004. WSD for Subcategorization Acquisition Task Description. In Proceedings of the ACL SENSEVAL-3 Workshop. Barcelona, Spain.
PDF

Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen. 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, Spain.
PDF

Anna Korhonen and Ted Briscoe. 2004. Extended Lexical-Semantic Classification of English Verbs. In Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF

2003

Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 64-71.
PDF

Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization Acquisition using Word Sense Disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 48-55.
PDF

Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio. 2003. Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment. Sapporo, Japan.
PDF

2002

 

Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet. In Proceedings of the COLING Workshop on Building and Using Semantic Networks. Taipei, Taiwan.
PDF

Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems. In Proceedings of the Sixth Conference on Natural Language Learning. Taipei, Taiwan. 91-97.
PDF

Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition. In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition. Philadelphia, USA. 51-58.
LINK

Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF

Judita Preiss, Anna Korhonen and Ted Briscoe. 2002. Subcategorization Acquisition as an Evaluation Method for WSD. In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF

Anna Korhonen. 2002. Subcategorization Acquisition. PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge.
PDF

2000

 

Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help Subcategorization Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 216-223.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Is Hypothesis Testing Useful for Subcategorization Acquisition? Technical Report UCAM-CL-TR-491. Computer Laboratory, University of Cambridge.
PDF

1999

 

Melanie Baljko and Anna Korhonen. 1999. Proceedings of the ACL 1999 Student Session. University of Maryland, Maryland.
PDF

1998

 

Anna Korhonen. 1998. Automatic Extraction of Subcategorization Frames from Corpora - Improving Filtering with Diathesis Alternations. In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing. Saarbrucken, Germany. 49-56.
PDF

Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada. 1493-1495.
PDF

1997

Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments. '97 Sparkle WP5 Deliverable.
PDF

Anna Korhonen. 1997. Acquiring Subcategorization from Textual Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS

© Anna Korhonen. Last updated: July 2015