- Publications
- Media
- Activities
- Teaching
- Projects
- People
- Biography
- Research
- News
Anna Korhonen
Reader in Computational Linguistics
Co-Director of the Language Technology Lab
Department of Theoretical and Applied Linguistics (DTAL)
Faculty of English Building, 9 West Road
Cambridge CB3 9DB, UK
Office: TR-12
Phone: (+44) 1223 767 389
Affiliated scientist
University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK
Email:
anna.korhonen @ cl.cam.ac.uk
News
MPhil students:
- MPhil students, please see my 2015-16 ACS project proposals on Biomedical Information Processing.
- During Lent 2015-16, I will be teaching Biomedical Information Processing with Pietro Lio'. This interdisciplinary course is part of the MPhil in Advanced Computer Science but we also welcome attendees from other departments in Cambridge. Please contact us if you are interested in attending!
- MPhil students in Cambridge have now the opportunity to follow the Language Sciences Interdisciplinary Programme (LSIP). I am the LSIP coordinator at DTAL. If you are an MPhil student and wish to take part in this programme, please get in touch with me.
Prospective PhD students:
- I supervise PhD students at DTAL and the Computer Laboratory. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me. If you are interested in pursuing a PhD and want to apply for PhD funding from Cambridge, please contact me well in advance (e.g. for entry in October 2016 please get in touch with me by October-November 2015 the latest).
Research
My research has principally been in the area of Natural Language Processing and Computational Linguistics. Some current areas of interest include:
- lexical acquisition
- computational semantics
- computational models of discourse
- lexical and domain adaptation
- statistical and machine learning approaches for NLP
- text mining
- multilingual NLP
- NLP for biomedicine
- NLP for real-world applications
- computational models of human language learning
- computational neuro-linguistics
Biography
I am a Reader in Computational Linguistics at the University of Cambridge. I am based at the Department of Theoretical and Applied Linguistics (DTAL) where I co-direct the Language Technology Laboratory (LTL). I am also affiliated with the Computer Laboratory.
- Royal Society University Research Fellow, the Computer Laboratory and DTAL, University of Cambridge (2005-2014)
- JSPS Postdoctoral Fellow, National Institute of Informatics, Tokyo, Japan (2004-2005)
- Visiting researcher, University of Pennsylvania, Department of Computer and Information Science (2004)
- Post-doctoral researcher, University of Cambridge Computer Laboratory (2001-2003)
- PhD in Computer Science, University of Cambridge Computer Laboratory, Trinity Hall) (1998-2001)
- MPhil in Computer Speech and Language Processing, Department of Engineering, University of Cambridge (1996-1997)
- MA in Theoretical Linguistics, University of Reading, School of Linguistics and Applied Language Studies (1994-1995)
People
Current PhD students:
Current postdocs:
Past PhD students and postdocs:
- Yufan Guo
- Colin Kelly
- Ian Lewin
- Thomas Lippincott
- Diarmuid Ó Séaghdha
- Roi Reichart
- Laura Rimell
- Ekaterina Shutova
- Ilona Silins
- Lin Sun
- Tim Van de Cruys
Projects
Current projects:
- LEXICAL - Lexical Acquisition across Languages
Funded by ERC (2015-2020)
Working with Roi Reichart, Martha Palmer and Ivan Vulić. - ENRICH - Enriched phrasal representations for improved language understanding
Google Faculty Award (2015-2016)
Working with Felix Hill and Yoshua Bengio. - LION - Literature-based discovery for cancer biology
Funded by MRC (2015-2018)
Working with Masashi Narita and Ulla Stenius. - PheneBank - automatic extraction and validation of a database of human phenotype-disease associations from the scientific literature.
Funded by MRC (2015-2018)
Working with Nigel Collier. - CRAB - Using Text Mining to Aid Cancer Risk Assessment.
Funded by MRC, EU and FSA and FORMAS in Sweden
(2008-). Working with Ulla Stenius, Johan Hogberg, Ilona Silins, Lin Sun and Yufan Guo.
Past projects:
- The Education First-Cambridge Learner Corpus of English - a data driven approach to second language learning.
Funded by EF and Isaac Newton Trust (2010-2015).
Working with Dora Alexopoulou, Brechtje Post and Jeroen Geertzen. - Developing Lexical Resources for Natural Language Processing Applications.
University Research Fellowship.
Funded by the Royal Society (2005-2014). - PANACEA - Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies.
Funded by EU FP7 (2010-2012).
Working with Laura Rimell, and project partners UPF (Spain), CNR-ILC (Italy), ILSP (Greece), Linguatec (Germany), DCU (Ireland) - Lexical Acquisition for the Biomedical Domain
Funded by EPSRC (2009-2012).
Working with Lin Sun, Diarmuid Ó Séaghdha, and Tom Lippincott. - Developing Multilingual Technologies for Automatic Lexical Acquisition.
Funded by Isaac Newton Trust (2010-2012).
Working with Tim Van de Cruys and Thierry Poibeau. - COMPLEX - Computational Natural Language Processing and the Neuro-Cognition of Language.
Co-funded by EPSRC, ESRC and MRC (2008-2011).
Working with with Lorraine K. Tyler, William Marslen-Wilson, and Paula Buttery. - Developing Multilingual Technologies for Automatic Lexical Acquisition.
Funded by British Council (2008-2009).
Working with Thierry Poibeau. - ACLEX - Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications.
Funded by EPSRC (2005-2008).
Working with Ted Briscoe and Judita Preiss. - Using Automatic Verb Classification to Aid Event Extraction.
JSPS Postdoctoral Fellowship.
Funded by the Japan Society for the Promotion of Science (2004-2005) - FLYSLIP - Integrating Literature, Experiments and Curation in Drosophila Genomics Research.
Funded by BBSRC (2004-2007).
With Ted Briscoe, Simone Teufel, and Rachel Drysdale.
Teaching
In 2015-2016, I am teaching the following courses
Computer Laboratory:
DTAL:
- Computational Linguistics Seminar, MPhil in Theoretical and Applied Linguistics
Activities
Current activities:
- Editorial Board member (an action editor) for Transactions of the Association for Computational Linguistics (TACL) (2015-)
- Editorial Board member for Linguistic Issues in Language Technology (2014-)
- Editorial Board member for Computational Linguistics (2011-2013)
- Secretary of SIGLEX (2013-2016)
- Advisory Board member of SIGDAT (2013-)
- A member of Association for Computational Linguistics
- A member of Cambridge Language Sciences
- A member of Cambridge Neuroscience
- A member of Cambridge Cancer Centre
- A member of PublicHealth@Cambridge
- A member of Cambridge Big Data
Recent activities:
- Co-chair for the EMNLP 2015 Workshop on Cognitive Aspects of Language Learning with Aline Villavicencio, Thierry Poibeau, Bob Berwick, and Alessandro Lenci
- Area Chair for *SEM 2014
- Program Co-Chair for EMNLP 2013 with Tim Baldwin
- Publicity Co-Chair for ACL 2013
- Shared Task Co-Chair for *SEM 2013 with Malvina Nissim
- Member of the Executive Board for SIGLEX (2010-2013)
- Area Chair for EMNLP-CoNLL-2012
- Co-chair for the EACL 2012 ROBUS-UNSUP Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP with Roi Reichart, Omri Abend, Ari Rappoport, Anders Soegaard, and Chris Biemann
- Co-chair for the EACL 2012 Workshop on Computational Models of Language Acquisition and Loss with Aline Villavicencio, Thierry Poibeau, and Bob Berwick
- Co-chair for the The EMNLP 2011 Workshop on Unsupervised Learning in NLP with Roi Reichart, Omri Abend and Ari Rappoport
- Co-chair for the The NIPS 2011 Workshop MLINI - Machine Learning and Interpretation in Neuroimaging
- Co-chair for the NAACL-HLT-2010 Workshop on Computational Neurolinguistics with Brian Murphy and Kai-min Kevin Chang
- Co-chair for the Interdisciplinary Workshop on Verbs - The Identification and Representation of Verb Features, Scuola Normale Superiore, Pisa, November 4-5, 2010 with Sabine Schulte im Walde, Aline Villavicencio, Alessandro Lenci, Alissa Melinger, and Pier Marco Bertinetto
- Area Chair for EACL-2009
- Co-organizer for the Nordic Conference in Computational Linguistics 2009
- Co-chair for the ACL-2007 Workshop on Cognitive Aspects of Computational Language Acquisition with Paula Buttery and Aline Villavicencio
- Co-organizer for the ESSLLI-2006 Course in Data-driven Methods for Acquiring Linguistic Information with Tim Baldwin, Aline Villavicencio and Valia Kordoni
- Co-organizer for the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition with Tim Baldwin and Aline Villavicencio
- Co-organizer for the ACL-2004 Workshop on Multiword Expressions: Integrating Processing with Takaaki Tanaka, Aline Villavicencio and Francis Bond
- Co-organizer for SENSEVAL-3 task with Judita Preiss
- Co-editor for the Computer Speech and Language Special Issue on Multiword Expressions with Aline Villavicencio, Francis Bond and Diana McCarthy
- Co-organizer for the ACL-2003 workshop on Multiword Expressions: Analysis, Acquisition and Treatment with Francis Bond, Diana McCarthy and Aline Villavicencio
Media
Mining the Language of Science. Research Horizons. November 18, 2011.
Computer System Developed to Analyse the Cancer Risk of a Chemical. CNN News. November 21, 2011.
Publications
2015
Imran Ali, Ilona Silins, Yufan Guo, Imran Ali, Johan Högberg, Ulla Stenius and Anna Korhonen. 2015. Grouping chemicals for health risk assessment: a text mining-based case study of polychlorinated biphenyls (PCBs) . Toxicology Letters. doi:10.1016/j.toxlet.2015.11.003.
Simon Baker, Ilona Silins, Yufan Guo, Imran Ali, Johan Högberg, Ulla Stenius and Anna Korhonen. 2015. Automatic Semantic Classification of Scientific Literature According to the Hallmarks of Cancer. Bioinformatics Oct 9. pii: btv585.
LINK
Felix Hill, Kyunghyun Cho, Anna Korhonen and Joshua Bengio. 2015. Learning to Understand Phrases by Embedding the Dictionary. arxiv preprint arxiv:1504.00548. Accepted for publication in the Transactions of the Association for Computational Linguistics (TACL).
LINK
Felix Hill, Roi Reichart and Anna Korhonen. 2015. SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation. Accepted for publication in Computational Linguistics. arxiv preprint arxiv:1408:3456
LINK
Accompanying dataset
Aline Villavicencio, Thierry Poibeau, Bob Berwick, and Alessandro Lenci. 2015. Proceedings of the EMNLP 2015 Workshop on Cognitive Aspects of Language Learning.
LINK
Jeroen Geertzen, Theodora Alexopoulou, Brechtje Post, and Anna Korhonen. 2015. Native language effects on pronunciation accuracy in L2 English Accepted for the International Symposium on Monolingual and Bilingual Speech 2015 (ISMBS 2015).
LINK
Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2015. Relativisors and animacy in L2 English. Accepted for the Second Language Research Forum (SLRF) 2015. Atlanta, Georgia.
LINK
Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2015. Exploring big educational learner corpora for SLA research: perspectives on relative clauses. International Journal of Learner Corpus Research,1(1), 96-129. doi: 10.1075/ijlcr.1.1.04ale.
LINK
Jussi Karlgren, Jimmy Callin, Kevyn Collins-Thompson, Amaru Cuba Gyllensten, Ariel Ekgren, David Jurgens, Anna Korhonen, Fredrik Olsson, Magnus Sahlgren and Hinrich Schütze. 2015. Evaluating Learning Language Representations. To appear in the Proceedings of CLEF 2015 Conference and Labs of the Evaluation Forum.
LINK
Yufan Guo, Roi Reichart and Anna Korhonen. 2015. Unsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents. Transactions of the Association for Computational Linguistics, TACL(3):131-143.
LINK
Douwe Kiela, Yufan Guo, Ulla Stenius and Anna Korhonen. 2015. Unsupervised Discovery of Information Structure in Biomedical Documents . Bioinformatics April 1;31(7):1084-92. doi: 10.1093/bioinformatics/btu758.
LINK
Anna Korhonen, Yufan Guo, Meliha Yetisgen-Yildiz, Ulla Stenius, Masashi Narita and Pietro Lio. 2015. Improving Literature-Based Discovery with Text Mining. In Proceedings of CIBB 2015. Cambridge, UK.
LINK
2014
Felix Hill, Roi Reichart and Anna Korhonen. 2014. Multi-Modal Models for Concrete and Abstract Concept Meaning. Transactions of ACL (TACL). Volume 2, 2014.
LINK
Felix Hill and Anna Korhonen. 2014. Learning Abstract Concepts from Multi-Modal Data: Since You Probably Can't See What I Mean. In Proceedings of EMNLP 2014. Doha, Qatar.
LINK
Diarmuid Ó Séaghdha and Anna Korhonen. 2014. Probabilistic distributional semantics with latent variable models. Computational Linguistics 40(3): 587-631.
LINK
Simon Baker, Roi Reichart and Anna Korhonen. 2014. An Unsupervised Model for Instance Level Subcategorization Acquisition. In Proceedings of EMNLP 2014, Doha, Qatar.
LINK
Yufan Guo, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg, Ulla Stenius and Anna Korhonen. 2014. CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment. In Proceedings of Coling 2014 (a demo paper), Dublin, Ireland.
LINK
Douwe Kiela, Felix Hill, Anna Korhonen and Stephen Clark. 2014. Improving multi-modal representations using image dispersion: Why less is sometimes more. In Proceedings of ACL 2014. Baltimore, USA.
LINK
Felix Hill and Anna Korhonen. 2014. Concreteness and subjectivity as dimensions of lexical meaning. In Proceedings of ACL 2014. Baltimore, USA.
LINK
Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer and Anna Korhonen. 2014. Verb Clustering for Brazilian Portuguese. 15th International Conference in Computational Linguistics and Intelligent Text Processing. In Lecture Notes in Computer Science. Vol. 8404. Springer. 25-40.
LINK
Xiao Jiang, Yufan Guo, Jeroen Geertzen, Theodora Alexopoulou, Lin Sun and Anna Korhonen. 2014. Native Language Identification Using Large, Longitudinal Data. In Proceedings of LREC. Reykjavik, Iceland.
LINK
Ilona Silins, Anna Korhonen and Ulla Stenius. 2014. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool. Front Pharmacol. 2014 Jun 23;5:145. doi: 10.3389/fphar.
LINK
Ilona Silins, Anna Korhonen, Yufan Guo and Ulla Stenius. 2014. A text mining approach for chemical risk assessment and cancer research. In Proceedings of Eurotox 2014. Edinburgh, UK.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2014. Automatic extraction of property norm-like data from large text corpora. Cognitive Science, 38: 638-682. doi: 10.1111/cogs.12091.
LINK
2013
Felix Hill, Anna Korhonen and Christian Bentz. 2013. A quantitative empirical analysis of the abstract/concrete distinction. Cognitive Science. Issue Cognitive Science, 38 (1): 162-177. doi: 0.1111/cogs.12076.
LINK
Ekaterina Shutova, Barry Devereux and Anna Korhonen. 2013. Conceptual Metaphor Theory Meets the Data: A Corpus-based Human Annotation Study. Language Resources and Evaluation.
LINK
Ekaterina Shutova, Jakub Kaplan, Simone Teufel and Anna Korhonen. 2013. A Computational Model of Logical Metonymy. ACM Transactions on Speech and Language Processing. 10(3). 11.
LINK
Jeroen Geertzen, Theodora Alexopoulou and Anna Korhonen. 2013. Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT). In Proceedings of the 31st Second Language Research Forum (SLRF), Carnegie Mellon, Cascadilla Press.
LINK
Roi Reichart and Anna Korhonen. 2013. Improved Lexical Acquisition through DPP-based Verb Clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK
Lin Sun, Diana McCarthy and Anna Korhonen. 2013. Diathesis alternation approximation for verb clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK
Felix Hill, Douwe Kiela and Anna Korhonen. 2013. Concreteness and corpora: A theoretical and practical analysis. In Proceedings of the ACL 2013 Workshop on Cognitive Modelling and Computational Linguistics, Sofia, Bulgaria.
LINK
Felix Hill, Christian Bentz and Anna Korhonen. 2013. Large-scale empirical analyses of concreteness. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2013. Minimally Supervised Learning for Unconstrained Conceptual Property Extraction. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK
Yufan Guo, Roi Reichart and Anna Korhonen. 2013. Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK
Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2013. A Tensor-based Factorization Model of Semantic Compositionality. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK
Yufan Guo, Ilona Silins, Ulla Stenius and Anna Korhonen. 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. Bioinformatics (2013) 29 (11): 1440-1447.
LINK
Thomas Lippincott, Laura Rimell, Karin Verspoor and Anna Korhonen. 2013. Approaches to verb subcategorization for biomedicine . Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 212-227.
LINK
Thomas Lippincott, Laura Rimell, Helen L. Johnson, Karin Verspoor and Anna Korhonen. 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 228-237.
LINK
Aline Villavicencio, Thierry Poibeau, Anna Korhonen and Afra Alishahi. 2013. Cognitive Aspects of Computational Language Acquisition. Springer.
LINK
Thierry Poibeau, Aline Villavicencio, Anna Korhonen and Afra Alishahi. 2013. Computational Modeling as a Methodology for Studying Human Language Learning . In Cognitive Aspects of Computational Language Acquisition. Springer.
LINK
Anna Korhonen. 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. In the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.
LINK
2012
Roi Reichart and Anna Korhonen. 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Multi-way Tensor Factorization for Unsupervised Lexical Acquisition. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Ekaterina Shutova, Tim van de Cruys and Anna Korhonen. 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Danish Contractor, Yufan Guo and Anna Korhonen. 2012. Using Argumentative Zones for Extractive Summarization of Scientific Articles. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen. 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Ekaterina Shutova, Simone Teufel and Anna Korhonen. 2012. Statistical Metaphor Processing. Computational Linguistics, 39(2).
LINK
Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Learning syntactic verb frames using graphical models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea.
LINK
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Hogberg and Ulla Stenius. 2012. Exocrine pancreatic carcinogenesis and autotaxin expression. PLoS ONE 7(8): e43209.
LINK
Anna Korhonen, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg and Ulla Stenius. 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS ONE 7(4):e33427.
LINK
Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Modelling selectional preferences in a lexical hierarchy. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics (*SEM 2012). Montreal, QC.
LINK
Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Merging Lexicons for Higher Precision Subcategorization Frame Acquisition. Proceedings of the LREC 2012 Workshop on Language Resource Merging, Istanbul, Turkey.
LINK
Ilona Silins, Anna Korhonen, Johan Hogberg and Ulla Stenius. 2012. Data and Literature Gathering in Chemical Cancer Risk Assessment. Integrated Environmental Assessment and Management. 2012, Jan 3.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2012. Semi-supervised learning for automatic conceptual property extraction. Proceedings of the NAACL 2012 Cognitive Modeling and Computational Linguistics Workshop.
LINK
Omri Abend, Chris Biemann, Anna Korhonen, Ari Rappoport, Roi Reichart and Anders Sogaard. 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP.
LINK
Robert Berwick, Anna Korhonen, Thierry Poibeau and Aline Villavicencio. 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss.
LINK
Theodora Alexopoulou, Jeroen Geertzen, Anna Korhonen and Detmar Meurers. 2012. L1 effects in L2 English relative clauses: evidence from corpus production In Book of Abstracts of the 22nd Annual Conference of the European Second Language Association (EUROSLA-22).
LINK
2011
Yufan Guo, Anna Korhonen, Ilona Silins and Ulla Stenius. 2011. Weakly-supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics 2011; doi: 10.1093/bioinformatics/btr536.
LINK
Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics 12:212.
LINK
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics 2011, 12:69.
LINK
Lin Sun and Anna Korhonen. Hierarchical Verb Clustering Using Graph Factorization. 2011. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Yufan Guo, Anna Korhonen and Thierry Poibeau. 2011. A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2011. Latent Vector Weighting for Word Meaning in Context Edinburgh. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Omri Abend, Anna Korhonen, Ari Rappoport and Roi Reichart. 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP.
LINK
Barry Devereux, Anna Korhonen, Paula Buttery and Lorraine Tyler. 2011. The role of verb subcategorization frames and selectional preferences in sentence processing: an investigation using corpus-derived measures. Multidisciplinary Workshop on the mental representation of verbal argument structure. Paris, France.
LINK
Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK
Jie Zhuang, Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2011. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations. AMLaP. Paris, France.
LINK
2010
Anna Korhonen. 2010. Automatic Lexical Classification - Bridging Research and Practice. In Philoshophical Transactions A of the Royal Society. 368: 3621-3632.
LINK
Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen. 2010. Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data. Research on Language and Computation.
PDF
Lin Sun, Thierry Poibeau, Anna Korhonen and Cedric Messiant. 2010. Investigating the cross-linguistic potential of VerbNet -style classification. In Proceedings of Coling. Beijing, China.
PDF
Ekaterina Shutova, Lin Sun and Anna Korhonen. 2010. Metaphor Identification Using Verb and Noun Clustering. In Proceedings of Coling. Beijing, China.
PDF
Tom Lippincott, Diarmuid O Seaghdha, Lin Sun and Anna Korhonen. 2010. Exploring variation across biomedical subdomains. In Proceedings of Coling. Beijing, China.
PDF
Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PDF
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun and Ulla Stenius. 2010. Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes. In Proceedings of bio-NLP 2010. Uppsala, Sweden
PDF
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Johan Hogberg, Kristian Dreij, and Ulla Stenius. 2010. Carcinogen-induced inflammation and pancreatic cancer. In Proceedings of the 101th Annual Meeting of the American Association for Cancer Research. Washington, D.C., USA.
PDF
Colin Kelly, Barry Devereux and Anna Korhonen. 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF
Barry Devereux, Colin Kelly and Anna Korhonen. 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF
Brian Murphy, Kai-min Kevin Chang and Anna Korhonen. 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
LINK
Stuart Moore, Anna Korhonen and Sabine Buchholz. 2010. Annotating the Enron Email Corpus with Number Senses. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). Valletta, Malta.
PDF
Barry Devereux, Colin Kelly, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora. The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives.
PDF
2009
Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
PDF
Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF
Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009. Number Sense Disambiguation. In Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF
Lin Sun and Anna Korhonen. 2009. Improving Verb Clustering with Automatically Acquired Selectional Preferences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF
Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment. In Proceedings of BioNLP. Boulder, Colorado.
PDF
Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF
Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009. Improved Cancer Risk Assessment Using Text Mining. In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF
Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009. Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF
Anna Korhonen. 2009. Automatic Lexical Acquisition: Bridging Research and Practice. In the FLaReNet Workshop. Vienna, Austria.
PDF
2008
Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proceedings of Coling 2008. Manchester, UK.
PDF
Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008. A New Challenge for Text Mining: Cancer Risk Assessment. In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF
Anna Korhonen, Ian Lewin, Ilona Silins, Johan Hogberg, and Ulla Stenius. 2008. CRAB - Cancer Risk Assessment and Biomedical Text Mining. In Proceedings of the European Conference on Computational Biology. Sardinia, Italy.
LINK
Andreas Vlachos, Zoubin Ghahramani and Anna Korhonen. 2008. Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF
Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008. LexSchem: A Large Subcategorization Lexicon for French Verbs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008. A Large-Scale Classification of English Verbs. In the Journal of Language Resources and Evaluation. 42(1). 21-40.
LINK
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Verb Class Discovery from Rich Syntactic Data. In Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics. Haifa, Israel.
PDF
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. Hyderabad, India.
PDF
2007
Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF
Paula Buttery and Anna Korhonen. 2007. I will shoot your shopping down and you can shoot all my tins - Automatic Lexical Acquisition from the CHILDES Database. In Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF
Paula Buttery, Aline Villavicencio and Anna Korhonen. 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF
2006
Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006. Automatic Classification of Verbs in Biomedical Texts. In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF
Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006. Zone Analysis in Biology Articles as a Basis for Information Extraction. In the International Journal of Medical Informatics on Natural Language Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes. In Proceedings of EURALEX. Turin, Italy.
DOC
Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006. A Large Subcategorization Lexicon for Natural Language Processing Applications. In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. Extending VerbNet with Novel Verb Classes. In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
2005
Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005. Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.
LINK
Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Ann Arbor, Michigan.
PDF
Timothy Baldwin, Anna Korhonen and Aline Villavicencio. 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition. Ann Arbor, Michigan.
PDF
Paula Buttery and Anna Korhonen. 2005. Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech. In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes. Saarbrucken, Germany.
PDF
2004
Judita Preiss and Anna Korhonen. 2004. WSD for Subcategorization Acquisition Task Description. In Proceedings of the ACL SENSEVAL-3 Workshop. Barcelona, Spain.
PDF
Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen. 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, Spain.
PDF
Anna Korhonen and Ted Briscoe. 2004. Extended Lexical-Semantic Classification of English Verbs. In Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF
2003
Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 64-71.
PDF
Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization Acquisition using Word Sense Disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 48-55.
PDF
Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio. 2003. Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment. Sapporo, Japan.
PDF
2002
Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet. In Proceedings of the COLING Workshop on Building and Using Semantic Networks. Taipei, Taiwan.
PDF
Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems. In Proceedings of the Sixth Conference on Natural Language Learning. Taipei, Taiwan. 91-97.
PDF
Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition. In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition. Philadelphia, USA. 51-58.
LINK
Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF
Judita Preiss, Anna Korhonen and Ted Briscoe. 2002. Subcategorization Acquisition as an Evaluation Method for WSD. In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF
Anna Korhonen. 2002. Subcategorization Acquisition. PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge.
PDF
2000
Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help Subcategorization Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 216-223.
PDF
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Is Hypothesis Testing Useful for Subcategorization Acquisition? Technical Report UCAM-CL-TR-491. Computer Laboratory, University of Cambridge.
PDF
1999
Melanie Baljko and Anna Korhonen. 1999. Proceedings of the ACL 1999 Student Session. University of Maryland, Maryland.
PDF
1998
Anna Korhonen. 1998. Automatic Extraction of Subcategorization Frames from Corpora - Improving Filtering with Diathesis Alternations. In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing. Saarbrucken, Germany. 49-56.
PDF
Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada. 1493-1495.
PDF
1997
Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments. '97 Sparkle WP5 Deliverable.
PDF
Anna Korhonen. 1997. Acquiring Subcategorization from Textual Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS
© Anna Korhonen. Last updated: July 2015