Group leader „Natural Language Processing“ at HITS “Honorarprofessor” at the Computational Linguistics Department at Heidelberg University
Research Interest
Linguistics:
- Text and Dialogue
- Pragmatics
Computational Linguistics:
- Anaphora and Coreference Resolution
- Generation of Referring Expressions
- Modeling Local (and maybe also Global) Coherence
- Discourse and Dialogue Structure (though I don’t believe in it)
Natural Language Processing:
- Automatic Summarization
- Concept Disambiguation, Entity Linking, Cross-document Coreference Resolution
- Information Extraction
- Knowledge Acquisition, Ontology Learning
- Natural Language Generation Systems
Curriculum Vitae
2017-2018 Scientific Director at HITS
2017/18 Program Co-Chair Workshops on Ethics in NLP at ACL
2015 PC Co-Chair of the ACL’s flagship conference ACL-IJCNLP ’15 in Beijing, China, July 26-31, 2015
Since 2010 “Honorarprofessor” in the Computational Linguistics Department at the University of Heidelberg
Since 2003 Member of the EML Research, now HITS
2000-2003 Member of the EML European Media Laboratory
1997-1999 PostDoc at the Institute for Research in Cognitive Science at the University of Pennsylvania, Philadelphia, US
1996 PhD at the University of Freiburg, Germany
2023
2022
- Yu J, Khosla S, Manuvinakurike R, Levin L, Ng V, Poesio M, Strube M, Rosé C (2022). Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Gyeongju, Republic of Korea, October 2022 1590
- Yu J, Khosla S, Manuvinakurike R, Levin L, Ng V, Poesio M, Strube M, Rosé C (2022). The CODI-CRAC 2022 shared task on anaphora, bridging, and discourse deixis in dialogue, Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Gyeongju, Repbulic of Korea, October 2022, pp. 1–14 1591
- Braud C, Hardmeier C, Li JJ, Loaciga S, Strube M, Zeldes A (2022). Proceedings of the 3rd Workshop on Computational Approaches to Discourse, Proceedings of the 3rd Workshop on Computational Approaches to Discourse, Gyeongju, Republic of Korea, October 2022 1589
- Chai H, Moosavi NS, Gurevych I, Strube M (2022). Evaluating coreference resolvers on community-based question answering: From rule-based to state of the art, Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 16–17 Octrober, 2022, pp.61–73 1592
- Liang S, Kades K, Fink M, Full P, Weber T, Kleesiek J, Strube M, Maier-Hein K (2022). Fine-tuning BERT Models for Summarizing German Radiology Findings, Proceedings of the 4th Clinical Natural Language Processing Workshop, Seattle, Washington, July 2022 1498
- Chai H, Strube M (2022). Incorporating Centering Theory into Neural Coreference Resolution, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Seattle, Washington, July 2022 1496
- Jeon S, Strube M (2022). Entity-based Neural Local Coherence Modeling, In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL). Dublin, Ireland, May 2022 1471
2021
- López F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices., In Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Online, December 6-12, 2021. 1287
- Fatima M, Strube M (2021). A Novel Wikipedia based Dataset for Monolingual and Cross-lingual Summarization, In Proceedings of the Third Workshop on New Frontiers in Summarization. Punta Cana, Dominican Republic, November 10, 2021. 1288
- Jeon S, Strube M (2021). Countering the Influence of Essay Length in Neural Essay Scoring., In Proceedings of the Second Workshop on Sustainable NLP. Punta Cana, Domincan Republic, November 10, 2021. 1289
- Khosla S, Yu J, Manuvinakurike R, Ng V, Poesio M, Strube M, Rosé C (2021). The CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, In Proceedings of the CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue. Punta Cana, Domincan Republic, November 10, 2021. 1500
- Braud C, Hardmeier C, Li JJ, Louis A, Strube M, Zeldes A (2021). Proceedings of the 2nd Workshop on Computational Approaches to Discourse, Punta Cana, Dominican Repbulich and Online 1501
- Strube M (2021). Computerwissenschaften und The Circle — The Circle und Computerwissenschaften, In Kempter, Klaus und Martina Engelbrecht (Eds.): Krise(n) der Moderne. Über Literatur und Zeitdiagnostik. Universitätsverlag Winter, Heidelberg, Germany. 451-460. 1167
- López F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach, In Proceedings of the 38th International Conference on Machine Learning 1266
- Lopez F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach, In Proceedings of the 38th International Conference on Machine Learning, vol. 139 of Proceedings of Machine Learning Research, pp. 7090–7101, Eds: Meila, Marina and Zhang, Tong, PMLR 1463
2020
- Jeon S, Strube M (2020). Incremental Neural Lexical Coherence Modeling, In Proceedings of the 28th International Conference on Computational Linguistics (COLING), Online, December 2020, pp. 6752–6758 1156
- Braud C, Hardmeier C, Li JJ, Louis A, Strube M (2020). Proceedings of the First Workshop on Computational Approaches to Discourse, Online 1502
- Mathews K, Strube M (2020). A large harvested corpus of location metonymy, In Proceedings of the 12th International Conference on Language Resources and Evaluation, Marseille, France, 11–16 May 2020 1042
- Jeon S, Strube M (2020). Centering-based Neural Coherence Modeling with Hierarchical Discourse Segments, In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, November 2020, pp. 7458-7472 1144
- López F, Strube M (2020). A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification, In Proceedings of Findings of the Association for Computational Linguistics: EMNLP 2020, Online, November 2020, pp. 460-475 1145
- Müller M, Ghosh S, Rey M, Wittig U, Müller W, Strube M (2020). Reconstructing Manual Information Extraction with DB-to-Document Backprojection: Experiments in the Life Science Domain, In Proceedings of the First Workshop on Scholarly Document Processing, Online, November 2020, pp. 81-90. 1147
- Chai H, Zhao W, Eger S, Strube M (2020). Evaluation of Coreference Resolution Systems Under Adversarial Attacks, In Proceedings of the First Workshop on Computational Approaches to Discourse, Online, November 2020, pp. 154-159. 1148
2019
- Sekulić I, Strube M (2019). Adapting deep learning methods for mental health prediction on social media, In Proceedings of the 5th Workshop on Noisy User-generated Text, Hong Kong, 4 November 2019, pp. 322-327 1044
- Zhu Y, Heinzerling B, Vulic I, Strube M, Reichart R, Korhonen A (2019). On the importance of subword information for morphological tasks in truly low-resource languages, In Proceedings of the 23rd Conference on Computational Natural Language Learning, Hong Kong, 3-4 November 2019, pp. 216-226 1043
- López F, Heinzerling B, Strube M (2019). Fine-Grained Entity Typing in Hyperbolic Space, In Proceedings of The Fourth Workshop on Representation Learning for NLP (Rep4NLP) @ ACL 2019, Florence, Italy, 2 August 2019, pp. 169-180 1040
- Heinzerling B, Strube M (2019). Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation, In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July – 2 August 2019, pp. 273-291 1036
- Moosavi NS, Born L, Poesio M, Strube M (2019). Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection, In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July – 2 August 2019, pp. 4168-4178 1039
2018
- Hou Y, Markert K, Strube M (2018). Unrestricted bridging resolution, Computational Linguistics 44(2):237-284 395
- Heinzerling B, Strube M (2018). BPEmb: Tokenization-free pre-trained subword embeddings in 275 languages, In Proceedings of the 11th International Conference on Language Resources and Evaluation, Miyazaki, Japan, 7–12 May 2018 394
- Kirilin A, Strube M (2018). Exploiting a speaker’s credibility to detect fake news, In Workshop on Data Science, Journalism and Media, London, UK, 20 August 2018 396
- Mesgar M, Strube M (2018). A neural local coherence model for text quality assessment, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October – 4 November 2018, pp. 4328-4339 397
- Müller M, Strube M (2018). Transparent, efficient, and robust word embedding access with WOMBAT, In Proceedings of the 27th International Conference on Computational Linguistics: System Demonstrations, Santa Fé, New Mexico, 20–26 August 2018, pp. 53-57 400
- Moosavi NS, Strube M (2018). Using linguistic features to improve generalization in neural coreference resolvers, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October – 4 November 2018, pp. 193-203 401
- Suter J, Strube M (2018). Extending and exploiting the entity graph for analysis, classification and visualization of German texts, In Proceedings of the 14th Conference on Natural Language Processing (KONVENS), Vienna, Austria, 17–19 September 2018, pp. 136-140 403
- Alfano M, Hovy D, Mitchell M, Strube M (2018). Proceedings of the 2nd ACL Workshop on Ethics in Natural Language Processing, New Orleans, Louis., 5 June 2018, http://aclweb.org/anthology/W18-0800.pdf 406
2017
- Born L, Mesgar M, Strube M (2017). Using a graph-based coherence model in document-Level machine translation, In Proceedings of the 3rd Workshop on Discourse in Machine Translation, Copenhagen, Denmark, 8 September 2017, pp. 26-35 278
- Heinzerling B, Strube M, Lin C (2017). Trust, but verify! Better entity linking through automatic verification, In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Valencia, Spain, 3–7 April 2017, pp. 828-838 279
- Heinzerling B, Moosavi NS, Strube M (2017). Revisiting selectional preferences for coreference resolution, In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 7–11 September 2017, pp. 1343-1350 280
- Hovy D, Spruit S, Mitchell M, Bender EM, Strube M, Wallach H (2017). Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, Valencia, Spain, 4 April 2017, http://www.aclweb.org/anthology/W17-16.pdf 281
- Judea A, Strube M (2017). Event argument identification on dependency graphs with bidirectional LSTMs, In Proceedings of the 8th International Joint Conference on Natural Language Processing, Taipei, Taiwan, 27 November – 1 December 2017, pp. 822-831 282
- Kurohashi S, Strube M (2017). Proceedings of the IJCNLP 2017, Tutorial Abstracts, Taipei, Taiwan, 27 November 2017, http://www.aclweb.org/anthology/I17-5.pdf 283
- Moosavi NS, Strube M (2017). Use generalized representations, but do not forget surface features, In Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes, Valencia, Spain, 4 April 2017, pp. 1-7 284
- Moosavi NS, Strube M (2017). Lexical features in coreference resolution: To be used with caution, In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (vol. 2: Short Papers), Vancouver, B.C., Canada, 30 July –4 August 2017 285
2016
- Resch B, Summa A, Zeile P, Strube M (2016). Citizen-centric urban planning through extracting emotion information from Twitter in an interdisciplinary space-time-linguistics algorithm, UP 1(2):114 196
- Heinzerling B, Judea A, Strube M (2016). HITS at TAC KBP 2015: Entity discovery and linking, and event nugget detection, In Proceedings of the Text Analysis Conference, National Institute of Standards and Technology, Gaithersburg, Maryland, USA, 16–17 November 2015 189
- Judea A, Strube M (2016). Incremental global event extraction, In Proceedings of the 26th International Conference on Computational Linguistics, Osaka, Japan, 11–16 December 2016, pp. 2279-2289 190
- Mesgar M, Strube M (2016). Lexical coherence graph modeling using word embeddings, In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, Cal., 12–17 June 2016, pp. 1414-1423 191
- Moosavi NS, Strube M (2016). Search space pruning: A simple solution for better coreference resolvers, In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, Cal., 12–17 June 2016, pp. 1005-1011 192
- Moosavi NS, Strube M (2016). Which coreference evaluation metric do you trust? A proposal for a link-based entity aware metric, In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), Berlin, Germany, 7–12 August 2016, pp. 632-642 193
- Parveen D, Mesgar M, Strube M (2016). Generating coherent summaries of scientific articles using coherence patterns, In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, Tex., 1–5 November 2016, pp. 772-783 194
- Remse M, Mesgar M, Strube M (2016). Feature-rich error detection in scientific writing using logistic regression, In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, San Diego, Cal., 16 June 2016, pp. 162-171 195
- Summa A, Resch B, Strube M (2016). Microblog emotion classification by computing similarity in text, time, and space, In Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media, Osaka, Japan, 12 December 2016, pp. 153-162 197