Information Extraction from Texts and Text Classification

Natural Language Processing (NLP) is part of many applications, especially in text mining and there especially for the tasks of automated information extraction and tagging, automated question answering, and text summerization, but also far beyond to e.g. the design of more comfortable user interfaces as well as automated translation of texts into other languages.


RapidMiner (YALE)
RapidMiner Conditional Random Fields Plugin
RapidMiner Information Extraction Plugin
Word Vector Tool and RapidMiner Word Vector Tool Plugin


Euler, Timm
Jungermann, Felix
Klinkenberg, Ralf
Morik, Katharina
Rössler, Marc

Past Master Thesis


Poelitz/Bartz/2014a Poelitz, Christian. Enhancing the possibilities of corpus-based investigations: Word sense disambiguation on query results of large text corpora. In Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), pages 42-46, 2014.
Jungermann/2012a Jungermann, Felix. About the Exploration of Data Mining Techniques using Structured Features for Information Extraction. Technische Universität Dortmund, 2012. Arrow Symbol
Tomanek/2010a Tomanek, Katrin. Resource-Aware Annotation through Active Learning. University of Dortmund, 2010.
Had/etal/2009a Had, Martin and Jungermann, Felix and Morik, Katharina. Relation Extraction for Monitoring Economic Networks. In Horacek, H. and Metais, E. and Munoz, R. and Wolska, M. (editors), Proceedings of the 14th International Conference on Applications of Natural Language to Information Systems (NLDB), Vol. 5723, pages 103--114, Springer, 2009.
Jungermann/Morik/2008a Jungermann, Felix and Morik, Katharina. Enhanced services for targeted information retrieval by event extraction and data mining. No. 04/2008, Sonderforschungsbereich 475, University of Dortmund, 2008.
Jungermann/2007a Jungermann, Felix. Named entity recognition without domain-knowledge using conditional random fields. In Workshop Notes of the Machine Learning for Natural Language Processing Workshop, pages 16 -- 17, van Someren, Marten and Katrenko, Sophia and Adriaans Pieter, 2007.
Jungermann/2007b Jungermann, Felix and Roessler, Marc. NER in Texts without Domain- or Language-Knowledge using Wikipedia. In intelligenza artificiale, Vol. Anno IV, No. 2, pages 75,76, 2007.
Roessler/etal/2007a Roessler, Marc and Wagner, Andreas and Jungermann, Felix and Hoeppner, Wolfgang. Applying WALU to annotate Named Entities in italian texts. In intelligenza artificiale, Vol. Anno IV, No. 2, pages 77,78, 2007.
Jungermann/2006a Jungermann, Felix. Named Entity Recognition mit Conditional Random Fields. Computer Science, University of Dortmund, 2006.
Roessler/Morik/2005a Roessler, Marc and Morik, Katharina. Using Unlabeled Texts for Named-Entity Recognition. In Tobias Scheffer and Stefan Rüping (editors), ICML Workshop on Multiple View Learning, 2005. Arrow Symbol
Morik/99c Morik, Katharina. Folien zur Vorlesung Kunstliche Intelligenz. 1999.
Hoelscher/98a Holscher, Markus. Informationsextraktion aus Freitext-Eintragen einer Datenbank. Fachbereich Informatik, Universitat Dortmund, 1998.
Aruca/etal/97a Aruca, Ayse and Bankmann, Tino and Becks, Andreas and Bullerdick, Kai and Frank, Steffen and Goren, Leyla and Klahold, Stefan and Klinkenberg, Ralf and Ritthoff, Oliver and Tuben, Uwe. Endbericht der Projektgruppe 281 ``MOSES'' (Medizinisch Orientiertes SprachevaluationsSystem). Fachbereich Informatik, Universitat Dortmund, Dortmund, Germany, 1997.
Morik/97a Morik, Katharina. Einfuhrung in die Kunstliche Intelligenz. 1997.
Morik/95b Morik, Katharina. Naturlichsprachliche Systeme. 1995.
Morik/87a Morik, Katharina. Discourse Models, Dialog Memories, and User Models. In Computational Linguistics, 1987.
Morik/87b Morik, Katharina. User Models and Conversational Settings -- Modelling the User's Wants. In Wahlster, W. and Kobsa, A. (editors), User Models in Dialog Systems, pages 364--385, Springer, 1987.
Hoeppner/etal/84a Hoeppner, Wolfgang and Morik, Katharina and Marburger, Heinz. Talking it Over: The Natural Language Dialog System HAM-ANS. In Cooperative Interfaces to Information Systems, pages 189--258, Springer, 1986.
Busemann/etal/85a Busemann, Stephan and Hoeppner, W. and Marburger, H. and Morik, K.. Representing and Processing Copula and Full-Verb Sentences in HAM-ANS. In GWAI-85. German Workshop on Artificial Intelligence, 1985.
Morik/85a Morik, Katharina. Ansätze der sprachorientierten KI-Forschung. In LDV-Forum, No. 2, 1985.
Morik/85b Morik, Katharina. Partnermodellierung in Beratungsdialogen. In Sprachverarbeitung in Information und Dokumentation 10. Jahrestagung der Gesellschaft fur Linguistische Datenverarbeitung, 1985.
Morik/85c Morik, Katharina. User Modelling, Dialog Structure, and Dialog Strategy in HAM-ANS. In Procs. of 2nd EACL, 1985.
Morik/84a Morik, Katharina. Partnermodellierung und Interessenprofile bei Dialogsystemen der Künstlichen Intelligenz. In Claus--Rainer Rollinger (editors), Probleme des (Text)-Verstehens. Ansatze der Künstlichen Intelligenz, pages 249--264, 1984.
Morik/84b Morik, Katharina. Customers' Requirements for Natural Language Systems: Results of an Inquiry. In International Journal of Man Machine Studies, 1984.
Hoeppner/etal/83a Hoeppner, Wolfgang and Morik, Katharina. Das Dialogsystem HAM-ANS: Worauf basiert es, wie funktioniert es und wem antwortet es?. In Linguistische Berichte, No. 88, 1983.
Hoeppner/etal/83b Hoeppner, Wolfgang and Christaller, Thomas and Marburger, Heinz and Morik, Katharina and Nebel, Bernhard and O'Leary, Mike and Wahlster, Wolfgang. Beyond Domain-Independence: Experience with the Development of a German Language Access System to Highly Diverse Background Systems. In Procs. of the 8th IJCAI, 1983.
Morik/83a Morik, Katharina. Demand and Requirements for Natural Language Systems -- Results of an Inquiry. In Proceedings of the 8th International Joint Conference on Artificial Intelligence (IJCAI), 1983.
Morik/Rollinger/83a Morik, Katharina and Rollinger, Claus-Rainer. Partnermodellierung im Evidenzraum. In GWAI-83. 7th German Workshop on Artificial Intelligence, 1983.
Christaller/etal/82a Christaller,Thomas and von Hahn, W. and Hoeppner, W. and Marburger, H. and Morik, K.. Wissensbasierter natürlichsprachlicher Zugang zu unterschiedlichen Diskursbereichen mit dem KI-System HAM-ANS. In R. Slama (editors), Workshop Sprachverarbeitung, Gesellschaft für Mathematik und Datenverarbeitung, Bonn, Addison-Wesley, 1982.
Morik/82a Morik, Katharina. Überzeugungssysteme der Künstlichen Intelligenz. Validierung vor dem Hintergrund linguistischer Theorien über implizite Ausserungen. Tübingen, Max Niemeyer, 1982.
Morik/81b Morik, Katharina. Verarbeitung von externer und interner Situation in Überzeugungssystemen. In Siekmann, Jörg H. (editors), German Workshop on Artificial Intelligence GWAI--81, No. 47, pages 287--296, Berlin u.a., Springer, 1981.