Within ULM-3, Tommaso Caselli & Roser Morante & Chantal van Son work on stories and world views as a key to understanding language.
Texts are not simply collections of stand-alone factual statements; these statements always reflect a certain perspective and together form a multi-dimensional storyline. The research that is being conducted within ULM-3 can be divided into two research lines, perspectives and storylines, both of which play an important role in human understanding and interpretation of texts and should play an equally important role when processing texts by machines.
ULM-3 closely collaborates with the Quality and Perspectives in Deep Data (QuPiD2) project and with the VENI-project Reading between the lines: identifying implicit perspectives through linguistic analyses.
Perspectives
Texts are inherently subjective; they are written with a certain perspective in mind on the topic, person or event, which is reflected in the selection of information and the way in which it is presented. ULM-3 develop software to detect these perspectives in texts and to represent the output according to a formal model that can represent what is said about a topic, a person or an event and how this is said in and by various sources, making it possible to place alternative perspectives next to each other. The model and detection software take into account linguistic phenomena such as attribution, modality, negation, factuality, sentiment, as well as phenomena related to discourse and rhetorical structure.
Annotation Guidelines: We are developing guidelines for the annotation of perspectives, which can be found at GitHub VUA-Perspectives.
The Reading Machine, by Daniel Libeskind is a fabrication of the “Reading Wheel” published in 1588 by Agostino Ramelli in his Le diverse et artificiose machine del capitano Agostino Ramelli
Storylines
Narratives are at the heart of information sharing. Ever since people began to share their experiences, they have connected them to form narratives. When reading texts, such as the news, humans build up a story over time by integrating the incoming information with the known, removing duplication, resolving conflicts and ordering relevant events in time. They also create an explanatory and causal scheme for what happened and relate the actors involved to these schemes. The second research line of ULM-3 is constructing these storylines, taking into account temporal, causal and subjective dimensions. ULM-3 investigates how storylines should be represented, how they can be extracted automatically, and how they can be evaluated.
Events
ULM-3 was or is involved as (co-)organizer in the following events:
- Workshop on Extra-Propositional Aspects of Meaning (ExProM) in Computational Linguistics, collocated with COLING 2016, Osaka, Japan
- The First Workshop on Computing News Storylines (CNewS 2015), collocated with ACL-IJCNLP 2015, Beijing, China
- SemEval-2015 Task 9: CLIPEval Implicit Polarity of Events, collocated with NACCL 2015, Denver, Colorado
- The 2nd Workshop on Processing Extra-propositional Aspects of Meaning (ExProM) 2015, collocated with NACCL 2015, Denver, Colorado
- The 2nd Workshop on Computing News Storylines (CNewsS 2016) which will be held on the 5th of November in Austin, Texas (USA), collocated with EMNLP 2016
Selected publications
- Caselli, T. and R. Morante (2016). VUACLTL: A CRF Approach to the 2016 Clinical TempEval Task. In Proceedings of SEMEVAL 2016. To appear.
- Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo, and Piek Vossen (2016). Perspective Based Local Agreement and Disagreement in Online Debate. In Proceedings of the 3rd Workshop on Argument Mining. July, Berlin, Germany. To appear.
- Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo and Piek Vossen (2016). “GRaSP: A Multilayered Annotation Scheme for Perspectives.” In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), May, Portorož, Slovenia. (pdf)
- Tommaso Caselli, Antske Fokkens, Roser Morante, and Piek Vossen (2015). “SPINOZA VU: An NLP Pipeline for Cross Document TimeLines.” In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, pp. 787-791.(pdf)
- Anne-Lyse Minard, Manuela Speranza, Rachele Sprugnoli and Tommaso Caselli (2015). “FacTA: Evaluation of Event Factuality and Temporal Anchoring.” In Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015. December, Trento, Italy.(pdf)
- Piek Vossen,Tommaso Caselli and Panagiota Kontzopoulou. “Storylines for structuring massive streams of news.” In Proceedings of the First Workshop on Computing News Storylines. Beijing, China. (pdf)