2016 The 2nd Workshop on Noisy User-generated Text (W-NUT)

December 11 2016, Osaka, Japan (last year at ACL)

We have received 27 main workshop submissions! We look forward to seeing you at COLING!

Best papers:

Veracity Computing from Lexical Cues and Perceived Certainty Trends
Uwe Reichel and Piroska Lendvai

Name Variation in Community Question Answering Systems
Anietie Andy, Satoshi Sekine, Mugizi Rwebangira and Mark Dredze

NEW! WNUT 2017 will be co-located with EMNLP!

WNUT focuses on Natural Language Processing applied to noisy user-generated text, such as that found in social media, web forums, online reviews, clinical records and language learner essays. This year, there will be two shared tasks: 1) Geolocation Prediction in Twitter and 2) Named Entity Recognition in Twitter.

Time:    9:00-17:00

Workshop Organizers

Invited Speakers


 Invited talk:
 9:10DISAANA and D-SUMM: Large-scale Real Time NLP Systems for Analyzing Disaster Related Reports in Tweets
Kentaro Torisawa
 Research presentations
 9:55Private or Corporate? Predicting User Types on Twitter
Nikola Ljubešić and Darja Fišer
 10:05From Noisy Questions to Minecraft Texts: Annotation Challenges in Extreme Syntax Scenario
Héctor Martínez Alonso, Djamé Seddah and Benoît Sagot
 10:15Disaster Analysis using User-Generated Weather Report
Yasunobu Asakura, Masatsugu Hangyo and Mamoru Komachi
 10:25Veracity Computing from Lexical Cues and Perceived Certainty Trends
Uwe Reichel and Piroska Lendvai
 10:35Exploring Word Embeddings for Unsupervised Textual User-Generated Content Normalization
Thales Felipe Costa Bertaglia and Maria das Graças Volpe Nunes
 10:45Name Variation in Community Question Answering Systems
Anietie Andy, Satoshi Sekine, Mugizi Rwebangira and Mark Dredze
 10:55Lightning Talks / Posters (60 sec. / talk)
 Text normalization for endangered languages: A shared task challenge
Patrick Littell, Shobhana Chelliah and Gina-Anne Levow
 Filtering Dialectal Arabic Text in Two Large Scale Annotation Projects
Wajdi Zaghouani, Nizar Habash, Houda Bouamor, Ossama Obeid, Sawsan Alqahtani, Mona Diab and Kemal Oflazer
 Whose Nickname is This? Recognizing Politicians from Their Aliases
Wei-Chung Wang, Hung-Chen Chen, Zhi-Kai Ji, Hui-I Hsiao, Yu-Shian Chiu and Lun-Wei Ku
 Towards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators
Ajit Jain, Girish Kasiviswanathan and Ruihong Huang
 Unsupervised Stemmer for Arabic Tweets
Fahad Albogamy and Allan Ramsay
 Topic Stability over Noisy Sources
Jing Su, Derek Greene and Oisin Boydell
 Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance
Julie Pain, Jessie Levacher, Adam Quinquenel and Anja Belz
 Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets
Billal Belainine, Alexsandro Fonseca and Fatiha Sadat
 A Simple but Effective Approach to Improve Arabizi-to-English Statistical Machine Translation
Marlies van der Wees, Arianna Bisazza and Christof Monz
 How Document Pre-processing affects Keyphrase Extraction Performance
Florian Boudin, Hugo Mougard and Damien Cram
 Japanese Text Normalization with Encoder-Decoder Model
Taishi Ikeda, Hiroyuki Shindo and Yuji Matsumoto
 Invited talk:
 14:00From Entity Linking to Question Answering – Recent Progress on Semantic Grounding Tasks
Ming-Wei Chang
 Shared task session
 14:45Results of the WNUT16 Named Entity Recognition Shared Task
Benjamin Strauss, Bethany Toma, Alan Ritter, Marie-Catherine de Marneffe and Wei Xu
 15:55Bidirectional LSTM for Named Entity Recognition in Twitter Messages
Nut Limsopatham and Nigel Collier
 15:05Twitter Geolocation Prediction Shared Task of the 2016 Workshop on Noisy User-generated Text
Bo Han, Afshin Rahimi, Leon Derczynski and Timothy Baldwin
 15:15CSIRO Data61 at the WNUT Geo Shared Task
Gaya Jayasinghe, Brian Jin, James Mchugh, Bella Robinson and Stephen Wan
 15:25Shared task poster session
 Learning to recognise named entities in tweets by exploiting weakly labelled data
Kurt Junshean Espinosa, Riza Theresa Batista-Navarro and Sophia Ananiadou
 Feature-Rich Twitter Named Entity Recognition and Classification
Utpal Kumar Sikdar and Björn Gambäck
 Learning to Search for Recognizing Named Entities in Twitter
Ioannis Partalas, Cédric Lopez, Nadia Derbas and Ruslan Kalitvianski
 DeepNNNER: Applying BLSTM-CNNs and Extended Lexicons to Named Entity Recognition in Tweets
Fabrice Dugas and Eric Nichols
 ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.
Michel Naim Gerguis, Cherif Salama and M. Watheq El-Kharashi
 UQAM-NTL: Named entity recognition in Twitter messages
Ngoc Tan LE, Fatma Mallek and Fatiha Sadat
 Semi-supervised Named Entity Recognition in noisy-text
Shubhanshu Mishra and Jana Diesner
 Geolocation Prediction in Twitter Using Location Indicative Words and Textual Features
Lianhua Chi, Kwan Hui Lim, Nebula Alam and Christopher J. Butler
 A Simple Scalable Neural Networks based Model for Geolocation Prediction in Twitter
Yasuhide Miura, Motoki Taniguchi, Tomoki Taniguchi and Tomoko Ohkuma
 Invited talk:
 16:10Processing non-canonical or noisy text: fortuitous data to the rescue
Barbara Plank
 16:55Awards and closing


For posters, COLING provides: A self-standing poster panel measuring 180 cm wide and 210 cm high. Sizes up to B0 landscape (1456 mm x 1030 mm; 57.32 inch x 40.55 inch) would be suitable.

Important Dates

Call for Papers

We seek submissions of regular papers on original and unpublished work (same 8-page limit plus 2-page reference as COLING main conference). 1-page abstracts on work-in-progress or work published elsewhere are also welcome and will *not* be included in the conference proceedings. All accepted submissions will be presented as posters. Additionally, selected submissions will be presented orally. The shared-task participants are also encouraged (but not required) to submit system description papers and present posters; the top systems will be invited (but not required) to present orally.

Topics of interest include but are not limited to:

All submissions should conform to COLING 2016 style guidelines. Long and short paper submissions must be anonymized. Abstract submissions should include author information (and where the work was published in a footnote on front page, if applicable). Please submit your papers at https://www.softconf.com/coling2016/WNUT/.

Shared task #1: Geolocation Prediction in Twitter

This new shared task focuses on the prediction of geolocation on the basis of text data (including Twitter metadata).
Task #1 Details
Registration: here
Evaluation Period: 18 September 2016
Contacts: Afshin Rahimi, Bo Han, Leon Derczynski and Tim Baldwin

Shared task #2: Named Entity Recognition in Twitter


Task #2 Details
Registration: click here
Evaluation Period: September 9-16
Contacts: Benjamin Strauss, Bethany E. Toma, Marie-Catherine de Marneffe, and Alan Ritter

Program Committee

Sponsored by    Microsoft Research logo