ACL 2015 Workshop on Noisy User-generated Text (W-NUT)

July 31 2015, CNCC 303A, Beijing, China.

This workshop focuses on core Natural Language Processing tasks applied to noisy user-generated text, such as that found in social media, web forums, online reviews and language learner essays. The workshop will host two shared tasks: 1) Named Entity Recognition in Twitter and 2) Normalization of Noisy Text.

We would like to thank the speakers, presenters and attendees for making WNUT-2015 a success (printable poster). See you in 2016.

Best Paper Awards:

Challenges of studying and processing dialects in social media
Anna Jørgensen, Dirk Hovy and Anders Søgaard

Five Shades of Noise: Analyzing Machine Translation Errors in User-Generated Text
Marlies van der Wees, Arianna Bisazza and Christof Monz


Invited Speakers

Program

Friday, July 31, 2015

9:00–10:30Invited Talks
9:00–9:45Text Mining of Social Media: Going beyond the Text and Only the Text
Tim Baldwin
9:45–10:30Where is Language?
Anders Søgaard
10:30–11:00Coffee Break
11:00–12:30Long Papers and Abstracts
11:00–11:15Learning finite state word representations for unsupervised Twitter adaptation of POS taggers
Julie Wulff and Anders Søgaard
11:15–11:30Towards POS Tagging for Arabic Tweets
Fahad Albogamy and Allan Ramasy
11:30–11:45Minority Language Twitter: Part-of-Speech Tagging and Analysis of Irish Tweets
Teresa Lynn, Kevin Scannell and Eimear Maguire
11:45–11:00Challenges of studying and processing dialects in social media
Anna Jørgensen, Dirk Hovy and Anders Søgaard
12:00–12:15Toward Tweets Normalization Using Maximum Entropy
Mohammad Arshi Saloot, Norisma Idris, Liyana Shuib, Ram Gopal Raj and AiTi Aw
12:15–12:30Five Shades of Noise: Analyzing Machine Translation Errors in User-Generated Text
Marlies van der Wees, Arianna Bisazza and Christof Monz
12:30–14:00Poster Session and Lunch
Learning finite state word representations for unsupervised Twitter adaptation of POS taggers
Julie Wulff and Anders Søgaard
Towards POS Tagging for Arabic Tweets
Fahad Albogamy and Allan Ramasy
Minority Language Twitter: Part-of-Speech Tagging and Analysis of Irish Tweets
Teresa Lynn, Kevin Scannell and Eimear Maguire
Challenges of studying and processing dialects in social media
Anna Jørgensen, Dirk Hovy and Anders Søgaard
Toward Tweets Normalization Using Maximum Entropy
Mohammad Arshi Saloot, Norisma Idris, Liyana Shuib, Ram Gopal Raj and AiTi Aw
Five Shades of Noise: Analyzing Machine Translation Errors in User-Generated Text
Marlies van der Wees, Arianna Bisazza and Christof Monz
 A Normalizer for UGC in Brazilian Portuguese
Magali Sanches Duran, Maria das Graças Volpe Nunes and Lucas Avanço
 USFD: Twitter NER with Drift Compensation and Linked Data
Leon Derczynski, Isabelle Augenstein and Kalina Bontcheva
Enhancing Named Entity Recognition in Twitter Messages Using Entity Linking
Ikuya Yamada, Hideaki Takeda and Yoshiyasu Takefuji
Improving Twitter Named Entity Recognition using Word Representations
Zhiqiang Toh, Bin Chen and Jian Su
 NRC: Infused Phrase Vectors for Named Entity Recognition in Twitter
Colin Cherry, Hongyu Guo and Chengbi Dai
 IITP: Multiobjective Differential Evolution based Twitter Named Entity Recognition
Md Shad Akhtar, Utpal Kumar Sikdar and Asif Ekbal
Multimedia Lab @ ACL WNUT NER Shared Task: Named Entity Recognition for Twitter Microposts using Distributed Word Representations
Fréderic Godin, Baptist Vandersmissen, Wesley De Neve and Rik Van de Walle
 Data Adaptation for Named Entity Recognition on Tweets with Features-Rich CRF
Tian Tian, Marco Dinarelli and Isabelle Tellier
 Hallym: Named Entity Recognition on Twitter with Word Representation
Eun-Suk Yang and Yu-Seop Kim
 IHS_RD: Lexical Normalization for English Tweets
Dmitry Supranovich and Viachaslau Patsepnia
 Bekli:A Simple Approach to Twitter Text Normalization.
Russell Beckley
 NCSU-SAS-Ning: Candidate Generation and Feature Engineering for Supervised Lexical Normalization
Ning Jin
 DCU-ADAPT: Learning Edit Operations for Microblog Normalisation with the Generalised Perceptron
Joachim Wagner and Jennifer Foster
 LYSGROUP: Adapting a Spanish microtext normalization system to English.
Yerai Doval Mosquera, Jesús Vilares and Carlos Gómez-Rodríguez
 IITP: Hybrid Approach for Text Normalization in Twitter
Md Shad Akhtar, Utpal Kumar Sikdar and Asif Ekbal
 NCSU_SAS_WOOKHEE: A Deep Contextual Long-Short Term Memory Model for Text Normalization
Wookhee Min and Bradford Mott
NCSU_SAS_SAM: Deep Encoding and Reconstruction for Normalization of Noisy Text
Samuel Leeman-Munk, James Lester, and James Cox
 USZEGED: Correction Type-sensitive Normalization of English Tweets Using Efficiently Indexed n-gram Statistics
Gábor Berend and Ervin Tasnádi
14:00–15:30Shared Task Session
14:00–14:30Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition
Timothy Baldwin, Marie-Catherine de Marneffe, Bo Han, Young-Bum Kim, Alan Ritter and Wei Xu
14:30–14:45Enhancing Named Entity Recognition in Twitter Messages Using Entity Linking
Ikuya Yamada, Hideaki Takeda and Yoshiyasu Takefuji
14:45–15:00Improving Twitter Named Entity Recognition using Word Representations
Zhiqiang Toh, Bin Chen and Jian Su
15:00–15:15Multimedia Lab @ ACL WNUT NER Shared Task: Named Entity Recognition for Twitter Microposts using Distributed Word Representations
Fréderic Godin, Baptist Vandersmissen, Wesley De Neve and Rik Van de Walle
15:15–15:30NCSU_SAS_SAM: Deep Encoding and Reconstruction for Normalization of Noisy Text
Samuel Leeman-Munk, James Lester and James Cox
15:30–16:00Coffee Break
16:00–17:30Invited Talks
16:00–16:45Automated Grammatical Error Correction for Language Learners: Where are we, and where do we go from there?
Joel Tetreault
16:45–17:30Are Minority Dialects "Noisy Text"?: Implications of Social and Linguistic Diversity for Social Media NLP
Brendan O’Connor

Important Dates

check this website or follow the organizers @cocoweixu @alan_ritter @BrooklynHAN on Twitter for updates

Call for Papers

We seek submissions of long papers on original and unpublished work (up to 8 pages of content plus 2 extra pages for references). Abstracts (2-4 pages including references) on work-in-progress or work published elsewhere are also welcome and will *not* be included in the conference proceedings. All accepted submissions will be presented as posters. Additionally, a small number of selected submissions will be presented orally.

Topics of interest include but are not limited to:

All submissions should conform to ACL 2015 style guidelines. Long paper submissions must be anonymized. Abstract submissions should include author information (and where the work was published in a footnote on front page, if applicable). Please submit your papers at https://www.softconf.com/acl2015/WNUT/.

Shared task #1: Named Entity Recognition in Twitter


ad

Task #1 Details: here
Register by clicking here
Evaluation Period: May 1 - May 8
Contacts: Marie-Catherine de Marneffe, Young-Bum Kim and Alan Ritter

Shared task #2: Normalization of Noisy Text


ad

Task #2 Details: here
Register by clicking here
Evaluation Period: May 7- May 11
Contact: Bo Han, Tim Baldwin

Workshop Organizers

Program Committee


Registration Awards


Thanks to the kind support of our sponsors, WNUT 2015 will provide several student registration grants ($150 each) to cover the registration fee and lunch for students to attend the workshop. Both undergraduate and graduate students are eligible to apply. Award receipts must attend the full-day WNUT workshop (9:00 - 17:30 July 31st, 2015) at China National Convention Center (CNCC) in Beijing. The application is due on July 26th. The workshop organization committee will review the applications and notify the winners by July 28th via email. Please email your application to xwe@cis.upenn.edu that includes the following documents :

- Copy of valid student ID
- Copy of proof of travel plan to Beijing, if the student is not currently enrolled in universities in Beijing.
- 500-word essay in the topic "Why do I deserve the registration grant?", including student's basic information (full name, email, which year in what program in which university and department).

ACL 2015 WNUT Workshop Student Registration Awards winners are:
Sponsored by    Microsoft Research logo      IBM Research logo