The 8th Workshop on Asian Language Resources

 
 


COLING 2010
21-22 August, 2010

 

 
               

 

Language resources play a central role in statistical and learning-based approaches to natural language processing. Therefore recent research has put great emphasis in building these resources for target languages. Parallel resources across various languages are also being developed for multilingual processing. These include lexica and corpora with multiple levels of annotations. Though significant progress has been achieved in modeling few of the Asian languages, with the wider spread of ICT use across the region, there is a growing interest in this field from other linguistic communities. As research in the field matures across Asia, there is a growing need for developing language resources. However the region is not only short in the linguistic resources for more than 2200 language spoken in the region, there is also lack of experience in the researchers to develop these resources. As the efforts to develop the linguistic resources increases, there is also need to coordinate the efforts to develop common frameworks and processes so that these resources can be used by various groups of researchers equally effectively. The workshop is organised under the Asian Language Resources Committee (ALRC) of AFNLP with the following goals

  • To chart and catalogue the status of Asian Language Resources

  • To investigate and discuss the problems related to the standards and specification on creating and sharing various levels of language resources

  • To promote a dialogue between developers and users of various language resources in order to address any gaps in language resources and practical applications, and to nurture collaboration in their development and use

  • To provide opportunity for researchers from Asia to collaborate with researchers in other regions

To achieve these goals, we call for the technical, strategy, policy and survey papers concerning, but not limited to the following issues.

  • Text corpora, speech corpora, Lexicons, Grammars, Machine-readable dictionaries, Ontologies

  • Infrastructure for constructing and sharing language resources

  • Exchange and annotation schemata, Exchange formats

  • Standards or specifications for language resources

  • Standards or specifications for content management

  • Language resources for basic NLP tasks (word segmentation, named entity recognition, syntactic analysis, semantic analysis, discourse analysis, speech recognition, speech synthesis, etc.)

  • Language Resources for HLT applications (such as information retrieval, information extraction, question answering, machine translation, speech translation, etc.)

  • Strategies and priorities cooperation and collaboration

  • Licensing and copyright issues

This workshop is one of a series of workshops organized by Asian Language Resources Committee (ALRC) of Asian Federation of NLP (AFNLP; www.AFNLP.org). The last workshop was held in conjunction with ACL/IJCNLP 2009 at Singapore. Followings are the series of the workshops since the beginning:

  • Tokyo, Japan, under the name of Symposium on Language Resources in Asia, 2001

  • Tokyo, Japan, in conjunction with the 6th Natural Language Processing Pacific Rim Symposium, National Center of Sciences, 2001

  • Taipei, Taiwan, in conjunction with Coling2002

  • Sanya City, Hainan Island, China, in conjunction with IJCNLP2004

  • Jeju Island, Korea, in conjunction with IJCNLP2005

  • Hyderabad, India, in conjunction with IJCNLP2008

  • Singapore, in conjunction with ACL/IJCNLP 2009

The intended length of the workshop: 2 days

webmaster@crulp.org