computational linguistics

UBY - A Large-Scale Unified Lexical-Semantic Resource (UBY 1.0) released

Submitted by judithek on Fri, 03/30/2012 - 08:03

We are pleased to announce the release of UBY 1.0 -
a large-scale lexical-semantic resource for natural language processing (NLP)
based on the ISO standard Lexical Markup Framework (LMF):

http://www.ukp.tu-darmstadt.de/data/uby/

UBY combines a wide range of information from expert-constructed and collaboratively constructed resources for English and German.
Currently, UBY holds structurally and semantically interoperable versions of nine resources in two languages:

* English WordNet, Wiktionary, Wikipedia, FrameNet and VerbNet,

Announcing Glottolog/Langdoc, a knowledge base of 175k references for (mostly) underdescribed languages

Submitted by snordhoff on Mon, 03/26/2012 - 05:19

We are happy to announce Glottolog/Langdoc, a comprehensive knowledge base of 104k languoids and 175k references for the Semantic Web.

In linguistics as well as in the Semantic Web world, it is important to clearly identify the concepts one is talking about. Glottolog/Langdoc takes this insight as a starting point and provides 104k Unique Resource Identifiers (URIs) for languoids and 175k for references to descriptive literature focusing on underdescribed languages.

NSF announces Building Community and Capacity for Data-Intensive Research in the SBE Sciences

Submitted by terry on Wed, 02/29/2012 - 07:21

The NSF Directorates for Social, Behavioral & Economic Sciences (SBE) and Education & Human Resources (EHR), together with the Office of Cyberinfrastructure (OCI) recently announced a solicitation for Building Community and Capacity for Data-Intensive Research (http://www.nsf.gov/pubs/2012/nsf12538/nsf12538.htm?WT.mc_id=USNSF_25&WT....) with a proposal deadline of 2012-05-22. Here are some snippets from the solicitation.

SSWL (Syntactic Structures of the World's Languages)

Submitted by cc116 on Thu, 11/10/2011 - 00:07

SSWL (Syntactic Structure of the World's Languages) is a open-ended database of syntactic, morphological and semantic properties. Each language is characterized by a set of property-value pairs (e.g., Object Verb: Yes), and examples that illustrate these property value pairs. A rich variety of search functions are available, as well as mapping and the creation of similarity trees. The database is open-ended in the sense that (a) new language experts may sign up to add new languages, and (b) new properties may be added.

EURALEX conference

Submitted by jannebj on Tue, 10/11/2011 - 07:39

Call for papers
Papers, posters and software demonstrations are invited on all topics of lexicography, including, but not limited to, the following fields, which are the main focus of the congress:
• Lexicography and national Identity
• Indigenous Languages and Lexicography
• Corpus-driven Lexicography
• Lexicography in Language Technology
• Multilingual Lexicography
• Lexicography and semantic Theory
• Terminology, LSP and Lexicography
• Reports on Lexicographical and Lexicological Projects
• Other topics

New book on language variation infrastructure

Submitted by jannebj on Wed, 10/05/2011 - 13:34

Dear colleagues,

You may want to learn about the book "Language Variation Infrastructure. Papers on selected projects" (2011) based on some talks from Workshop on research infrastructure for linguistic variation (RiLiVS) arranged at the University of Oslo. I think most of you will find the papers interesting.

The book is freely downloadable from the web site of the OSLA Oslo Studies in Language:
https://www.journals.uio.no/index.php/osla/issue/view/6
You can choose to download the whole book or just individual chapters.

This is the list of contents:

Adapting a Scientific Workflow Infrastructure to Linguistics

Submitted by Richard Littauer on Thu, 09/29/2011 - 11:10

In Linguistics (and similar social sciences), there are no standard 'workflow workbenches' that can be used for non-programmers to develop, use, and share their workflows. However, as an increasingly data-intensive science, computational linguists are using computational pipelines in their research, in order to facilitate their main work.

RELISH-Symposium „Rendering Endangered Lexicons Interoperable through Standards Harmonization”, Frankfurt, October 10, 2011 “RELISH meets LOEWE”

Submitted by nevskaya on Tue, 08/16/2011 - 10:31

The RELISH project promotes language-oriented research by addressing a two-pronged problem: (1) the lack of harmonization between digital standards for lexical information in Europe and America, and (2) the lack of interoperability among existing lexicons of endangered languages, in particular those created with the Shoebox lexicon building software. The cooperation partners in the RELISH project are the University of Frankfurt (FRA), the Max Planck Institute for Psycholinguistics (MPI), and Eastern Michigan University, the host of the Linguist List (ILIT).

LRL 2011: Addressing the Gaps in Language Resources and Technologies

Submitted by ebender on Thu, 08/11/2011 - 13:56

From the call for papers:

Date: 25.11.2011, half-day (afternoon) + banquet
LTC Workshop paper submission deadline : 05-sep-2011

Theme: Covering the existing gaps in Language Resources (LR) and Technologies (LT) is a challenging task not only for the so-called "Less-Resourced Languages" (LRL) but also for the technologically more advanced ones. Therefore we believe that this workshop will be of general interest for all. Still, the needs of the less-resourced languages are worth being specifically considered in order to reduce the unbalanced situation among languages.

Topics:

NSF and other agencies announce the National Robotics Initiative

Submitted by terry on Wed, 07/27/2011 - 08:01

The National Science Foundation (NSF) announced the National Robotics Initiative (http://www.nsf.gov/publications/pub_summ.jsp?org=ENG&ods_key=nsf11553) on 24 June; the solicitation is unusual in that it involves collaboration with the National Institutes of Health (NIH), the United States Department of Agriculture (USDA) and the National Aeronautics and Space Administration (NASA).