data sharing

A Grand Challenge for Linguistics: Scaling Up and Integrating Models

In response to NSF's call for White Papers in the SBE 2020 Initiative, Jeff Good and I have submitted a paper outlining our take on Cyberinfrastructure for Linguistics, why its necessary, and how it can come about. The abstract:

The Endangered Languages Archive (ELAR) at SOAS

The Endangered Languages Archive (ELAR) at SOAS preserves and disseminates digital documentation of endangered languages around the world, especially (but not limited to) the outcomes of ELDP-funded projects. ELAR's recently re-launched website is designed specifically to suit the needs of endangered languages archiving, using "Web 2.0" methods to implement a nuanced access control system and make the site user-friendly for a range of audiences.

Abney & Bird's Grand Challenge: The Human Language Project

Steven Abney and Steven Bird published a provocative paper (.pdf) at ACL 2010 calling on the computational linguistics community to work to create a "Universal Corpus", an undertaking that they compare in both scale and potential impact to the Human Genome Project. Here is the abstract:

RELISH Meeting in Nijmegen

On 4–5 August, the RELISH project held a workshop on lexicon tools and lexical standards. Slides from many of the presentations are posted on the workshop site.

A linguist’s perspective on Creative Commons’ data sharing whitepaper

Edit: this post on (legal aspects of) data sharing by Creative Commons' Kaitlin Thaney is also highly recommended.

Conference on Electronic Grammaticography

Dates: 11-Feb-2011 - 12-Feb-2011
Location: Leipzig, Germany
Contact Person: Sebastian Nordhoff
Meeting Email: sebastian_nordhoff at eva.mpg.de
General Web Site: http://www.eva.mpg.de/lingua/conference/11-grammaticography2011
Call for Papers: http://www.eva.mpg.de/lingua/conference/11-grammaticography2011/files/ca...
Abstract deadline: 1-Oct-2010

This meeting will bring together field linguists, computer scientists, and publishers with the aim of exploring production and dissemination of grammatical descriptions in electronic/hypertextual format.

Language Description Heritage (LDH) open access digital library

it is my pleasure to announce the Language Description Heritage (LDH) open access digital library, available online at

http://ldh.livingsources.org

The LDH is being compiled at the Max Planck Society in Germany, specifically at the MPI for Evolutionary Anthropology in Leipzig in cooperation with the Max Planck Digital Library in Munich.

Language Description Heritage (LDH) open access digital library

Dear colleagues,

it is my pleasure to announce the Language Description Heritage (LDH) open access digital library, available online at

http://ldh.livingsources.org

The LDH is being compiled at the Max Planck Society in Germany, specifically at the MPI for Evolutionary Anthropology in Leipzig in cooperation with the Max Planck Digital Library in Munich.

NSF Software Infrastructure for Sustained Innovation (SI**2)

On March 16, the National Science Foundation announced the Software Infrastructure for Sustained Innovation (SI**2) Program Solicitation 10-551 at http://www.nsf.gov/pubs/2010/nsf10551/nsf10551.pdf. This is an NSF-wide solicitation, led by the Office of Cyberinfrastructure (OCI).

The World Loanword Database goes online: interview with Robert Forkel

The World Loanword Database (WOLD, http://wold.livingsources.org/), edited by Martin Haspelmath and Uri Tadmor and published by the Max Planck Digital Library (http://www.mpdl.mpg.de/) is a new digital resource for linguists that allows tracing the origin of loan words.

We had the oportunity to interview WOLD web developer Robert Forkel and ask him about the design philosophy and technology behind the platform. Soon (in about 1-2 weeks) we will also post an interview with Martin Haspelmath on the potential of WOLD for data-driven linguistic research.

Syndicate content
Powered by Drupal, an open source content management system