Cyberling Blog | Information about cyberinfrastructure and data sharing in linguistics and language sciences

Cyberling 2009 workshop final report online

Submitted by ebender on Fri, 03/20/2015 - 11:07

in

The Cyberling 2009 Workshop workshop report is now available online for download thanks to the LSA:
http://www.linguisticsociety.org/files/Cyberling2009Workshop_FinalReport.pdf

9 comments

PHOIBLE Online

Submitted by stiv on Mon, 09/15/2014 - 15:46

in

We are pleased to announce the release of PHOIBLE Online, a repository of cross-linguistic phonological inventory data:

http://phoible.org/

Summer School "Coding for Language Communities"

Submitted by pbouda on Thu, 01/23/2014 - 08:02

in

Between 11th and 15th of August the Centro Interdisciplinar de Documentação Linguística e Social will organize another summer school in Minde (Portugal), dedicated to the topic "Coding for Language Communities" (CLC 2014).

NSF solicitation "Building Community and Capacity for Data-Intensive Research ... (BCC-SBE/EHR)"

Submitted by terry on Thu, 12/12/2013 - 11:59

in

funding opportunities

The NSF Directorates for Social, Behavioral and Economic Sciences (SBE) and for Education and Human Resources (EHR) recently issued the third and final in a series of joint solicitations for Building Community and Capacity for Data-Intensive Research.

Review of Glottolog 2.0

Submitted by stiv on Sat, 08/24/2013 - 03:07

in

Back in February of 2012, Sebastian Nordhoff (MPI-EVA) announced on Cyberling the launch of Glottolog/Langdoc, a comprehensive database of bibliographic data about the world’s languages.[1] Unfortunately, given the ephemeral nature of Web resources, links in this announcement, such as "Give me all works about Zulu" now return 404 error messages. However, these broken links are due to the release of the new Glottolog 2.0 and they are sure to be fixed, if they haven’t been already.[2]

Free Science Blog

Submitted by ebender on Tue, 02/19/2013 - 17:09

in

Members of the Cyberling community might be interested in the Free Science Blog, for discussion of open access publishing.

2 comments

Crowdsourcing WALS using Linked Data

Submitted by snordhoff on Mon, 09/03/2012 - 05:22

in

The World Atlas of Language Structures project (http://wals.info) is one of the landmarks of digital linguistics. It contains 192 features in 2678 languages. However, the resulting data matrix is very sparse, and instead of the possible 514176 datapoints, there are only about 68000, or 13%.

Interview: New blog for experimental statistics in corpus linguistics

Submitted by ebender on Tue, 06/19/2012 - 22:12

in

An interview with Sean Wallis, author of http://corplingstats.wordpress.com/:

What led you to set up the blog?

The blog comes from several sources. My research background is in cognitive science and AI, and in particular machine learning applied to scientific research, and statistics is a key component of that. I have been involved in regular debates about the role of statistical evidence in corpus linguistics over the years, so (for example) you will find some of the same experimental design themes about choice in our 2002 book, http://www.ucl.ac.uk/english-usage/projects/ice-gb/book.htm. I am not a linguist "by trade" but a methodologist, so I can only work by collaborating with and learning from others.

NSF/OCI Data Infrastructure Building Blocks (DIBBs) solicitation

Submitted by terry on Fri, 06/15/2012 - 06:16

in

funding opportunities

The National Science Foundation's Office of Cyberinfrastructure has announced a new solicitation, Data Infrastructure Building Blocks (DIBBs), which among other things is a successor to its 2007-08 INTEROP solicitation. It has three tracks: Conceptualization, Implementation, and Interoperability (the first with a 26 July 2012 deadline, the second and third with a 30 August 2012 deadline).

1 comment

Announcing Glottolog/Langdoc, a knowledge base of 175k references for (mostly) underdescribed languages

Submitted by snordhoff on Mon, 03/26/2012 - 05:19

in

We are happy to announce Glottolog/Langdoc, a comprehensive knowledge base of 104k languoids and 175k references for the Semantic Web.

In linguistics as well as in the Semantic Web world, it is important to clearly identify the concepts one is talking about. Glottolog/Langdoc takes this insight as a starting point and provides 104k Unique Resource Identifiers (URIs) for languoids and 175k for references to descriptive literature focusing on underdescribed languages.