by Peter Wittenburg
Participation in externally funded projects is very important for the TLA (The Language Archive) team for the usual reasons: (1) ensure funding to maintain existing software and add new functionalities – both being essential to maintain software; (2) participate in open competitions to show and to improve competence; (3) open new opportunities in a dynamic IT landscape. In this respect TLA was very successful during the last months, although the effort to form stable consortia and to come to proper proposals was considerable. We were part of 6 proposals from which 5 were accepted. It is a pity that the CLARICLE proposal which was meant to support the CLARIN ERIC in its construction efforts was not accepted.
CLARIN D (BMBF)
Common Language and Technology Research Infrastructure
2011 – 2016
The follow-up project for the German D-SPIN (CLARIN) has been granted and will start officially at 1.5.2011. The new CLARIN D will participate in building the language resource and tools infrastructure and is therefore part of the European CLARIN ERIC initiative which will become a legal entity in 2011. In this initiative TLA will become one of the strong centers, improve some of the already started frameworks and add new ones that will turn out to be important for building and maintaining a useful research infrastructure enabling e-Humanities. Since we have reported frequently about CLARIN we refer for further information to the web-site.
Data Service Infrastructure for the Social Sciences and Humanities
2011 – 2014
This project brings together all 5 ESFRI research infrastructure initiatives in the social sciences and humanities (SSH) represented each by some centers: CLARIN, DARIAH, CESSDA, ESS, SHARE. The goal is to determine areas of possible synergies in the infrastructure development and to work on a few concrete joint activities. The rationale behind this idea is that a) double developments should be prevented, b) initiatives should mutually benefit from the advanced work of the others and c) to establish joint integrated domains where this makes sense for the SSH users. Joint activities will be along the following dimensions: understanding the different architectural solutions, assessing and improving data and metadata quality, setting up a tools and services forum, improve the quality of survey data, locate and improve data preservation and curation services, develop a joint shared data access and enrichment framework (AAI, PIDs, joint Metadata, Workflow implementations, joint annotation framework), jointly work on legal and ethical aspects, carry out much training and education work, work on disseminating the results.
For TLA this is a very interesting opportunity to disseminate resources and tools to other disciplines and integrate good components from others in the CLARIN infrastructure. This project is expected to start after the summer time in 2011.
Innovative Networking in Infrastructure for Endangered Languages
2011 – 2014
This project will strengthen our international activities which where started in the DOBES project on the one hand and in CLARIN on the other. Together with the University of Cologne and colleagues from Poznan and Budapest we will start the following activities in the area of endangered language documentation and archiving: (1) setup 3 new regional archives and run annual workshops with all experts active in the current and coming regional centers; (2) organize best practice meetings with international guests and summer schools, (3) work out educational material to go into schools to get pupils’ attention. In all infrastructure aspects the CLARIN agreements will be of relevance.
For TLA it is an excellent opportunity to extend its archiving network and it is of course fo great importance to spread the CLARIN messages. More about this project will be said in a separate article. This project is expected to start in June/July 2011.
European Data Infrastructure
2011 – 2014
EUDAT is a first consequence of the report “Riding the Wave” of the EC’s High Level Expert Group on Scientific Data in so far as it brings together 13 community driven infrastructure initiatives and 10 data centers to build a first prototype of a Collaborative Data Infrastructure (CDI). In such a CDI the community infrastructures take care of user oriented services on data, the data centers take care of common horizontal data services which are the same or at least very similar for all research disciplines and where both need to address topics such as data curation and establishment of trust between all stakeholders. CLARIN is one of the communities being selected in this project of strategic relevance. It has been understood worldwide that our efforts to take care of research data in terms of their preservation and in order to maintain accessibility need to be strengthened. Therefore EUDAT will focus on professional and robust common services such as: (1) providing an easy deposit for all involved researchers, (2) setup a distributed architecture allowing the participating centers to easily store large data volumes for preservation and access purposes (which includes a safe replication of data), (3) working on a policy-rules based replication at logical level of collections, (4) testing generic web services execution frameworks. This project is expected to start at 1.10.2011.
Rahmenbedingungen einer disziplinübergreifenden Forschungsdateninfrastruktur
2011 – 2014
This project can be compared with the EUDAT project in so far as it tries to define the basis and roadmap for a future data infrastructure for the research domain in Germany. Whiel EUDAT is already meant to come up with concrete services, Radieschen will make many interviews with experts from different stakeholders which will be analyzed in a few major dimensions with the goal to come up with a suggestion how the Collaborative Data Infrastructure can be realized in Germany with its federal organization structure. This project will start at 1.5.2011
- More information on the existing Regional Archives set up by TLA can be found in our flyer on the topic (pdf).
- More information on CLARIN can be found on the CLARIN web site.
- More information on the CLARIN ERIC can be found in this issue of the CLARIN newsletter (pdf).
- Additional information on PARADE the project idea that became EUDAT can be found at the PARADE web site.
- More information on EUDAT can be found in this presentation by Kimmo Koski (ppt).
- Riding the Wave – How Europe can gain from the rising tide of scientific data” is the final report of the European Commission’s High Level Expert Group on Scientific Data. It is available for dowload (pdf).