The CHILDES project provides a large database of first and second language acquisition data from over 30 languages in a constant format, called CHAT. There are also programs for Windows and Macintosh that permit analysis of this database as well as alignment of text to speech and video.
TLA-team: the CHILDES project provides the CLAN Toolkit that can carry out a variety of very useful linguistic analysis functions on whole collections of CHAT formatted files. The CHAT format is aging, but it is being used by many researchers worldwide and can thus be seen as a best practice format. (see below under CLAN).