In order for research to be performed in speech technologies, there is a constant need for data collection and annotation. In this respect there have been several efforts over the years to collect data on the meeting environment. On the particular area of speaker diarization systems for Meetings, there needs to be meetings databases accurately transcribed into speaker segments. Nowadays a few databases are already available and a few more are currently being recorded and transcribed, some of them are:

