Welcome to the AMI Meeting Corpus
|[19 Dec. 2007] AMI annotation and metadata available in several formats.
|[15 Oct. 2007] An AMI corpus mirror has been set up at Edinburgh (UK).|
|[25 Sept. 2007] AMI annotation zip archive version 1.3 is now available.|
Register here for free
The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. For a gentle introduction to the corpus, see the corpus overview. To directly access the data, login or register here for free and go to the Access AMI corpus section. Around two-thirds of the data has been elicited using a scenario in which the participants play different roles in a design team, taking a design project from kick-off to completion over the course of a day. The rest consists of naturally occurring meetings in a range of domains. Detailed information can be found in the documentation section.
Synchronised recording devices:
Although the AMI Meeting Corpus was created for the uses of a consortium that is developing meeting browsing technology, it is designed to be useful for a wide range of research areas. The downloads on this website include videos that are suitable for most purposes, but higher resolution videos are available for researchers engaged in video processing. You can get samples of the the high resolution videos sent to you on a DVD by completing the form (DVD Taster online order).
All of the signals and transcription, and some of the annotations, have been released publicly under the AMI Meeting Corpus license, very similar to the Creative Commons Attribution NonCommercial ShareAlike 2.5 License. The remaining annotations generated for the original project will be released in stages by January 2007, but annotations may arise out of the ShareAlike licensing for some time to come.