[Show abstract][Hide abstract] ABSTRACT: The recording, multimodal analysis and archiving of meetings introduce new challenges for research in multimedia information management. Meetings involve multiple media that can be aligned together. This requires a global annotation framework. In particular, meetings often deal with documents, either projected or discussed, which can be aligned with the audio and video streams. This article presents a research agenda for bridging the gap between documents, several types of annotations of documents, and multimodal annotations of audio and video streams. It also presents the task of authoring meeting minutes as a means to evaluate multimodal annotations and the alignment of documents with other media.