In this study, the method to summarise TV drama based on the psychological unfolding is proposed. In past researches, video summarisation has been achieved almost by way of choosing representative frames in visual track and never based on the mental side of video content. In this work, the video structure which consists of audio & visual tracks (actor's utterances, BGM, background sound, effect ... [Show full abstract] sounds and shots) is modeled and the track-structure-based video summarization method is proposed. To extract the temporal feature patterns of the track structure correspondent with the specific psychological content, each tracks is first quantified by calculating existence ratio etc., and second the intra/inter track feature patterns are determined based on empirical knowledges. The proposed method is implemented and examined in the subjective experiment.