Department of Computer Engineering2024-11-092017978145035081510.1145/3092912.31228012-s2.0-85028566680https://hdl.handle.net/20.500.14288/1628From a user interaction perspective, speech and sketching make a good couple for describing motion. Speech allows easy specification of content, events and relationships, while sketching brings in spatial expressiveness. Yet, we have insufficient knowledge of how sketching and speech can be used for motion-based video retrieval, because there are no existing retrieval systems that support such interaction. In this paper, we describe a Wizard-of-Oz protocol and a set of tools that we have developed to engage users in a sketch-and speech-based video retrieval task. We report how the tools and the protocol fit together using "retrieval of soccer videos" as a use case scenario. Our software is highly customizable, and our protocol is easy to follow. We believe that together they will serve as a convenient and powerful duo for studying a wide range of multi-modal use cases.pdfComputer engineeringCharacterizing user behavior for speech and sketch-based video retrieval interfacesConference proceedinghttps://doi.org/10.1145/3092912.3122801N/ANOIR01359