Abstract: State-of-the-art approaches to Query-by-Example speech search are usually based on acoustic word embedding (AWE), representing variable-length speech segments with fixed-dimensional vectors ...
Unsupervised video Object-Centric Learning (OCL) is promising as it enables object-level scene representation and dynamics modeling as we humans do. Mainstream video OCL methods adopt a recurrent ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results