Abstract: Speech and gesture recognition has become a critical feature in this day’s applications and is critical in accessibility and learning and human-computer interfaces. However, real-scene ...