Emotions govern so much of human behavior – why are voice agents taking so long to catch up? What does the future of an emotional voice interface look like, and how will the ability to perceive and express emotions influence the development of voice interfaces in the future?