Look Mom, No Hands: Implementing Speech-Enabled Applications

Speech-based technology has evolved to where it is robust enough to be used by emergency responders.


Challenges and solutions
Considering the enormous advantages speech-based applications offer emergency workers, it is a wonder they have not been employed in the past. The reason emergency workers have not always relied on speech-based systems for data retrieval and filing reports is due to the number of technological hurdles that have plagued efforts to implement such systems. These hurdles include application development, user interface development and environmental challenges.

In the past, due to the enormous breadth of the English language, application development for speech-based platforms has been extremely difficult and limited. Efforts to convert existing data into a format compatible with speech-based applications have required extensive hand-coding and a great deal of oversight and management. Relatively simple applications can require hundreds of pages of code to implement basic data entry.

Today, software tools exist that streamline the coding process and provide automated development and trouble shooting for interactions between information systems and speech recognition (SR) engines. These developments serve to speed up the conversion process for speech-enabling information systems as well as continually updating and modifying SR platforms as the system is deployed.

Considering that reliable speech-based systems are still in the introductory market phase, voice user interfaces (VUI) have yet to be perfected for field use. A VUI differs substantially from a graphical user interface (GUI), and until recently, best practices for VUI design had not been codified. Just as early attempts to build GUIs were often laughable, early VUI designs have too often been confusing, awkward and frustrating for users. (Users have very high expectations for machine-based SR because they have spoken only to people who have remarkable flexibility and capability for disambiguation compared to computers.)

Many years of experience with SR applications in a variety of settings, including millions of telephone-based interactions with a diverse user base, have made it possible to codify best practices for building, testing and refining VUIs. Moreover, many of these best practices are built into reusable speech grammars and application architectures. While VUIs still require feedback from emergency professionals in the field to maximize performance, existing systems are sturdy enough to be applied in real-life scenarios without jeopardizing the safety of the user.

Considering the often inhospitable environment emergency professionals work in, past speech-based technology platforms have been inappropriate for use by emergency personnel due to the frequency of systems failure. However, today's generation of voice recognition technology is robust and rugged enough to work effectively in the harshest of environments. For example, emergency workers often operate in high-noise environments that make it difficult for SR engines to function at their best. However, today's SR systems incorporate microphone array and noise suppression technologies that can overcome many of the challenges posed by high-noise environments.

According to Max Patterson, a graduate of the U.S. Secret Service Protective Operations Briefing, and a former police chief for both the Albion, Michigan, and Windsor, Connecticut, police departments, "Integrated voice-enabled capability in mobile environments is particularly important in law enforcement. Officers can be alerted if the car is stolen, or if there is any other potential danger before pulling the vehicle over. The ability to use voice rather than a keyboard to enter and access information makes the VideoWitness Patrol Car System — using Vangard Voice AccuSPEECH technology — the ideal solution for smaller law enforcement agencies. It has a much smaller footprint, costs much less than the larger systems currently available and delivers all the functionality of those larger systems."

Ready to go
After a long gestation period — including more than 20 years of laboratory research and development — automated SR is finally ready for widespread deployment. While the technology has yet to be fully adopted by government agencies, speech-based records keeping and data retrieval systems offer a number of advantages for emergency personnel.

  • Enhance your experience.

    Thank you for your regular readership of and visits to Officer.com. To continue viewing content on this site, please take a few moments to fill out the form below and register on this website.

    Registration is required to help ensure your access to featured content, and to maintain control of access to content that may be sensitive in nature to law enforcement.