Mobile devices accelerate speech processing research
Professor of Practice Tom Bäckström, who began at Aalto University Department of Signal Processing and Acoustics in August, believes that in the future phone calls will be possible with devices other than mobile phones.
‘We have many devices around us. Why not take advantage of the resources of all of them rather than making calls from a single phone', asks Bäckström.
Three things are needed in order for this to succeed.
'The first mathematical modelling challenge is how to isolate the desired signal from the many microphones that are available. Another challenge is achieving interoperability between random devices. The third involves privacy protection and data security, which means how we ensure that a phone call only goes to the right person.'
Bäckström and his research group are currently focusing on the first challenge: how to isolate the desired signal from the many microphones that are available. Protecting privacy is not the most difficult task, because a compression algorithm has already been identified as a potential solution for encryption.
Scientific work in the field ends up in products
Bäckström spent eight years living in Germany and has experience at the Fraunhofer research institute and the Friedrich-Alexander University that operates in close co-operation with it. At Fraunhofer, he was involved in developing a standard for better speech quality, which has just been completed.
'These standards help ensure that phones from different manufacturers can communicate with each other. The 3GPP Enhanced Voice Services (EVS) standard provides better audio compression and increases the efficiency of infrastructure use because it is the first speech coding standard that supports packet-switched networks like LTE,' says Bäckström.
In addition to standardisation work, Bäckström expects the future to offer more scientific work and major development as mobile devices help move theory through the software development process to products. Mobile devices present new challenges for the sector as the scientific foundation has to be rebuilt and updated.
'The field can process and revise existing theory and achieve a lot with small discoveries. Speech processing is a great field, because highly mathematical concepts and scientific work in general are incorporated into products in a relatively short time.'
Bäckström wants to remind people that acoustics and speech processing is a very broad field. The hottest topics in speech processing at this time include synthesis and speech recognition.
'Although speech recognition is exciting, it is only one area of speech processing. I encourage students to become familiar with the whole field and I'll try to fuel their enthusiasm,' says Bäckström in closing.
Photo: Aino Huovio
Read more news
Research Council of Finland establishes a Center of Excellence in Quantum Materials
The Centre, called QMAT, creates new materials to power the quantum technology of coming decades.
Major funding powers development of next-generation machine technology aimed at productivity leap in export sectors
The BEST research project is developing new types of sealing, bearing, and damping technology.
The TAIMI project builds an equal working life – a six-year consortium project seeks solutions to recruitment and skill challenges
Artificial intelligence (AI) is changing skill requirements, the population is aging, and the labor shortage is deepening. Meanwhile, the potential of international experts often remains unused in Finland. These challenges in working life are addressed by the six-year TAIMI project funded by the Strategic Research Council, and implemented by a broad consortium.