2019 marks the third year in a row that Samsung R&D Institute Poland, in partnership with the UK’s University of Edinburgh (UEDIN), has received accolades at the International Workshop on Spoken Language Translation (IWSLT).
This is one of the top two global workshops on automatic language translation, along with the Workshop on Machine Translation (WMT).
This year, Samsung R&D Institute Poland won first place in two categories, the first being text-to-text translation from English to Czech and the second – an end-to-end system translating English speech into German text.
For the text-to-text translation category, researchers worked to develop a model to translate the transcript of a spoken English-language TED Talk into Czech. Developing their winning model required the Samsung team to develop large, filtered corpora from which to work and generate as much synthetic data as possible.
The work done by the Samsung R&D Institute Poland team, together with additional modelling help from UEDIN, was selected as the best in the challenge by human evaluators. This means that the translations produced by Samsung R&D Institute Poland’s system scored the highest both in fluency and adequacy.
Samsung R&D Institute Poland’s participation in their second winning category this year, the end-to-end translation system from English to German, was a first for the team. The task was to produce a German-language transcription of an English-language TED Talk audio recording. This task required the development of a single model that could take an audio file input and subsequently produce a translated transcription.
It was made more difficult by the deficiency of the provided audio sources, compared to typical speech recognition task. Samsung R&D Institute Poland proposed several innovative methods for end-to-end speech translation that mitigated this source paucity, obtaining a state-of-the-art result with their final system that won them first place in the challenge.
For more security news visit here.