Language Tech Market News
52.7K views | +1 today
Follow
Language Tech Market News
The Home of Multilingual Intelligence
Curated by LT-Innovate
Your new post is loading...
Your new post is loading...
Scooped by LT-Innovate
Scoop.it!

#ASR + Data 'Creation' to Preserve the Seneca Language

#ASR + Data 'Creation' to Preserve the Seneca Language | Language Tech Market News | Scoop.it

"Under the hood, it is data. With many Native languages, you don't have that volume of data," Jimerson said, explaining that some languages, while spoken, may not have as many formal linguistic tools—dictionaries, grammatical materials or extensive classes for non-native speakers, similar to those for Spanish or Chinese. "One of the most expensive and time-consuming processes of documenting language is collecting and transcribing it. We are looking at taking deep networks and maybe changing the architecture, making some synthetic data to create more data, but how do you make this work in deep learning? How do you augment data you already have?"

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Horizon2020 #ML-driven #ASR Technology Solution for Air Traffic Controllers

Horizon2020 #ML-driven #ASR Technology Solution for Air Traffic Controllers | Language Tech Market News | Scoop.it
Currently, several speech recognition modules require a manual adaptation to local needs caused by acoustic and language variabilities such as regional accents, phraseology deviations and local constraints. MALORCA proposed a general, cheap and effective solution to automate this re-learning, adaptation and customisation process. This involves automatically learning local speech recognition and ATCO models from radar and speech data recordings.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Salesforce Launches an #ASR-driven Speech Assistant for Enterprises

Salesforce Launches an #ASR-driven Speech Assistant for Enterprises | Language Tech Market News | Scoop.it

Salesforce has released Einstein Voice, a part of its Einstein AI platform that enables sales managers to dictate memos and navigate cloud services hands-free.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

@Speechmatics Partners with Clique to Expand HD Conferencing Options To 75 Languages

@Speechmatics Partners with Clique to Expand HD Conferencing Options To 75 Languages | Language Tech Market News | Scoop.it

Clique, a Los Angeles based leader in the API economy and Communications Platform as a Service (CPaaS) is partnering with UK ASR company Speechmatics. This will enable HD voice conversations worldwide to use chat, voice recording and transcription for anyone using the Clique API. 

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

A Euro Debut For Automated Closed-Captioning Tool

A Euro Debut For Automated Closed-Captioning Tool | Language Tech Market News | Scoop.it

The enCaption4 automated closed-captioning tool will be showcased by ENCO at IBC2018.  It marks the European debut of the engine for enCaption4, which ENCO says “takes real-time accuracy to unprecedented new levels, along with workflow efficiencies that range from NDI compatibility to captioning for live and on-demand streams.”

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Say ‘Aloha’: A Closer look at Facebook’s Voice Ambitions

Say ‘Aloha’: A Closer look at Facebook’s Voice Ambitions | Language Tech Market News | Scoop.it
Facebook is developing its own speech recognition feature under the name Aloha for both the Facebook and Messenger apps, as well as external hardware — likely the video chat smart speaker it’s developing.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

High-quality Actor-dubbing Synchronization Is Here

High-quality Actor-dubbing Synchronization Is Here | Language Tech Market News | Scoop.it
Researchers have developed a system using artificial intelligence that can edit the facial expressions of actors to accurately match dubbed voices, saving time and reducing costs for the film industry. It can also be used to correct gaze and head pose in video conferencing, and enables new possibilities for video postproduction and visual effects.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

UK Regional Accents Under Threat from Voice Assistants?

UK Regional Accents Under Threat from Voice Assistants? | Language Tech Market News | Scoop.it

Regional accents could be being suppressed by voice assistants including Google Home, Alexa, Cortana and Siri, according to the Life Science Centre in Newcastle in Newcastle. A study found that 79% of people with a regional accent said they had to alter how they spoke in order to communicate with their devices.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Battling Population, Gender and Accent Bias In Speech Recognition Tech

Battling Population, Gender and Accent Bias In Speech Recognition Tech | Language Tech Market News | Scoop.it

"We see three different biases that are developing in in the voice ecosystem — they are underrepresented demographics. So these are people in areas where the market is not powerful enough for bigger companies to reach. There is also gender biases and there's accent biases. So for instance, if you are a fluent, native English speaker in, say, Ireland, it's going to be a lot harder to use Siri than it is if you are a North American male," said Michael Henretty, Mozilla's Common Voice project lead. 

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Extracting Audio Signals from Visual Information

Extracting Audio Signals from Visual Information | Language Tech Market News | Scoop.it

Researchers at MIT, Microsoft, and Adobe have developed an algorithm that can reconstruct an audio signal by analyzing minute vibrations of objects depicted in video. In one set of experiments, they were able to recover intelligible speech from the vibrations of a potato-chip bag photographed from 15 feet away through soundproof glass.
In other experiments, they extracted useful audio signals from videos of aluminum foil, the surface of a glass of water, and even the leaves of a potted plant. The researchers will present their findings in a paper at this year’s Siggraph, the premier computer graphics conference.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Explore Apollo - Transcription Completed of U.S. Moon Missions' Massive Analog Audio Recordings 

Explore Apollo - Transcription Completed of U.S. Moon Missions' Massive Analog Audio Recordings  | Language Tech Market News | Scoop.it
Researchers at the Center for Robust Speech Systems (CRSS) in the Erik Jonsson School of Engineering and Computer Science (ECS) received a National Science Foundation grant in 2012 to develop speech-processing techniques to reconstruct and transform the massive archive of audio into Explore Apollo, a website that provides public access to the materials. The project, in collaboration with the University of Maryland, included audio from all of Apollo 11 and most of the Apollo 13, Apollo 1 and Gemini 8 missions.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Automatic Punctuator to Improve Speech-to-Text Output

Automatic Punctuator to Improve Speech-to-Text Output | Language Tech Market News | Scoop.it

Mengyi Shan, a mathematics student at Harvey Mudd College in California, USA., trained recurrent neural networks to restore missing punctuation in text. At the moment, it can only deal with commas and full stops, the most common and easiest of English's punctuation marks. “In NLProc  problems such as automatic speech recognition (ASR), the generated text is normally unpunctuated, which is hard for further recognition or analysis. Thus punctuation restoration is a small but crucial problem that deserves our attention,” she explained last month.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

@ScreenSystems to Launch New Subtitling Tech with @Speechmatics 

@ScreenSystems to Launch New Subtitling Tech with @Speechmatics  | Language Tech Market News | Scoop.it

Screen Systems (UK) has been working with the Cambridge company Speechmatics to create “the holy grail” of subtitling - a software that uses artificial intelligence to give what Mr Wales calls “the most accurate speech to text engine ever seen.” The new product is being launched at a trade show in Amsterdam this September.
But aren’t there other tech companies out there with the same mission? Yes, Mr Wales admitted - “but not to the level we are. We’ve been doing subtitling for over 40 years, so we’re the experts. We’re not scared of the competition, because we realise how difficult this industry is.”

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

@SnatchBot Goes Multi-National With Dublin Office

@SnatchBot Goes Multi-National With Dublin Office | Language Tech Market News | Scoop.it

Israel's SnatchBot claims it is the fastest growing chatbot company in the world. It has opened a new office in Dublin, Ireland. SnatchBot CMO and Dublin office manager Conor Kostick explains the thinking behind the move. ‘Obviously, the European market is hugely important to SnatchBot and Dublin has an attractive high-tech infrastructure, with state-of-the-art optical networks and world-class connectivity.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Tech Stack for Discovering & Indexing Podcast Episodes with Amazon Transcribe and Comprehend

Quick overview of the services used in this application:
- Amazon Transcribe: Converts the audio files into a text transcription, including the timestamp of each word spoken. The demo also uses two cool features of Amazon Transcribe: speaker identification and custom vocabularies.
- Amazon Comprehend: Find insights and relationships in text using natural language processing (NLP). It can extract key phrases, places, people, brands, and events.
- AWS Step Functions: Coordinates the workflow of processing the podcasts and scheduling Lambda functions.
- AWS Lambda: Provides a serverless compute platform to handle the application logic, in this case written in Python.
- Amazon Elasticsearch Service: Provides a managed Elasticsearch cluster for searching the podcast transcripts.
- Amazon Cognito: Provides user authentication for the Kibana user interface to Amazon ES.
- AWS CloudFormation: Provides infrastructure as code in the AWS environment to simplify the deployment of the components.
- Amazon S3: Provides shared storage for audio and text files.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Tencent Releases Open AI Platform for Industries Including #NLP & #ASR from @WeChat

Tencent Releases Open AI Platform for Industries Including #NLP & #ASR from @WeChat | Language Tech Market News | Scoop.it

Tencent’s open platform marks a major campaign for the Shenzhen-based firm to get its AI expertise adopted freely by individual developers and enterprises. It will enable enterprises across various industries to harness the capabilities of Tencent’s AI Lab, which is focused on machine-learning technology; super app WeChat, which provide speech recognition and natural language processing; and its Computer Vision Research Centre, which is focused on facial recognition technology,

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

@Trint on Automated Transcription & Translation for News Media

Trint recently teamed up with the Associated Press and they’ve rolled out a new extension for captions on Adobe Premiere Pro CC. They’re also currently working on a Translation Project, which has secured Google DNI funding. In our conversation, Kofman told us about his career change from reporter to startup CEO, how he manages people, and how to integrate translation into the workflow.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

How Smart Hubs Could Answer Follow-up Questions in #VA Contexts

How Smart Hubs Could Answer Follow-up Questions in #VA Contexts | Language Tech Market News | Scoop.it

Gupta and colleagues describe a solution in the paper “Contextual Slot Carryover for Disparate Schemas,” a neural network that automatically learns to map one skill’s slots to another’s. The findings will be presented at the upcoming Interspeech conference in Hyderabad, India in September.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Flipkart's Indian-Language Challenge to the Big Assistants

Flipkart's Indian-Language Challenge to the Big Assistants | Language Tech Market News | Scoop.it

Walmart-backed Flipkart has just issued a challenge to Amazon's Alexa and Google Assistant. The home-grown e-commerce giant today announced that it has acquired Bengaluru-based artificial intelligence (AI) startup Liv.ai, which has developed a platform that converts speech-to-text in nine regional languages apart from English. With this move, the e-tailer hopes to soon offer an end-to-end conversational shopping experience for its users.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Voice and #ASR Software Market to Hit $6.9B by 2025

Voice and #ASR Software Market to Hit $6.9B by 2025 | Language Tech Market News | Scoop.it

The global market for voice and speech recognition software will increase from $1.1 billion in 2017 to $6.9 billion by 2025, according to a new report from Tractica. The leading use cases for the technology during that period will include healthcare (clinical documentation), automotive virtual digital assistants (VDAs), and voice commerce and customer service applications.
Additional use cases will include smart home controls, security and authentication, and voice search, among others, the report notes.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Voice-Command Services in Korean Hotels

Voice-Command Services in Korean Hotels | Language Tech Market News | Scoop.it

SK Telecom said that it had installed its NUGU AI platform in 44 rooms of the Vista Walkerhill Hotel in eastern Seoul. This enables guests to control lighting, curtains and temperature through voice commands, and find checkout and breakfast times as well as other information about hotel facilities through voice commands.
Since the release of the NUGU platform in September 2016, the firm has released about 40 kinds of services using the platform.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Using Location Info to Improve Siri’s #ASR Language Models

Using Location Info to Improve Siri’s #ASR Language Models | Language Tech Market News | Scoop.it

This article describes how we improved Siri’s ability to recognize names of local POIs by incorporating knowledge of the user’s location into our speech recognition system. Customized language models that take the user’s location into account are known as geolocation-based language models (Geo-LMs). These models enable Siri to better estimate the user’s intended sequence of words by using not only the information provided by the acoustic model and a general LM (like in standard ASR) but also information about the POIs in the user’s surroundings.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Evaluating Speech-to-Text Services from AWS, Microsoft and Google

Evaluating Speech-to-Text Services from AWS, Microsoft and Google | Language Tech Market News | Scoop.it

For the moment, these speech-to-text services are likely to complement -- rather than replace -- other input modalities. Still, they can provide value, especially by indexing large blocks of audio for compliance and customer service purposes or automatically generating captions for audio and video streams. In cases where accuracy is paramount, developers should bake these tools into workflows that complement human transcribers. Developers can also use recording samples from existing sources to test the accuracy of these engines -- similar to an approach taken by Florida Institute of Technology researchers who developed a tool to analyze the quality of the different cloud speech engines.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Amazon Patents a Real-time Speech Accent "Translator"

Amazon Patents a Real-time Speech Accent "Translator" | Language Tech Market News | Scoop.it

Amazon has applied for a patent for an audio system that detects the accent of a speaker and changes it to the accent of the listener, perhaps helping eliminate communication barriers in many situations and industries. The patent doesn’t mean the company has made it (or necessarily that it will be granted), but there’s also no technical reason why it can’t do so.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Snapchat Launches First #ASR Lenses

Snapchat Launches First #ASR Lenses | Language Tech Market News | Scoop.it

Snapchat has just launched new lenses that respond to voice commands. The company says it’s now rolling out a series of lenses that will animate when users speak simple, English words like “hi,” “love,” “yes,” “no,” and “wow,” as op

more...
No comment yet.