Conversational Interaction Technology
32.6K views | +0 today
Conversational Interaction Technology
CIT keeps you up-to-date on news for the Conversational Interaction Technology Innovation Alliance (CITIA)
Curated by LT-Innovate
Your new post is loading...
Your new post is loading...
Scooped by LT-Innovate
Scoop.it!

By the Skin of Your Voice: #ASR for Noisy Environments

An Israeli startup called VocalZoom wants to examine skin to comprehend what we say. When we talk, the skin on our faces makes subtle vibrations too slight to be noticed by the naked human eye. While experimenting with an instrument known as an interferometer, VocalZoom CEO Tal Bakish and his team noticed it could detect peculiar measurements. “When it measures the face, we found out that the vibrations were caused only by the speaker’s voice and were not affected at all by any background voice. At this point we realized that we have a disruptive technology to extract the voice of speaker in any noisy condition.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Why Voice Control Could be Next Big Tech Trend in UK Radio

When Amazon was planning its top-secret UK launch, it contacted Radioplayer with an interesting proposal. It wanted us to build a Radioplayer ‘skill’ (that’s what it calls ‘apps’ on the Echo), optimised for the UK radio market. We’re keen to learn more about voice-control (partly because it will be crucial in car dashboards in the future), so we built a simple ‘skill’ which enables listeners to play a station of their choice, and ask for recommended radio.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Bots That Can Talk Will Help Us Get More Value from Analytics

Gartner predicts that by 2018, advanced NLG will be integrated into the majority of smart data discovery platforms and that 20% of business content will be generated by machines.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

3 Open Source Smart-Speaker Projects

There are a few open source tools for voice control out there already, and it wouldn’t surprise me if the field grows as the technology becomes more pervasive.

LT-Innovate's insight:

Commoditising Echo-type technology

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Global Automotive Voice and Speech Recognition System Market 2016-2020

The Report Global Automotive Voice and Speech Recognition System Industry 2016, Trends and Forecast Report provides information on pricing, market analysis, shares, forecast, and company profiles for key industry participants.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Open Access to Chinese #ASR via Free APIs

Baidu is making available Chinese language APIs for its four key speech technologies: Long Utterance Speech Recognition, Far-Field Speech Recognition, Expressive Speech Synthesis and Wake Word. The announcement coincides with the three-year anniversary of Baidu's speech API launch.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

New #TTS Techs Raise New Voice Security Concerns

Adobe’s VoCo is not commercially available yet, and Google still is working on its WaveNet project, but they both could have some security and privacy implications. Voice authentication systems rely on many factors, not just the user’s voiceprint, in order to identify a subject. But being able to change the words in a fragment of speech or generate it out of whole cloth could help an attacker trick such systems.
LT-Innovate's insight:

Security measures obviously need to be multi-vector

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Lipreading Technology for Crime-Solving and Entertainment

Researchers at the University of East Anglia are trialling visual speech recognition technology that could reconstruct conversations captured on video even where there is no sound. “Lip-reading is one of the most challenging problems in artificial intelligence,“ said Professor Richard Harvey, of UEA’s School of Computing Sciences. The research has concentrated on training machines to recognise the appearance and shape of humans lips as they form words and sentences.

“Potentially, a robust lip-reading system could be applied in a number of situations, from criminal investigations to entertainment,” said Helen Bear, the lead author of the study into visual speech recognition.

LT-Innovate's insight:

Survey of how AI is impacting detective work of all kinds.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Review of French Fabriq's Smart Speaker System

Fabriqis the maker of a new, small-sized smart speaker with Alexa functionality built right in. At just $50, it's the same price as the Amazon Echo Dot, but it comes with a few additional advantages. For starters, it's a more powerful speaker than the Dot, and it features a built-in battery, too, letting you unplug the the thing and take it with you for up to 5 hours on a charge. You can also sync multiple speakers together over Wi-Fi for simultaneous playback -- the Echo Dot can't do anything like that.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

#AI software That Estimates a Person’s Age Based on Their Speech

The IBM team has developed a system to estimate the age of someone who is speaking, and the company says it’s published the most accurate results of any system yet with an average error rate of 4.7 years. The system’s success rate even exceeded that of human listeners in a prior study, according to the company.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Temporary Skin Sensor Could Improve Noisy-Environment #ASR

A sensor, created by University of Colorado Boulder and Northwestern University scientists, latches onto the skin and monitors the wearer’s heartbeat by picking up sound waves as they move through body tissue and fluids. Essentially, it functions like a mini-stethoscope. Attaching it to your throat could produce more accurate speech recognition data, for example. This could be especially helpful in a noisy environment, where a regular microphone’s recording quality is degraded. The researchers suggest this could enhance communication in loud environments by first responders, ground controllers, or security agents.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

China-US Partnership to Deliver Conversational AI Robots Globally

Nuance (US) has announced that ROOBO (China) will be leveraging the Nuance Mix NLU development platform, which provides voice and NLU capabilities, to create conversational and cognitive interfaces for ROOBO robots and devices. Additionally, Nuance and ROOBO will be collaborating on future integrations of advanced Artificial Intelligence (AI) to advance conversational interfaces for intelligent IoT devices.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Results of 2016 GOV.UK Assistive Tech Survey

Back in May we launched an online survey which ran for 6 weeks. The survey asked users about what devices, web browsers and assistive technology they use to access GOV.UK.
LT-Innovate's insight:

Statistics on usage of assistive tech - including ASR - by people consulting UK government website. Nuance comes out top for speech. 

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Big Web Company Launches Developer Services in #TTS and #NLProc to Spark Innovation

Amazon Lex — which refers to what’s in between “Alexa” — lets developers build conversational interfaces into apps by using voice and text. The same conversational engine that powers Alexa is now available to any developer, making it easy to bring sophisticated, natural language ‘chatbots’ to new and existing applications.”

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

First Mass Notification Service to Include UK #Text to Speech Voices With Regional Accents

As one of the few remaining UK owned and operated mass notification companies, Alert Cascade's latest platform release builds on the UK theme by including regional TTS voices designed and engineered by Cereproc, based in Edinburgh.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

UK Company Releases World's First Commercial Cognitive Web Site

UK company Volume's Digital Concierge, powered by IBM Watson uses a conversation service to allow text-to-speech interaction between a human and a Smart Machine. The cognitive application called 'LUSY' is trained in a single domain—the company - Volume. The aim is to provide all the information on the company through natural dialogue and Q&A. 

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Deep-Domain Conversational AI

MindMeld (US) - Deep-Domain Conversational AI describes the AI technology which is required to build voice and chat assistants which can demonstrate deep understanding of any knowledge domain. Deep-Domain Conversational AI relies on state-of-the-art machine learning approaches, big data techniques to manage large amounts of training data, as well as the curation of custom knowledge graphs which capture important domain knowledge for any application.

LT-Innovate's insight:

The future of communication with bots has been solved. It is "deep"

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

UK Project: Speech Technology for Articulation Rehabilitation

To address the difficulties in delivering articulation therapy, the STAR (Speech Technology for Articulation Rehabilitation) project, funded by the National Institute of Health Research, is developing an app based on novel speech recognition technology. Researchers at Barnsley Hospital and the University of Sheffield have developed automatic speech recognition technology that can objectively ‘score’ speech productions. Research has shown that feedback based on these scores can be used to help people modify their speech productions.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

#ASR in Healthcare Overview

Experts says future versions will offer enhanced features that will make clinicians more productive and allow them to be more patient-focused. "It is a very, very exciting time for speech recognition and related technology, especially in health care information—a ton is going on," says Peter Mahoney, senior vice president and general manager of Dragon and clinical documentation for the health care division of Nuance Communications.

LT-Innovate's insight:

Talks about the emergence of "ambient intelligence" with ASR constantly listening to doctor-patient conversations to provide useful background or corrective information. US-oriented.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Size of the Smart Speakers Market

A new study by Juniper Research has found that revenue from smart audio hardware will more than triple over the next four years, rising from an estimated $1.4 billion this year to over $5.5 billion by 2020.
more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Global Live Chat Software Market Expected to Grow at CAGR 7.98% to 2020

The analysts forecast the global live chat software market to grow at a CAGR of 7.98% during the period 2016-2020. Live chat is a real-time communication between two users via computer. It is appropriate for low to moderately complex product support and is used by many organizations. It can also be a means of surveying customers without being intrusive.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

#ASR Still Has Rough Road Ahead for Drivers

According to the J.D. Power 2016 U.S. Initial Quality Study, 23 percent of all problems reported by car buyers involved infotainment, and voice-recognition systems remain a huge part of the problem. Not just for older motorists. Voice-based commands are among the top five "difficult-to-use" cockpit technologies for Generation X, Generation Y and baby boomers.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

New K Graph-driven Conversational Platform for Enterprises

MindMeld's new Deep-Domain Conversational AI platform makes it possible for companies to create voice and chat assistants that can demonstrate knowledge and expertise around any custom content domain. The platform offers capabilities such as broad vocabulary natural language understanding, question answering across any knowledge graph, dialogue management and dialogue state tracking, and large scale training data generation and management, with cloud-based or on-premises deployment.

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

New Crowd-as-a Service Private Beta #NLProc/Speech Platform

DefinedCrowd (US/JP) released the latest phase of speech processing program templates and introduced their first image collection and enrichment data pipeline, marking the first step into the exploding AI-based image enrichment market. All of these new pipelines enable enterprises to define and distribute speech and natural language programs for processing through the DefinedCrowd global Crowd-as-a-Service platform. 

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Conversational Accountability of AI Decision-making

Making the artificial intelligence process understandable to most people will require yet more artificial intelligence. "There will need to be a kind of conversation between a person and a machine, just like you would have a conversation among two people," Banavar (IBM) says. Each question and answer could lead to another question and answer, which would require a very smart language interface. Imagine asking Siri or Alexa not about the weather forecast or movie times, but about why you were denied a mortgage. The AI would have to go methodically through your credit history and explain how each item came into play.

LT-Innovate's insight:

Analyses among many things the need for AI decisions to be "explainable" under the new EU personal data protection law to become effective in 2018, and the role a natural language interface would play in enabling this process

more...
No comment yet.
Scooped by LT-Innovate
Scoop.it!

Artificial Lip-reading More Accurate than Humans But Still Needs Work

A new paper (pdf) from the University of Oxford (with funding from Alphabet’s DeepMind) details an artificial intelligence system, called LipNet, that watches video of a person speaking and matches text to the movement of their mouth with 93.4% accuracy.
more...
No comment yet.