<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;A0QCQXwyeSp7ImA9WhRUFkU.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708</id><updated>2012-01-27T13:36:00.291-05:00</updated><category term="ROI" /><title>Speech Analytics</title><subtitle type="html" /><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://speechanalytics.blogspot.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>41</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/blogspot/OzfbY" /><feedburner:info uri="blogspot/ozfby" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;C0UCQ345eSp7ImA9WhdXF0Q.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6405808409364702899</id><published>2011-08-31T07:47:00.000-04:00</published><updated>2011-08-31T07:47:42.021-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-31T07:47:42.021-04:00</app:edited><title>Speech Recognition Leaps Forward - Is it a revolution?</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/vaU5pYnm6venYhl9UYOUyscC37c/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/vaU5pYnm6venYhl9UYOUyscC37c/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/vaU5pYnm6venYhl9UYOUyscC37c/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/vaU5pYnm6venYhl9UYOUyscC37c/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;&lt;a href="http://www.blogger.com/post-create.g?blogID=1030722385446106708" name="versalTop"&gt;&lt;/a&gt;Great news from Microsoft about substantial progress in LVCSR. &lt;/div&gt;&lt;div class="MsoNormal"&gt;Please comment if you experience this technology and if you indeed view it as a revolution.&lt;/div&gt;&lt;div class="MsoNormal"&gt;Thx, Ofer&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;See details in: &lt;a href="http://research.microsoft.com/en-us/news/features/speechrecognition-082911.aspx"&gt;http://research.microsoft.com/en-us/news/features/speechrecognition-082911.aspx&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-size: large;"&gt;&lt;b&gt;Speech Recognition Leaps Forward&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;By &lt;a href="mailto:msreditr@microsoft.com"&gt;Janie Chang&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;August 29, 2011 12:01 AM PT&lt;/div&gt;During &lt;a href="http://interspeech2011.org/"&gt;Interspeech 2011&lt;/a&gt;, the 12th annual Conference of the International Speech Communication Association being held in Florence, Italy, from Aug. 28 to 31, researchers from Microsoft Research will present work that dramatically improves the potential of real-time, speaker-independent, automatic speech recognition. &lt;br /&gt;
&lt;a href="http://research.microsoft.com/en-us/people/dongyu/default.aspx"&gt;Dong Yu&lt;/a&gt;, researcher at &lt;a href="http://research.microsoft.com/en-us/labs/redmond/default.aspx"&gt;Microsoft Research Redmond&lt;/a&gt;, and &lt;a href="http://research.microsoft.com/en-us/people/fseide/"&gt;Frank Seide&lt;/a&gt;, senior researcher and research manager with &lt;a href="http://research.microsoft.com/en-us/labs/asia/default.aspx"&gt;Microsoft Research Asia&lt;/a&gt;, have been spearheading this work, and their teams have collaborated on what has developed into a research breakthrough in the use of artificial neural networks for large-vocabulary speech recognition. &lt;br /&gt;
&lt;h1&gt;&lt;a href="http://www.blogger.com/post-create.g?blogID=1030722385446106708" name="ID0E3C"&gt;&lt;/a&gt;The Holy Grail of Speech Recognition &lt;/h1&gt;Commercially available speech-recognition technology is behind applications such as voice-to-text software and automated phone services. Accuracy is paramount, and voice-to-text typically achieves this by having the user “train” the software during setup and by adapting more closely to the user’s speech patterns over time. Automated voice services that interact with multiple speakers do not allow for speaker training because they must be usable instantly by any user. To cope with the lower accuracy, they either handle only a small vocabulary or strongly restrict the words or patterns that users can say. &lt;br /&gt;
The ultimate goal of automatic speech recognition is to deliver out-of-the-box, speaker-independent speech-recognition services—a system that does not require user training to perform well for all users under all conditions. &lt;br /&gt;
&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;span class="imageblock"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;“This goal has increased importance in a mobile world,” Yu says, “where voice is an essential interface mode for smartphones and other mobile devices. Although personal mobile devices would be ideal for learning their user’s voices, users will continue to use speech only if the initial experience, which is before the user-specific models can even be built, is good.”&lt;br /&gt;
Speaker-independent speech recognition also addresses other scenarios where it’s not possible to adapt a speech-recognition system to individual speakers—call centers, for example, where callers are unknown and speak only for a few seconds, or web services for speech-to-speech translation, where users would have privacy concerns over stored speech samples.&lt;br /&gt;
&lt;h1&gt;&lt;a href="http://www.blogger.com/post-create.g?blogID=1030722385446106708" name="ID0EMD"&gt;&lt;/a&gt;Renewed Interest in Neural Networks&lt;/h1&gt;Artificial neural networks (ANNs), mathematical models of the low-level circuits in the human brain, have been a familiar concept since the 1950s. The notion of using ANNs to improve speech-recognition performance has been around since the 1980s, and a model known as the ANN-Hidden Markov Model (ANN-HMM) showed promise for large-vocabulary speech recognition. Why then, are commercial speech-recognition solutions not using ANNs? &lt;br /&gt;
“It all came down to performance,” Yu explains. “After the invention of discriminative training, which refines the model and improves accuracy, the conventional, context-dependent Gaussian mixture model HMMs (CD-GMM-HMMs) outperformed ANN models when it came to large-vocabulary speech recognition.”&lt;br /&gt;
Yu and members of the &lt;a href="http://research.microsoft.com/en-us/groups/srg/"&gt;Speech&lt;/a&gt; group at Microsoft Research Redmond became interested in ANNs when recent progress in building more complex “deep” neural networks (DNNs) began to show promise at achieving state-of-the-art performance for automatic speech-recognition tasks. In June 2010, intern George Dahl, from the University of Toronto, joined the team, and researchers began investigating how DNNs could be used to improve large-vocabulary speech recognition.&lt;br /&gt;
“George brought a lot of insight on how DNNs work,” Yu says, “as well as strong experience in training DNNs, which is one of the key components in this system.” &lt;br /&gt;
A speech recognizer is essentially a model of fragments of sounds of speech. An example of such sounds are “phonemes,” the roughly 30 or so pronunciation symbols used in a dictionary. State-of-the-art speech recognizers use shorter fragments, numbering in the thousands, called “senones.” &lt;br /&gt;
Earlier work on DNNs had used phonemes. The research took a leap forward when Yu, after discussions with principal researcher &lt;a href="http://research.microsoft.com/en-us/people/deng/"&gt;Li Deng&lt;/a&gt; and &lt;a href="http://research.microsoft.com/en-us/people/alexac/"&gt;Alex Acero&lt;/a&gt;, principal researcher and manager of the Speech group, proposed modeling the thousands of senones, much smaller acoustic-model building blocks, directly with DNNs. The resulting paper&lt;i&gt;, &lt;a href="http://research.microsoft.com/pubs/144412/DBN4LVCSR-TransASLP.pdf"&gt;Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition&lt;/a&gt;&lt;/i&gt; by Dahl, Yu, Deng, and Acero, describes the first hybrid context-dependent DNN-HMM (CD-DNN-HMM) model applied successfully to large-vocabulary speech-recognition problems. &lt;br /&gt;
“Others have tried context-dependent ANN models,” Yu observes, “using different architectural approaches that did not perform as well. It was an amazing moment when we suddenly saw a big jump in accuracy when working on voice-based Internet search. We realized that by modeling senones directly using DNNs, we had managed to outperform state-of-the-art conventional CD-GMM-HMM large-vocabulary speech-recognition systems by a relative error reduction of more than 16 percent. This is extremely significant when you consider that speech recognition has been an active research area for more than five decades.”&lt;br /&gt;
The team also accelerated the experiments by using general-purpose graphics-processing units to train and decode speech. The computation for neural networks is similar in structure to 3-D graphics as used in popular computer games, and modern graphics cards can process almost 500 such computations simultaneously. Harnessing this computational power for neural networks contributed to the feasibility of the architectural model.&lt;br /&gt;
In October 2010, when Yu presented the paper to an internal Microsoft Research Asia audience, he spoke about the challenges of scalability and finding ways to parallelize training as the next steps toward developing a more powerful acoustic model for large-vocabulary speech recognition. Seide was excited by the research and joined the project, bringing with him experience in large-vocabulary speech recognition, system development, and benchmark setups. &lt;br /&gt;
&lt;h1&gt;&lt;a href="http://www.blogger.com/post-create.g?blogID=1030722385446106708" name="ID0ELE"&gt;&lt;/a&gt;Benchmarking on a Neural Network&lt;/h1&gt;“It has been commonly assumed that hundreds or thousands of senones were just too many to be accurately modeled or trained in a neural network,” Seide says. “Yet Yu and his colleagues proved that doing so is not only feasible, but works very well with notably improved accuracy. Now, it was time to show that the exact same CD-DNN-HMM could be scaled up effectively in terms of training-data size.”&lt;br /&gt;
The new project applied CD-DNN-HMM models to speech-to-text transcription and was tested against Switchboard, a highly challenging phone-call transcription benchmark recognized by the speech-recognition research community. &lt;br /&gt;
First, the team had to migrate the DNN training tool to support a larger training data set. Then, with help from Gang Li, research software-development engineer at Microsoft Research Asia, they applied the new model and tool to the Switchboard benchmark with more than 300 hours of speech-training data. To support that much data, the researchers built giant ANNs, one of which contains more than 66 million inter-neural connections, the largest ever created for speech recognition. &lt;br /&gt;
The subsequent benchmarks achieved an astonishing word-error rate of 18.5 percent, a 33-percent relative improvement compared with results obtained by a state-of-the-art conventional system. &lt;br /&gt;
&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;span class="imageblock"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;“When we began running the Switchboard benchmark,” Seide recalls, “we were hoping to achieve results similar to those observed in the voice-search task, between 16- and 20-percent relative gains. The training process, which takes about 20 days of computation, emits a new, slightly more refined model every few hours. I impatiently tested the latest model every few hours. You can’t imagine the excitement when it went way beyond the expected 20 percent, kept getting better and better, and finally settled at a gain of more than 30 percent. Historically, there have been very few individual technologies in speech recognition that have led to improvements of this magnitude.”&lt;br /&gt;
The resulting paper, titled &lt;i&gt;&lt;a href="http://research.microsoft.com/pubs/153169/CD-DNN-HMM-SWB-Interspeech2011-Pub.pdf"&gt;Conversational Speech Transcription Using Context-Dependent Deep Neural Networks&lt;/a&gt;&amp;nbsp;&lt;/i&gt;by Seide, Li, and Yu, is scheduled for presentation on Aug. 29. The work already has attracted considerable attention from the research community, and the team hopes that taking the paper to the conference will ignite a new line of research that will help advance the state of the art for DNNs in large-vocabulary speech recognition. &lt;br /&gt;
&lt;h1&gt;&lt;a href="http://www.blogger.com/post-create.g?blogID=1030722385446106708" name="ID0EFF"&gt;&lt;/a&gt;Bringing the Future Closer&lt;/h1&gt;With a novel way of using artificial neural networks for speaker-independent speech recognition, and with results a third more accurate than what conventional systems can deliver, Yu, Seide, and their teams have brought fluent speech-to-speech applications much closer to reality. This innovation simplifies speech processing and delivers high accuracy in real time for large-vocabulary speech-recognition tasks. &lt;br /&gt;
“This work is still in the research stages, with more challenges ahead, most notably scalability when dealing with tens of thousands of hours of training data. Our results represent just a beginning to exciting future developments in this field,” Seide says. “Our goal is to open possibilities for new and fluent voice-based services that were impossible before. We believe this research will be used for services that change how we work and live. Imagine applications such as live speech-to-speech translation of natural, fluent conversations, audio indexing, or conversational, natural language interactions with computers.” &lt;br /&gt;
&lt;div class="MsoNormal"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6405808409364702899?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/-E8IgU65VKA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6405808409364702899/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6405808409364702899" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6405808409364702899?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6405808409364702899?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/-E8IgU65VKA/speech-recognition-leaps-forward-is-it.html" title="Speech Recognition Leaps Forward - Is it a revolution?" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2011/08/speech-recognition-leaps-forward-is-it.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEAARX86eCp7ImA9WhZUGUk.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-1751468951908035846</id><published>2011-06-13T02:32:00.000-04:00</published><updated>2011-06-13T02:32:24.110-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-06-13T02:32:24.110-04:00</app:edited><title>iOS Speech Recognition Settings Confirm Nuance-Apple Partnership</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/cTovM64NH1B0uG8d3OlfHJYtRoI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/cTovM64NH1B0uG8d3OlfHJYtRoI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/cTovM64NH1B0uG8d3OlfHJYtRoI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/cTovM64NH1B0uG8d3OlfHJYtRoI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Interesting post @ &lt;a href="http://www.macrumors.com/2011/06/11/ios-speech-recognition-settings-confirm-nuance-apple-partnership/"&gt;http://www.macrumors.com/2011/06/11/ios-speech-recognition-settings-confirm-nuance-apple-partnership/&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://cdn.macrumors.com/article-new/2011/06/nuance1.jpg" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img alt="" class="aligncenter size-full wp-image-314487" height="371" src="http://cdn.macrumors.com/article-new/2011/06/nuance1.jpg" title="nuance" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A couple of screenshots &lt;a href="http://twitter.com/#%21/chronicwire"&gt;posted on Twitter&lt;/a&gt; by &lt;a href="http://twitter.com/#%21/chronicwire"&gt;@ChronicWire&lt;/a&gt;  reveals hidden Nuance preferences found in the latest internal iOS  builds that confirms that Apple has been actively working on building in  speech recognition into iOS.  &lt;br /&gt;
&lt;br /&gt;
Rumors of a Nuance-Apple partnership had been &lt;a href="http://www.macrumors.com/2011/05/06/apple-negotiating-deal-with-nuance-for-speech-recognition-in-ios-5/"&gt;heavy&lt;/a&gt; in the weeks prior to WWDC, though no announcements were made during the keynote.   Later, &lt;a href="http://techcrunch.com/2011/06/06/apple-nuance-wwdc-keynote/"&gt;comments&lt;/a&gt; by Robert Scoble indicated that the deals were simply not completed in time for WWDC but were still in the works:&lt;br /&gt;
&lt;div class="quote"&gt;I  was told weeks ago by my source (same one who told me Twitter would be  integrated deeply into the OS) that Siri wouldn't be done in time. Maybe  for this fall's release of iPhone 5? After all, they need to have some  fun things to demo for us in August, no?&lt;/div&gt;The source of the screenshots (&lt;a href="http://twitter.com/#%21/chronic"&gt;@Chronic&lt;/a&gt; / &lt;a href="http://twitter.com/#%21/sonnydickson"&gt;@SonnyDickson&lt;/a&gt;)  has been known to have legitimate sources in the past.  So, it seems  certain that Apple is actively working on bringing Nuance speech  recognition into iOS, perhaps as early as iOS 5 this fall.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-1751468951908035846?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/eXw3NUmWi9A" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/1751468951908035846/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=1751468951908035846" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1751468951908035846?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1751468951908035846?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/eXw3NUmWi9A/ios-speech-recognition-settings-confirm.html" title="iOS Speech Recognition Settings Confirm Nuance-Apple Partnership" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2011/06/ios-speech-recognition-settings-confirm.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUEEQn85eip7ImA9WhZUGE0.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-5210162691467348537</id><published>2011-06-11T10:46:00.000-04:00</published><updated>2011-06-11T10:46:43.122-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-06-11T10:46:43.122-04:00</app:edited><title>Again: Nuance Slaps Vlingo With Another Patent Lawsuit Over Voice Recognition Technology</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/a4KbhnzFGr_AzUBPeIl-dsSqcYY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/a4KbhnzFGr_AzUBPeIl-dsSqcYY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/a4KbhnzFGr_AzUBPeIl-dsSqcYY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/a4KbhnzFGr_AzUBPeIl-dsSqcYY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;I guess Nuance is trying again to acquire Vlingo (given its standard sue before acquire strategy). &lt;br /&gt;
&lt;br /&gt;
See below from Techcrunch: &lt;a href="http://techcrunch.com/2011/06/09/nuance-sues-vlingo-again-over-voice-recognition-patents/#comments"&gt;http://techcrunch.com/2011/06/09/nuance-sues-vlingo-again-over-voice-recognition-patents/#comments&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Well, this is interesting. &lt;a href="http://www.nuance.com/"&gt;Nuance,&lt;/a&gt; a company that develops imaging and voice recognition technologies, is once again suing competitor &lt;a href="http://www.vlingo.com/"&gt;Vlingo&lt;/a&gt;, which also develops a voice search technology and is backed by Yahoo, &lt;a href="http://gigaom.com/2009/09/16/att-backs-vlingo-as-nuance-lawsuit-looms/"&gt;AT&amp;amp;T&lt;/a&gt; and Charles River Ventures. &lt;br /&gt;
According to the suit, which we’ve embedded below, Nuance claims  Vlingo is infringing on number of Nuance’s patents including U.S. patent  no. 6,487,534 B1, which relates to a “Distributed Client-Server Speech  Recognition System.” By making, using, selling, offering to sell, and or  importing its products and services related to speech recognition,  Nuance says Vlingo is infringing on its patent. &lt;br /&gt;
Nuance is also claiming that Vlingo is infringing on that U.S. patent  no. 6,785,653 B1, which is titled “Distributed Voice Web Architecture  and Associated Components and Methods,” U.S. patent no. 6,839,669 B1,  titled “Performing Actions Identified in Recognized Speech;”  U.S.  patent number No. 7,058,573 B1, titled “Speech Recognition System to  Selectively Utilize Different Speech Recognition Techniques Over  Multiple Speech Recognition Passes;” and U.S. patent no. 7,127,393 B2,  titled “Dynamic Semantic Control of a Speech Recognition System.”&lt;br /&gt;
Nuance is requesting that Vlingo pay damages for infringing and  profiting off the patents, but it’s unclear what the dollar amount of  these damages are. &lt;br /&gt;
The two companies have a bit of a storied past. Nuance &lt;a href="http://www.xconomy.com/boston/2008/06/19/vlingos-ceo-fires-back-at-nuance-over-patent-lawsuit-says-when-they-couldnt-win-yahoos-business-this-was-their-reaction/"&gt;slapped Vlingo&lt;/a&gt; with a patent lawsuit back in 2008. Vlingo then &lt;a href="http://www.xconomy.com/boston/2010/07/26/vlingo-buys-patents-from-bellevue-based-intellectual-ventures-as-defense-in-nuance-lawsuit-hopes-for-horse-trade/"&gt;bought&lt;/a&gt; a number of patents last year relating to voice and speech recognition, that aimed to force Nuance to drop its suit.&lt;br /&gt;
Dave Grannan, CEO of Vlingo, recently compared the act of competing with Nuance to&lt;br /&gt;
“having a venereal disease that’s in remission.” He tells &lt;a href="http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2011/05/22/BU8B1JIICK.DTL"&gt;Bloomberg BusinessWeek,&lt;/a&gt;  “We crush them whenever we go head-to-head with them. But just when  you’re thinking life is great – boom, there’s a sore on your lip.”  Gross. &lt;br /&gt;
Nuance is a &lt;a href="http://techcrunch.com/2011/05/10/nuance-buys-enterprise-print-management-software-developer-equitrac-for-157m-in-cash/"&gt;massive company&lt;/a&gt; with a $6 billion market cap and is a formidable competitor. In fact, Apple appears to be &lt;a href="http://techcrunch.com/2011/06/06/apple-nuance-wwdc-keynote/"&gt;licensing&lt;/a&gt; Nuance’s technology in OS X Lion. And we heard that Nuance was  &lt;a href="http://techcrunch.com/2011/05/06/apple-nuance-ios-siri/"&gt;in negotiations&lt;/a&gt; with Apple for a partnership to license and use the company’s voice recognition technology, though Nuance was &lt;a href="http://techcrunch.com/2011/06/06/apple-nuance-wwdc-keynote/"&gt;missing&lt;/a&gt; from the lineup of products revealed this week’s WWDC conference. And we’ve &lt;a href="http://techcrunch.com/2011/05/09/apple-nuance-data-center-deal/"&gt;learned&lt;/a&gt; that Apple may already be using Nuance technology in their new massive data center in North Carolina. &lt;br /&gt;
Photo Credit/&lt;a href="http://www.flickr.com/photos/kwdesigns/734124559/"&gt;Flickr/KWDesigns&lt;/a&gt;&lt;br /&gt;
&lt;div style="font-size: 10px; text-align: center; width: 100%;"&gt;&lt;a href="http://www.scribd.com/doc/57457828"&gt;View this document on Scribd&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-5210162691467348537?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/tEXNLlHrJPU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/5210162691467348537/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=5210162691467348537" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5210162691467348537?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5210162691467348537?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/tEXNLlHrJPU/again-nuance-slaps-vlingo-with-another.html" title="Again: Nuance Slaps Vlingo With Another Patent Lawsuit Over Voice Recognition Technology" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2011/06/again-nuance-slaps-vlingo-with-another.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0EMRXs5fip7ImA9Wx9XF04.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6564903583028319615</id><published>2011-01-11T02:41:00.000-05:00</published><updated>2011-01-11T02:41:24.526-05:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-11T02:41:24.526-05:00</app:edited><title>The Search for a Clearer Voice -  How Google's Voice Search is getting so good.</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/xhOemhDYqMcNibDhAVX7Q58jL00/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xhOemhDYqMcNibDhAVX7Q58jL00/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/xhOemhDYqMcNibDhAVX7Q58jL00/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xhOemhDYqMcNibDhAVX7Q58jL00/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;An interesting post by Paul Boutin: http://www.technologyreview.com/blog/guest/26242/?p1=A2&lt;br /&gt;
&lt;br /&gt;
It raises again the issue of talking with the right (= US) accent to your phone.&lt;br /&gt;
&lt;br /&gt;
&lt;h3&gt;The Search for a Clearer Voice&lt;/h3&gt;&lt;div class="intro"&gt;How Google's Voice Search is getting so good.&lt;/div&gt;&lt;div class="info"&gt;Paul Boutin 01/10/2011&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Smart phones are great at a lot of things, with one exception: Typing  on a touch screen or a downsized keyboard is still frustrating compared  to a full-size computer keyboard. That's probably why Google says that,  even before the release of its new personalized &lt;a href="http://googlemobile.blogspot.com/2010/12/voice-search-gets-personal.html" target="_blank"&gt;Voice Search&lt;/a&gt; app for Android in mid-December, one in four mobile searches were already input by voice rather than from a keyboard.&lt;br /&gt;
The improved Voice Search takes speech recognition to its next level:  Google's servers will now log up to two years of your voice commands in  order to more precisely parse exactly what you're saying.&lt;br /&gt;
In tests on the new app, which appeared in Google's Android Market a  week before Christmas, the app originally got about three out of five  searches correct. After a few days, the ratio crept up to four out of  five. It's surprisingly good at searches that involve common nouns  ("heathen child lyrics") and what search experts call vertical searches  for popular topics like airline flights and movie listings. Voice Search  knows "United Flight 714" and "True Grit show times 90066" when it  hears them. Less successful are searches involving people's names. In  repeated attempts to Google up WikiLeaks founder Julian Assange, Voice  Search got no closer than "wikileaks founder julian of songs."&lt;br /&gt;
How does it work? Rather than try to use the phone itself to do  speech recognition, Voice Search digitizes the user's input commands and  sends them off to Google's gargantuan server farms. There, the spoken  words are broken down and compared both to statistical models of what  words other people mean when they utter those syllables, plus a history  of the user's own voice commands, through which Google refines its  matching algorithm for that particular voice. The app recognizes five  different flavors of English—American, British, Australian, Indian and  South African—plus Afrikaans, Cantonese, Czech, Dutch, French, German,  Italian, Japanese, Korean, Mandarin, Polish, Portugese, Russian,  Spanish, Turkish, and Zulu.&lt;br /&gt;
The tricky part—and the motive for a personalized search app—is that  human voices vary wildly between men and women, between young people and  old people, and among those with various accents and dialects. By  storing hundreds, perhaps thousands of what speech recognition experts  call "utterances" by the same person over months of use, Voice Search  can better guess at what that particular person is saying. &lt;br /&gt;
That mathematical model used to recognize phrases was refined over three years using voice samples from Google's now-defunct &lt;a href="http://www.technologyreview.com/Biztech/19050/" target="_blank"&gt;GOOG-411&lt;/a&gt;  automated directory assistance service, which the company operated from  2007 through late last year specifically to capture a wide-ranging set  of voice samples for analysis. The company's &lt;a href="http://www.technologyreview.com/communications/21696/" target="_blank"&gt;first Voice Search app&lt;/a&gt;, for iPhone only, was launched a year after GOOG-411 in November 2008.&lt;br /&gt;
Voice Search doubles as a spoken-command system for the phone. As shown in &lt;a href="http://www.youtube.com/watch?v=gGbYVvU0Z5s" target="_blank"&gt;this video&lt;/a&gt;,  it understands commands such as, "Send mail to Mike LeBeau. How's life  in New York treating you? The weather's beautiful here." The app will  find LeBeau in your contacts—it's better at matching names here than in a  Web search, because it's working with a limited set—and will fill in  the subject line with your first sentence. You can speak additional text  into the message, or edit it with the phone's keyboard, before sending  it. &lt;br /&gt;
Google has clearly put a lot of effort into its speech recognition  technology. But the impact on it bottom line is obvious: By removing the  aggravation of typing on tiny keys, the company hopes to get customers  to reach for its search and e-mail services much more often.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6564903583028319615?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/lta_bBpHNDU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6564903583028319615/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6564903583028319615" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6564903583028319615?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6564903583028319615?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/lta_bBpHNDU/search-for-clearer-voice-how-googles.html" title="The Search for a Clearer Voice -  How Google's Voice Search is getting so good." /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2011/01/search-for-clearer-voice-how-googles.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0YHSHc-fCp7ImA9Wx5WFE0.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-8575403781655302617</id><published>2010-09-25T03:52:00.000-04:00</published><updated>2010-09-25T03:52:19.954-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-09-25T03:52:19.954-04:00</app:edited><title>New speech analysics whitepaper</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/_8LegNdEYU7H9xJtEIeMjdVs-ko/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/_8LegNdEYU7H9xJtEIeMjdVs-ko/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/_8LegNdEYU7H9xJtEIeMjdVs-ko/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/_8LegNdEYU7H9xJtEIeMjdVs-ko/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&amp;nbsp;Nice whitepaper from io speech analysics.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://images.tmcnet.com/tmc/whitepapers/documents/whitepapers/2010/3013-speech-analytics-simple-definition.pdf"&gt;http://images.tmcnet.com/tmc/whitepapers/documents/whitepapers/2010/3013-speech-analytics-simple-definition.pdf&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-8575403781655302617?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/DJz-GGhd52s" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/8575403781655302617/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=8575403781655302617" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/8575403781655302617?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/8575403781655302617?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/DJz-GGhd52s/new-speech-analysics-whitepaper.html" title="New speech analysics whitepaper" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2010/09/new-speech-analysics-whitepaper.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUYHSX88eyp7ImA9Wx5WEks.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-5557563605659742150</id><published>2010-09-23T14:38:00.001-04:00</published><updated>2010-09-23T14:38:58.173-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-09-23T14:38:58.173-04:00</app:edited><title>Nuance Launches Dragon Dictate for Mac -- Simply Smarter Speech Recognition</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/w288yeOI7j8udUGFuLyClPksQ0M/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/w288yeOI7j8udUGFuLyClPksQ0M/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/w288yeOI7j8udUGFuLyClPksQ0M/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/w288yeOI7j8udUGFuLyClPksQ0M/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;div id="above"&gt;     &lt;/div&gt;          &lt;div id="above"&gt;&lt;div class="aboveleft"&gt;&lt;br /&gt;  &lt;div class="headlines assetContainer pressrelease"&gt;&lt;h1&gt;&lt;br /&gt;&lt;/h1&gt;&lt;h2&gt;Dragon  Speech Application for Mac OS X Taps New Dragon 11 Engine, Allows Mac  Users to Create Content and Interact with Their Favorite Applications by  Voice &lt;/h2&gt;  &lt;/div&gt; &lt;/div&gt;      &lt;/div&gt;                            &lt;p&gt;            &lt;/p&gt;         &lt;p&gt;            &lt;/p&gt; &lt;p&gt; &lt;/p&gt;&lt;div class="imageSmall" style=""&gt;  &lt;div style="text-align: right;"&gt;    &lt;img alt="" id="image201" src="http://www.marketwatch.com/story/story/RenderImage?guid=4d20b9eb2ae944b9a59cadb71b0dea6a&amp;amp;imageID=201" /&gt;  &lt;/div&gt; &lt;/div&gt;           &lt;p&gt; BURLINGTON, Mass., Sep 20, 2010 (BUSINESS WIRE) -- Nuance Communications, Inc.       &lt;span id="quote2066987264" class="quotepeekbase bgQuote down"&gt;        (&lt;span class="symbol"&gt;&lt;a href="http://www.marketwatch.com/investing/stock/NUAN" title="Nuance Communications Inc"&gt;NUAN&lt;/a&gt;&lt;/span&gt;        &lt;b&gt;&lt;span class="data bgLast symbol"&gt;14.88&lt;/span&gt;&lt;/b&gt;,        &lt;span class="data bgChange symbol"&gt;-0.31&lt;/span&gt;,        &lt;span class="data bgPercentChange symbol"&gt;-2.04%&lt;/span&gt;)      &lt;/span&gt;, a leading provider of        speech solutions, today unveiled Dragon Dictate for Mac version 2.0, the        newest addition to the Dragon family of speech recognition products        which includes Dragon NaturallySpeaking for the PC and Dragon Mobile        Apps. Dragon Dictate for Mac -- the first major desktop product for Mac        OS X from the Dragon family following Nuance's acquisition of MacSpeech        earlier this year -- makes it easier than ever to create documents and        emails, search the web, navigate the Mac desktop, and interact with        popular Mac applications -- all by voice.            &lt;/p&gt;         &lt;p&gt; Dragon Dictate for Mac, version 2.0, improves on the accuracy and        performance of MacSpeech Dictate 1.5, leveraging the new Dragon 11        engine, which also powers the recently-announced Dragon        NaturallySpeaking 11 for the PC. Dictate 2.0 offers a        more streamlined set-up, revamped Mac user interface, and dynamic new        voice commands for dictation, editing, navigation and proofreading.        Dragon Dictate for Mac also "learns" better than any previous version of        Dictate, responds faster to spoken commands and supports Dragon Voice        Shortcuts(TM) for searching the web, email and Mac desktop by voice.            &lt;/p&gt;         &lt;p&gt; "Recognizing the important opportunity within the Mac community,        we set out earlier this year to bring Dragon to the Mac, working closely        with our experts from the MacSpeech team, and drawing on our history and        proven success with Dragon," said Peter Mahoney, senior vice president        and general manager for Dragon at Nuance. "We've maintained the elements        of Dictate that are most important to our Mac customers, such as the        native Mac interface, and integrated many features of Dragon        NaturallySpeaking 11 to bring improved accuracy as well as smarter        command and control capabilities to this discerning audience of new        Dragon customers."            &lt;/p&gt;         &lt;p&gt; Dragon for PC, Mobile, and now the Mac            &lt;/p&gt;         &lt;p&gt; For years, people from all walks of life have used Dragon        NaturallySpeaking on the PC to be more productive, save time and capture        their ideas at the speed of thought -- at home, in the office, the        courtroom, the classroom, the exam room, or even on the road. Most        recently, with Nuance's        Dragon Mobile Apps on the BlackBerry, Apple iPad,        iPhone and iPod touch, millions of people now communicate quickly and        easily on their mobile devices, simply using speech recognition. Dragon        Dictate for Mac is the first Mac member of the Dragon desktop family,        allowing Mac users to speak their minds to:            &lt;/p&gt;         &lt;p&gt; - Turn talk into text -- Dragon Dictate for Mac        lets users simply speak to produce text. Speak thoughts and watch the        words appear on screen, inside almost any Mac application -- up to three        times faster than typing -- with an astounding recognition accuracy rate        of up to 99 percent, right out of the box.            &lt;/p&gt;         &lt;p&gt; - Unlock creativity -- Transform ideas into        text at the speed of thought with Dragon Dictate for Mac. Users can let        their creativity flow from brain to voice to produce emails, blog posts,        essays, and more.            &lt;/p&gt;         &lt;p&gt; - Work comfortably -- With Dragon Dictate for        Mac, control the Mac in a relaxed, hands-free manner without being        tethered to the keyboard. Instead of using a mouse, just speak commands        to launch and control applications. Move the cursor or click anywhere on        screen, simply by voice.            &lt;/p&gt;         &lt;p&gt; - Multi-task -- Dragon Dictate for Mac allows        users to tell the Mac what to do, with commands like "Reply To This        Message" or "Open Microsoft Word" or "Search Google for Italian        restaurants" to work faster and smarter. In addition, it's easy to        create custom voice commands that automate complex workflows on the Mac.            &lt;/p&gt;         &lt;p&gt; - Work your way -- Dragon Dictate for Mac can        be customized with a personal vocabulary and voice commands that reflect        an individual's work style.            &lt;/p&gt;         &lt;p&gt; In addition to the powerful voice commands that MacSpeech Dictate        1.5 already offers for controlling popular Mac applications such as        Safari, Mail, iCal and iChat, the new Dragon Dictate for Mac 2.0 adds        more ways to interact with the Mac by voice:            &lt;/p&gt;         &lt;p&gt; - Dragon Voice Shortcuts(TM) for Search -- Offers        an easy way to search for information, files and content anywhere on        your Mac or on the Web by using single voice commands. For example:            &lt;/p&gt;         &lt;p&gt; - "Search Google for hula dancing lessons"            &lt;/p&gt;         &lt;p&gt; - "Search Bing for wedding dress repair"            &lt;/p&gt;         &lt;p&gt; - "Search Yahoo for gondola rentals"            &lt;/p&gt;         &lt;p&gt; - "Search Mail for RSVP"            &lt;/p&gt;         &lt;p&gt; - "Search Mac for history term paper"            &lt;/p&gt;         &lt;p&gt; - New Editing Commands -- A new suite of        commands in Dragon Dictate for Mac makes it easy to edit documents. With        commands that are designed to simplify editing, users can select and        delete text, insert new text, capitalize text, and more.            &lt;/p&gt;         &lt;p&gt; - Proofreading -- The new Proofreading commands        in Dragon Dictate for Mac take advantage of the powerful Text-to-Speech        capabilities built into Mac OS X. Simply dictate text, and ask Dragon        Dictate for Mac to read it back.            &lt;/p&gt;         &lt;p&gt; - Surrounding Punctuation -- New commands in        Dragon Dictate for Mac enable users to put punctuation around certain        words or groups of words, using a simple command, such as:            &lt;/p&gt;         &lt;p&gt; - "Put Quotes Around "            &lt;/p&gt;         &lt;p&gt; - "Put Parentheses Around "            &lt;/p&gt;         &lt;p&gt; - "Put Brackets Around "            &lt;/p&gt;         &lt;p&gt; - Voice Navigation -- Dragon Dictate for Mac        lets users control the Mac's cursor and mouse actions by voice, giving        new options for interacting with their desktop and precise handling of        mouse pointer-based tasks. New features include:            &lt;/p&gt;         &lt;p&gt; - MouseGrid: Placement of the cursor can be        accomplished with MouseGrid commands            &lt;/p&gt;         &lt;p&gt; - Mouse Click: Speak a command, with optional that        designate one or more keys such as Command, Option, Shift or Caps Lock            &lt;/p&gt;         &lt;p&gt; - Mouse Movement: New commands move the mouse pointer        by voice            &lt;/p&gt;         &lt;p&gt; - Flexible Microphones -- Dragon Dictate for        Mac users who have multiple microphones, such as a wired and wireless        headset, can easily switch between microphones within the same voice        profile.            &lt;/p&gt;         &lt;p&gt; Pricing, Availability &amp;amp; System Requirements            &lt;/p&gt;         &lt;p&gt; Dragon Dictate for Mac is available immediately starting at        $199.99 through Nuance's Web site as well as its global network of        reseller partners, software retailers and professional sales        organizations. Existing MacSpeech Dictate customers and Dragon        NaturallySpeaking customers can purchase Dragon Dictate for Mac starting        at $49.99 and $99.99 respectively for a limited time. For        additional information on features, editions, pricing and volume        licensing programs, please visit &lt;a href="http://www.nuance.com/dragon/mac"&gt;www.nuance.com/dragon/mac&lt;/a&gt;.            &lt;/p&gt;         &lt;p&gt; Dragon Dictate for Mac requires an Intel-based Mac, Mac OS X 10.6        Snow Leopard or greater, 3GB of available hard drive space, 2GB of RAM        recommended and an Internet connection for product registration. Dragon        Dictate for Mac comes complete with a Nuance-approved USB microphone        headset.            &lt;/p&gt;         &lt;p&gt; Trademark reference: Nuance, Dragon, Dragon Voice Shortcuts,        Dragon Dictate, Dragon NaturallySpeaking, and the Nuance logo are        registered trademarks or trademarks of Nuance Communications, Inc. or        its affiliates in the United States and/or other countries. All other        company names or product names referenced herein may be the property of        their respective owners.            &lt;/p&gt;         &lt;p&gt; About Nuance Communications, Inc.            &lt;/p&gt;         &lt;p&gt; Nuance is a leading provider of speech, imaging and customer        interaction solutions for businesses and consumers around the world. Its        technologies, applications and services make the user experience more        compelling by transforming the way people interact with information and        how they create, share and use documents. Every day, millions of users        and thousands of businesses experience Nuance's proven applications and        professional services. For more information, please visit: &lt;a href="http://www.nuance.com/"&gt;www.nuance.com&lt;/a&gt;.            &lt;/p&gt;         &lt;p&gt; SOURCE: Nuance Communications, Inc.            &lt;/p&gt;         &lt;pre style="display: inline;"&gt;Nuance Communications, Inc.  Erica Hill, 781-565-5218  erica.hill@nuance.com  Twitter : @DragonTweets, @macspeech  &lt;a href="http://www.facebook.com/dragonnaturallyspeaking"&gt;http://www.facebook.com/dragonnaturallyspeaking&lt;/a&gt;or  InkHouse for Nuance Communications, Inc.  Lisa Mokaba, 781-791-4570  nuance@inkhouse.net    &lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-5557563605659742150?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/S7-VXAQVI9M" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/5557563605659742150/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=5557563605659742150" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5557563605659742150?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5557563605659742150?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/S7-VXAQVI9M/nuance-launches-dragon-dictate-for-mac.html" title="Nuance Launches Dragon Dictate for Mac -- Simply Smarter Speech Recognition" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2010/09/nuance-launches-dragon-dictate-for-mac.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUcHRnY-eCp7ImA9Wx5WEks.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6095962011919304574</id><published>2010-09-23T14:36:00.000-04:00</published><updated>2010-09-23T14:37:17.850-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-09-23T14:37:17.850-04:00</app:edited><title>UTOPY Recommended by Ovum’s 2010 Contact Center Speech Analytics Market Report</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/ehlJsocJaNXwUzo0ikXDB842d5M/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/ehlJsocJaNXwUzo0ikXDB842d5M/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/ehlJsocJaNXwUzo0ikXDB842d5M/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/ehlJsocJaNXwUzo0ikXDB842d5M/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;span class="bylinedate"&gt;- September 20, 2010&lt;/span&gt;&lt;div id="newsbyline"&gt; &lt;/div&gt;           &lt;h3&gt;&lt;p&gt;UTOPY, the leading provider of Voice of the Customer  and Performance Optimization solutions powered by Speech Analytics,  announced that it has been recognized for its leadership in  recording-system-independent Speech Analytics by Ovum’s 2010 contact  center Speech Analytics market report.&lt;/p&gt;&lt;/h3&gt;           &lt;p&gt;Titled “Decision Matrix: Selecting a Speech Analytics  Vendor,” the report examines the competitive dynamics within the Speech  Analytics market in order to help businesses select a vendor based on  its technology strength, reputation among customers and impact on the  market.&lt;/p&gt; &lt;p&gt;In the report, an independent technology analyst provides a  comprehensive view of vendor capabilities and advises on those that  businesses should explore, consider and most importantly, shortlist. The  report cited the advantages provided by UTOPY’s ability to integrate  with virtually any call recording system. UTOPY also gained high marks  for its core technology, interface and packaged applications, and earned  additional praise for its support of a wide range of languages,  dialects, and accents.&lt;/p&gt; &lt;p&gt;“We are pleased to receive this recommendation from Ovum,” said Roy  Twersky, President and CEO at UTOPY. “We believe this further  underscores UTOPY’s unwavering commitment to developing innovative,  analytics-driven solutions that provide contact centers with unsurpassed  insight into customer-agent conversations, which can be leveraged to  maximize performance and profitability.”&lt;/p&gt; &lt;p&gt;Find more on Utopy at &lt;a href="http://www.speechanalytics.com/"&gt;www.speechanalytics.com&lt;/a&gt;.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6095962011919304574?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/JJ6ROlkl7kw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6095962011919304574/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6095962011919304574" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6095962011919304574?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6095962011919304574?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/JJ6ROlkl7kw/utopy-recommended-by-ovums-2010-contact.html" title="UTOPY Recommended by Ovum’s 2010 Contact Center Speech Analytics Market Report" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2010/09/utopy-recommended-by-ovums-2010-contact.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEMDRH45fip7ImA9WxFQE0w.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-7376452298666123939</id><published>2010-05-08T06:59:00.001-04:00</published><updated>2010-05-08T07:01:15.026-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-05-08T07:01:15.026-04:00</app:edited><title>Microsoft translating telephone</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/g8lOLSohuiVP9hRVE6_QZBXeMZw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g8lOLSohuiVP9hRVE6_QZBXeMZw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/g8lOLSohuiVP9hRVE6_QZBXeMZw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g8lOLSohuiVP9hRVE6_QZBXeMZw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;object height="385" width="640"&gt;&lt;param name="movie" value="http://www.youtube.com/v/QtCRTUmYDoo&amp;amp;color1=0xb1b1b1&amp;amp;color2=0xd0d0d0&amp;amp;hl=en_US&amp;amp;feature=player_embedded&amp;amp;fs=1"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed src="http://www.youtube.com/v/QtCRTUmYDoo&amp;amp;color1=0xb1b1b1&amp;amp;color2=0xd0d0d0&amp;amp;hl=en_US&amp;amp;feature=player_embedded&amp;amp;fs=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" height="385" width="640"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;See also &lt;a href="http://news.cnet.com/8301-13860_3-20004451-56.html"&gt;http://news.cnet.com/8301-13860_3-20004451-56.html&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-7376452298666123939?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/hr1rqSkTU0g" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/7376452298666123939/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=7376452298666123939" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7376452298666123939?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7376452298666123939?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/hr1rqSkTU0g/microsoft-translating-telephone.html" title="Microsoft translating telephone" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2010/05/microsoft-translating-telephone.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUIHR3Y9eCp7ImA9WxBWFUo.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-7207137736291334571</id><published>2010-02-07T15:00:00.003-05:00</published><updated>2010-02-07T15:12:16.860-05:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-02-07T15:12:16.860-05:00</app:edited><title>Yet another speech translation story - now from Google</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/4Wm9D8FxFRvpxdU1mhh50yxiAeM/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/4Wm9D8FxFRvpxdU1mhh50yxiAeM/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/4Wm9D8FxFRvpxdU1mhh50yxiAeM/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/4Wm9D8FxFRvpxdU1mhh50yxiAeM/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Not only speech to text from cellulars but also translation. Google is aiming high but does their existing voice to text really works?&lt;br /&gt;&lt;br /&gt;&lt;h1 class="heading"&gt;&lt;a href="http://technology.timesonline.co.uk/tol/news/tech_and_web/personal_tech/article7017831.ece"&gt;&lt;span style="font-size:85%;"&gt;http://technology.timesonline.co.uk/tol/news/tech_and_web/personal_tech/article7017831.ece&lt;/span&gt;&lt;/a&gt;&lt;br /&gt;&lt;/h1&gt;&lt;h1 class="heading"&gt;&lt;br /&gt;&lt;/h1&gt;&lt;h1 class="heading"&gt;Google leaps  language barrier with translator phone&lt;/h1&gt;  &lt;!-- END: Module - Main Heading --&gt;    &lt;!--CMA user Call Diffrenet Variation Of Image --&gt; &lt;!-- BEGIN: Module - M24 Article Headline with portrait image (b) --&gt; &lt;script type="text/javascript" src="http://technology.timesonline.co.uk/tol/js/m24-image-browser.js"&gt;&lt;/script&gt; &lt;script type="text/javascript"&gt; &lt;!-- /* Global variables that are used for "image browsing". Used on article pages to rotate the images of a story. */ var sImageBrowserImagePath = ''; var aArticleImages = new Array(); var aImageDescriptions = new Array(); var aImageEnlargeLink = new Array(); var aImageEnlargePopupWidth = '500'; var aImageEnlargePopupHeight = '500'; var aImagePhotographer = new Array(); var nSelectedArticleImage = 0; var i=0; var aImageAltText= new Array();  //--&gt; &lt;/script&gt; &lt;script type="text/javascript"&gt; &lt;!-- aArticleImages[i] = 'http://www.timesonline.co.uk/multimedia/archive/00681/Phone360_681565a.jpg'; //--&gt; &lt;/script&gt; &lt;script type="text/javascript"&gt; &lt;!-- aImageDescriptions[i] = ''; //--&gt; &lt;/script&gt; &lt;!--Don't Display undifined test for credit --&gt; &lt;script type="text/javascript"&gt; &lt;!-- aImageAltText[i] = "Young woman talking on mobile phone" ;  aImageAltText[i] = aImageAltText[i].replace(/&amp;quot;/g,"\""); //--&gt;  &lt;/script&gt; &lt;script type="text/javascript"&gt; &lt;!-- aImageEnlargeLink[i] = 'http://www.timesonline.co.uk/multimedia/archive/00681/Phone360_681565a.jpg'; i=i+1; //--&gt;  &lt;/script&gt;  &lt;div&gt; &lt;!-- Print Author name from By Line associated with the article --&gt;  &lt;span class="byline"&gt; Chris Gourlay &lt;/span&gt; &lt;/div&gt; &lt;div id="related-article-links"&gt;    &lt;div class="padding-left-right-5 padding-bottom-5"&gt; &lt;!-- BEGIN: M19 - Article tools --&gt; &lt;span id="comment-count" class="float-left global-comment-seperator"&gt;&lt;ul&gt;&lt;li class="your-comment"&gt;&lt;a onclick="'s_objectID="" id="comment-link" class="link-999" rel="nofollow" href="http://technology.timesonline.co.uk/tol/news/tech_and_web/personal_tech/article7017831.ece#comment-have-your-say"&gt;8  Comments&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;  &lt;span class="float-left padding-left-8 padding-top-2"&gt; &lt;script type="text/javascript"&gt; gSiteLife.Recommend("ExternalResource", "7017831","http://www.timesonline.co.uk/tol/news/tech_and_web/personal_tech/article7017831.ece"); &lt;/script&gt;&lt;div id="Recommend1265572691286" class="Recommend"&gt;&lt;div style="display: inline;"&gt;&lt;div class="Recommend_Container"&gt;        &lt;a href="http://technology.timesonline.co.uk/tol/news/tech_and_web/personal_tech/article7017831.ece#none" class="SiteLife_Recommend" onclick="'s_objectID=""&gt;Recommend?  (6) &lt;/a&gt;    &lt;/div&gt;&lt;/div&gt;&lt;/div&gt;   &lt;/span&gt;    &lt;!-- &lt;div class="padding-bottom-10"&gt;&lt;/div&gt; --&gt; &lt;!-- END: M19 - Article tools --&gt; &lt;!-- BEGIN: Portrait image --&gt;  &lt;div class="float-right padding-left-9 bg-fff"&gt; &lt;div class="article-portrait-image"&gt; &lt;div id="dynamic-image-holder"&gt;&lt;img title="Young woman talking on mobile  phone" src="http://www.timesonline.co.uk/multimedia/archive/00681/Phone360_681565a.jpg" alt="Young woman talking on mobile phone" border="0" width="185" /&gt;&lt;/div&gt; &lt;!-- Remove following &lt;div&gt; to not show photographer information --&gt; &lt;!-- Remove following &lt;div&gt; to not show image description --&gt; &lt;!-- Tip: This &lt;div&gt; here with the id "dynamic-image-navigation" is used so that the innerHTML can be written to by the JS call below. --&gt; &lt;div id="pagination-container" class="pagination-container"&gt; &lt;script type="text/javascript"&gt; &lt;!-- fCreateImageBrowser(nSelectedArticleImage,'portrait',"/tol/"); //--&gt; &lt;/script&gt; &lt;/div&gt; &lt;/div&gt;  &lt;/div&gt; &lt;!-- END: Portrait image --&gt;  &lt;/div&gt; &lt;/div&gt; &lt;!-- END: Module - Module - M24 Article Headline with portrait image (b) --&gt; &lt;!-- BEGIN: Module - Main Article --&gt; &lt;!-- Check the Article Type and display accordingly--&gt; &lt;!-- Print Author image associated with the Author--&gt; &lt;!-- Print the body of the article--&gt; &lt;div id="region-column1-layout2"&gt;&lt;style type="text/css"&gt; div#related-article-links p a, div#related-article-links p a:visited { color:#06c; }  &lt;/style&gt;  &lt;div id="related-article-links"&gt; &lt;!-- Pagination --&gt; &lt;p&gt;GOOGLE is developing software for the first phone capable of  translating foreign languages almost instantly — like the Babel Fish in  The Hitchhiker’s Guide to the Galaxy.&lt;/p&gt; &lt;p&gt;By building on existing technologies in voice recognition and  automatic translation, Google hopes to have a basic system ready within a  couple of years. If it works, it could eventually transform  communication among speakers of the world’s 6,000-plus languages.&lt;/p&gt; &lt;p&gt;The company has already created an automatic system for translating  text on computers, which is being honed by scanning millions of  multi-lingual websites and documents. So far it covers 52 languages,  adding Haitian Creole last week.&lt;/p&gt; &lt;p&gt;Google also has a voice recognition system that enables phone users  to conduct web searches by speaking commands into their phones rather  than typing them in.&lt;/p&gt; &lt;!--#include file="m63-article-related-attachements.html"--&gt;  &lt;!-- BEGIN: Module - M63 - Article Related Attachements --&gt; &lt;script type="text/javascript" src="http://technology.timesonline.co.uk/tol/js/picture-gallery.js"&gt;&lt;/script&gt; &lt;script type="text/javascript"&gt; function slideshowPopUp(url) { pictureGalleryPopupPic(url); return false; } &lt;/script&gt; &lt;!-- BEGIN: Comment Teaser Module --&gt; &lt;div class="float-left related-attachements-container"&gt; &lt;!-- END: Comment Teaser Module --&gt;  &lt;!-- BEGIN: Module - M63 - Article Related Package --&gt; &lt;!-- END: Module - M63 - Article Related Package --&gt; &lt;div class="related-attachements-top padding-top-10"&gt; &lt;h3 class="section-heading"&gt;Related Links&lt;/h3&gt; &lt;/div&gt; &lt;div class="related-attachements-side padding-top-7 padding-bottom-10  padding-right-7"&gt; &lt;div class="padding-bottom-5 padding-top-3"&gt; &lt;ul class="chevron-list chevron-blue"&gt;&lt;li&gt;&lt;a onclick="'s_objectID="" href="http://technology.timesonline.co.uk/tol/news/world/us_and_americas/article7015973.ece" class="link-666"&gt; Fears over Google alliance with spy-master &lt;/a&gt;&lt;/li&gt;&lt;/ul&gt; &lt;form name="relatedLinksform" action="" method="post"&gt; &lt;/form&gt; &lt;ul class="chevron-list chevron-blue"&gt;&lt;li&gt;&lt;a onclick="'s_objectID="" href="http://technology.timesonline.co.uk/tol/news/tech_and_web/article6022902.ece" class="link-666"&gt; Village mob thwarts Google Street View car &lt;/a&gt;&lt;/li&gt;&lt;/ul&gt; &lt;form name="relatedLinksform" action="" method="post"&gt; &lt;/form&gt; &lt;/div&gt; &lt;/div&gt; &lt;!-- BEGIN: POLL --&gt; &lt;!--This block will execute if an article of type Poll is attached--&gt;  &lt;!-- END : POLL --&gt; &lt;!-- BEGIN: DEBATE--&gt; &lt;!-- END: DEBATE--&gt;   &lt;/div&gt; &lt;!-- END: Module - M63 - Article Related Attachements --&gt; &lt;p&gt;Now it is working on combining the two technologies to produce  software capable of understanding a caller’s voice and translating it  into a synthetic equivalent in a foreign language. Like a professional  human interpreter, the phone would analyse “packages” of speech,  listening to the speaker until it understands the full meaning of words  and phrases, before attempting translation.&lt;/p&gt; &lt;p&gt;“We think speech-to-speech translation should be possible and work  reasonably well in a few years’ time,” said Franz Och, Google’s head of  translation services.&lt;/p&gt; &lt;p&gt;“Clearly, for it to work smoothly, you need a combination of  high-accuracy machine translation and high-accuracy voice recognition,  and that’s what we’re working on.&lt;/p&gt; &lt;p&gt;“If you look at the progress in machine translation and corresponding  advances in voice recognition, there has been huge progress recently.”&lt;/p&gt; &lt;p&gt;Although automatic text translators are now reasonably effective,  voice recognition has proved more challenging.&lt;/p&gt; &lt;p&gt;“Everyone has a different voice, accent and pitch,” said Och. “But  recognition should be effective with mobile phones because by nature  they are personal to you. The phone should get a feel for your voice  from past voice search queries, for example.”&lt;/p&gt; &lt;p&gt;The translation software is likely to become more accurate the more  it is used. And while some translation systems use crude rules based on  the grammar of languages, Google is exploiting its vast database of  websites and translated documents to improve the accuracy of its system.&lt;/p&gt; &lt;p&gt;“The more data we input, the better the quality,” said Och. There is  no shortage of help. “There are a lot of language enthusiasts out  there,” he said.&lt;/p&gt; &lt;p&gt;However, some experts believe the hurdles to live translation remain  high. David Crystal, honorary professor of linguistics at Bangor  University, said: “The problem with speech recognition is the  variability in accents. No system at the moment can handle that  properly.&lt;/p&gt; &lt;p&gt;“Maybe Google will be able to get there faster than everyone else,  but I think it’s unlikely we’ll have a speech device in the next few  years that could handle high-speed Glaswegian slang.&lt;/p&gt; &lt;p&gt;“The future, though, looks very interesting. If you have a Babel  Fish, the need to learn foreign languages is removed.”&lt;/p&gt; &lt;p&gt;In the Hitchhiker’s Guide to the Galaxy, the small, yellow Babel Fish  was capable of translating any language when placed in the ear. It  sparked a bloody war because everyone became able to understand what  other people were saying.&lt;/p&gt; &lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-7207137736291334571?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/WfNnlx-7_-U" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/7207137736291334571/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=7207137736291334571" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7207137736291334571?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7207137736291334571?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/WfNnlx-7_-U/yet-another-speech-translation-story.html" title="Yet another speech translation story - now from Google" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>2</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2010/02/yet-another-speech-translation-story.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0QMSX09eSp7ImA9WxNQFE8.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-1994468019338753186</id><published>2009-09-20T02:44:00.002-04:00</published><updated>2009-09-20T02:49:48.361-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-09-20T02:49:48.361-04:00</app:edited><title>Nuance and Lawsuits - again</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/bL_9puCpcK1K-_w5btQXN0ce82o/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bL_9puCpcK1K-_w5btQXN0ce82o/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/bL_9puCpcK1K-_w5btQXN0ce82o/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bL_9puCpcK1K-_w5btQXN0ce82o/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;h2 id="post-10798"&gt;&lt;span style="font-family: verdana;font-size:100%;" &gt;&lt;span style="font-weight: normal;"&gt;In addition the the IBM engine which has strong patent behind it, Vlingo is leveraging new relations with AT&amp;amp;T as an investor and as a speech engine (Watson) provider in prep for the Nuance lawsuit. So now the IBM engine is licensed to Vlingo via Nuance (and this is why theuy are suing them). Moreover, Vlingo is not using this engine anymore. It seems like lawyers tricks rather than technology.&lt;/span&gt;&lt;/span&gt;&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;h2 id="post-10798"&gt;&lt;br /&gt;&lt;/h2&gt;&lt;h2 id="post-10798"&gt;&lt;br /&gt;&lt;/h2&gt;&lt;h2 id="post-10798"&gt;AT&amp;amp;T Backs Vlingo as Nuance Lawsuit Looms&lt;/h2&gt;   &lt;div class="meta group"&gt;     &lt;p&gt;&lt;br /&gt;&lt;span class="edit"&gt; &lt;/span&gt;  &lt;/p&gt;&lt;/div&gt;   &lt;div class="main"&gt;  &lt;p&gt;AT&amp;amp;T has taken a minority stake in Vlingo in a move that could have major repercussions for Nuance’s patent infringement suit against the voice navigation startup. As part of the deal, Vlingo will integrate its offerings with &lt;a href="http://www.research.att.com/viewProject.cfm?prjID=49" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://www.research.att.com/viewProject.cfm?prjID=49');"&gt;AT&amp;amp;T’s Watson&lt;/a&gt;, a core speech recognition technology that serves as a foundation for voice-activated products. Vlingo will all but abandon the IBM-developed technology that it had been using — and which is at the heart of Nuance’s lawsuit.&lt;/p&gt; &lt;p&gt;“Our goal is to move everything to AT&amp;amp;T,” Vlingo CEO Dave Grannan told me this morning. “If Nuance decides to proceed, they’ll essentially be suing us for violating patents — and this is the crazy thing — and the alleged violation occurs in the IBM engine Nuance licensed to us and, by the way, we don’t use anymore.”&lt;/p&gt; &lt;p&gt;Nuance executives were not immediately available for comment.&lt;/p&gt; &lt;p&gt;Grannan said the Watson engine is “superior” to the IBM-developed technology and will help Vlingo build better voice navigation offerings. AT&amp;amp;T and Vlingo will begin rolling out new products later this year for AT&amp;amp;T customers and plan to market the joint solution to other industry players, including device manufacturers and other carriers.&lt;/p&gt; &lt;p&gt;The cutthroat nature of the speech recognition space underscores the increasing attention it’s attracting from investors, as voice is positioned &lt;a href="http://gigaom.com/2008/06/24/i-talk-vlingo-listens/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://gigaom.com/2008/06/24/i-talk-vlingo-listens/');"&gt;as a superior navigation tool to device keypads and touchscreens&lt;/a&gt; -– especially for users behind the wheel. A Boston-area outfit, Vlingo began to gain traction in early 2008 when Yahoo tapped it to add a speech recognition component to its oneSearch offering. The agreement –- which followed Vlingo’s $6.5 million Series A round in 2007 -– saw Yahoo lead a second round of funding that brought in $20 million.&lt;/p&gt; &lt;p&gt;But Vlingo isn’t the only startup getting funded; the space has attracted hundreds of millions of dollars, and competition is fierce. Microsoft Corp. joined the field two years ago with its $800 million acquisition of Tellme, whose technology it’s using to incorporate speech into its upcoming Windows Mobile 6.5. Google has invested heavily as well, and is deploying voice navigation with its Android platform. As for the voice recognition companies themselves, outside of Vlingo, smaller players such as V-Enable and Promptu are also vying for attention. Meanwhile Nuance, the dominant pure play on the field, has spent $1 billion or so in acquisitions.&lt;/p&gt; &lt;p&gt;- By Colin Gibbs | &lt;a href="http://gigaom.com/2009/09/16/att-backs-vlingo-as-nuance-lawsuit-looms/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://gigaom.com/2009/09/16/att-backs-vlingo-as-nuance-lawsuit-looms/');"&gt;Gigaom&lt;/a&gt;&lt;/p&gt;  &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-1994468019338753186?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/__Y16bJqfTU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/1994468019338753186/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=1994468019338753186" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1994468019338753186?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1994468019338753186?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/__Y16bJqfTU/nuance-and-lawsuits-again.html" title="Nuance and Lawsuits - again" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/09/nuance-and-lawsuits-again.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkQMSX89eCp7ImA9WxJUFUQ.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6821762283878271068</id><published>2009-07-14T15:01:00.002-04:00</published><updated>2009-07-14T15:06:28.160-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-14T15:06:28.160-04:00</app:edited><title>Nuance acquires Jott (without suing them first :-)</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/a-2b_hWMBcFS1SqU4xF8b9Vzxic/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/a-2b_hWMBcFS1SqU4xF8b9Vzxic/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/a-2b_hWMBcFS1SqU4xF8b9Vzxic/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/a-2b_hWMBcFS1SqU4xF8b9Vzxic/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Aiming to improve at the human based transcription, Nuance just acquired Jott. It is not the technology,  so why? Probably minimal price for Jott assets (not disclosed) and some methodology and interfaces for some external applications such as salesforce.com.&lt;br /&gt;If you know more, share with us this info.&lt;br /&gt;&lt;br /&gt;&lt;div id="y-content"&gt;&lt;div class="col3"&gt;&lt;div class="mod content-wrapper"&gt;&lt;div id="y-article-hd"&gt; &lt;h1&gt;Nuance Acquires Jott, Expands Mobile Portfolio&lt;/h1&gt;&lt;h2&gt;     &lt;p class=" bwtextaligncenter"&gt;       &lt;i&gt;&lt;b&gt;Innovative Jott Service to Deliver Powerful New Voice-to-Text        Capabilities to Mobile Operators and En&lt;/b&gt;&lt;/i&gt;&lt;/p&gt;&lt;/h2&gt;&lt;/div&gt;&lt;div id="y-article-bd"&gt;&lt;!-- /.mod.related-companies --&gt;                           &lt;p&gt;BURLINGTON, Mass. &amp;amp; SEATTLE--(BUSINESS WIRE)--Nuance Communications, Inc. (NASDAQ: &lt;a href="http://finance.yahoo.com/q;_ylt=AjH1pI_647Iqz5p.Wtekhznjba9_?s=nuan&amp;amp;d=t" class="yltasis"&gt;NUAN&lt;/a&gt; - &lt;a href="http://finance.yahoo.com/q/h;_ylt=AqBiIzh.Nb.7pvJ2iAFrk0Ljba9_?s=nuan" class="yltasis"&gt;News&lt;/a&gt;) today announced it has        expanded its Mobile Division voice services portfolio with the        acquisition of Jott. Jott is the innovator behind the popular Jott        Assistant, the simple and easy-to-use service that enables users to        create notes, set reminders and appointments, send email and text        messages, and post to their favorite web services – all by voice, from        any device.&lt;/p&gt;                                                                           &lt;!--- Insert the sidebar information --&gt;                                &lt;div id="y-article-related" class="mod-group"&gt;&lt;br /&gt;&lt;span id="yfs_module_params_0" class="yfs_module_params"&gt;&lt;/span&gt;            &lt;/div&gt; &lt;!-- Article Related Media --&gt;                         &lt;p&gt;“Jott’s voice-to-text offerings have experienced a groundswell of        adoption and positive industry recognition since the company’s        inception, and we’re thrilled about the opportunity to expand our market        reach and our voice services portfolio,” said Michael Thompson, senior        vice president and general manager, Nuance Mobile. “Together we will        deliver a range of new services to our mobile operator and enterprise        customers.”&lt;/p&gt;&lt;p&gt;The combined Nuance and Jott teams will focus on several key        voice-to-text initiatives:&lt;/p&gt;&lt;ul&gt;&lt;li class="bwlistitemmarginbottom"&gt;         The innovative Jott Assistant service has been adopted by hundreds of          thousands of users, providing vast insight into the demands of today’s          mobile users. To further extend the power of Jott across the mobile          mass market, Nuance plans to package and offer Jott Assistant to          mobile operators as part of its voice services portfolio, including          Nuance Voicemail-to-Text.       &lt;/li&gt;&lt;li class="bwlistitemmarginbottom"&gt;         By combining its easy-to-use voice services with email, text messaging          and a variety of web services, Jott’s service has advanced mobile          productivity in the enterprise market. Nuance, together with its          Enterprise Unified Communications partners, will offer a secure,          highly scalable and differentiated Enterprise package including Nuance          Voicemail-to-Text, Messaging, and Collaboration.       &lt;/li&gt;&lt;li class="bwlistitemmarginbottom"&gt;         As used in Jott for Salesforce, Jott provides open APIs that allow for          voice integration with third party CRM providers and other critical          enterprise applications that require mobile access. Nuance will          continue to mainstream and expand the CRM partner program through its          existing CRM partnerships, enabling enterprise application providers          to better meet the needs of the evolving professional mobile market.       &lt;/li&gt;&lt;/ul&gt;&lt;p&gt;“We’ve seen dramatically increased demand for our mobile voice        solutions, because they offer real business value, are easy to deploy,        and are a delight to use,” said John Pollard, co-founder, Jott. “Nuance        has consistently delivered groundbreaking mobile applications to        billions of people worldwide. Our combined expertise will bring        innovative and differentiated voice services to a variety of markets        with tremendous scale.”&lt;/p&gt;&lt;p&gt;All of Jott’s services, including Jott Assistant, Jott Voicemail and        Jott for Salesforce, will remain available, and existing customers will        experience no interruptions in service. For more information and to        access Jott services, visit &lt;a href="http://us.lrd.yahoo.com/_ylt=AhnlG0EG9_pMfAQRKly3cJLjba9_/SIG=14o7l0frv/**http%3A//cts.businesswire.com/ct/CT%3Fid=smartlink%26url=http%253A%252F%252Fwww.jott.com%26esheet=6005590%26lan=en_US%26anchor=www.jott.com%26index=1" class="yltasis"&gt;www.jott.com&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;&lt;b&gt;About Nuance Communications, Inc.&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Nuance is a leading provider of speech and imaging solutions for        businesses and consumers around the world. Its technologies,        applications and services make the user experience more compelling by        transforming the way people interact with information and how they        create, share and use documents. Every day, millions of users and        thousands of businesses experience Nuance’s proven applications and        professional services. For more information, please visit: &lt;a href="http://us.lrd.yahoo.com/_ylt=Anca3a69BCfH0E_DitGKuBPjba9_/SIG=14tnp1d12/**http%3A//cts.businesswire.com/ct/CT%3Fid=smartlink%26url=http%253A%252F%252Fwww.nuance.com%252F%26esheet=6005590%26lan=en_US%26anchor=Nuance.com%26index=2" class="yltasis"&gt;Nuance.com&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;&lt;b&gt;About Jott&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Headquartered in Seattle, WA, Jott Networks is the world leader in        mobile voice-to-text applications. Jott allows individuals and        businesses to easily capture thoughts, send emails and text messages,        set reminders, organize lists, and post to web services and business        applications – all with their voice, using any phone. Jott also converts        voicemail into email and text messages, making voicemail a more        productive tool. Since being founded in 2006 by John Pollard and Shree        Madhavapeddi, Jott has made world-class voice transcription accessible        to anyone with a cell phone. For more information on the Jott service,        visit &lt;a href="http://us.lrd.yahoo.com/_ylt=ArCwq.6ev6DcJfY03QTGb5Pjba9_/SIG=14oq49obl/**http%3A//cts.businesswire.com/ct/CT%3Fid=smartlink%26url=http%253A%252F%252Fwww.Jott.com%26esheet=6005590%26lan=en_US%26anchor=www.Jott.com%26index=3" class="yltasis"&gt;www.Jott.com&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6821762283878271068?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/B-Yl1HGaAMU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6821762283878271068/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6821762283878271068" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6821762283878271068?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6821762283878271068?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/B-Yl1HGaAMU/nuance-acquires-jott-without-suing-them.html" title="Nuance acquires Jott (without suing them first :-)" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>3</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/07/nuance-acquires-jott-without-suing-them.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUEEQH87fip7ImA9WxJVFkQ.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-944959239355344993</id><published>2009-07-03T15:19:00.002-04:00</published><updated>2009-07-04T03:46:41.106-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-04T03:46:41.106-04:00</app:edited><title>Google voice mail transcription quality - can you trust it?</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/rjI0MUwTzh4G5kdIAnT2atWA-2w/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/rjI0MUwTzh4G5kdIAnT2atWA-2w/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/rjI0MUwTzh4G5kdIAnT2atWA-2w/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/rjI0MUwTzh4G5kdIAnT2atWA-2w/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Recently  &lt;a href="http://gadgetwise.blogs.nytimes.com/author/david-f-gallagher/" class="url fn" title="See all posts by David F. Gallagher"&gt;David F. Gallagher&lt;/a&gt; wrote in the NY times about Google voice. David asked people to leave him messages and he posted the automatic transcriptions. While performing good in many cases, the results are far from accurate and in many cases one cannot understand the gist of the message.&lt;br /&gt;See information at   &lt;a href="http://gadgetwise.blogs.nytimes.com/2009/06/29/pushing-the-limits-of-googles-speech-recognition/"&gt;pushing the limits of googles speech recognition &lt;/a&gt;  and &lt;a href="http://gadgetwise.blogs.nytimes.com/2009/07/01/messing-with-googles-speech-recognition-part-2/"&gt;messing with googles speech recognition part 2.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;To some extent everybody knows it is experimental and do not really trust it. Is it useful? Probably just for fun and for people who do not want any human to be involved in their private message transcription like SpinVox, Nuance or other such vendors.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-944959239355344993?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/lfXnxgXSdX4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/944959239355344993/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=944959239355344993" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/944959239355344993?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/944959239355344993?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/lfXnxgXSdX4/google-voice-mail-transcription-quality.html" title="Google voice mail transcription quality - can you trust it?" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/07/google-voice-mail-transcription-quality.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DE4AQ3o7fSp7ImA9WxJQEU0.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-3692918085310774768</id><published>2009-05-23T15:22:00.002-04:00</published><updated>2009-05-23T15:29:02.405-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-23T15:29:02.405-04:00</app:edited><title>“Mainstream” Speech Analytics</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/O7hJa8UycVazrYk8dz4oSZ4m6H4/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/O7hJa8UycVazrYk8dz4oSZ4m6H4/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/O7hJa8UycVazrYk8dz4oSZ4m6H4/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/O7hJa8UycVazrYk8dz4oSZ4m6H4/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;div id="companyLogosTop"&gt;               Recently Verint announced a new offering for the "mainstream" market. Many people do not understand the importance of this announcement.&lt;br /&gt;&lt;br /&gt;For years we are pushing speech anlaytics to the market and the adoption cycle is pretty long and thus maybe expensive. It requires experts to learn the system and get th best out of it. Eventually there are many success stories for speech analytics. However for a technology to become widely use there is a need for additional simplicity - make sure it can be operated by laymen, work out of the box and provide benefits without much learning. This is exactly the step that Verint is taking with the new system and I bet it will overachieve any other offering in the market.&lt;br /&gt;&lt;br /&gt;If you consider deploying a speech analytics solution, even for a small contact center - this should be your first alternative.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://mms.businesswire.com/bwapps/mediaserver/ViewMedia?mgid=173799&amp;amp;vid=2" alt=" Verint Systems Inc." style="border: 0pt none ;" /&gt;                       &lt;/div&gt;             &lt;!-- start main story div --&gt;                  &lt;div class="story_dateline"&gt;             &lt;a class="permalink" href="http://www.businesswire.com/news/google/20090513005427/en" onclick="return false;" title="To save a permanent link to this news, right-click (Ctl-click on a Mac) and choose the command to copy the link, link location or shortcut."&gt;              May 13, 2009 08:30 AM Eastern Daylight Time              &lt;img class="icon" src="http://www.businesswire.com/images/icons/icon_permalink.gif" alt="" /&gt;       &lt;/a&gt;            &lt;/div&gt;                                      &lt;h1 class="epi-fontLg"&gt;       &lt;b&gt;Verint Witness Actionable Solutions Takes Speech Analytics        “Mainstream” with New Impact 360 Speech Analytics Solution&lt;/b&gt;     &lt;/h1&gt;                                      &lt;div id="story_subheadline"&gt;                  &lt;p&gt;       &lt;b&gt;Breakthrough Technology Enables Cost-Effective Deployment of Speech        Analytics, Helping Companies React Rapidly to Changes in Customer        Behavior, Reduce Expenses and Enhance the Customer Experience&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;New Speech Analytics “Essentials” Solution Enables Small and        Medium-Sized Contact Centers to Attain Speech Analytics Benefits        Previously Out of Reach&lt;/b&gt;     &lt;/p&gt;            &lt;/div&gt;                  &lt;!-- GENRE NOTES BEGIN --&gt;           &lt;div class="story_genre_notes"&gt;                 &lt;div&gt;Driving Innovation 2009&lt;/div&gt;&lt;div&gt;Verint Witness Actionable Solutions’ 13th Annual Global User Conference&lt;/div&gt;&lt;div&gt;Loews Lake Las Vegas Resort, Nevada&lt;/div&gt;           &lt;/div&gt;           &lt;!-- GENRE NOTES END --&gt;             &lt;!-- start story body --&gt;            &lt;p&gt;LAS VEGAS &amp;amp; MELVILLE, N.Y.--(&lt;a href="http://www.businesswire.com/"&gt;BUSINESS WIRE&lt;/a&gt;)--&lt;b&gt;Verint&lt;/b&gt;&lt;sup&gt;&lt;b&gt;®&lt;/b&gt;&lt;/sup&gt;&lt;b&gt; Systems Inc. &lt;/b&gt;today announced the        availability of its new Impact 360&lt;sup&gt;®&lt;/sup&gt; Speech Analytics        solutions. Simple and cost effective to deploy, the software from Verint&lt;sup&gt;®&lt;/sup&gt;        Witness Actionable Solutions&lt;sup&gt;®&lt;/sup&gt; is bringing speech analytics        technology into mainstream contact center operations without costly        setup overhead, lengthy consulting engagements and the need for        interpretation by separate analyst staff.     &lt;/p&gt;     &lt;p&gt;       The new Impact 360 Speech Analytics enables businesses to mine recorded        customer interactions to surface the intelligence essential for building        effective cost containment and customer service strategies. Designed to        provide rapid insight into changes in customer behaviors and patterns,        the solution can deliver value right out-of-the-box by helping remove        guesswork from the customer service equation. Impact 360 Speech        Analytics proactively identifies call drivers — along with the related        product, process and service issues that often originate in areas        outside the contact center, such as back-office functions — emerging        trends, opportunities and competitive influences, and can then make that        business intelligence available enterprise-wide.     &lt;/p&gt;     &lt;p&gt;       The newly introduced Impact 360 Speech Analytics Essentials solution        enables businesses with small to medium-sized contact centers to        cost-effectively achieve these speech analytics benefits that were        previously out of reach.     &lt;/p&gt;     &lt;p&gt;       “We believe this is a technological breakthrough that has the potential        to create a new wave of adoption for speech analytics, breaking down        previous barriers to entry, such as set-up and configuration,” explains        Nancy Treaster, senior vice president and general manager, Verint        Witness Actionable Solutions. “We know that the contact center holds a        wealth of valuable business intelligence, and speech analytics is an        automated, powerful solution that can help companies not only get to        that information, but then determine what actions to take. Impact 360        Speech Analytics Essentials allows organizations with small and        medium-sized centers to analyze captured conversations and rapidly        benefit from valuable insight.”     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;Robust, New Functionality Extends Verint Leadership in Analytics&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       The patent-protected Impact 360 Speech Analytics Essentials, and the        Impact 360 Advanced Speech Analytics solution, leverage both phonetics        and LVCSR recognition — the best combination of both speed and accuracy        — adding meaning and context to every word and phrase identified in        every call processed without predefinition of terms.     &lt;/p&gt;     &lt;p&gt;       Key, patent-pending functionality featured in Impact 360 Speech        Analytics Essentials, as well as Verint’s Impact 360 Advanced Speech        Analytics solution, is driven by the company’s proprietary Complete        Semantic Index™ technology that features such functionality as:     &lt;/p&gt;     &lt;ul&gt;&lt;li class="bwlistitemmarginbottom"&gt;         &lt;i&gt;&lt;b&gt;Automated Trend Analysis, Surfacing Changes in Customer Behaviors&lt;/b&gt;&lt;/i&gt;       &lt;/li&gt;&lt;/ul&gt;     &lt;p&gt;       Using hundreds of thousands of term and phrase combinations, the new        Complete Semantic Index automatically identifies significant changes in        customer behavior as expressed within recorded customer interactions.        Such changes are proactively surfaced by the software’s Automated Trend        Analysis, which identifies increases or decreases in terms and phrases        used during customer/agent conversations. Delivered daily, trend reports        can be tracked for up to 18 months.     &lt;/p&gt;     &lt;ul&gt;&lt;li class="bwlistitemmarginbottom"&gt;         &lt;i&gt;&lt;b&gt;Guided Search and Context Visualization&lt;/b&gt;&lt;/i&gt;       &lt;/li&gt;&lt;/ul&gt;     &lt;p&gt;       With the Complete Semantic Index, users do not need to know in advance        what terms to search. Intuitive search engine-like, guided search        capabilities — including contextual suggestions and search visualization        functionality — help business users find relevant calls quickly to        determine the underlying causes of rising call volumes, costs and        customer dissatisfaction.     &lt;/p&gt;     &lt;ul&gt;&lt;li class="bwlistitemmarginbottom"&gt;         &lt;i&gt;&lt;b&gt;Analytics-Driven Unification with Workforce Optimization Suite          Via Native Integrations&lt;/b&gt;&lt;/i&gt;       &lt;/li&gt;&lt;/ul&gt;     &lt;p&gt;       The Impact 360 Speech Analytics solutions, part of Verint’s        patent-protected, unified Impact 360 Workforce Optimization suite, can        use the content of captured customer interactions to route contacts of        interest to users throughout the enterprise, such as quality        supervisors, marketing managers and customer retention teams.     &lt;/p&gt;     &lt;p&gt;       “We’re excited to introduce this solution, as Verint is fulfilling a        critical market need in enabling businesses with small and medium-sized        centers to reap the same type of intelligence that their larger        counterparts have used for years — all at an attractive price point, and        at a time in our economy in which cost containment, retention and the        customer experience are paramount,” adds Treaster.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;Specifications and Availability&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       Optimally sized for centers with 50 to 300 agent seats, Impact 360        Speech Analytics Essentials can operate on a single box. As businesses’        needs change and grow, the solution can be easily upgraded to Verint’s        advanced speech offering designed for larger centers, scaling up to        50,000 seats.     &lt;/p&gt;     &lt;p&gt;       In addition to the new Complete Semantic Index, Automated Trend Analysis        and Guided Search functionality, the Impact 360 Advanced Speech        Analytics solution for larger center operations includes Automated Root        Cause Analysis with patent-pending TellMeWhy™ functionality. With this        capability, the solution can identify potential underlying drivers of        specific call types, such as customer complaints or long calls,        prioritize the top five root cause groups and automatically suggest the        top instigators for each call set.     &lt;/p&gt;     &lt;p&gt;       The Impact 360 Advanced Speech Analytics solution also features Speech        Analytics-driven Scorecards designed to help centers better manage        performance by balancing cost drivers with customer satisfaction        drivers. Peer-based agent comparisons factor in key performance        indicators (KPIs), such as customer complaints and repeat call drivers,        that are identified from speech analytics results, along with proactive        alerts on defined thresholds. Other functionality includes native        business integrations with Impact 360 Quality Monitoring’s Smart Inbox&lt;sup&gt;™&lt;/sup&gt;—        which delivers recorded interactions directly to the desktop based on        defined criteria that can include speech analytics categories — and        Impact 360 Data Analytics, which analyzes call attributes to help        uncover contact scenarios that can positively or negatively impact        organization and/or individual KPI achievements.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;About Verint Witness Actionable Solutions&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       Verint&lt;sup&gt;®&lt;/sup&gt; Witness Actionable Solutions&lt;sup&gt;®&lt;/sup&gt; is the        leader in analytics-driven workforce optimization software and services.        Its solutions are designed to help organizations capture customer        intelligence, uncover business trends, discover the root cause of        employee and customer behavior, and optimize the customer experience.        From contact centers to remote office, branch and back-office        operations, its award-winning, next-generation Impact 360&lt;sup&gt;®&lt;/sup&gt;        Workforce Optimization suite is the industry’s most unified solution set        — featuring quality monitoring and recording, workforce management,        speech analytics, data analytics, customer feedback surveys, performance        management, eLearning and coaching. Impact 360 helps improve the entire        customer service delivery network, powering the right decisions to help        ensure service excellence and transform organizations into        customer-centric enterprises.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;About Verint Systems Inc.&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       Verint Systems Inc. (VRNT.PK), headquartered in Melville, New York, is a        leading provider of Actionable Intelligence&lt;sup&gt;®&lt;/sup&gt; solutions for an        optimized enterprise and a safer world. Today, more than 10,000        organizations in over 150 countries rely on Verint solutions to perform        more effectively, build competitive advantage and enhance the security        of people, facilities and infrastructure. Visit us at our website &lt;a target="_blank" href="http://cts.businesswire.com/ct/CT?id=smartlink&amp;amp;url=http%3A%2F%2Fwww.verint.com&amp;amp;esheet=5963461&amp;amp;lan=en_US&amp;amp;anchor=www.verint.com&amp;amp;index=1" shape="rect"&gt;www.verint.com&lt;/a&gt;.     &lt;/p&gt;     &lt;p&gt;       This press release contains "forward-looking statements" within the        meaning of the Private Securities Litigation Reform Act of 1995,        including statements regarding expectations, predictions, views,        opportunities, plans, strategies, beliefs, and statements of similar        effect relating to Verint Systems Inc. These forward-looking statements        are not guarantees of future performance and they are based on        management's expectations that involve a number of risks and        uncertainties, any of which could cause actual results to differ        materially from those expressed in or implied by the forward-looking        statements. For a detailed discussion of these risk factors, see the        Company's Current Report on Form 8-K filed with the Securities and        Exchange Commission on September 10, 2007, as supplemented by our        Current Reports on Form 8-K filed on November 5, 2007, January 16, 2008,        and April 9, 2008 and the Form NT-10Q filed on April 16, 2009. The        forward-looking statements contained in this press release are made as        of the date of this press release and, except as required by law, the        Company assumes no obligation to update or revise them or to provide        reasons why actual results may differ.     &lt;/p&gt;     &lt;p&gt;       VERINT, the VERINT logo, ACTIONABLE INTELLIGENCE, POWERING ACTIONABLE        INTELLIGENCE, WITNESS ACTIONABLE SOLUTIONS, STAR-GATE, RELIANT, VANTAGE,        X-TRACT, NEXTIVA, ULTRA, AUDIOLOG, WITNESS, the WITNESS logo, IMPACT        360, the IMPACT 360 logo, IMPROVE EVERYTHING, EQUALITY, CONTACTSTORE,        EYRETEL, BLUE PUMPKIN SOFTWARE, BLUE PUMPKIN, the BLUE PUMPKIN logo,        EXAMETRIC and the EXAMETRIC logo, CLICK2STAFF, STAFFSMART, AMAE SOFTWARE        and the AMAE logo are trademarks and registered trademarks of Verint        Systems Inc. Other trademarks mentioned are the property of their        respective owners.     &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-3692918085310774768?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/x3IXdVXN2sc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/3692918085310774768/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=3692918085310774768" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/3692918085310774768?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/3692918085310774768?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/x3IXdVXN2sc/mainstream-speech-analytics.html" title="“Mainstream” Speech Analytics" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/05/mainstream-speech-analytics.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cCQXk-fSp7ImA9WxVaFEU.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-1587399755375419064</id><published>2009-04-11T16:12:00.002-04:00</published><updated>2009-04-11T16:17:40.755-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-04-11T16:17:40.755-04:00</app:edited><title>Google's 'Voice Search' comes to India</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/-gG9JHQj2mUJfbXbE_GeYiX42ew/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-gG9JHQj2mUJfbXbE_GeYiX42ew/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/-gG9JHQj2mUJfbXbE_GeYiX42ew/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-gG9JHQj2mUJfbXbE_GeYiX42ew/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;span class="Verdanabold_blueL20"&gt;Nice post on &lt;/span&gt;&lt;span class="Verdanabold_blueL20"&gt;siliconindia: &lt;/span&gt;&lt;span class="Verdanabold_blueL20"&gt;http://www.siliconindia.com/shownews/Googles_Voice_Search_comes_to_India-nid-55075.html&lt;br /&gt;I specifically liked the example to search for whether :) - must be phonetic.. Anyway given the heavy accents of the south and north and various sub languages, there is a lot of work for google india for just collecting all this data.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Google's 'Voice Search' comes to India&lt;/span&gt; &lt;div id="namepost"&gt;  &lt;span class="Verdana12Ash"&gt;By&lt;/span&gt;   &lt;span class="Verdana12Ash"&gt; siliconindia news bureau   &lt;/span&gt; &lt;/div&gt; &lt;div class="Verdana12Ash" id="whenpostedmain"&gt; &lt;div class="vvvl12liteAsh" id="postdate1"&gt;Friday,10 April 2009, 15:09 hrs &lt;/div&gt;&lt;br /&gt;&lt;/div&gt;   Bangalore: Now search is just a call. Google has set path for a new era in the speech analytics with 'voice search', which it recently introduced in Hyderabad, Delhi, Mumbai and Bangalore. One just needs to speak a single word into the phone, such as 'whether' or 'hotel' and one can get the top results. &lt;div id="logphoto" style="padding: 10px 10px 10px 0px; float: left;"&gt;              &lt;img src="http://www.siliconindia.com/news/newsimages/Voice%281%29.jpg" width="194" height="287" /&gt;&lt;br /&gt;      &lt;div style="width: 200px; line-height: 15px;"&gt;       &lt;span class="Verdana10Ash999"&gt;              &lt;/span&gt;      &lt;/div&gt;           &lt;/div&gt;   &lt;br /&gt;&lt;br /&gt;The 'voice search' uses a combination of automated voice recognition engine and operators to provide this facility. To make the service faster and better, Google is also experimenting with voice recognition technology, which will ensure 24-hour support. Currently, the automated system offers results in English, but the operator-driven system offers results in only Hindi and Telegu.&lt;br /&gt;&lt;br /&gt;"This is in line with our mission of making information universally useful and accessible, be it at home or on the go," explains Hugo Barra, Group Product Manager, Google Mobile. Not all those who make queries, though, will get accurate results, since the project is still in its pilot stage, reported Business Standard.&lt;br /&gt;&lt;br /&gt;Google's logic is a simple one. Mobiles outnumber personal computers (PCs) in India. Besides, just about 5-7 percent of the population has an internet connection, including those who use surf the net via their mobiles. "Voice enables India to reach non-web users in local languages even as our core strength is search," Barra adds.&lt;br /&gt;&lt;br /&gt;In the U.S., Western Europe and Japan, the 'voice search' feature is available under the Google Mobile App for the iPhone. It is also available on the Android-based T-Mobile G1, and was introduced on the BlackBerry as a free download last month.&lt;br /&gt;&lt;br /&gt;In India, though, Google plans to extend the technology to other cities once it is confident in the quality of its speech recognition technology "in any region of the country", since the number of languages and accents in India are very diverse and distinct from each other.&lt;br /&gt;&lt;br /&gt;The company is currently not making any revenue on this service in India though it monetises this in the U.S., Western Europe and Japan through ads which appear when the results show up. It does not charge the user for information received or for connecting them to businesses. The local business information used by Google is the same as that on local search. Data is continuously being added, and Google is collecting feedback from the users, Barra explained.&lt;br /&gt;&lt;br /&gt;Talking about the company's mobile strategy, Vinay Goel, Country Head of Products, Google India, said, "There has been a significant increase of mobile search users in 2009. We believe that users graduate from plain voice search to an SMS-based one and finally to internet-based search, which is our goal."&lt;br /&gt;&lt;br /&gt;The search giant has also made Orkut (its social networking site) and Google Maps available on mobiles. Besides, it also has the Google Latitude (to help locate one's friends) feature as an opt-in download.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-1587399755375419064?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/QRPyYjSokb4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/1587399755375419064/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=1587399755375419064" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1587399755375419064?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1587399755375419064?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/QRPyYjSokb4/googles-voice-search-comes-to-india.html" title="Google's 'Voice Search' comes to India" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/04/googles-voice-search-comes-to-india.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0QGQnY5cSp7ImA9WxVbFUw.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-7754918900261587888</id><published>2009-03-31T09:19:00.003-04:00</published><updated>2009-03-31T13:08:43.829-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-31T13:08:43.829-04:00</app:edited><title>Google Voice should hire Spinvox for message transcription</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/7OzOIlLDFZF8QRrZoggij2TMRxo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/7OzOIlLDFZF8QRrZoggij2TMRxo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/7OzOIlLDFZF8QRrZoggij2TMRxo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/7OzOIlLDFZF8QRrZoggij2TMRxo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;div dir="ltr"&gt;I just saw this post at:  &lt;a href="http://abrilliantblog.com/2009/03/google-voice-should-hire-spinvox-for-message-transcription/" target="_blank"&gt;http://abrilliantblog.com/2009/03/google-voice-should-hire-spinvox-for-message-transcription/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I guess it is the laymen feedback on the google voice service and exactly what I anticipated at:&lt;br /&gt;&lt;a name="7570773065778461402"&gt;&lt;/a&gt;&lt;span style="font-size:85%;"&gt;&lt;a href="http://speechanalytics.blogspot.com/2009/03/google-voice-new-threat-to-reputation.html"&gt;Google Voice - the new threat to the reputation of speech recognition system.&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;Google Voice should hire Spinvox for message transcription&lt;/h2&gt;     &lt;h4&gt;March 30th, 2009&lt;a href="http://abrilliantblog.com/2009/03/google-voice-should-hire-spinvox-for-message-transcription/#comments"&gt;&lt;/a&gt;&lt;/h4&gt;    &lt;div class="entry"&gt;     &lt;p&gt;&lt;a href="http://abrilliantblog.com/wp-content/uploads/2009/03/spinvox_logo_white.gif" title="spinvox_logo_white.gif"&gt;&lt;img src="http://abrilliantblog.com/wp-content/uploads/2009/03/spinvox_logo_white.thumbnail.gif" alt="spinvox_logo_white.gif" /&gt;&lt;/a&gt;I have been using &lt;a href="http://www.grandcentral.com/"&gt;GrandCentral &lt;/a&gt;for years and when they announced &lt;a href="http://www.google.com/voice"&gt;Google Voice&lt;/a&gt; I was elated.  I could not wait for the new features including my absolute favorites being SMS and voice mail being transcribed to text and emailed and texted to you.&lt;/p&gt; &lt;p&gt;Well the SMS is great but the voice transcription is dreadful.  I didn't call someone back last week mistakenly thinking it was a telemarketer.  It was &lt;a href="http://twitter.com/gomonews"&gt;Bena Roberts&lt;/a&gt; of &lt;a href="http://www.gomonews.com/"&gt;GoMoNews&lt;/a&gt; fame and a very important mover and shaker in the business.  This kind of transcription mistake is not acceptable.  I have a suggestion for Google Voice that I really hope they listen to, OUTSOURCE TO &lt;a href="http://www.spinvox.com/"&gt;SPINVOX&lt;/a&gt;.  Now I don't work for SpinVox, but I've used it and on dozens of friends very high regard and reference for the SpinVox service I'm willing to give this advice.&lt;/p&gt; &lt;p&gt;&lt;a href="http://www.google.com/"&gt;Google&lt;/a&gt; don't allow Google Voice to tarnish a perfectly fantastic service because you're not willing to spend a few dollars.  Go test out SpinVox and if it beats your transcription (I guarantee it will) then please switch over.&lt;/p&gt;        &lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-7754918900261587888?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/S7fF4u_6plQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/7754918900261587888/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=7754918900261587888" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7754918900261587888?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7754918900261587888?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/S7fF4u_6plQ/google-voice-should-hire-spinvox-for.html" title="Google Voice should hire Spinvox for message transcription" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/google-voice-should-hire-spinvox-for.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QMRnY8fSp7ImA9WxVbFEQ.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6190666726359466460</id><published>2009-03-31T05:22:00.002-04:00</published><updated>2009-03-31T05:23:07.875-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-31T05:23:07.875-04:00</app:edited><title>TelStrat - TelStrat’s Engage™ Suite Now Understands Speech Analytics</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/NX7b4QEBKbcLlvA4Q4uT9pjlXww/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NX7b4QEBKbcLlvA4Q4uT9pjlXww/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/NX7b4QEBKbcLlvA4Q4uT9pjlXww/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NX7b4QEBKbcLlvA4Q4uT9pjlXww/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;div dir="ltr"&gt;               &lt;p&gt; &lt;em&gt;Engage Analyze expands TelStrat's contact center product portfolio with true phonetic speech analytics intelligence&lt;/em&gt; &lt;/p&gt;  &lt;p&gt; &lt;strong&gt;Orlando, Florida – March 30, 2009 –&lt;/strong&gt; TelStrat, a global supplier of comprehensive contact center solutions, business call recording products, and leading-edge access network systems, today chose VoiceCon Orlando 2009 as the venue to announce Engage Analyze, the latest addition to its industry-leading contact center solution suite. Engage Analyze provides advanced speech analytics that equip organizations to transform voice calls into knowledge that can help them improve efficiency, increase compliance, and gain the competitive advantage. &lt;/p&gt;  &lt;p&gt;Engage Analyze indexes and audio mines words and phrases buried in calls using a patented Phonetic Audio Search and Recognition Engine. Unlike older, less efficient speech-to-text approaches, phonetic speech search is not dependent on finite dictionary and grammar models which require constant maintenance. This makes it easy to accurately search for new competitors, product names, slang, and other dynamically changing terms. &lt;/p&gt;  &lt;p&gt;Phonetic search technology also makes Engage Analyze fast – much faster than speech-to-text systems. Pre-processing or indexing of content is typically 60-80 times faster than real time, more than an order of magnitude faster than Large Vocabulary Conversational Speech Recognition (LVCSR) speech-to-text systems. Subsequent searches for words or phrases are incredibly quick, averaging over 30,000 times faster than real time and reaching rates up to 80,000 times faster. &lt;/p&gt;  &lt;p&gt;Speech-to-text systems rely on limited, statistical sampling of calls, typically 3-5% of call volume, due to cost and complexity. The technology in Engage Analyze makes it possible to audio mine up to 100% of calls, in real time if desired. The product does all this without the massive computing power necessary for comparable LVCSR systems. With Engage Analyze, contact centers can now accurately analyze and recognize trends over thousands of hours of customer calls. &lt;/p&gt;  &lt;p&gt;"With Engage Analyze we're bringing our customers advanced technology that provides them with a powerful tool to enhance business and customer intelligence, "said TelStrat President Kevin Smith. "The search speed and recognition capabilities make this product a market leader, and we've made it affordable for organizations of virtually any size." &lt;/p&gt;  &lt;p&gt;Engage Analyze is the newest component of TelStrat's Engage Contact Center Suite. Engage Suite blends full-featured voice and screen recording; intuitive agent performance evaluation, tracking and coaching; powerful agent scripting and call automation; sophisticated workforce forecasting and scheduling; and now, advanced speech analytics. It addresses each major aspect of contact center operations. Designed to benefit any 'center of contact', Engage Suite is ideal whether used by a large telemarketing firm or a small company's support staff. &lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.telstrat.com/content/view/399/350/" target="_blank"&gt;http://www.telstrat.com/content/view/399/350/&lt;/a&gt; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6190666726359466460?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/cK3-d1i-Zrw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6190666726359466460/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6190666726359466460" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6190666726359466460?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6190666726359466460?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/cK3-d1i-Zrw/telstrat-telstrats-engage-suite-now.html" title="TelStrat - TelStrat’s Engage™ Suite Now Understands Speech Analytics" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/telstrat-telstrats-engage-suite-now.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUYCQ34_eyp7ImA9WxVbFE4.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-5978348183621362179</id><published>2009-03-30T14:06:00.002-04:00</published><updated>2009-03-30T14:19:22.043-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-30T14:19:22.043-04:00</app:edited><title>Salesforce.com speech recognition integration by Jott</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/vSU_DegexkctByFl1rK7bRIdD9A/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/vSU_DegexkctByFl1rK7bRIdD9A/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/vSU_DegexkctByFl1rK7bRIdD9A/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/vSU_DegexkctByFl1rK7bRIdD9A/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Jott recently announced a speech interface to salesforce.com. By leveraging their speech to text services, they came with an offering to enterprise users. It is another example how a speech to text API (human based on automatic) can be easily integrated into other saas applications. See also &lt;a href="http://speechanalytics.blogspot.com/2009/03/spinvox-open-api.html"&gt;SpinVox Open API&lt;/a&gt;. A key around this service will be what happen when the data is inaccurate due to transcription error (weather human or machine error). What is the feedback that users will recieve from the system and what is the fix process for such error. I guess the Jott people can highlight some of that but so can similar service providers like SpinVox, Nuance, Google etc.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h1 class="style4"&gt;&lt;span style="font-size:100%;"&gt;Jott Networks Introduces Jott for Salesforce, Makes Mobile CRM Input Insanely Simple With Voice-to-text&lt;/span&gt;&lt;/h1&gt;  &lt;span class="content"&gt;   &lt;p&gt;SEATTLE, March 19 /PRNewswire/ -- Jott Networks today announced the addition of Jott for Salesforce to its expanding line of mobile productivity services. The new service uses Jott's high-quality voice-to-text technology, and allows sales professionals to make a simple call on any phone to directly input opportunity updates, take quick notes, and set reminders and appointments - all hands-free. Jott is offering a one-month free trial of Jott for Salesforce and it comes with a free subscription to Jott Assistant Pro, Jott's widely acclaimed mobile productivity tool. &lt;/p&gt;    &lt;p&gt;Jott CEO and co-founder John Pollard said, "There are already over a thousand businesses that use Jott's other services to get more done on the go. These same businesses wanted us to provide integration with more critical business applications and Jott for Salesforce is the first in that line." He added, "Sales professionals can now use our best-in-class voice-to-text technology to avoid the hassle of cramped and clumsy mobile interfaces and frustratingly slow connection speeds. Sales teams spend more time selling and less time typing reports, and sales managers see greater adoption of Salesforce and receive better, fresher forecast data." Jott for Salesforce includes the following features:&lt;/p&gt;    &lt;p&gt; &lt;b&gt;Features for Sales Professionals&lt;/b&gt; &lt;/p&gt;    &lt;p&gt; &lt;/p&gt;&lt;ul&gt;&lt;li&gt; &lt;b&gt;Update opportunities and accounts - &lt;/b&gt;Use your voice to&lt;b&gt; &lt;/b&gt;quickly update entire opportunities with a simple flow or individual forecast fields with shortcuts. Data ends up in specific accounts with no cutting, pasting or forwarding required.&lt;/li&gt;&lt;li&gt; &lt;b&gt;Take quick notes - &lt;/b&gt;While they are still fresh, speak quick notes about accounts and opportunities and add tasks to your Salesforce dashboard.&lt;/li&gt;&lt;li&gt; &lt;b&gt;Schedule appointments and set reminders - &lt;/b&gt;Use your voice to book a meeting on your Salesforce.com or Outlook calendar, and set reminders so you never forget.&lt;/li&gt;&lt;li&gt; &lt;b&gt;Get confirmation of everything - &lt;/b&gt;Every update you leave comes with a confirmation email/text message so you know for certain that data was entered into your accounts.&lt;/li&gt;&lt;/ul&gt;       &lt;p&gt; &lt;b&gt;Features for Managers&lt;/b&gt; &lt;/p&gt;    &lt;p&gt; &lt;/p&gt;&lt;ul&gt;&lt;li&gt; &lt;b&gt;Set up in minutes - &lt;/b&gt;Jott for Salesforce is incredibly easy to set up. There are no desktop or phone downloads, and it requires no changes to your existing Salesforce set-up. &lt;/li&gt;&lt;li&gt; &lt;b&gt;Scale easily - &lt;/b&gt;Jott for Salesforce was built for scale. With nothing to download or maintain, and no new equipment to buy, it easily accommodates individuals or organization-wide rollouts.&lt;/li&gt;&lt;li&gt; &lt;b&gt;No training necessary -&lt;/b&gt; While training is available to help teams get the most out of Jott for Salesforce, only a few simple commands are needed to get started.&lt;/li&gt;&lt;/ul&gt;       &lt;p&gt; &lt;b&gt;Pricing and Availability&lt;/b&gt; &lt;/p&gt;  &lt;p&gt;Jott for Salesforce is available today from the App Exchange on Salesforce.com and from the Jott.com web site. It takes just a few minutes to set up, and is compatible with all carriers and all mobile phones in the US and Canada. Jott for Salesforce's pricing is straightforward and affordable at $25 per user per month. For that fee, users can send unlimited updates into Salesforce with no need to worry about overage charges.&lt;/p&gt;    &lt;p&gt;For more information on Jott Salesforce and other Jott services, please visit &lt;a href="http://www.trafficresults.com/click-rabbit.php?acctid=ifM5E8V/Lzk=&amp;amp;docid=SF8610619032009-1&amp;amp;redirect=1&amp;amp;url=http://www.jott.com/"&gt;www.Jott.com&lt;/a&gt;.  &lt;/p&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-5978348183621362179?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/zHJh9kGNk68" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/5978348183621362179/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=5978348183621362179" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5978348183621362179?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5978348183621362179?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/zHJh9kGNk68/salesforcecom-speech-recognition.html" title="Salesforce.com speech recognition integration by Jott" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/salesforcecom-speech-recognition.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkUARXkycSp7ImA9WxVbFEw.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-7832109332287710779</id><published>2009-03-29T03:57:00.003-04:00</published><updated>2009-03-30T07:57:24.799-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-30T07:57:24.799-04:00</app:edited><title>Visual Speech Recognition: Lip Segmentation and Mapping</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/5aujzYt71yHzSLdbj3iyyyeOQBc/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/5aujzYt71yHzSLdbj3iyyyeOQBc/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/5aujzYt71yHzSLdbj3iyyyeOQBc/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/5aujzYt71yHzSLdbj3iyyyeOQBc/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.gfxhost.ru/out.php/t3082_51I2ByjrN0cL.jpg"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 194px; height: 250px;" src="http://www.gfxhost.ru/out.php/t3082_51I2ByjrN0cL.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;I just saw this free ebook which maybe of  interest for the speech analytics community. See the link below.&lt;br /&gt;&lt;br /&gt;Enjoy.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Description: The unique research area of audio-visual speech recognition has attracted much interest in recent years as visual information about lip dynamics has been shown to improve the performance of automatic speech recognition systems, especially in noisy environments.&lt;br /&gt;&lt;br /&gt;Visual Speech Recognition: Lip Segmentation and Mapping presents an up-to-date account of research done in the areas of lip segmentation, visual speech recognition, and speaker identification and verification. A useful reference for researchers working in this field, this book contains the latest research results from renowned experts with in-depth discussion on topics such as visual speaker authentication, lip modeling, and systematic evaluation of lip features.&lt;br /&gt;&lt;br /&gt;http://www.dbebooks.biz/engine/redirect.php?url=http://paid4share.net/file/13767/9781605661865-1605661864-rar.html&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-7832109332287710779?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/sSdGi14Imzs" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/7832109332287710779/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=7832109332287710779" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7832109332287710779?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7832109332287710779?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/sSdGi14Imzs/visual-speech-recognition-lip.html" title="Visual Speech Recognition: Lip Segmentation and Mapping" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>2</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/visual-speech-recognition-lip.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkQAQX4_eyp7ImA9WxVUFUU.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-3014608034209109927</id><published>2009-03-20T15:40:00.005-04:00</published><updated>2009-03-20T16:19:00.043-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-20T16:19:00.043-04:00</app:edited><title>SpinVox Open API</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/NcMdnT79snWPhSiF42GALWCamtA/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NcMdnT79snWPhSiF42GALWCamtA/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/NcMdnT79snWPhSiF42GALWCamtA/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NcMdnT79snWPhSiF42GALWCamtA/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;div id="companyLogosTop"&gt;           &lt;br /&gt;Congrats to SpinVox for this important move. Opening your system to others can drive speech applications faster leveraging many more developers and creative minds. Whether it is human transcription or machine transcription, this move separates the speech processing part from the application part and push for SaaS speech enabled applications.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://mms.businesswire.com/bwapps/mediaserver/ViewMedia?mgid=176233&amp;amp;vid=2" alt=" SpinVox" style="border: 0pt none ;" /&gt;                       &lt;/div&gt;             &lt;!-- start main story div --&gt;                  &lt;div class="story_dateline"&gt;             &lt;a class="permalink" href="http://www.businesswire.com/news/google/20090318005204/en" onclick="return false;" title="To save a permanent link to this news, right-click (Ctl-click on a Mac) and choose the command to copy the link, link location or shortcut."&gt;              March 18, 2009 07:30 AM Eastern Daylight Time              &lt;img class="icon" src="http://www.businesswire.com/images/icons/icon_permalink.gif" alt="" /&gt;       &lt;/a&gt;            &lt;/div&gt;                                    &lt;br /&gt;&lt;br /&gt;&lt;h1&gt;       &lt;br /&gt;&lt;br /&gt;SpinVox to Demonstrate Open API Applications at CTIA Wireless 2009     &lt;/h1&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;                             &lt;div id="story_subheadline"&gt;                  &lt;p class="bwtextaligncenter"&gt;       Pre-Registrations For SpinVox Create Fuel Co-Development Program and        Confirm Demand for Voice Conversion Services from Web-Based Technology        Developers Wanting to Build Speech 2.0 Applications     &lt;/p&gt;            &lt;/div&gt;                  &lt;!-- GENRE NOTES BEGIN --&gt;           &lt;div class="story_genre_notes"&gt;                 &lt;div&gt;CTIA Wireless 2009&lt;/div&gt;           &lt;/div&gt;           &lt;!-- GENRE NOTES END --&gt;             &lt;!-- start story body --&gt;            &lt;p&gt;LONDON &amp;amp; NEW YORK--(&lt;a href="http://www.businesswire.com/"&gt;BUSINESS WIRE&lt;/a&gt;)--SpinVox, the global leader in voice to content messaging, will showcase        three brand new Speech 2.0 applications at CTIA Wireless 2009, to be        held April 1-3 in Las Vegas. The applications have been developed in        less than a month to demonstrate the power of SpinVox Create, an open        API (Application Programming Interface) to the SpinVox Voice Message        Conversion System™ (VMCS), the world’s largest commercial speech        platform.     &lt;/p&gt;     &lt;p&gt;       SpinVox Create was announced at Mobile World Congress, Barcelona in        February 2009 and developers were invited to pre-register their interest        in SpinVox Create in advance of its launch via a web registration page - &lt;a target="_blank" href="http://cts.businesswire.com/ct/CT?id=smartlink&amp;amp;url=http%3A%2F%2Fwww.spinvox.com%2Fdeveloper&amp;amp;esheet=5919818&amp;amp;lan=en_US&amp;amp;anchor=www.spinvox.com%2Fdeveloper&amp;amp;index=1" shape="rect"&gt;www.spinvox.com/developer&lt;/a&gt;     &lt;/p&gt;     &lt;p&gt;       SpinVox Create will be launched as a key part of a two-stage corporate        API strategy that will also be announced at CTIA and rolled out by        SpinVox in the first half of 2009.     &lt;/p&gt;     &lt;p&gt;       Nearly 100 developers have already registered interest in SpinVox Create        and, of these, 20 have been selected by SpinVox to be part of the        co-development program. Those selected include business efficiency,        personal productivity, games and social networking applications.     &lt;/p&gt;     &lt;p&gt;       SpinVox Create is a simple, straightforward API that leverages SpinVox’s        commercial speech platform – which is growing quickly with more than 30        million users - to enable any developer with Web access to quickly build        commercial speech applications. It also enables SpinVox to collaborate        with third parties to expand the Speech 2.0 market and foster further        innovations in voice that complement SpinVox’s existing platform        development services for Enterprise application partners and Carrier        networks.     &lt;/p&gt;     &lt;p&gt;       “We’ve been impressed by both the quality and quantity of responses to        our pre-registration announcement,” says SpinVox co-founder and CEO        Christina Domecq. “We are clearly seeing an increased demand for voice        conversion services from technology developers who recognize that a        speech interface enables the most natural form of communication, and who        want to build Speech 2.0 applications with best-in-class open products.”     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;Demonstration of Voice innovation on Apple, Nokia, and Windows Mobile        platforms&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       Three applications will be demonstrated at CTIA Wireless 2009. These are        based on Apple iPhone, Nokia Series 60, and Microsoft Windows Mobile        platforms.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;`Travel Blog`&lt;/b&gt;, a Windows Mobile 6.0 Application developed by        Singapore-based Global Idealogy Corporation lets you tag and post your        photographs using just your voice . You can select photographs through        the application, speak a message, attach the converted text to the        photographs, post it on blog, social networking websites or send it as        email or MMS.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;`Speak-a-Text`,&lt;/b&gt; a Nokia Series 60 Application developed by        UK-based Symbian Platinum Partner, Savage Minds, incorporates the        ability to speak a text which is converted to text and placed into the        menu structure of the phone software.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;`Memo`&lt;/b&gt;, an iPhone Application, developed by UK-based SpinVox        allows iPhone users to speak a memo through the iPhone application and        after conversion into text by the SpinVox VMCS the memo resides on the        iPhone for instant access whenever needed.     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;Drive the next upturn&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       SpinVox has already received pre-registrations for SpinVox Create across        the globe and looking ahead expects rapid uptake of the API particularly        in Silicon Valley where SpinVox Web 2.0 services such as SpinVox Blog        and SpinVox Social Networks have been increasingly popular.     &lt;/p&gt;     &lt;p&gt;       Adds Domecq, “SpinVox has created a new category - carrier-grade voice        conversion - and now is helping talented developers take advantage of        the next growth opportunity in speech. The potential for innovation        between carriers and the web is enormous – along with our own        innovations we're now delivering a platform for creation of market        changing applications and supporting their transformation to        carrier-grade services. Speech 2.0 applications will be one of a cluster        of innovations that will drive the next upturn as people are        increasingly enabled to re-discover the power of their voice.”     &lt;/p&gt;     &lt;p&gt;       &lt;b&gt;About SpinVox&lt;/b&gt;     &lt;/p&gt;     &lt;p&gt;       SpinVox® is the world's largest privately-held speech technology        company, providing the only voice to text messaging services which are        used daily by millions of people and whose user base has grown over        twenty-fold in the last 12 months.     &lt;/p&gt;     &lt;p&gt;       Through significant innovations in voice and network technologies which        are protected by over 40 patents worldwide, SpinVox has converged the        two most natural forms of communication - voice and text - to create the        fastest-growing form of messaging: Voice-to-Content™.     &lt;/p&gt;     &lt;p&gt;       SpinVox services are available directly on &lt;a target="_blank" href="http://cts.businesswire.com/ct/CT?id=smartlink&amp;amp;url=http%3A%2F%2Fwww.spinvox.com&amp;amp;esheet=5919818&amp;amp;lan=en_US&amp;amp;anchor=www.spinvox.com&amp;amp;index=2" shape="rect"&gt;www.spinvox.com&lt;/a&gt;        and through leading carriers and through new media, Unified        Communications and other service providers globally.     &lt;/p&gt;     &lt;p&gt;       Implemented as a carrier-class cloud service, SpinVox is proven to be        able to easily create value from everyday user behavior using voice and        deliver rapid and easy implementation of low input, sustained high        reward services.     &lt;/p&gt;     &lt;p&gt;       At the heart of SpinVox is its ground-breaking Voice Message Conversion        System™ (VMCS), which works by combining state-of-the-art speech        technologies with a live-learning language process. Developed by the        Cambridge, UK- based SpinVox Advanced Speech Group; VMCS now serves        users across five continents in English, French, Spanish, German,        Portuguese and Italian.     &lt;/p&gt;     &lt;p&gt;       SpinVox is now live with Alltel, Cincinnati Bell, Sasktel, Rogers        Wireless, Telus, Telstra, Vodacom South Africa, Vodafone Spain, Movistar        Chile, Skype and Livejournal.     &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-3014608034209109927?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/L0pEhWjaRp4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/3014608034209109927/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=3014608034209109927" title="11 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/3014608034209109927?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/3014608034209109927?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/L0pEhWjaRp4/spinvox-open-api.html" title="SpinVox Open API" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>11</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/spinvox-open-api.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04MRXgyfip7ImA9WxVUE0o.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-5532506685248100043</id><published>2009-03-18T07:11:00.004-04:00</published><updated>2009-03-18T07:33:04.696-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-18T07:33:04.696-04:00</app:edited><title>SpinVox response for Google voice - where is the Nuance response?</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/9F32rHt2KBhA4nxvH9diVma1pfY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9F32rHt2KBhA4nxvH9diVma1pfY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/9F32rHt2KBhA4nxvH9diVma1pfY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9F32rHt2KBhA4nxvH9diVma1pfY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;As per the recent &lt;a href="http://speechanalytics.blogspot.com/2009/03/google-voice-new-threat-to-reputation.html"&gt;google voice voicemail transcription announcement&lt;/a&gt;, SpinVox is reponding -&lt;br /&gt;In a recent post by Rich Tehrani from TMC, he quote a response from SpinVox about Google voice:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;Google is entering a marketplace that continues to be led by SpinVox, the world's largest privately-held speech technology company. We're excited by the launch of Google Voice because it will demonstrate the benefits of speech-to-text conversion and validate its deployment as a network service to an increased audience. We have already launched carrier-grade services with 13 operators - including recently with Skype - on five continents and SpinVox is in use by in excess of 30 million users. SpinVox's 97 percent accuracy in conversion is now the benchmark around the world. - Christina Domecq, co-founder and CEO of SpinVox&lt;/span&gt;&lt;/blockquote&gt;&lt;/div&gt;&lt;br /&gt;And this 97% are human results. I am not sure that Google will post any response but we need to wait for usability feedbacks from users.&lt;br /&gt;&lt;br /&gt;I further wonder about Nuance's response to Google Voice..?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-5532506685248100043?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/gjpxy1M_cVI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/5532506685248100043/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=5532506685248100043" title="7 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5532506685248100043?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/5532506685248100043?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/gjpxy1M_cVI/spinvox-response-for-google-voice-where.html" title="SpinVox response for Google voice - where is the Nuance response?" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>7</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/spinvox-response-for-google-voice-where.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUcCSX8yeCp7ImA9WxVUEUk.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-7570773065778461402</id><published>2009-03-15T13:03:00.004-04:00</published><updated>2009-03-15T14:51:08.190-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-15T14:51:08.190-04:00</app:edited><title>Google Voice - the new threat to the reputation of speech recognition system</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/ICP7pkgXBbGVcglOnsrY86MTAcU/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/ICP7pkgXBbGVcglOnsrY86MTAcU/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/ICP7pkgXBbGVcglOnsrY86MTAcU/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/ICP7pkgXBbGVcglOnsrY86MTAcU/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Google posted last week the new enhancements for GrandCentral and actually its evolution to Google Voice.  It is in general an application to  better manage your voice communications.&lt;br /&gt;&lt;br /&gt;The new application improves the way you use your phone. You can get transcripts of your voicemail (see the video below) and archive and search all of the SMS text messages you send and receive. You can also use the service to make low-priced international calls and easily access &lt;a href="http://www.google.com/goog411/"&gt;Goog-411&lt;/a&gt; directory assistance. It is an addition to the GrandCentral standard features&lt;br /&gt;including a single number to ring your home, work, and mobile phones, a central voicemail inbox that you could access on the web, and the ability to screen calls by listening in live as callers leave a voicemail. You'll find these features, and more, in the Google Voice preview. Check out the &lt;a href="https://www.google.com/voice/about"&gt;features page&lt;/a&gt; for videos and more information on how these features work. It is great to have visual voicemail and this will enahance further the iPhone and simialr smartphones.&lt;br /&gt;&lt;br /&gt;&lt;object width="425" height="344"&gt;&lt;param name="movie" value="http://www.youtube.com/v/oFVXAqFNgic&amp;amp;color1=0xb1b1b1&amp;amp;color2=0xcfcfcf&amp;amp;hl=en&amp;amp;feature=player_embedded&amp;amp;fs=1"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;embed src="http://www.youtube.com/v/oFVXAqFNgic&amp;amp;color1=0xb1b1b1&amp;amp;color2=0xcfcfcf&amp;amp;hl=en&amp;amp;feature=player_embedded&amp;amp;fs=1" type="application/x-shockwave-flash" allowfullscreen="true" width="425" height="344"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Google is taking its voice recognition into prime time and into a very delicate position as it exposes all the voice transcripts load and clear with no option for human correction. I can only guess what will happen to voicemails in foreign language, heavy accent etc. We've all seen the mistakes in the Google speech recognition system used on for mining presidential campaign in the US and trust me, most of us do not speak as clear as Obama.&lt;br /&gt;&lt;br /&gt;So as a user, there are several options for getting a transcription to your voice mail. SpinVox, Nuance and all other small players (SimulScribe, Jott etc.) who rely on human review and Google who is claiming that there maybe errors but its the user decision whether to rely on the provided transcripts.&lt;br /&gt;&lt;br /&gt;I anticipate that the frustration from the quality of Google Voice transcription is going to be a source for bad attitude towards speech recognition technology (see the &lt;a href="http://speechanalytics.blogspot.com/2008/03/congratulations-to-spinvox-for-their.html"&gt;Dilbert cartoon&lt;/a&gt; I put in this blog in the past). While there were mechanisms to "hide" the embracing mistakes in other systems, here all the transcript will be visible and may relate to mission critical information.&lt;br /&gt;&lt;br /&gt;My recommendation to Google is to open an interface for providing a manual transcripts. Either this can be connected to the SpinVox API (announced recently) or alternatively open an interface that allows sending a proposed transcript + Audio and receiving back a corrected transcript. This can open a new market for people who will be willing to send their voice communication to remote secretaries  (probably in India) who will transcribe the audio and return it to Google Voice. This can also be a perfect fit to Amazon’s &lt;a href="http://www.mturk.com/" target="_blank"&gt;Mechanical Turk&lt;/a&gt; service where you can get people to perform simple tasks. If you want further info, contact me directly about the way it should be constructed including adaptive language id etc.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-7570773065778461402?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/A46uu2EM8Sw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/7570773065778461402/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=7570773065778461402" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7570773065778461402?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/7570773065778461402?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/A46uu2EM8Sw/google-voice-new-threat-to-reputation.html" title="Google Voice - the new threat to the reputation of speech recognition system" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2009/03/google-voice-new-threat-to-reputation.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEcNQXw-fCp7ImA9WxRRGEQ.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-8271163283502823479</id><published>2008-10-01T12:34:00.003-04:00</published><updated>2008-10-01T17:21:30.254-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-10-01T17:21:30.254-04:00</app:edited><title>Nuance buys speech recognition business from Philips</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/D8vOYytmLypc1b3sOPDxA4jBfFI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/D8vOYytmLypc1b3sOPDxA4jBfFI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/D8vOYytmLypc1b3sOPDxA4jBfFI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/D8vOYytmLypc1b3sOPDxA4jBfFI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Another step in speech recognition consolidation: Nuance Communications buys Philips speech recognition unit&lt;br /&gt;&lt;br /&gt;More than five years after acquiring Philips Speech Processing Business Units, Nuance completes acquiring the Philips speech recognition business. While the older acquisition was for telephony and voice control, the recent acquisition is mainly around the healthcare speech dictation market. It seems more of a business driven acquisition (the EU market) rather than technology. Will the future of Speech Magic be the same as Speech Pearl? Nuance was very aggressive and I assume efficient with its acquisitions making sure that no multiple engines will coexist for the long term.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="t"&gt;&lt;a href="http://biz.yahoo.com/ap/081001/nuance_communications_acquisition.html?.v=1"&gt;Nuance Communications buys speech recognition unit&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Wednesday October 1, 8:40 am ET&lt;br /&gt;Nuance Communications pays $96.1 million for speech recognition unit in Europe&lt;br /&gt;&lt;br /&gt;BURLINGTON, Mass. (AP) -- Nuance Communications Inc., which makes speech recognition software, said Wednesday it bought Royal Philips Electronics' speech recognition unit for about 66 million euros, or US$96.1 million.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Nuance bought the unit in order to expand its presence in the European health care market.&lt;br /&gt;&lt;br /&gt;Under the deal, Netherlands-based Royal Philips received a payment of 21.7 million euros on Sept. 26 and will receive 44.3 million euros in a deferred payment on Sept. 21, 2009. Nuance said the deal will add between $36 million and $39 million in revenue in 2009, and now expects revenue to reach $410 million. The buyout will add up to 1-cent per share in profit.&lt;br /&gt;&lt;br /&gt;The addition of the speech recognition unit, Nuance said, enhances the company's ability to provide documentation and communication services to health care organizations throughout Europe. It said about $2 billion is spent annually in Europe for health care companies to manually process clinical information.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://australia.nuance.com/news/20030103_philips.asp"&gt;ScanSoft Completes Acquisition of Philips Speech Processing Business Units&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Leading Telephony and Voice Control Technologies and Applications Expand ScanSoft's Presence in Telephony, Automotive and Embedded Markets&lt;h2&gt;&lt;/h2&gt;   &lt;p&gt;&lt;b&gt;PEABODY, Mass., 3rd February, 2003&lt;/b&gt; - ScanSoft, Inc. (Nasdaq: SSFT), a leading provider of imaging, speech and language solutions, today announced that is has closed the acquisition of the Speech Processing Telephony and Voice Control business units and related intellectual property from Royal Philips Electronics (NYSE: PHG, AEX: PHI).&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;"ScanSoft's acquisition of the Telephony and Voice Control business units from Philips Speech Processing further enhances our market share in key markets and gives the company additional competitive momentum in our target markets," said Paul Ricci, ScanSoft's chairman and CEO. "With a broader set of technologies and an enhanced distribution channel, ScanSoft is well positioned to capitalise on growth opportunities in the telephony, automotive and embedded markets. In addition, we expect the relationship we have forged with Philips to contribute to the development of new speech technologies in the future."&lt;/p&gt;  &lt;p&gt;The businesses acquired by ScanSoft include: &lt;/p&gt; &lt;p&gt; &lt;/p&gt;&lt;ul&gt;&lt;li&gt;Telephony - The Philips Speech Processing Telephony business allows enterprise customers, telephony vendors and carriers to speech-enable a range of services, including directory assistance, interactive voice response and voice portal applications. Philips automatic speech recognition (ASR) engine, SpeechPearl®, supports more than 40 languages and can process a vocabulary of more than one million words, making it the solution of choice for telephony applications that target global and broad regional markets. Philips has also leveraged its expertise in telephony ASR to develop VoiceRequest™, an enterprise auto-attendant solution, and automated directory assistance implementations that have been deployed by Telia in Sweden, the Japan Multimedia Service and Telefónica de Argentina, among others. &lt;/li&gt;&lt;br /&gt;&lt;br /&gt;&lt;li&gt;Voice Control - The Philips Speech Processing Voice Control business is addressing growing consumer demand for speech-enabled automotive, mobile and consumer electronics products. Philips' SpeechWave™ and VoCon® small-footprint speech recognition engines are ideal for embedded applications including voice-control of climate and entertainment features within cars. These solutions are also used within navigation systems and to enable automated voice dialing within mobile phones, including those from Philips.&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;The consideration for the transaction comprises a $27.5 million three-year, zero-interest convertible subordinated debenture, convertible at any time into common shares of ScanSoft at $6.00 per share; 4.1 million euros in cash, of which 3.1 million euros were paid at closing and 1 million euros are payable by December 31, 2003; and a 5 million euro 5% interest note due December 31, 2003. The cash payable is subject to adjustment in accordance with the provisions of the agreement as amended.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-8271163283502823479?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/DGPZVTwwcDU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/8271163283502823479/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=8271163283502823479" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/8271163283502823479?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/8271163283502823479?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/DGPZVTwwcDU/nuance-buys-speech-recognition-business.html" title="Nuance buys speech recognition business from Philips" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2008/10/nuance-buys-speech-recognition-business.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0EEQ306eSp7ImA9WxRTFEk.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-4340947297967561423</id><published>2008-09-02T15:35:00.001-04:00</published><updated>2008-09-03T09:46:42.311-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-09-03T09:46:42.311-04:00</app:edited><title>iPhone speech recognition - status</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/AmOR5XqrWpsq7wE6fmdTThuaSwg/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/AmOR5XqrWpsq7wE6fmdTThuaSwg/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/AmOR5XqrWpsq7wE6fmdTThuaSwg/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/AmOR5XqrWpsq7wE6fmdTThuaSwg/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;The iPhone mania is pushing many vendors to offer speech recognition on this platform. As there are many offerings, I enclose a summary to ensure a simple presentation of the current status. I am not certain it covers all iPhone speech offering and will be glad to receive comments about additional speech recognition solutions for the iPhone.&lt;br /&gt;&lt;br /&gt;There are many posts on the net about speech recognition around iPhone, some for command and control and some for perfroming free speech web search. Many people are disappointed that there is no speech recognition for iPhone while other are discussing working or semi-working applications and few names are mentioned:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;AT&amp;amp;T&lt;/li&gt;&lt;li&gt;Nuance&lt;/li&gt;&lt;li&gt;VoiceSignal&lt;/li&gt;&lt;li&gt;VoiceDialer&lt;br /&gt;&lt;/li&gt;&lt;li&gt;VOICE DIAL&lt;/li&gt;&lt;li&gt;VoiceThis&lt;/li&gt;&lt;li&gt;Fonix&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:180%;"&gt;AT&amp;amp;T&lt;/span&gt; exposed recently a research project - &lt;a href="http://www.research.att.com/viewProject.cfm?prjID=355"&gt;Watson Speech Mashups Architecture&lt;/a&gt; - a new software framework that casts AT&amp;amp;T’s WATSON speech recognition as a web service to economically bring speech processing technologies to the larger web and mobile developer community. This new capability provides network-hosted speech technologies for multimedia devices with broadband access (iPhone, BlackBerry®, IPTV set-top box, SmartPhones, etc.) without the need to install, configure, and manage speech recognition software and equipment. This enables easy and rapid development of new speech and multimodal mobile services as well as new web-based services. The software implementation is based on well-established web programming models, such as SOA, REST, AJAX, JavaScript and JSON.&lt;br /&gt;&lt;br /&gt;AT&amp;amp;T provides a &lt;a href="http://www.research.att.com/mediaplayer/iphone.html"&gt;video&lt;/a&gt; of the YellowPages.com application with speech recognition on an iPhone.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:180%;"&gt;&lt;a href="http://www.nuance.com/"&gt;Nuance &lt;/a&gt;&amp;amp; &lt;a href="http://www.voicesignal.com/"&gt;VoiceSignal&lt;/a&gt;&lt;/span&gt;&lt;a href="http://www.voicesignal.com/"&gt; &lt;/a&gt;are promoting separate solutions or maybe the same one? VoiceSignal is part of Nuance but still exposed its own iPhone speech recognition solution.  I am confused and I bet many others inside Nuance are confused.&lt;br /&gt;&lt;br /&gt;I wrote in the past about the &lt;a href="http://speechanalytics.blogspot.com/2008/06/iphone-speech-recognition.html"&gt;Nuance iPhone offering&lt;/a&gt;. What I managed to understand from the various announcements is that Nuance named its solution OVS for open voice search (in some cases it is referred to as open vsearch). It is also known as the Mobile Solution Suite. Surprisingly on the VoiceSignal website, there is a demo video of vSearch which is a speech recognition based search application for iPhone. The application logo displayed on the iPhone in this video is the VoiceSignal logo (unlike the logo on the Nuance vsearch demo which is of course the Nuance logo).&lt;br /&gt;&lt;span style="font-weight: normal;"&gt;If you are still not confused, take a look at the Nuance announcement from June 10th and compare to the VoiceSignal announcement from August 24th:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;p style="color: rgb(102, 51, 0);"&gt;Nuance Unveils Voice Search Prototype On iPhone&lt;/p&gt;&lt;/span&gt;APPLE WWDC, SAN FRANCISCO. June 10, 2008 — Nuance Communications, Inc. (NASDAQ: NUAN), a leading provider of speech solutions, today unveiled a ground-breaking prototype for voice search capabilities shown on the Apple iPhone (NASDAQ: AAPL).     &lt;p style="color: rgb(102, 51, 0);"&gt;&lt;br /&gt;The newly designed application introduces a new, more compelling consumer and search experience. Through Nuance speech recognition servers, mobile consumers — with no training required — can simply speak requests into their phone like “Find the Apple store in Boston, Massachusetts,” “Score of the Boston Celtics game,” or “Play Hannah Montana Best of Both Worlds” to quickly and accurately search the mobile web or in the future dictate an IM, SMS or e-mail message. The prototype, code-named “OVS” for open voice search, will allow mobile operators to offer simple ‘say anything’ search capabilities and is search engine agnostic, able to link to any search engine of an operator’s choosing. A video demonstration of the new application can be found at &lt;a href="http://www.nuance.com/mobilesuite"&gt;www.nuance.com/mobilesuite&lt;/a&gt;.  &lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 51, 0);"&gt;VoiceSignal Voice Enables iPhone in Proof of Concept Development&lt;/span&gt; &lt;p style="color: rgb(102, 51, 0);" align="left"&gt;WOBURN, Mass., August 24, 2007– VoiceSignal Technologies, Inc., a leading supplier of speech recognition solutions, today announced that VoiceSignal engineers have ported several of VoiceSignal’s applications to the iPhone. These initial proof-of-concept applications include VSearch (mobile local search by voice) and VTunes (voice enabled music player). &lt;/p&gt;&lt;p style="color: rgb(102, 51, 0);"&gt;The video demonstrations can be found either on the VoiceSignal website  (&lt;a href="http://www.voicesignal.com/"&gt;www.voicesignal.com&lt;/a&gt;) or on the  following YouTube links:&lt;/p&gt;&lt;p style="color: rgb(102, 51, 0);"&gt;VTunes: &lt;a href="http://www.youtube.com/watch?v=zne4rwCCmAc"&gt;http://www.youtube.com/watch?v=zne4rwCCmAc&lt;/a&gt;&lt;br /&gt;VSearch: &lt;a href="http://youtube.com/watch?v=ayrCCw5xWug"&gt;http://youtube.com/watch?v=ayrCCw5xWug&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;I am surprised by the Nuance marketing performance on this issue. I hope that someone from Nuance will wake up to clear this issue and maybe add some comments to enable us understand their offering.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;SpeechCloud's &lt;span style="font-size:180%;"&gt;VoiceDialer &lt;/span&gt;(free) was the first iPhone application to try to offer speech dialing on the iPhone. VoiceDialer takes advantage of the iPhone's always-on internet connection to record your voice and send it to SpeechCloud's servers to perform the actual recognition. Similar to the AT&amp;amp;T and Nuance approach. Once recognized, the application pulls up the contact's name and allows you to select which number to dial. Some of the criticism of the application is that it requires too much manual interaction (tapping on buttons) to actually dial a number, and slow response time due to the transferring of data across wireless networks.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.makayama.com/iphonevoicedial.html"&gt;&lt;span style="font-size:180%;"&gt;VoiceDial  &lt;/span&gt;&lt;/a&gt;( by Makayama) ($ 14.99 on apple store) avoids actual speech recognition and instead perform audio comparison. VoiceDial requires you to actually record your own voice for each contact which can then later be used to match your voice command.  If you are willing to pay the $15 and willing to record yourself saying your contacts, MercuryNews claims the product "works as advertised" and "had no problems recognizing the contact I wanted to call, even when it was similar to other names I'd recorded."&lt;br /&gt;&lt;br /&gt;HRL Technologies' &lt;a href="http://www.voicethis.com/"&gt;&lt;span style="font-size:180%;"&gt;VoiceThis &lt;/span&gt;&lt;/a&gt;Dialer , ( $9.99) is an application that actually tries to perform speech recognition within the iPhone itself. No wireless connection required. Instead, the application runs within the iPhone. VoiceThis Dialer promises to offer completely hands free activity with the ability to dial contacts and even quit the application with your voice.&lt;br /&gt;&lt;span style="font-size:180%;"&gt;&lt;br /&gt;&lt;a href="http://www.fonix.com/"&gt;Fonix &lt;/a&gt;&lt;/span&gt;Speech is currently developing &lt;a href="http://www.fonix.com/pr/20080707.pdf"&gt;iSpeak&lt;/a&gt;, which includes a run-time engine that sits on the phone allowing users to interact with the personal contents of their Apple iPhone™. Unlike other voice applets that enable voice search of the Internet by sending commands over the airwaves, this client-side application gives users the power of voice interaction with their personal content and eliminates network latency. Fonix iSpeak™ connects the user by just saying the phone number or by saying the name of a person in the contacts database. Additionally, users will be able to navigate their music libraries and launch a song or playlist simply by saying the name of the artist, song, or playlist.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;To summarize, we are facing a proliferation of speech recognition applications on the iPhone.&lt;br /&gt;A Key for the evolution of speech recognition on the iPhone is the 3G capability (which provides a fast channel to server side conmputing) and the platform openness - both released recently. As this two criteria are fulfilled, we should expect a quick growth in speech recognition applications availability for iPhone.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-4340947297967561423?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/grlniTbAzeU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/4340947297967561423/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=4340947297967561423" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/4340947297967561423?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/4340947297967561423?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/grlniTbAzeU/iphone-speech-recongiotion-status.html" title="iPhone speech recognition - status" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>3</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2008/09/iphone-speech-recongiotion-status.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0YERHk_cCp7ImA9WxdaEUo.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-1163965592264662775</id><published>2008-08-19T14:09:00.000-04:00</published><updated>2008-08-19T14:38:25.748-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-08-19T14:38:25.748-04:00</app:edited><title>IBM is behind Vlingo's technology - will Nuance sue IBM?</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/UOocVMSFUWU9E1uTwQVQU-ql7oc/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/UOocVMSFUWU9E1uTwQVQU-ql7oc/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/UOocVMSFUWU9E1uTwQVQU-ql7oc/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/UOocVMSFUWU9E1uTwQVQU-ql7oc/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;Recently Nuance decided to play aggressively against &lt;a href="http://www.vlingo.com"&gt;Vlingo&lt;/a&gt;. &lt;a href="http://speechanalytics.blogspot.com/2008/06/nuance-vlingo-if-you-are-not-sued-you.html"&gt;While filing a lawsuit&lt;/a&gt; against Vlingo seems reasonable for Nuance who has a long history for using the lawsuit weapon against small companies (usually pre acquisition attempts). However the recent &lt;a href="http://www.businessweek.com/innovate/content/aug2008/id20080818_745252.htm"&gt;Business Week article&lt;/a&gt; - IBM's Speech Recognition, expose the fact that Vlingo is using the IBM technology as the basis for their solution.&lt;br /&gt;&lt;br /&gt;I wonder if Nuance will continue the legal actions given this information. IBM speech recognition patents is one of the broadest in the industry and in general nowone should mess with IBM about patents.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-1163965592264662775?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/A5zUVpXwhFs" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/1163965592264662775/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=1163965592264662775" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1163965592264662775?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/1163965592264662775?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/A5zUVpXwhFs/ibm-is-behind-vlingo-technology-will.html" title="IBM is behind Vlingo's technology - will Nuance sue IBM?" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2008/08/ibm-is-behind-vlingo-technology-will.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkUCQHY8fCp7ImA9WxdVEk8.&quot;"><id>tag:blogger.com,1999:blog-1030722385446106708.post-6265717091797731845</id><published>2008-07-16T04:59:00.000-04:00</published><updated>2008-07-16T10:57:41.874-04:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-07-16T10:57:41.874-04:00</app:edited><title>Google Video Search via Speech Recognition</title><content type="html">
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/OurVSOVUc6y1ytoiIjqYJEMEVgk/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/OurVSOVUc6y1ytoiIjqYJEMEVgk/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/OurVSOVUc6y1ytoiIjqYJEMEVgk/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/OurVSOVUc6y1ytoiIjqYJEMEVgk/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp1.blogger.com/_7ZYqYi4xigk/SHvUEbOjgyI/AAAAAAAAAv4/K2R8ubzaPwQ/s400/screen-shot-final-final.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 200px;" src="http://bp1.blogger.com/_7ZYqYi4xigk/SHvUEbOjgyI/AAAAAAAAAv4/K2R8ubzaPwQ/s400/screen-shot-final-final.jpg" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Finally a hint on the expected Google move to the speech recognition arena.&lt;br /&gt;&lt;br /&gt;Google announced at  the &lt;a href="http://googleblog.blogspot.com/2008/07/in-their-own-words-political-videos.html"&gt;Official Google Blog&lt;/a&gt;, the availability of a new video search capability based on speech recognition. &lt;br /&gt;&lt;br /&gt;It was release as a gadget you can embed on your iGoogle homepage and is a good preview of things to come. &lt;br /&gt;The gadget only searches videos uploaded to YouTube's Politicians channels, which include videos from Senator Obama's and Senator McCain's campaigns, as well as those from dozens of other candidates and politicians. It usually takes less than a few hours for a video to appear in the index after it has been published on YouTube.&lt;br /&gt;&lt;br /&gt;So apart from congratulations to the google team who are exposed to the public for the first time, how are they compared to other speech recognition engines aimed for broadcast quality? The google team refer to their precision: "While some of the transcript snippets you see may not be 100% accurate, we hope that you'll find the product useful for most purposes." While I do not understand what are the purposes for just searching within the YouTube political channel, people should be aware of much more mature solutions developed in the past years. From the pioneering work of BBN and IBM to the existing online solutions like &lt;a href="http://search.everyzing.com/"&gt;everyzing&lt;/a&gt;, &lt;a href="http://www.tveyes.com/"&gt;tveyes&lt;/a&gt;, &lt;a href="http://www.blinkx.com/"&gt;blinx&lt;/a&gt;, &lt;a href="http://www.snipp.tv/"&gt;snipp.tv&lt;/a&gt; by NSC and more. Based on the perceived quality, the google team has a long way to go in order to get to the first league and to be able to analyze data which is not at broadcast quality. The good news as users is that the YouTube data is easier to process relative to telephony calls speech recognition performed widely today at contact centers by companies like Verint, Nice, Autonomy, Utopy, Nexidia, CallMiner and other players.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1030722385446106708-6265717091797731845?l=speechanalytics.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/OzfbY/~4/RlqNsUC6ncU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://speechanalytics.blogspot.com/feeds/6265717091797731845/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=1030722385446106708&amp;postID=6265717091797731845" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6265717091797731845?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/1030722385446106708/posts/default/6265717091797731845?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/OzfbY/~3/RlqNsUC6ncU/google-video-search-via-speech.html" title="Google Video Search via Speech Recognition" /><author><name>Ofer</name><uri>http://www.blogger.com/profile/04077145148331478519</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://bp1.blogger.com/_7ZYqYi4xigk/SHvUEbOjgyI/AAAAAAAAAv4/K2R8ubzaPwQ/s72-c/screen-shot-final-final.jpg" height="72" width="72" /><thr:total>1</thr:total><feedburner:origLink>http://speechanalytics.blogspot.com/2008/07/google-video-search-via-speech.html</feedburner:origLink></entry></feed>

