<?xml version='1.0' encoding='UTF-8'?><rss xmlns:atom="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearchrss/1.0/" xmlns:blogger="http://schemas.google.com/blogger/2008" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" version="2.0"><channel><atom:id>tag:blogger.com,1999:blog-1268266350438382990</atom:id><lastBuildDate>Mon, 30 Jun 2025 14:17:56 +0000</lastBuildDate><category>conference recap</category><category>interspeech</category><category>autobi</category><category>evaluation</category><category>machine learning</category><category>speech prosody</category><category>teaching</category><category>academia</category><category>google</category><category>hlt-naacl</category><category>icassp</category><category>java</category><category>mac os x</category><category>major themes</category><category>negative results</category><category>prominence</category><category>writing</category><title>Spoken Language Processing</title><description>Some thoughts on Spoken Language Processing, with tangents on Natural Language Processing, Machine Learning, and Signal Processing thrown in for good measure.</description><link>http://spokenlanguageprocessing.blogspot.com/</link><managingEditor>noreply@blogger.com (Anonymous)</managingEditor><generator>Blogger</generator><openSearch:totalResults>41</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-796287467075104820</guid><pubDate>Tue, 25 Oct 2016 19:26:00 +0000</pubDate><atom:updated>2016-10-25T12:26:06.735-07:00</atom:updated><title>States of the Arts</title><description>&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Fall 2016 has seen important improvements to the state of the art in both speech synthesis and speech recognition.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;In September, Google DeepMind unveiled &lt;/span&gt;&lt;a href=&quot;https://deepmind.com/blog/wavenet-generative-model-raw-audio/&quot; style=&quot;text-decoration: none;&quot;&gt;&lt;span style=&quot;background-color: transparent; color: #1155cc; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: underline; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;WaveNet&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;, a speech synthesis system, that generates exceptionally natural sounding speech. &amp;nbsp;In October, Microsoft Research &lt;/span&gt;&lt;a href=&quot;https://blogs.microsoft.com/next/2016/10/18/historic-achievement-microsoft-researchers-reach-human-parity-conversational-speech-recognition/#sm.000000wu6huvewud2trt2dd7y1cl0&quot; style=&quot;text-decoration: none;&quot;&gt;&lt;span style=&quot;background-color: transparent; color: #1155cc; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: underline; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;announced&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt; that they had developed a speech recognition system that matches the word error rate of human transcribers.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;A few observations.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 700; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Life moves pretty fast.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;It’s a time of rapid progress in speech and spoken language processing. &amp;nbsp;Microsoft, IBM and Baidu have all posted better and better speech recognition numbers in the last few years.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 700; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Deep Learning has the goods.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;It’s very easy to be dismissive of “deep learning” as being over-hyped. &amp;nbsp;However, both of these advances rely heavily on deep neural networks. &amp;nbsp;So far, they continue to deliver on their promise. &amp;nbsp;&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 700; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Arxiv.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;One of the &lt;/span&gt;&lt;a href=&quot;https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/CD-DNN-HMM-SWB-Interspeech2011-Pub.pdf&quot; style=&quot;text-decoration: none;&quot;&gt;&lt;span style=&quot;background-color: transparent; color: #1155cc; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: underline; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;first important ASR papers&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt; showing that DNNs can outperform traditional GMM acoustic models on a hard task (i.e. Switchboard) was presented at Interspeech 2011. &amp;nbsp;This means the work was done at least 6 months earlier. &amp;nbsp;Both of these advances are described not only by press releases and glossy webpages, but also technical papers [&lt;/span&gt;&lt;a href=&quot;https://arxiv.org/pdf/1609.03499.pdf&quot; style=&quot;text-decoration: none;&quot;&gt;&lt;span style=&quot;background-color: transparent; color: #1155cc; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: underline; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;WaveNet paper&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;, &lt;/span&gt;&lt;a href=&quot;https://arxiv.org/pdf/1610.05256v1.pdf&quot; style=&quot;text-decoration: none;&quot;&gt;&lt;span style=&quot;background-color: transparent; color: #1155cc; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: underline; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;MSR paper&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;]. &amp;nbsp;Both were posted to arxiv. &amp;nbsp;There’s no doubt that immediate, self-publication is flooding the scientific engine with oxygen. &amp;nbsp;Progress is rapid because we’re still learning the limits of neural networks, but also groups are able to compete and learn from each other much more quickly than semi-annual conferences enable.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 700; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;WaveNet is a new way of doing things.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 400; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;WaveNet synthesizes speech in a novel way. &amp;nbsp;The resulting waveform is generated one sample at a time, conditioned on the previous sample. &amp;nbsp;This is essentially doing parametric speech synthesis without a vocoder. &amp;nbsp;Not only does this approach work surprisingly well, it’s exciting in its newness as well. &amp;nbsp;There’s other novelty in this work too (1. Using a classification approach to predict discretized mu-law values instead of predicting a continuous value 2. The dilating convolution layer.) but this work is most important for showing the promise of this approach to generating audio data.&lt;/span&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;line-height: 1.38; margin-bottom: 0pt; margin-top: 0pt;&quot;&gt;
&lt;span style=&quot;background-color: transparent; color: black; font-family: Arial; font-size: 14.666666666666666px; font-style: normal; font-variant: normal; font-weight: 700; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;The Microsoft ASR work is not a new way of doing things.&lt;/span&gt;&lt;/div&gt;
&lt;span id=&quot;docs-internal-guid-013c8b0e-fd4e-4296-4277-9a7e88210a88&quot;&gt;&lt;span style=&quot;font-family: Arial; font-size: 14.6667px; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;The work is of the highest quality, without a doubt. &amp;nbsp;This paper represents a exeptionally well-engineered collection of effective speech recognition tools that have hit an interesting milestone. &amp;nbsp;However, the individual pieces will seem familiar to anyone up to date on the current state-of-the-art. &amp;nbsp;The major improvements come improvements to language modeling (an LSTM LM specifically), LACE units which MSR showed at Interspeech this year, lattice free MMI, and spatial smoothing (which is similar to Cambridge’s “stimulated training”). &amp;nbsp;The Microsoft team has put these parts together more effectively than anyone else, and it’s an important achievement. &amp;nbsp;But compared to the WaveNet development, it’s a more incremental step.&lt;/span&gt;&lt;/span&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2016/10/states-of-arts.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-1907971980030341782</guid><pubDate>Mon, 13 Oct 2014 20:56:00 +0000</pubDate><atom:updated>2014-10-13T14:20:49.071-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">conference recap</category><category domain="http://www.blogger.com/atom/ns#">interspeech</category><title>Interspeech 2014 Recap</title><description>This year&#39;s Interspeech was in Singapore. &amp;nbsp; &amp;nbsp;Singapore is, in some ways, a very easy venue to travel to. &amp;nbsp;It&#39;s a modern, cosmopolitan city. &amp;nbsp;They speak English. &amp;nbsp;It&#39;s tropical, but you&#39;re never more than a hundred meters from air conditioning. &amp;nbsp; In other ways, it&#39;s so far. &amp;nbsp;Over 20 hours each way. &amp;nbsp;I like airplanes. &amp;nbsp;They&#39;re as magical as any technology we&#39;ve got. &amp;nbsp;But 20 hours is a long time to sit still. &amp;nbsp;Think about how many steps you take in 20 hours. &amp;nbsp;How many different faces you see. &amp;nbsp;Then reduce that to about 200 steps, and 10 people. &lt;br /&gt;
&lt;br /&gt;
&lt;h1 class=&quot;quoteText&quot; style=&quot;background-color: white; color: #181818; font-family: georgia, serif; font-size: 14px; font-weight: normal; line-height: 18px; margin: 0px 0px 15px; padding: 0px; text-align: right;&quot;&gt;
&quot;Because we are all poets or babies in the middle of the night, struggling with being.&quot; - Martin Amis &quot;London Fields&quot;&lt;/h1&gt;
Interspeech 2014 was a well run conference. &amp;nbsp;The quality of papers was generally quite high. The venue easily handled the size of the event and the wifi was steady. &amp;nbsp;It was difficult to find enough food at the Welcome Reception, but easy to find enough beer. &amp;nbsp;The banquet was flawed -- segregating vegetarians is pretty rude -- but they all are, and at least there was enough food to go around, and everyone ate promptly. &amp;nbsp;And of course, there was Mambo and Jambo. &amp;nbsp;I&#39;m not going to go into it here, but find someone who attended the opening ceremony and ask them to describe it. &amp;nbsp;Then don&#39;t believe them, and ask someone else to do the same. &amp;nbsp;It was &quot;odd&quot; at best. &amp;nbsp;But be sure, I&#39;ll be attending the Dresden opening ceremony to see how they one-up it.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Deep Learning&lt;/b&gt;&lt;br /&gt;
A few years ago, DNNs invaded speech conferences. &amp;nbsp;Deep Learning is still a significant buzz word, and a hot topic. &amp;nbsp;But for, the better of everyone involved, the intensity has cooled. &amp;nbsp;Now the interest in DNNs seems to have shifted into 1) understanding how they work, and how to best train them to a task and 2) Long Short-Term Memory units to model sequential data. &amp;nbsp;The latter really broke out at this years conference. &amp;nbsp;There were a number of papers finding that them to be an effective alternative to traditional recurrent nets trained with back-propagation through time.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;BABEL&lt;/b&gt;&lt;br /&gt;
I&#39;ve been involved with the &lt;a href=&quot;http://www.iarpa.gov/index.php/research-programs/babel&quot;&gt;IARPA-BABEL&lt;/a&gt; program, so my view is pretty biased on this front, but I felt like the presence of BABEL in this year&#39;s Interspeech was particularly large. &amp;nbsp;The program&#39;s central task is performing keyword search on low-resource languages. It has an aggressive evaluation schedule with an increased number of *new* languages involved each year. &amp;nbsp;There were at least two sessions devoted to keyword search, and papers evaluated by either BABEL-proper or the NIST OpenKWS challenge seemed to be all over the conference. &amp;nbsp;(Searching the paper index suggests that there are between 30 and 40 BABEL papers, and another 10 or so OpenKWS papers.) &amp;nbsp;It seems clear that this program has had a large impact on ASR and KWS research. &amp;nbsp;2014 was certainly the high-water mark here, as the program shrunk by 50% last year, but it&#39;s worth noting its effect on the field.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Some standout papers&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;I don&#39;t mean to suggest that these are &quot;the best&quot; papers, but they&#39;re ones that caught my eye for one reason or another.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;div class=&quot;p1&quot;&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_0890.html&quot;&gt;Acoustic Modeling with Deep Neural Networks Using Raw Time Signal for LVCSR&lt;/a&gt;&amp;nbsp;by&amp;nbsp;Zoltán Tüske, Pavel Golik, Ralf Schlüter and&amp;nbsp;Hermann Ney. &amp;nbsp; Part of the promise of Deep Learning is the ability to &quot;learn feature representations&quot; directly from data. &amp;nbsp;This is frequently touted as a description of what is happening in the first hidden layer of a deep net. &amp;nbsp;So, the logic goes, do we need MFCC/PLP/etc. features, or can we do speech recognition directly on a raw acoustic signal? &amp;nbsp;This is the first paper I&#39;m aware of to affirmatively show that &quot;yes, yes we can&quot;. &amp;nbsp;It requires a good amount of training data, and rectified linear (ReLU) neurons work better for this, but 1) it works competitively with traditional features and 2) many of the first hidden layer neurons can be shown to be learning a filterbank. &amp;nbsp;Very cool.&lt;/span&gt;&lt;/div&gt;
&lt;div class=&quot;p2&quot;&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class=&quot;p2&quot;&gt;
&lt;/div&gt;
&lt;div class=&quot;p1&quot;&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;u&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_2645.html&quot;&gt;Backoff Inspired Features for Maximum Entropy Language Models&lt;/a&gt;&lt;/u&gt;&amp;nbsp;by Fadi Biadsy, Keith Hall, Pedro Moreno and Brian Roark. &amp;nbsp;In n-gram language modeling, when a sequence of words A, B, C have never been observed, its n-gram probability P(C|A,B) can be approximated by the probability P(C|B). &amp;nbsp;But to be sensible about it, you&#39;ve got to apply a backoff penalty. &amp;nbsp;Discriminative language models can seamlessly incorporate backoff features, F(A,B,C), F(B,C), etc. and learn appropriate weights. &amp;nbsp;The key insight in this paper is that when a discriminative model uses these backoff estimates, it incurs no penalty. &amp;nbsp;It&#39;s essentially overestimating the probability of uncommon n-grams in the context of common (n-k)-grams. &amp;nbsp;This paper seeks to fix this, and does.&lt;/span&gt;&lt;/div&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Additional favorites some from my students:&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_1053.html&quot;&gt;Word Embeddings for Speech Recognition&lt;/a&gt;&amp;nbsp;by Samy Bengio and Georg Heigold. &amp;nbsp;Far and away the most popular poster at the conference. &amp;nbsp;This is high on my list to read closely. &amp;nbsp;It promises to learn a euclidean space into which word decoding can happen, so that words that sound similar are closer in space. &amp;nbsp;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_1312.html&quot;&gt;The Obligatory Contour Principle in African and European Varieties of French&lt;/a&gt;&amp;nbsp;by Mathieu Avanzi, Guri Bordal and Gélase Nimbona. &amp;nbsp;An investigation of prosodic differences in dialects of French. &amp;nbsp;Very consistent with one of my student&#39;s dissertation projects.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_1430.html&quot;&gt;Improving Spoken Document Retrieval by Unsupervised Language Model Adaptation Using Utterance-Based Web Search&lt;/a&gt; by&amp;nbsp;Robert Herms, Marc Ritter, Thomas Wilhelm-Stein, Maximilian Eibl. &amp;nbsp;A clever way of handling OOVs in spoken document retrieval. &amp;nbsp;Essentially the idea is if I recognize W[i-2], W[i-1], UNK, W[i+1], W[i+2], go do a web search for the context, find some matching documents, and augment the language model with them, then re-decode. &amp;nbsp; Kind of like distant supervision for spoken document IR.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;span id=&quot;docs-internal-guid-821c4d4b-0b45-4ff2-41fc-ae19fcf8ded0&quot;&gt;&lt;span style=&quot;vertical-align: baseline; white-space: pre-wrap;&quot;&gt;&lt;a href=&quot;http://www.isca-speech.org/archive/interspeech_2014/i14_1910.html&quot;&gt;Learning Small-Size DNN with Output-Distribution-Based Criteria&lt;/a&gt; by &lt;/span&gt;&lt;/span&gt;Jinyu Li, Rui Zhao, Jui-Ting Huang, Yifan Gong. &amp;nbsp;How do you effectively train a small DNN without ruining its performance? &amp;nbsp;This paper out of MSR suggests training a large DNN, then using its output distribution to train the small one. &amp;nbsp;It&#39;ll take me some closer reading to fully understand why this works, but I&#39;m intrigued.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;a href=&quot;http://www.cmpe.boun.edu.tr/~kaya/IS14_compare_paper_final.pdf&quot;&gt;Canonical Correlation Analysis and Local Fisher Discriminant Analysis based&amp;nbsp;Multi-View Acoustic Feature Reduction for Physical Load Prediction&lt;/a&gt;&amp;nbsp;by&amp;nbsp;Heysem Kaya, Tuğçe Özkaptan, Albert Ali Salah, Sadık Fikret Gürgen. We tried a number of dimensionality reduction approaches for this year&#39;s Paralinguistic Challenge, but didn&#39;t get particularly good results. &amp;nbsp;These guys did using CCA and LFDA. &amp;nbsp;Looking forward to reading this one as a more general feature reduction approach when the number of features are larger than the number of training instances.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;div style=&quot;text-align: -webkit-center;&quot;&gt;
&lt;/div&gt;
&lt;div style=&quot;text-align: left;&quot;&gt;
&lt;span style=&quot;font-family: Times, &#39;Times New Roman&#39;, serif;&quot;&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style=&quot;text-align: left;&quot;&gt;
&lt;span style=&quot;font-family: Times, &#39;Times New Roman&#39;, serif;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2014/10/interspeech-2014-recap.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-2263835387434447240</guid><pubDate>Tue, 12 Aug 2014 20:55:00 +0000</pubDate><atom:updated>2014-08-12T18:57:10.642-07:00</atom:updated><title>Things I didn&#39;t know before becoming a professor: #5 How to manage my time</title><description>&lt;div dir=&quot;ltr&quot;&gt;
Grad school is an exercise in working independently.&amp;nbsp; &lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
What I mean by working independently is this:&amp;nbsp; Your daily activities are barely supervised.&amp;nbsp; Your long term activities are heavily scrutinized and harshly judged.&amp;nbsp;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
You must master this skill in order to finish a PhD.&amp;nbsp; You must have a PhD to be a professor.&amp;nbsp; So.... You&#39;ll be all set.&amp;nbsp; Right?&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;div dir=&quot;ltr&quot; style=&quot;-webkit-text-stroke-width: 0px; color: black; font-family: Times; font-size: medium; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px;&quot;&gt;
&lt;/div&gt;
&lt;br /&gt;
&lt;div dir=&quot;ltr&quot;&gt;
This covers a lot of careers, not just academics, but being a professor provides an extremely&amp;nbsp;lightly constrained schedule. &amp;nbsp; &amp;nbsp;On one hand, in this, grad school prepared me for the challenge of controlling my own time.&amp;nbsp; On the other, it gave me a false sense of confidence.&amp;nbsp; I was driving so well in the parking lot, and then I pulled into midtown traffic at rush hour.&amp;nbsp; And I&#39;m driving a go-kart.&lt;/div&gt;
&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
In grad school you learn how to motivate yourself. You can read another paper, or you can watch another episode of The Sopranos.&amp;nbsp; You can write up an experiment or you can play one more hour of Halo.&amp;nbsp; Or you can just go to the beach. &amp;nbsp;Once you&#39;re done with classes, there&#39;re no midterms coming up in a month.&amp;nbsp; No papers due in two weeks.&amp;nbsp; Just an uncomfortable meeting with your advisor, where you can make small talk and rehash previous discussions and results until the meeting is over and you think you&#39;ve played it off. (You haven&#39;t, btw.) &amp;nbsp;Most people figure this out.&amp;nbsp; There is a wealth of advice about how to get off your rear and get work done.&amp;nbsp; There are &lt;a href=&quot;http://psychology.about.com/od/psychologytopics/tp/theories-of-motivation.htm&quot;&gt;theories&lt;/a&gt; and &lt;a href=&quot;http://www.lifehack.org/articles/productivity/the-lifehack-big-list-50-top-productivity-apps-for-iphone.html&quot;&gt;apps&lt;/a&gt; and &lt;a href=&quot;http://procrastinators-anonymous.org/&quot;&gt;support groups&lt;/a&gt; and &lt;a href=&quot;http://zenhabits.net/the-top-50-productivity-blogs-most-of-which-you-havent-heard-about/&quot;&gt;better blogs&lt;/a&gt; than this devoted to the subject.&amp;nbsp; So I&#39;m going to assume that even the most procrastinative of professors (&lt;a href=&quot;https://twitter.com/RaBergstein&quot;&gt;my wife&lt;/a&gt; lovingly refers to me as an &quot;epic time waster&quot;) have a set of tools at their disposal to handle the inevitable distractions and failings of motivation. &amp;nbsp;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
But here&#39;s where it goes haywire. The breadth and depth of things that need to be handled on a given day is far greater than what was expected in grad school.&amp;nbsp; This isn&#39;t revelatory. Its to be expected; its much more responsibility.&amp;nbsp; The diversity of responsibility is as surprising as anything else.&amp;nbsp; Being a professor requires you to teach, write grants, mentor grad students, handle bureaucracies (internal to the university and externally), manage budgets, foster relationships with a broader scientific community, attend conferences, project meetings and site visits, and do some research.&amp;nbsp; Its not too much; its very doable. But its much more faceted than grad student responsibilities ready you for.&amp;nbsp; As a grad student, your responsibility is typically to do your research, and occasionally teach or take classes.&amp;nbsp; There is relatively little necessary by way of prioritizing which of many projects and tasks need your attention.&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
I see the challenge in transitioning from student to professor like this: &amp;nbsp;The motivational chops you honed during grad school have made your job as entertaining and rewarding as TV, video games and the beach.&amp;nbsp; Now, you need to repurpose them so that the frustrating parts of the job are as entertaining and rewarding as fun parts.&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
Everyday I make two lists. One: things I want to do. Two: things I don&#39;t want to do. Things I want to do usually includes reading papers, and writing code, some errands, grant ideas, sometimes course planning.&amp;nbsp; Things I don&#39;t want to do are typically bureaucratic. Plus I&#39;ve got a rule that if I&#39;ve put the same task on the &quot;want&quot; list for three days running, my attitude has outed itself and its actually a &quot;don&#39;t want&quot; kind of thing.&amp;nbsp; Through the day, I go one for me, one for them.&amp;nbsp; One from column a and one from column b.&amp;nbsp; And everyday there are a couple of things that I don&#39;t want to do, that I&#39;ve found some way to avoid doing. &lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div dir=&quot;ltr&quot;&gt;
I&#39;ve tried a lot of approaches to bring order to the responsibilities that being a professor entails, the most basic lessons I&#39;ve learned are these.&lt;br /&gt;
&lt;br /&gt;
1. Write everything down. &amp;nbsp;I&#39;ve got a decent memory, but there&#39;s no way that I can keep my schedule and the status of each students project and other milestones and what I was thinking about yesterday at hand without a lot of notes. &lt;br /&gt;
&lt;br /&gt;
2. Let go. &amp;nbsp;Days are not long enough. &amp;nbsp;When I go to bed, there is always something left undone. &amp;nbsp;Make sure that it&#39;s not something truly important.&lt;br /&gt;
&lt;br /&gt;
3. Do a little bit of soul-sucking work first thing in the morning every day. &amp;nbsp;This keeps it from building up leading to dreaded days that are totally eaten up by it. &amp;nbsp;Spending a full day on bureaucratic maintenance to dig out is something that (I sincerely hope) no grad student ever has to grapple with it. &amp;nbsp;I treat this like the nuclear option; it&#39;s a clear sign that I&#39;ve made a bunch of time management mistakes in the previous week/month.&lt;br /&gt;
&lt;br /&gt;
4. Keep on top of email. &amp;nbsp;Even when traveling or facing down a deadline. &amp;nbsp;(See above.)&lt;br /&gt;
&lt;br /&gt;
One thing I&#39;m still trying to get my arms around is managing my time for priorities of different durations. &amp;nbsp;Handling tasks for the day or week is pretty easy. &amp;nbsp;Pegging against milestones like paper and grant deadlines, presentations at conferences or grant meetings, is pretty easy. It&#39;s harder for me too keep a longer perspective and give enough weight to the task that&#39;s not needed in 2 weeks, but one that might have impact in 2 months or 2 years. &amp;nbsp;This medium- to long-view is harder to maintain and harder to integrate into daily priorities.&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
Get a little better every day.&lt;/div&gt;
</description><link>http://spokenlanguageprocessing.blogspot.com/2014/08/things-i-didnt-know-before-becoming.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-597132368452390154</guid><pubDate>Tue, 08 Jul 2014 05:28:00 +0000</pubDate><atom:updated>2014-07-08T17:38:39.054-07:00</atom:updated><title>Things I didn&#39;t know before becoming a professor: #4 Different types of collaboration</title><description>There&#39;s a romantic and resilient fantasy of the scientist toiling away in isolation. &amp;nbsp;He (sad to say, it&#39;s almost always he) emerges from the lab, bleary-eyed and triumphant and shares his genius with the world. &lt;br /&gt;
&lt;br /&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
Da Vinci, Frankenstein, Edison, Einstein, Every Evil Genius, Zuckerberg&lt;/div&gt;
&lt;br /&gt;
Our science narrative has expanded to pairs of geeks in garages&lt;br /&gt;
&lt;br /&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
Hewlett &amp;amp; Packard, Gates &amp;amp; Allen, Jobs &amp;amp; Wozniak, Brin &amp;amp; Page&lt;/div&gt;
&lt;br /&gt;
but hasn&#39;t yet embraced the fundamental reality of modern science: most work is done in collaboration. &amp;nbsp;Most great big ideas are the product of many great small ideas. &lt;br /&gt;
&lt;br /&gt;
This is why there&#39;s no satisfying answer to questions like &quot;who invented the atomic bomb&quot;. &amp;nbsp;Was it Fermi? Einstein? Oppenheimer? &amp;nbsp; Of course Al Gore didn&#39;t invent The Internet, but who did?&lt;br /&gt;
&lt;br /&gt;
As a graduate student, you are, at worst, Igor, the hunchbacked assistant doing all the work and getting none of the credit. &amp;nbsp;At best, you are a near-equal collaborator with your advisor. &amp;nbsp; &amp;nbsp;But there&#39;s a sea of collaboration waiting once you&#39;re out of the nest and on to the next step. &lt;br /&gt;
&lt;br /&gt;
Since becoming a professor I&#39;ve noticed a handful of types of collaboration. From intimate to remote, they each have a role to play.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Peer Collaboration&lt;/b&gt;&lt;br /&gt;
This can be the most satisfying and most productive way to work with a partner or team. &amp;nbsp;You sit in the same room, or with a chat window open. &amp;nbsp;You talk through ideas, big and small. You tackle the parts of the work you&#39;re most suited to, or most excited by. &amp;nbsp;You debug code together. You run experiments together. Try to make sense of your shared results. You iterate over writing papers together. &lt;br /&gt;
&lt;br /&gt;
It can also be the most frustrating. &amp;nbsp;A group only walks as fast as its slowest member. &amp;nbsp;If you don&#39;t work well with your partners, for any reason, the stress is visible almost immediately. &amp;nbsp;If the partnership is strong, you figure it out, and get back to work. &amp;nbsp;If it&#39;s not, you&#39;ll know pretty quickly.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Mentor-Student Collaboration&lt;/b&gt;&lt;br /&gt;
There&#39;s a similar intimacy in this collaboration, but there are some very clear differences. &amp;nbsp;First the pace of collaboration is typically slower. &amp;nbsp;At slowest, you may only work with your student only once a week. &amp;nbsp;Even if you&#39;re in regular email contact it&#39;s only a couple of times a day. &amp;nbsp;Typically it&#39;s only during the most intense periods of an evaluation or paper deadline does the collaboration reach the level of immediacy and proximity of peer collaboration.&lt;br /&gt;
&lt;br /&gt;
The division of labor here is more driven by the hierarchical nature of the relationship. &amp;nbsp;The mentor typically has a broader vision of the work, while the student has a better grasp of the details. &amp;nbsp;(Or if they don&#39;t, it&#39;s part of the responsibility and training to learn about the details.) &amp;nbsp;The mentor has the responsibility of contextualizing the work, either into a broader research agenda that will include the students thesis as an offshoot, or into a broader project or publication. &amp;nbsp; At the start of the relationship, the mentor is typically responsible for guiding the research, determining which research questions are most fruitful and&lt;br /&gt;
&lt;br /&gt;
Over the course of graduate study, a good mentor-student collaboration will take on more characteristics of a peer collaboration, with the student ultimately initiating most of the research ideas. &amp;nbsp;But there&#39;s always going to be a level of deference that the mentor receives.&lt;br /&gt;
&lt;br /&gt;
-----&lt;br /&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
As a graduate student, I had a lot of experience with the previous two types of collaboration. I even had some exposure to the next three, but so much as to be able to identify the differences, or how they impact the work.&lt;/div&gt;
&lt;div&gt;
-----&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;Project Collaboration&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
On medium-to-large multifaceted projects there are frequently multiple PIs (principal investigators) each with independent research groups. &amp;nbsp;Collaboration across sites towards a common goal is different and challenging. &amp;nbsp;The PIs coordinate with some frequency. &amp;nbsp;There are bureaucratic reasons for this -- mostly around budgets and report writing -- but coordination between students is much less common. &amp;nbsp; &amp;nbsp;Each of the individual research groups have their own broad responsibilities, timeframes and agendas. &amp;nbsp;Each are balancing a number of projects, student milestones (qualifying exams, etc.), and specific personnel decisions. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
The biggest difference in this kind of collaboration, in my experience, is the pace. &amp;nbsp;The interaction between groups when it comes to research is infrequent. &amp;nbsp;Sometimes just a few times a month. &amp;nbsp;Another difference is in the level of detail. &amp;nbsp;When the groups share their work, it is usually at a relatively high level -- far removed from unsuccessful approaches and interesting failures. &amp;nbsp; &amp;nbsp;There are opportunities to get advice at a high level about directions and plans, but close collaboration about specific decisions and approaches is much less common. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
This is completely understandable. The nature of this collaboration demands it. &amp;nbsp;These projects are big enough to demand a large number of people working on them. &amp;nbsp;This task oriented partitioning into research groups is probably the best way to manage the effort. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
However, there&#39;s the time when the rubber meets the road. &amp;nbsp;This is usually in the form of deliverables, where you have to send a report or code to a funder, evaluations, where all of the moving parts that the groups have developed have to work together, or presentations at site visits or PI meetings, where you have to show that you&#39;re all working towards a common goal.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Often, these moments demand closer coordination and collaboration. &amp;nbsp; &amp;nbsp;Having made a few mistakes. The lesson I&#39;ve learned is to stop and drill down. &amp;nbsp;Take meetings offline to deal with discrete questions and issues. &amp;nbsp;Get the students involved in these meetings. &amp;nbsp;If at all possible, visit your partners. With your students. &amp;nbsp;One hour face to face can take the place of a few days worth of emails.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;b&gt;Social Collaboration&lt;/b&gt;&lt;br /&gt;
Still another step more removed is the kind of collaboration you have with the scientific community which you&#39;re a part of. &amp;nbsp;This is the sort of collaboration where you have a conversation with someone about an idea that one of you is chewing on. &amp;nbsp;You bat the idea around for a little while, maybe hear about a technique you didn&#39;t know about or hadn&#39;t thought of, and then the conversation moves on. &amp;nbsp;I&#39;ve found that this kind of collaboration occurs with someone you know. &amp;nbsp;Either you know each other&#39;s work, or you have a mutual colleague to bring you together.&lt;br /&gt;
&lt;br /&gt;
Most of the time this type one-off micro-collaboration ends with a single conversation. &amp;nbsp;Sometimes it results in shared data or tools. &amp;nbsp;But given the right alignment of circumstances it can foster a more serious coming together. Usually just an extended conversation over email, or a phone/skype conversation, but if fate is really on your side, this can grow into a paper or a grant proposal. But by this point, you&#39;ve switched categories and you&#39;re working much more closely.&lt;br /&gt;
&lt;br /&gt;
I find that this collaboration is most available during visits to other institutions and at conferences. Frankly, this kind of collaboration is often the most valuable thing I take away from conferences.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;&quot;Public&quot; Collaboration&lt;/b&gt;&lt;br /&gt;
If you&#39;re doing it right, people are going to know about your work. &amp;nbsp;People will email you with questions and comments. &amp;nbsp;You will get queries about how to use a tool you wrote, or (maybe more frequently) a bug report or feature request. &amp;nbsp; Through this communication you are supporting other people&#39;s research. &amp;nbsp;This rarely turns into a more substantial collaboration. &amp;nbsp; But if it means that people will cite your work.&lt;br /&gt;
&lt;br /&gt;
Here, the collaboration is somewhat lopsided. &amp;nbsp;Essentially your prior effort of writing a paper or tool are supporting someone&#39;s current research. &amp;nbsp;But this needs some additional support from your current self, either in clarification or amendment. &amp;nbsp; The upside is that if the system is working, you can garner this support as much as you give it out.&lt;br /&gt;
&lt;br /&gt;
Here, I find the challenge is to appropriately prioritize these interactions. &amp;nbsp;More frequently than I&#39;d like to admit, these are the emails that filter to the bottom of my inbox. &lt;br /&gt;
&lt;br /&gt;
If you&#39;re reading this, and I still haven&#39;t gotten back to you, I&#39;m sorry. &amp;nbsp;I know these interactions are important. &amp;nbsp;I&#39;m working on it.</description><link>http://spokenlanguageprocessing.blogspot.com/2014/07/things-i-didnt-know-before-becoming.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-6128109424970860132</guid><pubDate>Wed, 21 May 2014 17:03:00 +0000</pubDate><atom:updated>2014-05-21T10:10:48.034-07:00</atom:updated><title>Things I didn&#39;t know before becoming a professor: #3 How to run a research group</title><description>At this point, I&#39;ve &lt;a href=&quot;http://spokenlanguageprocessing.blogspot.com/2014/04/things-i-didnt-know-before-becoming_27.html&quot;&gt;figured out my classes&lt;/a&gt;&amp;nbsp;and&amp;nbsp;&lt;a href=&quot;http://spokenlanguageprocessing.blogspot.com/2014/05/things-i-didnt-know-before-becoming.html&quot;&gt;secured a little bit of funding&lt;/a&gt;. &amp;nbsp;Now it&#39;s time to staff up the group and get to the work of research. &lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;
Every researcher I know keeps a list of open questions and tasks that they want to get to next. &amp;nbsp;This list is a best friend, and a worst enemy. &amp;nbsp;This list is a hydra. &amp;nbsp;Every time an element is taken off, two more appear in its place. &amp;nbsp;To tame the beast, it takes a group effort.&lt;br /&gt;
&lt;br /&gt;
It&#39;s no revelation to recognize that management is different from labor. &amp;nbsp;I&#39;m sure if I had an MBA instead of (in addition to?!) a PhD, some of these issues would be more intuitive. &amp;nbsp;But I didn&#39;t, and most professors don&#39;t. &amp;nbsp;We&#39;re self-taught.&lt;br /&gt;
&lt;br /&gt;
When the research group is firing on all cylinders, it feels like my work is having a multiplicative effect, that I&#39;m lifting more than I possibly could alone. &amp;nbsp;When it&#39;s not, it&#39;s demoralizing. I try not to get sucked into this, but there are days when it feels like I could do more in a cave with a laptop, an internet connection and coffee.&lt;br /&gt;
&lt;br /&gt;
These are some of the things about running a group that I didn&#39;t know before becoming a professor.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How to organize a research agenda&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
At my thesis proposal, &lt;a href=&quot;http://www1.cs.columbia.edu/~kathy/&quot;&gt;Kathy McKeown&lt;/a&gt; asked me the hardest question: &quot;What do you want to be famous for?&quot; &amp;nbsp; It totally blindsided me. &amp;nbsp;I laughed uncomfortably, and looked to my other committee members for acknowledgement that this was an outrageous proposition. &amp;nbsp;No help was coming. &amp;nbsp;I have no idea what my answer was, but it&#39;s the only question I remember from my candidacy exam, proposal or defense.&lt;br /&gt;
&lt;br /&gt;
A clear answer for this question, an organizing principle for the research that your lab will do, brings clarity and identity to the group. &amp;nbsp;It represents a research agenda that guides the work. &amp;nbsp;A clear research agenda serves a similar role as a mission statement and business plan. &amp;nbsp;It communicates your goals internally and externally.&lt;br /&gt;
&lt;br /&gt;
This also helps with recruitment -- if students and postdocs know exactly what kinds of questions you are interested in, you will attract people who are interested in these questions, and people who are capable of working on them. &amp;nbsp;The same principle applies to students who are looking to do independent studies, undergraduate or master&#39;s theses with you.&lt;br /&gt;
&lt;br /&gt;
I&#39;ve generally under-appreciated the value of this focus. &amp;nbsp;Personally, I&#39;ve benefitted from broad research interests. &amp;nbsp;I get a lot of satisfaction by working on a lot of different things. My &lt;a href=&quot;http://scholar.google.com/citations?user=40bq19cAAAAJ&quot;&gt;most cited paper&lt;/a&gt; wasn&#39;t even cited in my &lt;a href=&quot;http://www1.cs.columbia.edu/~amaxwell/amaxwell-thesis-final.pdf&quot;&gt;dissertation&lt;/a&gt;. But I think I&#39;ve also suffered from this generalist approach. &amp;nbsp;I may have had greater impact on the questions I find most interesting had I limited my activities on the second tier. &amp;nbsp;I may have made more progress in getting &quot;famous&quot; for something.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How to engender community and collaboration&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Being a graduate student can be isolating. &amp;nbsp;There are some classes. &amp;nbsp;A weekly meeting or two that you need to show up for. &amp;nbsp;But most of your work is alone. &lt;br /&gt;
&lt;br /&gt;
How do you take a group of students who are organized only by you, the professor, and help them communicate, support each other, and become colleagues? &amp;nbsp;I certainly had no idea.&lt;br /&gt;
&lt;br /&gt;
Some things I&#39;ve tried:&lt;br /&gt;
&lt;br /&gt;
Taking everyone to lunch. &amp;nbsp;this didn&#39;t work so well. &amp;nbsp;it probably would work now, but at the time the students didn&#39;t know each other well enough and it was just pretty awkward.&lt;br /&gt;
&lt;br /&gt;
Shared tasks. &amp;nbsp;Inviting a group of students to work together on a short term project is the single best thing I&#39;ve done to get students to trust each other and exercise their basic research skills. &amp;nbsp;I use the interspeech paralinguistics challenges for these. &amp;nbsp;I also invite other computational linguistics students to participate from the linguistics and cs programs. &amp;nbsp;It&#39;s a really fun exercise and the students get a lot out of it.&lt;br /&gt;
&lt;br /&gt;
Weekly lab meetings. &amp;nbsp;One student gives a talk every week. It gives a safe space for students to practice giving presentations (and giving feedback on presentations), and it makes sure that everyone knows what everyone else is working on.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How to make and manage a budget&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
This is my least favorite part of the job. Making a budget is fine. &amp;nbsp;It&#39;s just making puzzle pieces fit. &amp;nbsp;But inevitably there is some wrinkle to sticking to it. &amp;nbsp;Travel costs are more or less than budgeted. &amp;nbsp;At CUNY tuition is highly variable based on what year a student is and how many courses they are taking.&lt;br /&gt;
&lt;br /&gt;
And then there is interacting with admins, grants, financial officers and contract lawyers. &amp;nbsp;These people are invaluable. &amp;nbsp;When the relationship is good, your life is easy. &amp;nbsp;When there are delays it can make sticking to the timing of a budget very very very difficult.&lt;br /&gt;
&lt;br /&gt;
Having an eye for detail and the patience to use it really comes into play here.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Not everyone knows what I know&lt;/b&gt;&lt;br /&gt;
&lt;div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
The next three observations are less about specific skills, and more about the personal effects of leading a group.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
As a new professor with new graduate students, you know more than any other research group member. &amp;nbsp;This isn&#39;t a &quot;smartest person in the room&quot; thing (though sometimes it is), but an experience thing. &amp;nbsp;The professor has read more papers, seen more talks, (may have executed more experiments) and knows more about the topic.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
The list of things that you want to work on next are based on your previous work, your thinking and your instincts.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
In order for a student to pick up and work on a task, they need to get up to speed, and fast.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
It is too easy to forget that the people working with me don&#39;t know what I know.&amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;How to delegate&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
It became clear to me early on that not all CS professors still program. &amp;nbsp;This struck me as a scary idea. &amp;nbsp;First of all, I like to program, and I&#39;m reasonably good at it. &amp;nbsp;Secondly, it seems like distance between a professor and the nuts and bolts of research is a dangerous thing.&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
I was talking with &lt;a href=&quot;http://www.ee.columbia.edu/~dpwe/&quot;&gt;Dan Ellis&lt;/a&gt; shortly after I had realized this. &amp;nbsp;(Nevermind that Dan is a EE professor.) &amp;nbsp;I asked him if he still coded himself. &amp;nbsp;He had just finished a project over the summer and was happy with it and the process, but then he described it as &quot;indulgent&quot;. &amp;nbsp;why &quot;indulgent&quot;? &amp;nbsp;because, according to Dan, he should be working on writing grants, papers and reports, executing the broad vision of the lab, while the students benefit from the programming experience. (Dan, if you read this, please forgive mistakes by paraphrase, and the fog of memory.)&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
The responsibility to delegate work to students is one that I haven&#39;t quite internalized. &amp;nbsp;I still run a lot of experiments myself. I write a good deal of code still. &amp;nbsp;And when we&#39;re under close deadlines, I have a tendency to tell students to walk away, and handle the final push myself. &amp;nbsp;I&#39;m not yet prepared to call this &quot;indulgent&quot;, but it I&#39;m ready to acknowledge it as a bad habit.&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;
&lt;b&gt;How to be a mentor&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Internalizing the role of &quot;mentor&quot; is much more difficult than &quot;teacher&quot;. &amp;nbsp;Teaching is something that you do a few times a week, but it&#39;s a hat you take off. &amp;nbsp;Mentoring, or advising graduate students, is an ongoing process.&lt;br /&gt;
&lt;br /&gt;
I&#39;m the most visible example of what a professor is for my students. &amp;nbsp;How I do this job impacts how they will. &amp;nbsp;(Just as &lt;a href=&quot;http://www.cs.columbia.edu/~julia/&quot;&gt;my advisor&lt;/a&gt; is my first and best example that I draw from when advising students.)&lt;br /&gt;
&lt;br /&gt;
My behavior as a mentor impacts how they&#39;ll behave as students. If I respond to emails at 1am, I will get emails at 1am. &amp;nbsp; &amp;nbsp;If I don&#39;t communicate what I expect from them, they won&#39;t know.&lt;br /&gt;
&lt;br /&gt;
In addition to my advisor, I&#39;ve had a number of important mentors. &amp;nbsp;There are two lessons I&#39;ve taken from those relationships.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Protect your advisees. &lt;/i&gt;&amp;nbsp;There are plenty of external challenges -- defenses, negative paper reviews, job postings, evaluations -- that a student will participate in. &amp;nbsp;They do most of the work. &amp;nbsp;I feel like it&#39;s my job to make sure they&#39;re set up for success. &amp;nbsp;(Only submit papers that you would accept. Don&#39;t volunteer them for too many responsibilities.) &amp;nbsp;When things blow up, absorb the blow. &amp;nbsp;Outwardly, take responsibility for mistakes, figure it out with the student internally. &amp;nbsp;If the students feel safe, they&#39;ll be enabled to do good work.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Be a happy warrior.&lt;/i&gt; &amp;nbsp;There are parts of the job that are difficult. &amp;nbsp; It takes long hours and drive. &amp;nbsp; &amp;nbsp;And you&#39;re in this job for a reason. &amp;nbsp;You want to discover, and share those discoveries. You want to contribute to Knowledge. &amp;nbsp; As a student, I fed on the enthusiasm of my mentors. &amp;nbsp;While it&#39;s easy and appropriate to feel a kinship with students, I feel a responsibility to set a tone that &lt;i&gt;this is valuable, &lt;/i&gt;but also &lt;i&gt;this is fun.&lt;/i&gt;&amp;nbsp; And it is.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2014/05/things-i-didnt-know-before-becoming_21.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-7002643312541677785</guid><pubDate>Mon, 05 May 2014 19:38:00 +0000</pubDate><atom:updated>2014-05-05T15:23:21.019-07:00</atom:updated><title>Things I didn&#39;t know before becoming a professor: #2 How to Write a Grant</title><description>To get tenure in computer science, you have to secure external funding. &lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
I don&#39;t know how important this is in other disciplines. &amp;nbsp;I&#39;m sure there are administrations that will say that external funding is only one of a number of criteria that determine tenure decisions. &amp;nbsp;I&#39;m sure they&#39;re right. &amp;nbsp; But without funding, you&#39;re at a severe disadvantage. You can&#39;t work with grad students (as easily -- some programs provide student funding through other means). &amp;nbsp;Travel to conferences -- where you present your work and solidify relationships with important people in the field (who may end up writing letters in support of your promotion) -- is expensive. &amp;nbsp;And then there&#39;s the brass tacks. &amp;nbsp;Depending on institution and some uninteresting details, the administration receives somewhere in the ballpark of 1/3 to 1/2 of the funds that you secure. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Before becoming a professor, I hadn&#39;t written a grant proposal. I had read a couple of successful proposals that I had worked under as a student, but I hadn&#39;t been a part of the preparation of any.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
A little bit of context before I share what little I&#39;ve learned about writing grants.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
I&#39;ve been pretty successful in securing &lt;a href=&quot;http://speech.cs.qc.cuny.edu/grants.html&quot;&gt;funding&lt;/a&gt; over the last five years.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
I don&#39;t expect this trend to continue forever.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
I recently received an NSF CAREER award.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
But my previous two proposals were not funded.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
I&#39;m hardly an expert, but I&#39;ve gotten better since starting.&lt;/div&gt;
&lt;br /&gt;
Here are a few things I didn&#39;t know&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How many funding opportunities there are?&amp;nbsp;&lt;/b&gt;&lt;br /&gt;
I knew about NSF and NIH. I knew about DARPA because I worked on a grant as a student. &amp;nbsp;I didn&#39;t know how many programs NSF has (there&#39;s no way I know about all of them). &amp;nbsp;I didn&#39;t know about the other DoD agencies that fund basic research. &amp;nbsp;I still don&#39;t know about all of the foundations and industrial grants and awards that are available. &amp;nbsp; I&#39;m sure I&#39;m still missing some, but each of IBM, Google, Microsoft offer programs to support faculty. &amp;nbsp;On top of that, there are institutional awards -- CUNY has a handful of them, and if we do, I&#39;m sure everyone else does. &amp;nbsp;This doesn&#39;t mean that it&#39;s easy to find funding. &amp;nbsp;But by keeping an open eye, asking around, and being lucky enough to be asked to join grant writing efforts, I&#39;ve been surprised by how many opportunities are available (in this field). &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How to tell the difference between a good proposal and a not-good-enough proposal&lt;/b&gt;&lt;br /&gt;
Friends and colleagues will offer to share successful grants with you. &amp;nbsp;Take them up on it. &amp;nbsp;But learning from only positive examples is challenging. &amp;nbsp;It can be challenging to figure out how to translate structure and presentation cues from one grant to another. &amp;nbsp;Also, it&#39;s not always clear what made these examples impress the reviewers. &amp;nbsp; If you&#39;re so bold, ask these people to share the reviews along with the grant. &amp;nbsp;I haven&#39;t tried this, and I&#39;d have to trust a person quite a bit to share my highlighted flaws with them, but give it a shot.&lt;br /&gt;
&lt;br /&gt;
Still better is getting to look at unsuccessful proposals and their associated reviews. &amp;nbsp;Everyone you know has one or two of these kicking around. &amp;nbsp;Funding rates are pretty low, say 20% or so, so most people will have a stack of failures for each success they have. &amp;nbsp;But again, it takes moxie to ask someone for this, and a lot of trust to offer it. &amp;nbsp;(I don&#39;t know if this happens ever, but I&#39;d love to hear if other junior faculty were able to read unfunded proposals as they started writing themselves.)&lt;br /&gt;
&lt;br /&gt;
The best way to see a lot of examples of grants, both positive and negative, and their reviews is to get on a grant reviewing panel. &amp;nbsp;NSF Program Managers like to include junior faculty in their panels. &amp;nbsp;Call one who funds work like yours, and have a conversation. &amp;nbsp;I was invited to one in Spring 2010. &amp;nbsp;It was an opportunity to closely read about 5-6 proposals, and more casually review another 12 or so. &amp;nbsp;Seeing all the ways a basically good idea could not get funded was eye-opening. &amp;nbsp;Generally, the quality is high, so the differentiation between successful and unsuccessful proposals can be the depth of their weaknesses rather than the height of their strengths.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;That writing a grant is writing science fiction&lt;/b&gt;&lt;br /&gt;
It&#39;s believable, near-future science fiction, but still in the genre. &amp;nbsp; You&#39;re describing something that doesn&#39;t exist, but will in the next few years.&lt;br /&gt;
&lt;br /&gt;
This perspective has made the process of grant a &lt;i&gt;lot&lt;/i&gt; more entertaining. &lt;br /&gt;
&lt;br /&gt;
Science writing has a narrative arc. &amp;nbsp;The introduction and motivation of your work should be exciting. &amp;nbsp;The feeling I want to leave a reader with is somewhere between, &quot;Of course that&#39;s what will happen&quot;, and &quot;Wouldn&#39;t it be cool if that&#39;s what the world was like.&quot; &amp;nbsp;Balancing these two extremes is the difference between the Scylla and Charybdis of incrementality (&quot;boring&quot;) and overreaching (&quot;i don&#39;t believe you can do this&quot;).&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;How to keep it together after getting a grant rejected&lt;/b&gt;&lt;br /&gt;
It&#39;s only natural to be offended, embarrassed, insulted, frustrated, and sad when a proposal that you spent months of your life on is not funded. &amp;nbsp; The panel was clearly full of idiots who didn&#39;t understand your genius. &amp;nbsp;There was some mistake, if only they had read more closely they would understand how important this work is.&lt;br /&gt;
&lt;br /&gt;
Here&#39;s what rejection has taught me:&lt;br /&gt;
&lt;br /&gt;
Take a deep breath.&lt;br /&gt;
&lt;br /&gt;
Most proposals aren&#39;t funded. &lt;br /&gt;
&lt;br /&gt;
The folks who review your grants are almost universally qualified to do so.&lt;br /&gt;
&lt;br /&gt;
If a proposal is funded, you are not a genius. &amp;nbsp;If a proposal is not funded, you are not an idiot.&lt;br /&gt;
&lt;br /&gt;
The sooner you get over it, the better. Remember, it&#39;s the work that&#39;s being reviewed, those crammed 15 pages of science fiction, not your identity. &lt;br /&gt;
&lt;br /&gt;
And there&#39;s a silver lining: almost always they will include a stack of written reviews. These reviews are a gold mine. &amp;nbsp;Treat them as a genuinely helpful consolation prize. &amp;nbsp;If you don&#39;t understand what they&#39;re saying, you can probably talk to a program manager about them.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;
When revising a proposal, I take all of the reviews, and cut out everything positive that they had to say. &amp;nbsp;It&#39;s too easy to be comforted by that. &amp;nbsp;What I really need to know is what didn&#39;t work. &amp;nbsp;That&#39;s what needs work.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;How to find writing advice&lt;/b&gt;&lt;br /&gt;
I still don&#39;t have a lot of confidence in my grant writing. &amp;nbsp;But there are a lot of people who do, who will share their advice with you. &amp;nbsp;Almost all institutions have grant writing workshops. &amp;nbsp;NSF hosts workshops. &amp;nbsp;There are posts like &lt;a href=&quot;http://www4.ncsu.edu/unity/lockers/users/f/felder/public/Columns/Career-Award.html&quot;&gt;this&lt;/a&gt;, and &lt;a href=&quot;http://www.cs.cmu.edu/~sfinger/advice/advice.html&quot;&gt;this&lt;/a&gt;, and &lt;a href=&quot;http://xsrv.mm.cs.sunysb.edu/300/lectures/proposal.pdf&quot;&gt;this&lt;/a&gt;, written by people who are truly qualified about the process to really steer you right. &amp;nbsp;Writing with a group of people helps. &amp;nbsp;Collaborative proposals come with their own logistical challenges, but it can be helpful for getting feedback on the writing.&lt;br /&gt;
&lt;br /&gt;
Some graduate students write applications for fellowships, internships and other sources of funding. &amp;nbsp;This is great practice, but this is different from the grant process. &amp;nbsp;Maybe the best way for students to get exposure and experience at grant writing is through their advisor&#39;s proposals. &amp;nbsp;I think some professors work with their students on their proposals. &amp;nbsp;If the student is a strong enough writer, and far along in their dissertation, I could see this making a lot of sense for both the professor and student.&lt;/div&gt;
&lt;div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;/div&gt;
</description><link>http://spokenlanguageprocessing.blogspot.com/2014/05/things-i-didnt-know-before-becoming.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-7830850003176188192</guid><pubDate>Mon, 28 Apr 2014 04:02:00 +0000</pubDate><atom:updated>2014-04-27T21:02:29.526-07:00</atom:updated><title>Things I didn&#39;t know before becoming a professor: #1 How to Teach</title><description>In order to be a professor* you need a PhD. &amp;nbsp;The two responsibilities you have as a professor are to do good research and to teach well.&lt;br /&gt;
&lt;br /&gt;
To get a PhD, you need to do good research, so you&#39;re well equipped to handle this.&lt;br /&gt;
&lt;br /&gt;
To get my PhD, I did not have to teach. &amp;nbsp;I had TA&#39;d a few times. &amp;nbsp;But I had put in 10 years in college/grad school. &amp;nbsp;I had taken countless classes, some with great teachers, some less great. &amp;nbsp;So I figured no big deal; I can teach.&lt;br /&gt;
&lt;br /&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
I&#39;ve seen good movies. &amp;nbsp;I&#39;ve got a good story. &amp;nbsp;I can make a good movie.&lt;/div&gt;
&lt;div style=&quot;text-align: right;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
These are some things I didn&#39;t know about teaching before becoming a professor. &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;You are television.&lt;/b&gt;&lt;br /&gt;
I learned this from Michael Cirino in&amp;nbsp;&lt;a href=&quot;http://arazorashinyknife.com/&quot;&gt;a very different context&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
For the time that you are in front of students, you are television. &amp;nbsp;You are putting on a show. If your students don&#39;t engage with you, your students won&#39;t learn anything beyond what they get in a book. &amp;nbsp; &amp;nbsp; That&#39;s not to say that you&#39;re entertainment, but you are performing.&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
My act isn&#39;t strong yet, but it&#39;s getting better.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;b&gt;Don&#39;t skimp on the basics.&lt;/b&gt;&lt;/div&gt;
&lt;div&gt;
In an early version of a Machine Learning class, I decided that I wanted a lecture on spectral clustering. It was a topic I was interested in, but didn&#39;t have a ton of experience with. I read a lot. I worked out math. &amp;nbsp;By the time I had a lecture ready, I was feeling good about it. &amp;nbsp;I went to class, delivered my A material....blank stares. &amp;nbsp;It was a total dud.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
It wasn&#39;t that the lecture was bad, but I hadn&#39;t earned it. &amp;nbsp;Because I wanted to get to a topic that was exciting to me, I had rushed through or omitted a lot of background material. &amp;nbsp;I had completely set myself up for failure. &amp;nbsp;When I went back to figure out where I had went wrong, I realized it was months earlier, when I was planning the syllabus.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Every time I teach a new class, I tell myself that I&#39;m going to have half a dozen or so lectures ready to go by the first day of class. &amp;nbsp;Sometimes I hit this number, usually I don&#39;t. &amp;nbsp;But I&#39;ve come to realize that lectures are a lot easier if you have a clear structure ready for the class. &amp;nbsp;Lately, I&#39;ve become less concerned about having complete lectures prepared. &amp;nbsp;Instead of writing a few complete lectures, I try to have a fairly detailed outline of every class. &amp;nbsp;This helps me focus on the big picture.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
When I have a clear direction for how the pieces fit together, the course is better. &amp;nbsp;Even if that means I don&#39;t get to the most exciting material, or have an excuse to learn something new.&amp;nbsp;&lt;/div&gt;
&lt;br /&gt;
&lt;b&gt;Preparing a lecture takes an unbelievable amount of time.&lt;/b&gt;&lt;br /&gt;
Truly unbelievable. &amp;nbsp;In my first semester, it took 8-10 hours to prepare each 75 minute lecture. &amp;nbsp;Add in writing assignments and exams, grading, office hours, and 2.5 hours in front of students. &amp;nbsp;Thankfully, I was only teaching one class. &amp;nbsp;But at CUNY a full load is 3 classes in a semester. &amp;nbsp; &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Most of your time is spent with students who are struggling.&lt;/b&gt;&lt;br /&gt;
&quot;Do you learn from your students?&quot; No. I teach them. &amp;nbsp;&quot;Are you inspired by your students?&quot;&lt;br /&gt;
&lt;br /&gt;
My PhD students are great. &amp;nbsp;They bring some exciting ideas and papers, and there&#39;s a collaborative learning that happens there. &amp;nbsp;They inspire me and drive me. &amp;nbsp;Absolutely.&lt;br /&gt;
&lt;br /&gt;
Students in class, I have a very limited and lopsided relationship with. &amp;nbsp;I don&#39;t think I&#39;ve ever had an office hours meeting with a student that took material from class and took it a step further. &amp;nbsp;It&#39;s almost always something to the effect of going over material from the previous lecture or exercise in more detail. &amp;nbsp;There&#39;s immense satisfaction from guiding someone to understanding challenging material, but it takes a lot of time.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Students cheat. &amp;nbsp;A lot.&lt;/b&gt;&lt;br /&gt;
I have had a student cheat in almost every course I&#39;ve taught. &amp;nbsp;(This isn&#39;t unique to CUNY. Ask around.) &lt;br /&gt;
&lt;br /&gt;
The cheating meeting is the most emotional human experience, I&#39;ve had with anyone other than a family member or romantic partner.&lt;br /&gt;
&lt;br /&gt;
The student usually cries.&lt;br /&gt;
&lt;br /&gt;
The student usually tries to negotiate out of the repercussions.&lt;br /&gt;
&lt;br /&gt;
Denial? &amp;nbsp;Not so much. &amp;nbsp;Most people own up to it pretty quickly. &amp;nbsp;Forceful denial is a red flag for me. Be open to the possibility that you made a mistake. Maybe one party knew about the cheating and the other didn&#39;t. &amp;nbsp;Maybe the similarities between two assignments really were random. &lt;br /&gt;
&lt;br /&gt;
I&#39;ve had over a dozen of these conversations. &amp;nbsp;Here&#39;s my best advice: &amp;nbsp;Be prepared. Be able to clearly explain how you know cheating happened. &amp;nbsp;Be able to point to your syllabus and university student handbook about the penalties for cheating. Absolutely document everything you can about the exchange. &amp;nbsp;Send the student an email after the fact, recapping the major points. &amp;nbsp; Expect that the student will scramble for an out -- some way to lessen the impact -- don&#39;t let them. &amp;nbsp;At this point, I have a loose script. It&#39;s almost as formulaic as a five-paragraph essay. &lt;br /&gt;
&lt;br /&gt;
A. It&#39;s clear to me that you cheated on this assignment/exam.&lt;br /&gt;
&lt;br /&gt;
B. Here&#39;s how I know. &amp;nbsp;(it&#39;s about here where they usually admit to it.)&lt;br /&gt;
&lt;br /&gt;
C. Because of this you will be getting a zero on the assignment/failing the class/getting expelled, and I will be send a letter with this information to the department. &amp;nbsp;(Or whatever your policy is.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Put it in writing. &amp;nbsp;Get it in writing.&lt;/b&gt;&lt;br /&gt;
Almost all I knew about teaching I learned from a video (VHS!) I was shown while at Columbia. &amp;nbsp;This was an impromptu sharing, it seemed like it was a tape that was passed around the CS department and shown by professors to their grad students. &amp;nbsp;It was a lecture by John Kender called something like &quot;How to Teach&quot;. &amp;nbsp;It was fantastic. &amp;nbsp;It&#39;s similar to &lt;a href=&quot;https://itunes.apple.com/us/itunes-u/teaching-professor-john-kender/id412490101?mt=10&quot;&gt;this video on iTunes&lt;/a&gt;. &amp;nbsp;Before you teach, watch it.&lt;br /&gt;
&lt;br /&gt;
The most significant lesson I remember from this lecture was that a syllabus is a contract, an assignment is a contract, an exam is a contract. &amp;nbsp;It is your responsibility to outline the terms of this contract as clearly as you can. &amp;nbsp;This is what you will learn from this class. This is what to expect from this course, assignment, exam. &amp;nbsp;If you do this, you will get this grade. &lt;br /&gt;
&lt;br /&gt;
Most of the difficulty I have had with students and disputes can be traced back to not being rock solid in the language used in a syllabus, or on an assignment, or (and this was a surprise) not putting things that were discussed in a meeting, in writing. &lt;br /&gt;
&lt;br /&gt;
If you make an arrangement outside your syllabus with a student around any element of your course, shoot them an email after the meeting recapping what was discussed. &amp;nbsp;I didn&#39;t know this before teaching, but wish I had.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;What now?&lt;/b&gt;&lt;br /&gt;
I&#39;ve definitely become a better teacher over the last five years. I&#39;ve made a lot of mistakes and I still do.&lt;br /&gt;
&lt;br /&gt;
Each time I repeat a course, I toy with its structure. My boiler plate syllabus has gotten tighter. &lt;br /&gt;
&lt;br /&gt;
The broader point is when I started, I was starting cold.&lt;br /&gt;
&lt;br /&gt;
There must be ways, programs, seminars, etc. that aim to teach people how to teach. &amp;nbsp;I was barely exposed to any as a graduate student; I know many of my peers weren&#39;t either. &amp;nbsp;Very little was available before I was in front of students for the first time. &lt;br /&gt;
&lt;br /&gt;
It&#39;s easy to gripe about a system that left me unprepared, but now i&#39;m on the other side of the equation. I have graduate students, some of whom will go on to be professors. &amp;nbsp;What can I (and my institution) do to make sure they have the skills to be good teachers when they land tenure-track jobs (which of course they all will).&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Practice. Practice. Practice.&lt;/b&gt;&lt;br /&gt;
For me, becoming a better teacher has taken practice. &amp;nbsp;I think that&#39;s the only way to learn how you are going to teach. &amp;nbsp;And there aren&#39;t enough opportunities to practice this. &amp;nbsp;(I&#39;ve taught Algorithms 4 times, and Machine Learning 3. &amp;nbsp;That may sound like a lot, but it&#39;s only 2 or 3 opportunities to revise structure, lectures and graded material.)&lt;br /&gt;
&lt;br /&gt;
At CUNY, graduate students teach a lot**, so many of my students will have experience in front of students. &amp;nbsp;The downside to this is 1) teaching takes a lot of time. &amp;nbsp;This means that they&#39;re not focusing on their research. &amp;nbsp;and 2) Often they&#39;re not given the responsibility/opportunity to design the class themselves. &amp;nbsp;Instead they teach a section of a larger course with a fixed set of assignments and exams. &amp;nbsp;This leaves students with plenty of experience lecturing, and leading discussions, but less experience with the mechanics of running a course (which is where your teaching lives or dies).&lt;br /&gt;
&lt;br /&gt;
I think one solution might be to have graduate students prepare and teach mini-courses, complete with syllabus, and graded assignments. &amp;nbsp;These should be short, maybe 6 or fewer meetings over a month or so. &amp;nbsp;This keeps the workload more manageable compared to teaching a full course. &amp;nbsp;But it would allow students to practice structuring material, and writing homeworks and exams. &amp;nbsp;They shouldn&#39;t be offered during regular course periods, but in summer or between terms.&lt;br /&gt;
&lt;br /&gt;
I think the best approach would be for this mini-course to be on the student&#39;s dissertation topic. &amp;nbsp;First of all, they&#39;ll already know a ton about it. &amp;nbsp;Second, if they go on to a tenure-track job, chances are they&#39;ll have an opportunity to reuse some of these lectures, either in a(nother) course of their own or at a conference tutorial. &amp;nbsp;Third, lecturing on a topic, and fielding questions can bring to light all the things you don&#39;t know or are unsure of. &amp;nbsp;But the practice would be useful even if it was on some other topic.&lt;br /&gt;
&lt;br /&gt;
The biggest problem I see with this idea is getting the incentives right. &amp;nbsp;To teach something like this takes a lot of work, and there&#39;s little reward. &amp;nbsp;Moreover, there&#39;s little incentive for other students to take one of these mini-courses (and to do the homeworks/assignments). &amp;nbsp; MIT has a thriving IAP program with a ton of activities and minicourses ranging from the &lt;a href=&quot;http://student.mit.edu/iap/fc22.html&quot;&gt;technical&lt;/a&gt; (some for credit) to the &lt;a href=&quot;http://student.mit.edu/iap/nc10.html&quot;&gt;slightly absurd&lt;/a&gt; to &lt;a href=&quot;http://www.mit.edu/~puzzle/&quot;&gt;one of my favorite things&lt;/a&gt;. &amp;nbsp;The IAP is well established in the MIT culture. &amp;nbsp;Can something similar be started up elsewhere?&lt;br /&gt;
&lt;br /&gt;
There&#39;s no way to do this through the university registrar without a lot of bureaucracy. &amp;nbsp;However, if a department, or division, were to unofficially &quot;bless&quot; this kind of activity by 1) including a list of course offerings, 2) document who taught what when, and 3) conferring completion &quot;certificates&quot; (and 4) finding teaching space), the publicity of a program like this could encourage students to participate on both sides.&lt;br /&gt;
&lt;br /&gt;
There are a lot of reasons that a program like this would fail to get off the ground, but if there were a mechanism for graduate students to get practice running courses in a relatively low-risk environment, I am confident that they would be better prepared for tenure-track positions.&lt;br /&gt;
&lt;br /&gt;
I would have been.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* tenure-track&lt;br /&gt;
** maybe too much, but that&#39;s a different discussion</description><link>http://spokenlanguageprocessing.blogspot.com/2014/04/things-i-didnt-know-before-becoming_27.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-8806600867139243185</guid><pubDate>Thu, 24 Apr 2014 21:44:00 +0000</pubDate><atom:updated>2014-07-09T19:34:54.419-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">academia</category><title>Things I didn&#39;t know before becoming a professor (and that i&#39;m still not very good at)</title><description>&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
July 2009. I deposited my dissertation.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
September 2009. I started a tenure-track position at CUNY.&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Coming up on the close of my fifth year, I&#39;m convinced of a simple proposition.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div style=&quot;text-align: center;&quot;&gt;
I was unprepared to be a professor. &amp;nbsp;&amp;nbsp;&lt;/div&gt;
&lt;div style=&quot;text-align: center;&quot;&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
This is not a reflection on the academic preparation I had, or that my weaknesses went unnoticed by the search committee that hired me. &amp;nbsp;Neither do I think I&#39;ve done a particularly bad job over the last five years. &amp;nbsp;Rather, there is a disconnect between the skills that are required to become a professor and the skills that are needed to be a good professor.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
To land a tenure-track position you must:&lt;/div&gt;
&lt;div&gt;
&lt;ol&gt;
&lt;li&gt;&lt;b&gt;get a PhD&lt;/b&gt;&lt;/li&gt;
&lt;li&gt;get a solid publication record&lt;/li&gt;
&lt;li&gt;get good recommendations from important people&lt;/li&gt;
&lt;li&gt;give a good job talk&lt;/li&gt;
&lt;li&gt;be personable enough to not ruin your visit to campus&lt;/li&gt;
&lt;li&gt;get lucky (&lt;a href=&quot;http://agb.org/trusteeship/2013/5/changing-academic-workforce&quot;&gt;there are fewer tenure-track positions than in the past&lt;/a&gt;)&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;
(If you&#39;re very lucky, you&#39;ve got some funding when you walk in the door. &amp;nbsp;But this is a catch-22. &amp;nbsp;It&#39;s really hard to get funding until you&#39;re already a professor.) &amp;nbsp;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
The only one of these that is non-negotiable is having a PhD. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
In order to get a PhD* you must:&lt;/div&gt;
&lt;div&gt;
&lt;ol&gt;
&lt;li&gt;do good research.&lt;/li&gt;
&lt;li&gt;survive on little money and less sleep&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;
That&#39;s it.&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Over the last five years, I&#39;ve repeatedly found myself in situations where I have no idea how to do things that I am expected to do well. &amp;nbsp;I was a good candidate for a tenure-track position, but a mediocre professor.&amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Here&#39;s an incomplete list of things I didn&#39;t know before becoming a professor (and that I&#39;m still not very good** at).&lt;/div&gt;
&lt;div&gt;
&lt;ol&gt;
&lt;li&gt;how to teach&lt;/li&gt;
&lt;li&gt;how to write a (successful) grant&lt;/li&gt;
&lt;li&gt;how to head up a research group&lt;/li&gt;
&lt;li&gt;that project collaboration is different from research collaboration&lt;/li&gt;
&lt;li&gt;how to manage my time&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;
In all new jobs, there are things that you have to learn how to do, skills that get developed through practice. &amp;nbsp;But this isn&#39;t figuring out where the closest printer is, or how to fill out a TPS Report. &amp;nbsp;Most of these skills are central to the job. &amp;nbsp;There is a disconnect between the requirements to get a tenure-track job and the skills needed to do it well. &amp;nbsp;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Over the next few weeks, I&#39;ll drill down on each of these. &amp;nbsp;Hopefully, this will be some comfort to other pre-tenure faculty members, and a preview for graduate students. &amp;nbsp;It&#39;s helpful to acknowledge these challenges and the gap between what we expect from graduate students and professors. &amp;nbsp;In a perfect world, this points to opportunity for graduate programs (including mine) to provide more support to better prepare good graduate students to become good professors. &amp;nbsp;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
* specific requirements vary by institution&lt;/div&gt;
&lt;div&gt;
** I&#39;ve gotten better... But mostly through missteps and course corrections.&lt;/div&gt;
</description><link>http://spokenlanguageprocessing.blogspot.com/2014/04/things-i-didnt-know-before-becoming.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-3122391606048355580</guid><pubDate>Wed, 05 Jun 2013 20:58:00 +0000</pubDate><atom:updated>2013-06-05T20:16:46.159-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">conference recap</category><category domain="http://www.blogger.com/atom/ns#">icassp</category><title>Deep Thoughts on ICASSP 2013</title><description>ICASSP 2013 is wrapping up today in Vancouver. &amp;nbsp;Unfortunately, I missed the last day (and sessions on speech synthesis and prosody that I would have enjoyed). &amp;nbsp;But a wedding on Saturday brought me back a day early.&lt;br /&gt;
&lt;br /&gt;
I hadn&#39;t been to ICASSP before, mostly due to timing oddities and writing grants over summers rather than writing papers that would hit the deadline. &amp;nbsp;It is a very large conference. &amp;nbsp;About twice as large as Interspeech. &amp;nbsp;But the scope is also much broader. &amp;nbsp;Speech and Language work made up at most 30% of the work at the conference. &amp;nbsp;And even this is generous, including machine learning, and other work on audio. &lt;br /&gt;
&lt;br /&gt;
So take this recap with a grain of salt. &amp;nbsp;I missed the last day of the conference, and my impressions are speech focused. &amp;nbsp;(I think I&#39;ve described all conference recaps as blind-men-and-the-elephant problems and this one is no exception.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Deep Learning.&lt;/b&gt;&lt;br /&gt;
OK, I pointed out that Deep Neural Nets were a &quot;hot topic&quot; at &lt;a href=&quot;http://spokenlanguageprocessing.blogspot.com/2012/09/interspeech-2012-recap.html&quot;&gt;last years Interspeech&lt;/a&gt;. &amp;nbsp;It&#39;s hard to believe it&#39;s possible, but they&#39;re even hotter now. &amp;nbsp;Geoffrey Hinton gave the first plenary talk. &amp;nbsp;This was followed by an oral session called &quot;Automatic Speech Recognition using Neural Networks&quot;, which was followed by a Special Session titled &quot;New Types of Deep Neural Network Learning for Speech Recognition and Related Applications&quot;. &amp;nbsp;The next morning, you could attend &quot;Acoustic Modeling with Neural Networks&quot;. &amp;nbsp;And this is just at the session level. &amp;nbsp;Even more applications of multilayer neural networks were scattered around other oral and poster sessions. &amp;nbsp;Some of these oral sessions were so crowded that people were standing along the walls and sitting in the aisles. &amp;nbsp;Nothing else that I saw received nearly so much attention.&lt;br /&gt;
&lt;br /&gt;
It&#39;s easy to view &quot;deep&quot; learning as a silver bullet -- the next great machine learning that will solve all of our problems. &amp;nbsp;It&#39;s almost certainly not. &amp;nbsp;However, a wide array of research groups are seeing similar impressive performance gains by using deep network models for a broad spectrum of spoken language processing tasks. &amp;nbsp;This is especially true for acoustic modeling in speech recognition. &amp;nbsp;Given this, deep learning shouldn&#39;t be ignored.&lt;br /&gt;
&lt;br /&gt;
Hinton&#39;s&amp;nbsp;&lt;a href=&quot;https://www.coursera.org/course/neuralnets&quot;&gt;coursera&lt;/a&gt;&amp;nbsp;course is a solid place to start. (Though resist drinking the kool-aid. &amp;nbsp;To my mind, perceptrons are bad approximations of neurons and worse approximations of the brain, and do little to advance our understanding of human intelligence.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Another highlight&lt;/b&gt;&lt;br /&gt;
One paper which caught my attention for its simplicity came out of Google: &quot;&lt;a href=&quot;http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/41158.pdf&quot;&gt;Language Model Verbalization for Automatic Speech Recognition&lt;/a&gt;&quot;. &amp;nbsp;Essentially &quot;verbalization&quot; is defined as a sort of inverse text-normalization. In text normalization for speech synthesis we have to translate &quot;10&quot; to &quot;TEN&quot;, and &quot;7:11&quot; to &quot;SEVEN ELEVEN&quot; or &quot;ELEVEN PAST SEVEN&quot;. &amp;nbsp;For ASR, the idea of verbalization is to convert decoding output of &quot;SEVEN ELEVEN&quot; into &quot;7:11&quot; or &quot;7-11&quot;. &amp;nbsp;Why bother? &amp;nbsp;Well, Google (and everyone else) has big language models based on text data. You could run a text normalizer over all of this data, but the proposition here is to convert the ASR output into a form that looks more like the source material in your language model.&lt;br /&gt;
&lt;br /&gt;
The Verbalizer solution to this problem is remarkably elegant. &amp;nbsp;A traditional WFST decoder can be expressed as&amp;nbsp;D = C  • L •  G, where C comes off the acoustic model mapping context dependent to independent phones, L is the pronunciation model and G the language model. &amp;nbsp;The &quot;Verbalized&quot; WFST model includes a WFST V which maps ASR realizations like &quot;SEVEN ELEVEN&quot; to text-like realizations like &quot;7-11&quot; or &quot;7:11&quot; (and since it&#39;s a WFST it can do both simultaneously). &amp;nbsp;The new decoder looks like D = C  • L •  V • G. &amp;nbsp;No fuss, no muss. &amp;nbsp;Except that you have to write Verbalizer rules by hand.&lt;br /&gt;
&lt;br /&gt;
The paper focused on terms involving numbers, but the framework is very extensible. &amp;nbsp;And it&#39;s great to see work coming out of Google that doesn&#39;t have Google-scale data as a prerequisite.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Meta-comment&lt;/b&gt;&lt;br /&gt;
The acceptance rate at this years ICASSP was 52%. &amp;nbsp;This means that the ICASSP and Interspeech acceptance rates are identical for the first time. &amp;nbsp;I know that Interspeech organizers have been working to lower the acceptance rate, while it sounds like there has been pressure to keep the size of ICASSP large, even at the expense of a higher acceptance rate. &amp;nbsp;IEEE (the ICASSP parent organization) is a &lt;b&gt;much&lt;/b&gt;&amp;nbsp;larger bureaucracy than ISCA (Interspeech). &amp;nbsp;There are clear expectations from IEEE about the expected revenue from hosting a conference, which translates to expectations on attendance and therefore the number of accepted papers regardless of the number of submissions.&lt;br /&gt;
&lt;br /&gt;
Despite the near constant rain, I genuinely enjoyed Vancouver and ICASSP 2013. &amp;nbsp;I&#39;m looking forward to the next.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2013/06/deep-thoughts-on-icassp-2013.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-5232187466319381526</guid><pubDate>Mon, 22 Oct 2012 17:27:00 +0000</pubDate><atom:updated>2012-10-22T19:41:57.856-07:00</atom:updated><title>Reading and Reconnecting</title><description>With travel finally settling down for me, but ramping up for &lt;a href=&quot;http://rachellebergstein.com/&quot;&gt;my wife&#39;s book tour&lt;/a&gt;, I&#39;m able to settle in to some long overdue reading, thinking and planning. &lt;br /&gt;
&lt;br /&gt;
Also, after an exciting conversation with &lt;a href=&quot;http://www-scf.usc.edu/~dogancan/&quot;&gt;Dogan Can&lt;/a&gt;, during a trip to USC&#39;s&amp;nbsp;&lt;a href=&quot;http://sail.usc.edu/&quot;&gt;SAIL lab&lt;/a&gt;, I&#39;m trying to get more on top of sharing ideas, progress and information here.&lt;br /&gt;
&lt;br /&gt;
First up: some drill-down reading from Paul Mineiro&#39;s blog post on &lt;a href=&quot;http://www.machinedlearnings.com/2012/10/bagging.html&quot;&gt;Bagging!&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Ensemble methods work too well for me to understand them so poorly, so:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;How &lt;a href=&quot;http://www.aaai.org/Papers/AAAI/1999/AAAI99-047.pdf&quot;&gt;out-of-bag&lt;/a&gt; estimates can be used to get at generalization error (better than cross-validation can). &amp;nbsp;&lt;/li&gt;
&lt;li&gt;The relationship between the bias-variance tradeoff and ensemble methods from &lt;a href=&quot;http://www.cs.umd.edu/class/spring2006/cmsc726/Lectures/EnsembleMethods.pdf&quot;&gt;this lecture&lt;/a&gt;. &amp;nbsp;This is a nicely framed discussion of ensemble methods that I hadn&#39;t seen before.&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;div&gt;
&lt;a href=&quot;http://homepage.psy.utexas.edu/HomePage/Faculty/Pennebaker/Reprints/Deception.pdf&quot;&gt;Lying Words: Predicting Deception&amp;nbsp;From Linguistic Styles&lt;/a&gt;. &amp;nbsp;This paper describes a common pattern of language use in deceptive story-telling: &amp;nbsp;Less self-reference. More negative emotion words. Less cognitive complexity. &amp;nbsp;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
I&#39;m looking forward to verifying these claims on some old deception data. And taking a look at debate transcripts through this lens.&amp;nbsp;&lt;/div&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/10/reading-and-reconnecting.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-2816087617616770497</guid><pubDate>Tue, 18 Sep 2012 02:49:00 +0000</pubDate><atom:updated>2012-09-18T12:04:12.214-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">conference recap</category><category domain="http://www.blogger.com/atom/ns#">interspeech</category><title>Interspeech 2012 Recap</title><description>&lt;span style=&quot;font-family: inherit;&quot;&gt;Portland proved to be a great venue for this year&#39;s Interspeech. &amp;nbsp;(Though people who attended ACL 2011 probably already could have guessed that.)&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;Setting up three simultaneous poster sessions in the parking garage may not sound like the mark of a good conference, but it was perfect. &amp;nbsp;There was loads of space between each presenter. &amp;nbsp;It allowed for all three sessions to be in the same place. &amp;nbsp;And the folks at the Hilton did a great job of making it fairly unrecognizable as a parking lot. &amp;nbsp;(In fact, Alejna Brugos didn&#39;t realize it until they were removing the carpets and &quot;walls&quot; on Thursday afternoon.)&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;b&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Deep Neural Networks.&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;For &quot;trends&quot;, there&#39;s really nothing hotter right now than Deep Neural Networks or Deep Belief Nets. &amp;nbsp;This isn&#39;t an area that I do research in, but the story goes more or less this. &amp;nbsp;Neural Networks with more than a few hidden layers don&#39;t train very well with back-propagation. Geoff Hinton and his group figured out how to overcome this limitation not too long ago. &amp;nbsp;(I think &lt;a href=&quot;http://www.cs.toronto.edu/~hinton/absps/ncfast.pdf&quot;&gt;this&lt;/a&gt; 2006 paper explains it, but I can&#39;t be 100% sure.) Then at ASRU 2012 and ICASSP 2011 and 2012, the folks at Microsoft showed that you can use Deep Neural Networks to generate *very* useful front end features. &amp;nbsp;(Tara Sainath has a nice recap of ICASSP 2012 &lt;a href=&quot;http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2012-05/AMatICASSP2012/&quot;&gt;here&lt;/a&gt;.) Now, everyone wants a piece.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;The field has expanded from Microsoft to include IBM and Stanford/Berkeley/Google and RWTH Aachen. &amp;nbsp;Joining them with posters on Deep Neural Nets for ASR are Tsinghua, CMU, Karlsruhe, NTT, INESC-ID, UWashington, and Georgia Tech. &amp;nbsp;At this point, there&#39;s no way to deny that this approach is receiving significant research attention. &amp;nbsp;The results seem to be holding up. &amp;nbsp;If only they didn&#39;t take so long to train...&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;b&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Prominence Special Session.&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;I was particularly looking forward to the Special Session on Prominence. &amp;nbsp;On balance I was happy about the session. &amp;nbsp;It attracted work and discussion of prosody in a way that can sometimes feel diffuse and unfocused at a large conference like Interspeech. &lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;I found this session to be surprising in a few ways.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;It&#39;s been my understanding that &quot;prominence&quot; was used as a catch-all term to cover diverse prosodic phenomena including stress, emphasis, and pitch accenting. &amp;nbsp;The first surprising element of this Session came in a review of the paper I submitted to it. &amp;nbsp;The paper is on the use of automatically predicted pitch accents and intonational phrase boundaries to improve pronunciation modeling. &amp;nbsp;The review, while generally positive, found the paper to not be appropriate for a prominence session because it explored the use of &quot;pitch accents&quot; rather than &quot;prominence&quot;. &amp;nbsp;I still haven&#39;t gotten a good explanation of the difference, and the reviews are blind.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;A second surprise is that there seems to be a movement away from a phonological theory of prosody. Mark Hasegawa-Johnson and Jennifer Cole have been doing work over the last few years investigating how naive listeners perceive prominence. &amp;nbsp;They&#39;ve consistently found that listeners respond to different qualities sometimes at different thresholds when assessing prominence. &amp;nbsp;I&#39;ve found this line of research to be interesting and generally informative, but not a clear indictment of the theory that there perceptual and productive prosodic categories exist. &amp;nbsp; &amp;nbsp;The panel (which I was a part of) on balance seemed comfortable with the idea that prominence is a continuous rather than categorical phenomenon. &amp;nbsp;This view was most directly expressed Denis Arnold who said approximately: focus can be categorical, stress can be categorical, while prominence is still continuous. I didn&#39;t understand this statement then, and still don&#39;t. &amp;nbsp;But again, this may be due to a different definition of prominence than I use.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;The last surprise comes from finding out that there is a direction of pursuing&amp;nbsp;language universals in prominence and prosody more broadly. Petra Wagner and Fabio Tamburini (the session organizers) are planning a workshop to investigate this. &amp;nbsp;In my experience, while the dimensions of prosodic variation may be used in multiple languages and some of these (e.g. increased intensity or duration)&amp;nbsp;&lt;i&gt;may&lt;/i&gt; be used to indicate prominence in all languages, it is extremely unlikely that either the communicative impacts of prosodic variation or its realization and perception are language-universal. &amp;nbsp; From that perspective, I&#39;m not quite clear about what this line of research hopes to accomplish, but I&#39;m curious about where it ends up.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;b&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Dynamic Decoding.&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;It appears that every year, I find myself sitting in on an oral session on a topic that I know very little about. &amp;nbsp;Last year it was the language identification session. &amp;nbsp;This year it was Dynamic Decoding. &amp;nbsp;I was most intrigued by this because I hadn&#39;t heard the term before. &amp;nbsp;When I asked someone what it was, they said &quot;I don&#39;t know, Viterbi?&quot;. &lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;I&#39;m not quite sure this is a good enough distillation of the topic, but the papers in this session were about how to make on-the-fly (or post-training) modifications to language or pronunciation models. &amp;nbsp;This is a cool idea with clear practical importance -- how do you add words to a recognizer on a mobile device and have this appropriately incorporated into the LM and pronunciation model? &amp;nbsp;&lt;a href=&quot;http://interspeech2012.org/accepted-abstract.html?id=1272&quot;&gt;These&lt;/a&gt; &lt;a href=&quot;http://interspeech2012.org/accepted-abstract.html?id=194&quot;&gt;two&lt;/a&gt; papers have some interesting WFST based approaches on this task. &amp;nbsp;I&#39;ll be curious to see learn more about this. &amp;nbsp;Also, if anyone has a more precise definition of this research area, I&#39;d love to hear it.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;b&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Finally, some comments on two of the keynotes.&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;
There were four keynotes at this year&#39;s Interspeech, two were about interesting inter/multi-disciplinary questions about how speech processing intersects with music and animal vocalization, respectively.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;i&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Chin-Hui Lee: An Information-Extraction Approach to Speech Analysis and Processing&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;A third was delivered by this years ISCA medalist, Chin-Hui Lee. &amp;nbsp;Prof. Lee&#39;s most famous accomplishment is &lt;a href=&quot;http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=279278&quot;&gt;MAP adaptation&lt;/a&gt; in acoustic modeling. &amp;nbsp;This is a researcher who spent a career treating speech recognition as a pattern matching problem. &amp;nbsp;This is a view embodied by the Fred Jelenik quote: &quot;&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;Every time I fire a linguist, the performance of the speech recognizer goes up&quot;. &amp;nbsp;What struck me, is that despite this view, in a talk summarizing a successful career, Prof. Lee presented a view of speech recognition that says that linguistic knowledge and speech science should be incorporated into the task. &amp;nbsp;This is an alternate perspective that has been investigated by a lot of talented researchers, including Hynek Hermansky, Jennifer Cole, Mark Hasegawa-Johnson, Alex Waibel, Hermann Ney&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;(via speech-to-speech translation)&lt;/span&gt;&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;, Mari Ostendorf, Elizabeth Shriberg, Andreas Stolcke, Rene Beutler, Karen Livescu and many more (my apologies to anyone I missed).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;I was struck by the evolution of perspective from someone who represents the statistical pattern matching approach to recognizing the potential importance of linguistic knowledge.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white; line-height: 19.200000762939453px;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;line-height: 19.200000762939453px;&quot;&gt;However, this talk was not so well received by some members of the audience for fairly obvious reasons. &amp;nbsp;Firstly, it over-played the importance of Prof. Lee&#39;s own contributions. &amp;nbsp;In a slide on &quot;My contributions&quot;, virtually all major improvements to ASR over the last 20 years were mentioned including most styles of adaptation (including MAP), and virtually all major forms of discriminative training. &amp;nbsp;Secondly, it failed to recognize that the linguistic inspired approach that he was advocating for the future had been extensively researched by other talented peers.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;line-height: 19.200000762939453px;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;
&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;line-height: 19.200000762939453px;&quot;&gt;On balance, I found it a compelling message. &amp;nbsp;In principle, it understandably rubbed some people the wrong way.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;i&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;Michael Riley: Weighted Transducers in Speech and Language Processing&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;I should preface my comments about Michael Riley&#39;s keynote by saying that we worked together while I was interning at Google. &amp;nbsp;I&#39;m a fan. &amp;nbsp;Michael has the rare quality of being the smartest guy in the room without letting anyone know until its genuinely useful.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;The best part of this keynote was the history of the Weighted Finite State Transducer. &amp;nbsp;This was a great story that takes place largely at Bell Labs in the 90s and features Fernando Pereira, Mehryar Mohri and, naturally, Michael Riley. &amp;nbsp;This section was appropriately personal, while presenting this relevant recent history. &amp;nbsp;The WFST is so ubiquitous in speech and NLP applications that it&#39;s easy to forget that it&#39;s has a human context.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;Much of the rest of the keynote felt like a 3 hour tutorial compressed into 40 minutes. &amp;nbsp;This involved showing algorithms, and example WFSTs and describing all of the things that they can be used for. &amp;nbsp;While a successful demonstration of the breadth of application, it was presented at such a pace that it was difficult to get anything out of it, if you didn&#39;t know it already. &amp;nbsp;I&#39;d point the interested to the references found on the &lt;a href=&quot;http://www.openfst.org/twiki/bin/view/FST/FstBackground&quot;&gt;OpenFST page&lt;/a&gt; for more thorough tutorials that can be digested at your own pace.&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;Interspeech 2012 was successful and fun. &amp;nbsp;Portland and the Hilton (and it&#39;s solid wifi) were excellent hosts. &amp;nbsp;There was good work and as ever more than I could see. &amp;nbsp;If you have great or favorite papers that I missed, please let me know!&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;span style=&quot;font-family: inherit;&quot;&gt;&lt;br /&gt;&lt;/span&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/09/interspeech-2012-recap.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-1335211775508736063</guid><pubDate>Wed, 29 Aug 2012 20:55:00 +0000</pubDate><atom:updated>2012-08-29T13:55:30.734-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">teaching</category><title>Overview of Speech and Spoken Language Processing</title><description>Here&#39;s the premise: I was invited to give a guest lecture in &lt;a href=&quot;http://nlp.cs.qc.cuny.edu/advancednlp.html&quot;&gt;Advanced Natural Language Processing&lt;/a&gt;. &amp;nbsp;&amp;nbsp;The students will get one week out of 14 focusing on speech and spoken language processing. But it&#39;s early in the semester, so there&#39;s an opportunity to give a perspective about how speech fits in to the lessons that they&#39;ll be learning in more detail later in the semester.&lt;br /&gt;
&lt;br /&gt;
Here&#39;s the question: how do you spend 75 minutes to provide a useful survey of speech and spoken language processing?&lt;br /&gt;
&lt;br /&gt;
My answer, in powerpoint form, can be found &lt;a href=&quot;http://eniac.cs.qc.cuny.edu/andrew/ANLP-IntroToSpeech.pptx&quot;&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
I spent about 2/3 or so of the material on speech recognition. &amp;nbsp;I figured most students are fascinated by the idea of a machine being able to get words from speech, so let&#39;s go through the fundamentals of the technology behind it. &lt;br /&gt;
&lt;br /&gt;
The remaining 1/3rd or so, I focus on the notion that speech recognition is not sufficient for speech understanding. &amp;nbsp;This a lot of other information in speech that is either 1) unavailable in text, or 2) unavailable in ASR transcripts. &amp;nbsp;The premise in this section is to convince students that speech isn&#39;t just a noisy string of unadorned words, but that there&#39;s a lot of information about structure, and intention that is available from the speech signal. What&#39;s more, we can use it in spoken language processing.&lt;br /&gt;
&lt;br /&gt;
There are an outrageous amount of important concepts that get almost no attention here including but not limited to: Digital signal processing, human speech production and perception, speech synthesis, multimodal speech processing, speaker identification, language identification, building speech corpora, linguistic annotation, discourse and dialog, and conversational agents.&lt;br /&gt;
&lt;br /&gt;
Would you do it differently? &amp;nbsp;I&#39;m curious what some other takes on this problem might look like.&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/08/overview-of-speech-and-spoken-language.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-3043351229942442899</guid><pubDate>Thu, 16 Aug 2012 18:19:00 +0000</pubDate><atom:updated>2012-08-16T11:19:34.195-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">autobi</category><title>AuToBI v1.3</title><description>This release to &lt;a href=&quot;http://speech.cs.qc.cuny.edu/autobi/&quot;&gt;AuToBI&lt;/a&gt; is a more traditional milestone release than v1.2 was. &amp;nbsp;Trained models and a new .jar file will be available on the AuToBI site shortly.&lt;br /&gt;
&lt;br /&gt;
There are improvements to performance that are thoroughly documented in a submission to &lt;a href=&quot;http://www.slt2012.org/&quot;&gt;IEEE SLT 2012&lt;/a&gt;. &amp;nbsp;These improvements were achieved from two sources.&lt;br /&gt;
&lt;br /&gt;
First, AuToBI uses importance weighting to improve classification performance on skewed distributions. &amp;nbsp;I found this to be a more useful approach than the standard under- or over-sampling. &amp;nbsp;This is discussed in a paper that will appear at Interspeech next month. &lt;br /&gt;
&lt;br /&gt;
Second, inspired by features that Taniya Mishra, Vivek Sridhar and Aliaster Conkie developed at AT&amp;amp;T, I included some new features which had a big payoff. &amp;nbsp;(They described these features in an upcoming Interspeech 2012 paper). &amp;nbsp;One of the most significant was to calculate the area under a normalized intensity curve. &amp;nbsp;This has a strong correlation with duration, but is more robust. &amp;nbsp;You could make an argument that it approximates &quot;loudness&quot; by incorporating duration and intensity. &amp;nbsp;This is a pretty poor psycholinguistic or perceptual argument so I wouldn&#39;t make it too strongly, but it could be part of the story.&lt;br /&gt;
&lt;br /&gt;
Here is a recap of speaker-independent acoustic-only performance on the six ToBI classification tasks on BURNC speaker f2b.&lt;br /&gt;
&lt;br /&gt;
&lt;table border=&quot;2&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;th&gt;Task&lt;/th&gt;
&lt;th&gt;Version 1.2&lt;/th&gt;
&lt;th&gt;Version 1.3&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Pitch Accent Detection&lt;/td&gt;
&lt;td&gt;81.01% F1:83.28&lt;/td&gt;
&lt;td&gt;84.83% F1:86.58&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Intermediate Phrase Detection&lt;/td&gt;
&lt;td&gt;75.41% F1:43.15&lt;/td&gt;
&lt;td&gt;77.97% F1:44.43&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Intonational Phrase Detection&lt;/td&gt;
&lt;td&gt;86.91% F1:74.50&lt;/td&gt;
&lt;td&gt;90.36% F1:76.49&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Pitch Accent Classification&lt;/td&gt;
&lt;td&gt;18.46% Average Recall:18.97&lt;/td&gt;
&lt;td&gt;16.33% Average Recall:21.06&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Phrase Accent Classification&lt;/td&gt;
&lt;td&gt;48.34% Average Recall:47.99&lt;/td&gt;
&lt;td&gt;47.44% Average Recall:48.31&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Phrase Accent/Boundary Tone Classification&lt;/td&gt;
&lt;td&gt;73.18% Average Recall:25.92&lt;/td&gt;
&lt;td&gt;74.47% Average Recall:26.02&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;br /&gt;
There are also a number of improvements to AuToBI from a technical side and as a piece of code.&lt;br /&gt;
&lt;br /&gt;
First of all, unit test coverage has increased from ~11% to ~73% between v1.2 and v1.3. &lt;br /&gt;
&lt;br /&gt;
Second, there was a bug in the PitchExtractor code causing a pretty serious under prediction of unvoiced grames. &amp;nbsp;(A big thanks to Victor Soto for finding this bug.)&lt;br /&gt;
&lt;br /&gt;
Third, memory use is much lower by a more aggressive deletion of prediction attributes, and through a modification of how WavReader works.&lt;br /&gt;
&lt;br /&gt;
I&#39;d like to thank&amp;nbsp;Victor Soto,&amp;nbsp;Fabio Tesser, Samuel Sanchez, Jay Liang, Ian Kaplan, Erica Cooper and, as ever, Julia Hirschberg and anyone else who has been using AuToBI, for their patience and feedback.&lt;br /&gt;
&lt;br /&gt;
I&#39;ve been pretty lax about posting here. &amp;nbsp;I&#39;ll try to get better about it in the coming academic year.&lt;br /&gt;
&lt;br /&gt;
This fall is full of travel which will lead to a lot of ideas and not enough time to work on them.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/08/autobi-v13.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-9038968343412439089</guid><pubDate>Wed, 11 Jan 2012 04:21:00 +0000</pubDate><atom:updated>2012-01-10T20:22:24.288-08:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">autobi</category><title>AuToBI Version 1.2</title><description>I hadn&#39;t really planned for this current improvement to &lt;a href=&quot;http://eniac.cs.qc.cuny.edu/andrew/autobi/&quot;&gt;AuToBI&lt;/a&gt; be a milestone release. &lt;br /&gt;
&lt;br /&gt;
I&#39;m about halfway through an effort to get test coverage up to 90-95% of lines and 100% of classes. &amp;nbsp;I promise it&#39;ll get there eventually.&lt;br /&gt;
&lt;br /&gt;
But in the mean time, I was playing with an improvement to how attributes are associated to data points. &amp;nbsp;I knew this was a significant source of inefficiency, but didn&#39;t quite expect this much.&lt;br /&gt;
&lt;br /&gt;
Here are memory usage graphs for training a Pitch Accent Detection model on the Boston University Radio News Corpus -- about 22k data points and 136 features. &amp;nbsp;The first one is on my MacBookPro Laptop with 4G RAM (and a lot of other nonsense running).&lt;br /&gt;
&lt;br /&gt;
&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;
&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr9yrQpFLNyOI4VIXrRAMPmtaXQLCGHCGeU96Fqzi0zj9LHJbD9CKt6EJtm7-pXqvC8WI9yTSkY9eU1H4E-U2bp5m6ncgTJ_IAm1cMt0kIaX38AZm-3FB7NDO3PZLY3pL1zKR9Gj3kr70/s1600/osx.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; height=&quot;141&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr9yrQpFLNyOI4VIXrRAMPmtaXQLCGHCGeU96Fqzi0zj9LHJbD9CKt6EJtm7-pXqvC8WI9yTSkY9eU1H4E-U2bp5m6ncgTJ_IAm1cMt0kIaX38AZm-3FB7NDO3PZLY3pL1zKR9Gj3kr70/s320/osx.png&quot; width=&quot;320&quot; /&gt;&lt;/a&gt;&lt;/div&gt;
The max memory usage of Version 1.1 was 1914Mb, with this improvement it tops out at 1049Mb. An improvement of about 45%. &amp;nbsp;(You&#39;ll notice it also ends a little bit quicker too, but this is probably because of fewer or quicker garbage collection calls.)&lt;br /&gt;
&lt;br /&gt;
I figured I&#39;d check on a compute server too, one of the Spe&lt;span style=&quot;font-family: inherit;&quot;&gt;ech Lab @ Queens College&#39;s&amp;nbsp;&lt;span style=&quot;background-color: rgba(255, 255, 255, 0.917969); color: #222222;&quot;&gt;Quad Core Intel Xeon Processor E5450 (3.0GHz,2X6M&lt;/span&gt;&lt;span style=&quot;background-color: rgba(255, 255, 255, 0.917969); color: #222222;&quot;&gt;L2,1333) with 4Gb RAM. &amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;
&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWLQFHuE5KjjT3dROj1iTRLpSqDFvT1dkxZgGfOFNAJoWXeIsKUNz5kuqCOJy7jTStfsgBEz28Q8pLnqKNQL_etd5D2Id02XC8RH1QdoOm_nXRoi27FHujONGhgz4uHu-JY7f-47gRwkI/s1600/linux.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; height=&quot;132&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgWLQFHuE5KjjT3dROj1iTRLpSqDFvT1dkxZgGfOFNAJoWXeIsKUNz5kuqCOJy7jTStfsgBEz28Q8pLnqKNQL_etd5D2Id02XC8RH1QdoOm_nXRoi27FHujONGhgz4uHu-JY7f-47gRwkI/s320/linux.png&quot; width=&quot;320&quot; /&gt;&lt;/a&gt;&lt;/div&gt;
Similar results here. &amp;nbsp;Max memory usage of version 1.1 was 2343Mb and with the improvement 1392Mb. Improving by 40%. (And the speed improvement is here too.) I don&#39;t have a good explanation for why the linux version is taking more memory to run, but for now I&#39;ll assume it has something to do with the difference to the JVM.&lt;br /&gt;
&lt;br /&gt;
There are some other bugfixes in this version, but this is the big reason to upgrade.&lt;br /&gt;
&lt;br /&gt;
The version 1.2 is available from github&lt;br /&gt;
&lt;span style=&quot;background-color: #eeeeee; color: #555555; font-family: monospace; font-size: 12px; white-space: pre-wrap;&quot;&gt;git clone git@github.com:AndrewRosenberg/AuToBI.git&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/01/autobi-version-12.html</link><author>noreply@blogger.com (Anonymous)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjr9yrQpFLNyOI4VIXrRAMPmtaXQLCGHCGeU96Fqzi0zj9LHJbD9CKt6EJtm7-pXqvC8WI9yTSkY9eU1H4E-U2bp5m6ncgTJ_IAm1cMt0kIaX38AZm-3FB7NDO3PZLY3pL1zKR9Gj3kr70/s72-c/osx.png" height="72" width="72"/><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-6271765655859700880</guid><pubDate>Wed, 04 Jan 2012 00:58:00 +0000</pubDate><atom:updated>2012-01-03T16:58:13.386-08:00</atom:updated><title>English Pronunciation by G. Nolst Trenité</title><description>This is a repost of a poem posted on &lt;a href=&quot;http://spelling.wordpress.com/2007/09/05/english-pronunciation/&quot;&gt;spelling.wordpress.com&lt;/a&gt;&amp;nbsp;that&#39;s been going around facebook today.&lt;br /&gt;
&lt;br /&gt;
It&#39;s an incredibly elegant set of examples about why grapheme-to-phoneme (letter-to-sound) conversion is so difficult in English. &amp;nbsp;(Maybe this should be a required regression test for any TTS frontend...)&lt;br /&gt;
&lt;br /&gt;
Please enjoy.&lt;br /&gt;
&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;&lt;b&gt;English Pronunciation&amp;nbsp;&lt;/b&gt;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;by G. Nolst Trenité &amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;(after the break)&lt;/span&gt;&lt;br /&gt;
&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;&lt;/span&gt;&lt;br /&gt;
&lt;a name=&#39;more&#39;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Dearest creature in creation,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Study English pronunciation.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;I will teach you in my verse&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Sounds like corpse, corps, horse, and worse.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;I will keep you, Suzy, busy,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Make your head with heat grow dizzy.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Tear in eye, your dress will tear.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;So shall I! Oh hear my prayer.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Just compare heart, beard, and heard,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Dies and diet, lord and word,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Sword and sward, retain and Britain.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;(Mind the latter, how it’s written.)&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Now I surely will not plague you&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;With such words as plaque and ague.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;But be careful how you speak:&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Say break and steak, but bleak and streak;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Cloven, oven, how and low,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Script, receipt, show, poem, and toe.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Hear me say, devoid of trickery,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Daughter, laughter, and Terpsichore,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Typhoid, measles, topsails, aisles,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Exiles, similes, and reviles;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Scholar, vicar, and cigar,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Solar, mica, war and far;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;One, anemone, Balmoral,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Kitchen, lichen, laundry, laurel;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Gertrude, German, wind and mind,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Scene, Melpomene, mankind.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Billet does not rhyme with ballet,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Bouquet, wallet, mallet, chalet.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Blood and flood are not like food,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Nor is mould like should and would.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Viscous, viscount, load and broad,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Toward, to forward, to reward.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;And your pronunciation’s OK&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;When you correctly say croquet,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Rounded, wounded, grieve and sieve,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Friend and fiend, alive and live.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Ivy, privy, famous; clamour&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;And enamour rhyme with hammer.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;River, rival, tomb, bomb, comb,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Doll and roll and some and home.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Stranger does not rhyme with anger,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Neither does devour with clangour.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Souls but foul, haunt but aunt,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Font, front, wont, want, grand, and grant,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Shoes, goes, does. Now first say finger,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;And then singer, ginger, linger,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Real, zeal, mauve, gauze, gouge and gauge,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Marriage, foliage, mirage, and age.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Query does not rhyme with very,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Nor does fury sound like bury.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Dost, lost, post and doth, cloth, loth.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Job, nob, bosom, transom, oath.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Though the differences seem little,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;We say actual but victual.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Refer does not rhyme with deafer.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Foeffer does, and zephyr, heifer.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Mint, pint, senate and sedate;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Dull, bull, and George ate late.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Scenic, Arabic, Pacific,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Science, conscience, scientific.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Liberty, library, heave and heaven,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Rachel, ache, moustache, eleven.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;We say hallowed, but allowed,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;People, leopard, towed, but vowed.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Mark the differences, moreover,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Between mover, cover, clover;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Leeches, breeches, wise, precise,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Chalice, but police and lice;&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Camel, constable, unstable,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Principle, disciple, label.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Petal, panel, and canal,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Wait, surprise, plait, promise, pal.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Worm and storm, chaise, chaos, chair,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Senator, spectator, mayor.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Tour, but our and succour, four.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Gas, alas, and Arkansas.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Sea, idea, Korea, area,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Psalm, Maria, but malaria.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Youth, south, southern, cleanse and clean.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Doctrine, turpentine, marine.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Compare alien with Italian,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Dandelion and battalion.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Sally with ally, yea, ye,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Eye, I, ay, aye, whey, and key.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Say aver, but ever, fever,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Neither, leisure, skein, deceiver.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Heron, granary, canary.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Crevice and device and aerie.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Face, but preface, not efface.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Phlegm, phlegmatic, ass, glass, bass.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Large, but target, gin, give, verging,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Ought, out, joust and scour, scourging.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Ear, but earn and wear and tear&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Do not rhyme with here but ere.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Seven is right, but so is even,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Hyphen, roughen, nephew Stephen,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Monkey, donkey, Turk and jerk,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Ask, grasp, wasp, and cork and work.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Pronunciation (think of Psyche!)&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Is a paling stout and spikey?&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Won’t it make you lose your wits,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Writing groats and saying grits?&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;It’s a dark abyss or tunnel:&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Strewn with stones, stowed, solace, gunwale,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Islington and Isle of Wight,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Housewife, verdict and indict.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Finally, which rhymes with enough,&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Though, through, plough, or dough, or cough?&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;Hiccough has the sound of cup.&lt;/span&gt;&lt;br style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: left;&quot; /&gt;&lt;span style=&quot;background-color: white; color: #333333; font-family: verdana, tahoma, arial, sans-serif; font-size: 12px; line-height: 19px; text-align: left;&quot;&gt;My advice is to give up!!!&lt;/span&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2012/01/english-pronunciation-by-g-nolst.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-8869706279734313167</guid><pubDate>Wed, 21 Dec 2011 18:35:00 +0000</pubDate><atom:updated>2011-12-21T10:46:45.025-08:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">evaluation</category><title>Evaluating multi-class classification performance</title><description>Classification performance is generally easy to evaluate. &amp;nbsp;Just use accuracy. &amp;nbsp;It&#39;s the most intuitive evaluation measure you can think of.&lt;br /&gt;
&lt;br /&gt;
$\frac{Correct Predictions}{Number of Data Points}$&lt;br /&gt;
&lt;br /&gt;
We can also represent classification results as a contingency matrix A, with $A_{ij}$ for $i, j \in \vec{t} = \{t_1,\ldots, t_k\}$ where $k$ is the number of possible target values, and A_{ij} is the number of times that a data point from with true label $t_i$ was classified as label $t_j$. &amp;nbsp;Then accuracy is defined as follows:&lt;br /&gt;
&lt;br /&gt;
$Accuracy = \frac{\sum_i A_{ii}}{\sum_i \sum_j A_{ij}}$&lt;br /&gt;
&lt;br /&gt;
This is great until the balance of classes gets out of whack.&lt;br /&gt;
&lt;br /&gt;
This is most commonly seen in cases, where you&#39;re not really classifying but detecting something. &amp;nbsp;Say, a classification task where 95% of data points are non-cancerous, and 5% are cancerous. &amp;nbsp; A degenerate classifier which assigns the majority class label to all points will lead to 95% accuracy (seemingly very good) but is completely uninformative.&lt;br /&gt;
&lt;br /&gt;
When the distribution of class labels is skewed, like this, accuracy becomes a poor evaluation measure. &amp;nbsp;When there are only 2 labels, there are a variety of choices for evaluation that have been developed through investigation of detection problems, including &lt;a href=&quot;http://en.wikipedia.org/wiki/F1_score&quot;&gt;F-measure&lt;/a&gt;, and&amp;nbsp;&lt;a href=&quot;http://en.wikipedia.org/wiki/Receiver_operating_characteristic&quot;&gt;ROC&lt;/a&gt; curves.&lt;br /&gt;
&lt;br /&gt;
There are fewer available choices with more than 2 labels, and the community (at least in natural language processing and speech) has not yet settled on a consistent evaluation measure for these tasks.&lt;br /&gt;
&lt;br /&gt;
Here are some options:&lt;br /&gt;
&lt;br /&gt;
* Ian Read and Stephen Cox proposed the use of &quot;Balanced Error Rate&quot; in &quot;&lt;a href=&quot;http://www.uea.ac.uk/polopoly_fs/1.77246!/read-cox-interspeech-07.pdf&quot;&gt;Automatic Pitch Accent Prediction for Text-To-Speech Synthesis&lt;/a&gt;&quot;, Interspeech 2007.&lt;br /&gt;
&lt;br /&gt;
\[&lt;br /&gt;
BER = 1 - \frac{1}{k}\sum_i\frac{A_{ii}}{\sum_j A_{ij}}&lt;br /&gt;
\]&lt;br /&gt;
&lt;br /&gt;
This is one minus the average recall (correct predictions of a class / true instances) treating each class evenly, regardless of its class membership. &amp;nbsp;It&#39;s worth noting that this measure -- average recall -- was also used in the Interspeech Challenges in &lt;a href=&quot;http://emotion-research.net/sigs/speech-sig/paralinguistic-challenge&quot;&gt;2009&lt;/a&gt;, &lt;a href=&quot;http://emotion-research.net/sigs/speech-sig/paralinguistic-challenge&quot;&gt;2010&lt;/a&gt;, and &lt;a href=&quot;http://emotion-research.net/sigs/speech-sig/is11-speaker-state-challenge&quot;&gt;2011&lt;/a&gt;. &amp;nbsp;The organizers of this challenge, for reasons that escape explanation, chose to call the evaluation measure &quot;unweighted accuracy&quot; rather than &quot;average recall&quot; or &quot;balanced error rate&quot;.&lt;br /&gt;
&lt;br /&gt;
* In &lt;a href=&quot;http://www1.cs.columbia.edu/~amaxwell/amaxwell-thesis-final.pdf&quot;&gt;my thesis&lt;/a&gt;, I put a spin on this measure and defined, &quot;Combined Error Rate&quot;. &amp;nbsp;This was the average of the total false positive and false negative rates taken across each class.&lt;br /&gt;
&lt;br /&gt;
\[&lt;br /&gt;
CER = \frac{\sum_i \frac{n_i}{n} * FP_i + \sum_i\frac{n_i}{n} * FN_i}{2}&lt;br /&gt;
\]&lt;br /&gt;
&lt;br /&gt;
This was fairly ad hoc, and not all that well motivated. &lt;br /&gt;
&lt;br /&gt;
* There is also micro and macro-averaged F-measure, which extend the idea of precision and recall from a single class to a set of classes.&lt;br /&gt;
&lt;br /&gt;
* &lt;a href=&quot;http://en.wikipedia.org/wiki/Mutual_information&quot;&gt;Mutual Information&lt;/a&gt; treats the class and hypothesis distributions as random multinomial variables, and calculates their, well,...mutual information. &amp;nbsp;The downside to this technique is that perfectly bad predictions will have perfect MI, just as a set of perfectly good predictions will. &amp;nbsp;This is a serious problem, making it significantly less useful. &lt;br /&gt;
&lt;br /&gt;
With the exception of Mutual Information, each of these have the form, of examining the contingency table, and weighting the contribution of a correct prediction (or error) based on its true or hypothesized class membership.&lt;br /&gt;
&lt;br /&gt;
This allows us to generalize (many of) these measures into something like&lt;br /&gt;
&lt;br /&gt;
\[&lt;br /&gt;
Measure = \frac{1}{Z(A)}\sum_i \sum_j w(i,j) A_{ij}&lt;br /&gt;
\]&lt;br /&gt;
&lt;br /&gt;
where w(i,j) is a function that determines how much a classification of a point of class $i$ as class $j$ contributes to the measure, and Z(A) is a normalizing function.&lt;br /&gt;
&lt;br /&gt;
Essentially we&#39;re trying to determine how important it is for each point to be classified correctly, on the basis of its class membership, and how damaging an incorrect classification.&lt;br /&gt;
&lt;br /&gt;
Information theory can give us a values for how important each point is. It seems as though w(i,j) := $\delta(i = j)*(-p_i \log p_i)$ would be a valuable evaluation measure. &amp;nbsp;Where $\delta(i = j)$ is a Kronecker delta function, which is 1 if $i = j$ and 0 otherwise. This says that correct classifications contribute the amount of information in its true class to the measure function, while incorrect classifications contribute nothing. &amp;nbsp;This instantiation of an accuracy-like measure weights correct classifications by the amount of information they transmit to the user.&lt;br /&gt;
&lt;br /&gt;
I don&#39;t know of anyone who has done this, but it seems like someone should have. &amp;nbsp;I&#39;ll keep looking in the literature until I find something, or I give up and write a little paper on this.&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2011/12/evaluating-multi-class-classification.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-4254289860454603693</guid><pubDate>Tue, 06 Sep 2011 18:22:00 +0000</pubDate><atom:updated>2011-09-06T16:30:51.518-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">conference recap</category><category domain="http://www.blogger.com/atom/ns#">interspeech</category><title>Interspeech 2011 Recap</title><description>Interspeech 2011 was held in Florence Italy about a week ago, August 28-August 31. &amp;nbsp;A vacation in Italy was too good to pass up on, so The Lady joined me, and we stayed until Labor Day.&lt;br /&gt;
&lt;br /&gt;
I ended up sending a bulk of work to Interspeech, so spent more time than usual in sessions that I was presenting in rather than seeing a lot of papers. &lt;br /&gt;
&lt;br /&gt;
Two interesting themes stood out to me this year. &amp;nbsp;Not for nothing, but these represent some novel ideas about speech science through understanding dialog and the speech engineering.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Entrainment&lt;/b&gt;&lt;br /&gt;
Julia Hirschberg, my former advisor, received the ISCA medal for her years of work in speech. &amp;nbsp;Her talk was on current work with Agustín Gravano and Ani Nenkova on entrainment. &amp;nbsp;Entrainment is the phenomenon by which when people are speaking to each other, their speech becomes more similar. &amp;nbsp;This can be realized in terms of the words that are used to describe a concept, as well as speaking rate, pitch, intensity. &amp;nbsp;How this happens isn&#39;t totally understood, and measures of entrainment are still being developed. &amp;nbsp;This research theme is still in its early phases, but I haven&#39;t seen an idea spread around a conference as quickly or as thoroughly as this did. &amp;nbsp;There were questions and discussions all over the place (like Tom Mitchell&#39;s keynote about fMRI data and word meaning) about this phenomenon. &amp;nbsp;The more engineering folks weren&#39;t as compelled by the utility of this in helping speech processing, but within the speech perception and production communities, and specifically the dialog and IVR folks, it was all the rage. &amp;nbsp;It&#39;ll be something to see how this develops. &lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;I-vectors&lt;/b&gt;&lt;br /&gt;
I-vectors are a topic that I need to spend some time learning. &amp;nbsp;I hadn&#39;t heard of it before this conference, where there were no less than a dozen papers that used this approach. &amp;nbsp;Essentially the idea is this: &amp;nbsp;The location of mixture components in a GMM model are composed of (in at least one form) a UBM, a channel component, and a &quot;interesting&quot; component. &amp;nbsp;This &quot;interesting&quot; component can be the contribution of a particular speaker, or a language/dialect, or anything else you&#39;re trying to model. &amp;nbsp;Joint Factor Analysis is used to decompose the observation into these components in an unsupervised fashion. &amp;nbsp;It&#39;s in this part where my understanding of the math is still limited. &amp;nbsp;The crux is that the &quot;interesting&quot; component can be represented by a \[Dx\] transformation, where the dimensionality of x can be set by the user. &amp;nbsp;In comparison to a supervector representation, where the dimensionality of the supervector is constrained to be equal to the number of parameters (or means) of the GMM, i-vectors can be significantly smaller leading to better estimation and smaller models. &amp;nbsp;I&#39;ll be reading &lt;a href=&quot;http://www.icsi.berkeley.edu/Speech/presentations/AFRL_ICSI_visit2_JFA_tutorial_icsitalk.pdf&quot;&gt;this tutorial&lt;/a&gt;&amp;nbsp;by Howard Lei over the next few weeks to get up to speed. &lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
There were few specific papers that stood out to me this conference. &amp;nbsp;I&#39;m intrigued by Functional Data Analysis as a way to model continuous time-value observations. Michele Gubian gave a tutorial on this that I sadly missed, and included it in at least one paper,&amp;nbsp;&lt;b&gt;&lt;a href=&quot;http://lands.let.ru.nl/FDA/papers/Chierh_Interspeech2011_camera_ready.pdf&quot;&gt;Predicting Taiwan Mandarin tone shapes from their duration&lt;/a&gt; by Chierh Chung and Michele Gubian&lt;/b&gt;. &amp;nbsp;This paper wasn&#39;t totally convincing in the utility of the technique, but there may be more appropriate applications.&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;background-color: #fafafa; color: #404040; font-family: Verdana, sans-serif, Arial; font-size: 12px; line-height: 15px;&quot;&gt;&lt;/span&gt;&lt;br /&gt;
&lt;h4 style=&quot;color: black; font-size: 12px; font-weight: bold; margin-bottom: 3pt; margin-top: 0pt; text-align: left;&quot;&gt;


&lt;span class=&quot;Apple-style-span&quot; style=&quot;background-color: #fafafa; color: #404040; font-family: Verdana, sans-serif, Arial; font-size: 12px; line-height: 15px;&quot;&gt;
&lt;/span&gt;&lt;/h4&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;background-color: #fafafa; color: #404040; font-family: Verdana, sans-serif, Arial; font-size: 12px; line-height: 15px;&quot;&gt;
&lt;/span&gt;&lt;br /&gt;
It was a satisfying and inspiring conference, to be sure. &amp;nbsp;I think I was more interested in talking to people than in papers in particular this time. &amp;nbsp;If anyone has particular favorites that I missed, please use the comments to share or just email me.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2011/09/interspeech-2011-recap.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-3727938517694645034</guid><pubDate>Thu, 25 Aug 2011 02:34:00 +0000</pubDate><atom:updated>2011-08-24T19:34:09.168-07:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">interspeech</category><title>Interspeech Tutorial</title><description>I&#39;m giving my first tutorial at Interspeech on Saturday in Florence.&lt;br /&gt;
&lt;br /&gt;
It&#39;s titled &quot;More than Words can Say: Prosodic Analysis Techniques and Applications&quot;.&lt;br /&gt;
&lt;br /&gt;
The (very likely) final version of the slides are available&amp;nbsp;&lt;a href=&quot;http://eniac.cs.qc.cuny.edu/andrew/presentations/interspeech11-tutorial-rosenberg.pptx&quot;&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Now off to Italy. &amp;nbsp;Interspeech Recap to come.&lt;br /&gt;
&lt;br /&gt;
</description><link>http://spokenlanguageprocessing.blogspot.com/2011/08/interspeech-tutorial.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-1066806389212755744</guid><pubDate>Thu, 12 May 2011 12:12:00 +0000</pubDate><atom:updated>2011-05-13T13:37:28.539-07:00</atom:updated><title>Because Research is a Creative Act.</title><description>This quote by &lt;a href=&quot;http://www.thisamericanlife.org/about/staff&quot;&gt;Ira Glass&lt;/a&gt;&amp;nbsp;has been going around design and writing blogs and tumblrs. &amp;nbsp;It&#39;s equally applicable to research.&lt;br /&gt;
&lt;br /&gt;
&lt;blockquote&gt;Nobody tells this to people who are beginners, I wish someone told me. All of us who do creative work, we get into it because we have good taste. But there is this gap. For the first couple years you make stuff, it’s just not that good. It’s trying to be good, it has potential, but it’s not. But your taste, the thing that got you into the game, is still killer. And your taste is why your work disappoints you. A lot of people never get past this phase, they quit. Most people I know who do interesting, creative work went through years of this. We know our work doesn’t have this special thing that we want it to have. We all go through this. And if you are just starting out or you are still in this phase, you gotta know its normal and the most important thing you can do is do a lot of work. Put yourself on a deadline so that every week you will finish one story. It is only by going through a volume of work that you will close that gap, and your work will be as good as your ambitions. And I took longer to figure out how to do this than anyone I’ve ever met. It’s gonna take awhile. It’s normal to take awhile. You’ve just gotta fight your way through.&lt;/blockquote&gt;&lt;br /&gt;
Do good work. &amp;nbsp;But maybe more importantly, work.</description><link>http://spokenlanguageprocessing.blogspot.com/2011/05/because-research-is-creative-act.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-3077306123520529980</guid><pubDate>Sat, 19 Feb 2011 06:42:00 +0000</pubDate><atom:updated>2011-02-19T07:44:48.382-08:00</atom:updated><title>I&#39;ll take &quot;The Humans&quot; for $1,000,000, Alex. (The obligatory Watson post.)</title><description>Full disclosure: I worked one day a week for a year on improving Watson&#39;s speech synthesis. &amp;nbsp;While I think the voice added flair and personality to the exhibition, everyone knows that question-answering is what Watson is all about. And I didn&#39;t have any hand in that.&lt;br /&gt;
&lt;br /&gt;
Wednesday marked the end of the three day exhibition Jeopardy! match between Watson, IBM&#39;s question answering system, Ken Jennings and Brad Rutter. &amp;nbsp;Watson decisively beat the human competitors.&lt;br /&gt;
&lt;br /&gt;
There is no shortage of commentary on the implications of Watson&#39;s victory. &amp;nbsp; One of my &lt;a href=&quot;http://www.slate.com/id/2284721/&quot;&gt;favorite pieces&lt;/a&gt; was by Ken Jennings himself. &amp;nbsp;He&#39;s played more games of Jeopardy! than anyone else, and describes the experience in a very measured way.&lt;br /&gt;
&lt;br /&gt;
One of his observations sticks out&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: inherit;&quot;&gt;: &quot;&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 18px;&quot;&gt;I expected Watson&#39;s bag of cognitive tricks to be fairly shallow, but I felt an uneasy sense of familiarity as its programmers briefed us before the big match: The computer&#39;s techniques for unraveling&lt;/span&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 18px;&quot;&gt;&lt;em&gt;&amp;nbsp;Jeopardy!&amp;nbsp;&lt;/em&gt;&lt;/span&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 18px;&quot;&gt;clues sounded just like mine.&quot;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Verdana; font-size: 12px; line-height: 18px;&quot;&gt;&lt;/span&gt;&amp;nbsp;It is in this that Watson represents a step forward for Artificial Intelligence.&lt;br /&gt;
&lt;br /&gt;
There is a human exceptionalism that most people have when discussing &quot;intelligence&quot;. &amp;nbsp;This exceptionalism leads to a dismissal of the success of Deep Blue&#39;s defeat of Kasparov as computation not intelligence. &amp;nbsp;It leads to descriptions of Watson as a &quot;big database&quot;. &amp;nbsp;The same exceptionalism leads to the Turing Test as the final arbiter of intelligence.&lt;br /&gt;
&lt;br /&gt;
Prediction: No machine will ever pass the Turing Test.&lt;br /&gt;
&lt;br /&gt;
The test is explicitly defined to defend the exceptionalism of human intelligence. &amp;nbsp;It all but explicitly says, &quot;humans are intelligent. To be intelligent, you must be indistinguishable from a human.&quot; &lt;br /&gt;
&lt;br /&gt;
Before this test is passed, machines will continue to perform tasks which we consider to demonstrate intelligence better than humans can. &amp;nbsp;Chess: done. &amp;nbsp;Quiz shows: done. Mathematical theorem proving: done. &amp;nbsp;What&#39;s next? Perfect SAT scores. Perfect GRE scores? &amp;nbsp;Writing a five paragraph essay? &amp;nbsp;All conceivable in the next, say, five years. &amp;nbsp;Performing basic research &lt;b&gt;and&lt;/b&gt;&amp;nbsp;writing an accepted peer-reviewed publication of its findings. &amp;nbsp;Certainly further off, but not inconceivable. &amp;nbsp;But none of these systems would pass the Turing Test; people would still ask, &quot;can it make me coffee?&quot; or &quot;can it tell me &lt;a href=&quot;http://rachellebergstein.blogspot.com/2011/02/addicted-to-love.html&quot;&gt;why Ronnie and Sam are still together&lt;/a&gt;?&quot; &amp;nbsp;And yet, progress in artificial intelligence will be undeterred.&lt;br /&gt;
&lt;br /&gt;
Ray Kurzweil&#39;s prediction of a self-aware system will almost certainly predate one that can convince us that it is human. &amp;nbsp;Despite its ability to describe its &quot;thought&quot; process, and outperform humans on tasks that were formerly considered representative behavior, there will be those that won&#39;t consider this system &quot;intelligent&quot; until it passes the Turing Test.&lt;br /&gt;
&lt;br /&gt;
Watson&#39;s success comes from some very impressive machine learning and natural language processing, not only in its ability to generate answers quickly, but in its collection of knowledge. &amp;nbsp;The individual ml and nlp components used in Watson may individually be incremental improvements on previously existing approaches. &amp;nbsp;Yet this doesn&#39;t mean that Watson is itself an incremental milestone. &amp;nbsp;Watson bested a human at a task that we believed, until Wednesday night, required intelligence. &amp;nbsp;But more impressively, it won in a way that was similar (in spirit if not in hardware) to the way humans play Jeopardy!&lt;br /&gt;
&lt;br /&gt;
Requisite CYA: As always, this post doesn&#39;t represent anyone&#39;s views other than my own.</description><link>http://spokenlanguageprocessing.blogspot.com/2011/02/ill-take-humans-for-1000000-alex.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-7893777640278308212</guid><pubDate>Thu, 20 Jan 2011 22:55:00 +0000</pubDate><atom:updated>2011-01-20T14:55:35.557-08:00</atom:updated><title>P == NP.  The battle rages on.</title><description>Last summer, Vinay Deolalikar claimed to have proved that &lt;a href=&quot;http://spokenlanguageprocessing.blogspot.com/2010/08/p-np.html&quot;&gt;P!=NP&lt;/a&gt;. &amp;nbsp;This kicked up some terrific math and computer science dust, and a lot of people got excited, myself included. &amp;nbsp;But this wasn&#39;t exactly a surprising result...most people suspect that P != NP after all, but a proof has been elusive.&lt;br /&gt;
&lt;br /&gt;
Well, evening out the playing field, Vladimir Romanov claims to have come up with a polynomial time algorithm for 3-SAT -- one of the foundational NP-hard problems. &lt;a href=&quot;http://arxiv.org/abs/1011.3944&quot;&gt;Here&#39;s&lt;/a&gt; the paper describing the work. &amp;nbsp;But what&#39;s more...there&#39;s &lt;a href=&quot;https://github.com/anjlab/sat3&quot;&gt;source code&lt;/a&gt; for it. &amp;nbsp;If he&#39;s right, then P==NP.&lt;br /&gt;
&lt;br /&gt;
I fully expect that people will be picking this work apart pretty rigorously over the next few weeks, months and years. &amp;nbsp;But if it holds up, it&#39;s a really big deal. &amp;nbsp;Much bigger than P!=NP, most specifically because it implies that large factorization can be solved in polynomial time, which means all of our encryption is broken.</description><link>http://spokenlanguageprocessing.blogspot.com/2011/01/p-np-battle-rages-on.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-291216400294549604</guid><pubDate>Wed, 19 Jan 2011 00:34:00 +0000</pubDate><atom:updated>2011-01-18T16:35:49.666-08:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">autobi</category><title>AuToBI Version 1.1</title><description>I&#39;ve made enough improvements to AuToBI to consider the toolkit a milestone more mature.&lt;br /&gt;
&lt;br /&gt;
In addition to some uninteresting bug fixes, and refactoring, the version 1.1 is made up by 3 significant changes. &amp;nbsp;As ever, the&amp;nbsp;&lt;a href=&quot;http://eniac.cs.qc.cuny.edu/andrew/autobi/&quot;&gt;AuToBI homepage&lt;/a&gt;&amp;nbsp;includes milestone releases, and the project itself is hosted on &lt;a href=&quot;https://github.com/AndrewRosenberg/AuToBI&quot;&gt;github&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
1) Package restructuring.&lt;br /&gt;
&lt;br /&gt;
The internal structure makes more sense now. &amp;nbsp;Classes are divided into &#39;core&#39;, &#39;feature extractors&#39;, &#39;classifiers&#39;, &#39;feature sets&#39;, &#39;io&#39; and &#39;utilities&#39;. &amp;nbsp;With over 100 classes, the code had outgrown a flat package structure. &amp;nbsp;Unfortunately this restructuring changed the serialization signatures of class names. &amp;nbsp;This means that old models won&#39;t work with the version 1.1 release. &amp;nbsp;But I have new models trained on the Boston Directions Corpus and Boston Radio News Corpus which will be available from the &lt;a href=&quot;http://eniac.cs.qc.cuny.edu/andrew/autobi/&quot;&gt;AuToBI homepage&lt;/a&gt; shortly.&lt;br /&gt;
&lt;br /&gt;
2) Implemented reference counting for Feature maintenance.&lt;br /&gt;
&lt;br /&gt;
The feature extraction process allows a user to specify the features they want without explicitly going through all the intermediate steps to extract them. &amp;nbsp;For example, extracting the mean speaker normalized pitch from the second half of a word requires pitch to be extracted, speaker normalization parameters to be calculated or retrieved, the pitch to be normalized, the second half of the word to be identified, and finally the mean calculated. &amp;nbsp;In AuToBI each of these steps are treated as features that are required by the feature extraction of the user-desired feature. &amp;nbsp;In version 1.0, AuToBI was able to identify which features were required, but never recognized when a feature wasn&#39;t needed any longer. &amp;nbsp;Version 1.1 includes functionality that maintains a reference count for each feature based on how many features that still need to be extracted are going to need it -- speaker normalization parameters may be required by many requested features. &amp;nbsp;This allows for tidier memory management, which should, in turn, allow AuToBI to operate on more material.&lt;br /&gt;
&lt;br /&gt;
3) Reduced storage for acoustic contours.&lt;br /&gt;
&lt;br /&gt;
In version 1.0 pitch and intensity contours were stored as lists of time-value pairs -- a storage class containing 2 doubles. &amp;nbsp;Assuming 10ms samples, and 16bytes per sample, this is about 1600bytes per second. &amp;nbsp;Not unacceptably inefficient, but definitely could be improved. &amp;nbsp;The other approach would be to only store values, and let the time of each point be specified by a start time (t0) and step size (dt). &amp;nbsp;This allows storage with 8bytes per sample plus 16bytes for the parameters. &amp;nbsp;The problem with this approach is that pitch points are invalid when there is no periodic material detected. &amp;nbsp;The time-value pair approach handled this simply and easily. &amp;nbsp;The new solution uses an array of one-bit booleans for each sample in the contour, describing whether it was a &#39;valid&#39; point or not. &amp;nbsp;If the rate of periodic to aperiodic material is &amp;nbsp;less than 32:1, the new approach will reduce the memory requirements. &amp;nbsp;(As I&#39;m writing this, I realize that this ratio could be explicitly tested during pitch extraction, selecting the most efficient storage solution at runtime. &amp;nbsp;Keep an eye out in a new release.)&lt;br /&gt;
&lt;br /&gt;
The changes are mostly inside baseball kind of things, but possibly useful for AuToBI users who get their hands dirty in the code -- whoever you are. &amp;nbsp; For everyone else, the most obvious changes you&#39;ll notice is that version 1.0 models don&#39;t work any more, and a speedup of ~6%.</description><link>http://spokenlanguageprocessing.blogspot.com/2011/01/autobi-version-11.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-3289972161364871194</guid><pubDate>Fri, 03 Dec 2010 22:30:00 +0000</pubDate><atom:updated>2010-12-03T14:30:50.660-08:00</atom:updated><title>Personal Paper Writing Month - Recap</title><description>Well, no one wanted to join with me for Paper Writing Month, so I can only report on my own progress for November.&lt;br /&gt;
&lt;br /&gt;
I got one full draft written, and the experiments for two others are more or less done running.&lt;br /&gt;
&lt;br /&gt;
I can point out a number of factors that kept me from being more productive, none of which make me disappointed in myself.&amp;nbsp;In all say this was a good exercise, and one I&#39;ll do again. &amp;nbsp;Maybe next time, I can find a buddy or two to join with me on it.&lt;br /&gt;
&lt;br /&gt;
Now that I&#39;ve written four (now five) sentences including a first-person personal pronoun, here&#39;s some more general thoughts about research.&lt;br /&gt;
&lt;br /&gt;
Alex and Aki at Ideas in Food have a blog post about about &lt;a href=&quot;http://blog.ideasinfood.com/ideas_in_food/2010/11/5-factors-shaping-creativity-in-the-kitchen.html&quot;&gt;Creativity in the Kitchen&lt;/a&gt;. &amp;nbsp;They loosely break this down into Inspiration, Flexibility, Motivation, Adaptation and Refinement. &amp;nbsp;Both these top-level categories and the specifics they address apply almost as well to creativity in research. &amp;nbsp;So check it out and cross-pollenate a little.</description><link>http://spokenlanguageprocessing.blogspot.com/2010/12/personal-paper-writing-month-recap.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>2</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-1346096307926760927</guid><pubDate>Thu, 11 Nov 2010 21:19:00 +0000</pubDate><atom:updated>2010-11-11T13:19:54.963-08:00</atom:updated><title>Cross-validation with one model</title><description>&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;This is essentially a repost of &lt;a href=&quot;http://robjhyndman.com/researchtips/crossvalidation/&quot;&gt;Rob J Hyndman&#39;s blog post&lt;/a&gt; on the relevance of cross-validation for statisticians.&lt;/span&gt;&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Within this very nice piece, Rob drops this bomb of mathematical knowledge:&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style=&quot;text-align: center;&quot;&gt;&lt;b&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;I&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: 16px; line-height: 24px;&quot;&gt;t is not necessary to actually fit&amp;nbsp;&lt;/span&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: 16px; line-height: 24px;&quot;&gt;&lt;img alt=&quot;n&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-b9953f5805678502c3149a3a96fb5ad3_l2.gif&quot; style=&quot;border-color: initial; border-color: initial; border-style: initial; border-top-style: none; border-width: initial; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: 16px; line-height: 24px;&quot;&gt;&lt;b&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&amp;nbsp;separate models when computing the CV statistic for linear models.&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; font-size: 16px; line-height: 24px;&quot;&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Say what?&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class=&quot;Apple-style-span&quot; style=&quot;line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Here is a broader excerpt and the method itself (after the jump).&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;a name=&#39;more&#39;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;hr /&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; font-size: 16px; line-height: 24px;&quot;&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;While cross-validation can be computationally expensive in general, it is very easy and fast to compute LOOCV for linear models. A linear model can be written as&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: center;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;&lt;img alt=&quot;\[&amp;lt;br /&amp;gt;
\mathbf{Y} = \mathbf{X}\mbox{\boldmath$\beta$} + \mathbf{e}.&amp;lt;br /&amp;gt;
\]&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-7e50600b231371a08c582b7d46308497_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;Then&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: center;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;&lt;img alt=&quot;\[&amp;lt;br /&amp;gt;
\hat{\mbox{\boldmath$\beta$}} = (\mathbf{X}&#39;\mathbf{X})^{-1}\mathbf{X}&#39;\mathbf{Y}&amp;lt;br /&amp;gt;
\]&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-8b37f88a8ed5d515da0e7efe8975d10b_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;and the fitted values can be calculated using&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: center;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;&lt;img alt=&quot;\[&amp;lt;br /&amp;gt;
\mathbf{\hat{Y}} = \mathbf{X}\hat{\mbox{\boldmath$\beta$}} = \mathbf{X}(\mathbf{X}&#39;\mathbf{X})^{-1}\mathbf{X}&#39;\mathbf{Y} = \mathbf{H}\mathbf{Y},&amp;lt;br /&amp;gt;
\]&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-8fe5c63e9e6182338919d9e74d6697ab_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;where&amp;nbsp;&lt;img alt=&quot;\mathbf{H} =  \mathbf{X}(\mathbf{X}’\mathbf{X})^{-1}\mathbf{X}’&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-4deb176029efb1ce146ad95195fd3b48_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: -4px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&amp;nbsp;is known as the “hat-matrix” because it is used to compute&amp;nbsp;&lt;img alt=&quot;\mathbf{\hat{Y}}&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-99e32de32dcce704cb078a4396fa8e96_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: -1px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&amp;nbsp;(“Y-hat”).&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;If the diagonal values of&amp;nbsp;&lt;img alt=&quot;\mathbf{H}&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-fe27ea8970f697e4b951fabca5ff653c_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: -1px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&amp;nbsp;are denoted by&amp;nbsp;&lt;img alt=&quot;h_{1},\dots,h_{n}&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-679927b092a013154c88ec022c5e5d2a_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: -4px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;, then the cross-validation statistic can be computed using&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; text-align: center;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;&lt;img alt=&quot;\[&amp;lt;br /&amp;gt;
\text{CV} = \frac{1}{n}\sum_{i=1}^n [e_{i}/(1-h_{i})]^2,&amp;lt;br /&amp;gt;
\]&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-9c139a93f634723776c3ee3c8d538c89_l2.gif&quot; style=&quot;border-color: initial; border-color: initial; border-style: initial; border-top-style: none; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;/div&gt;&lt;div style=&quot;font-family: Georgia, serif; line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Georgia, serif; line-height: 24px;&quot;&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-size: small;&quot;&gt;where&amp;nbsp;&lt;span class=&quot;Apple-style-span&quot; style=&quot;border-color: initial; border-color: initial; border-style: initial; border-width: initial;&quot;&gt;&lt;img alt=&quot;e_{i}&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-d90076703c21e5547e04a0571c7e44cb_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: -3px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&amp;nbsp;is the residual obtained from fitting the model to all&amp;nbsp;&lt;span class=&quot;Apple-style-span&quot; style=&quot;border-color: initial; border-color: initial; border-style: initial; border-width: initial;&quot;&gt;&lt;img alt=&quot;n&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-b9953f5805678502c3149a3a96fb5ad3_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&amp;nbsp;observations. See Christensen’s book&amp;nbsp;&lt;a class=&quot;vt-p&quot; href=&quot;http://www.amazon.com/gp/product/0387953612?ie=UTF8&amp;amp;tag=prorobjhyn-20&amp;amp;linkCode=as2&amp;amp;camp=1789&amp;amp;creative=390957&amp;amp;creativeASIN=0387953612&quot; style=&quot;color: #0071bb; font-family: Georgia, serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; outline-color: initial; outline-style: none; outline-width: initial; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;Plane Answers to Complex Questions&lt;/a&gt;&amp;nbsp;for a proof. Thus, it is not necessary to actually fit&amp;nbsp;&lt;span class=&quot;Apple-style-span&quot; style=&quot;border-color: initial; border-color: initial; border-style: initial; border-width: initial;&quot;&gt;&lt;img alt=&quot;n&quot; src=&quot;http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-b9953f5805678502c3149a3a96fb5ad3_l2.gif&quot; style=&quot;border-bottom-style: none; border-bottom-width: 0px; border-color: initial; border-color: initial; border-left-style: none; border-left-width: 0px; border-right-style: none; border-right-width: 0px; border-style: initial; border-top-style: none; border-top-width: 0px; border-width: initial; font-family: &#39;Segoe UI&#39;, Calibri, &#39;Myriad Pro&#39;, Myriad, &#39;Trebuchet MS&#39;, Helvetica, Arial, sans-serif; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: 0px;&quot; title=&quot;Rendered by QuickLaTeX.com&quot; /&gt;&lt;/span&gt;&amp;nbsp;separate models when computing the CV statistic for linear models. This remarkable result allows cross-validation to be used while only fitting the model once to all available observations.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;hr /&gt;&lt;div style=&quot;line-height: 24px; margin-bottom: 0.8em; margin-left: 0px; margin-right: 0px; margin-top: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px;&quot;&gt;&lt;div style=&quot;font-family: Georgia, serif;&quot;&gt;&lt;br /&gt;
&lt;/div&gt;&lt;span class=&quot;Apple-style-span&quot; style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt;Very cool.&lt;/span&gt;&lt;/div&gt;</description><link>http://spokenlanguageprocessing.blogspot.com/2010/11/cross-validation-with-one-model_11.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-1268266350438382990.post-76926786295971938</guid><pubDate>Sat, 06 Nov 2010 15:02:00 +0000</pubDate><atom:updated>2010-11-06T08:02:44.343-07:00</atom:updated><title>Semantically Related Term Challenge</title><description>Joseph Turian over at &lt;a href=&quot;http://metaoptimize.com/&quot;&gt;MetaOptimize.com&lt;/a&gt; has posted a fun NLP &lt;a href=&quot;http://metaoptimize.com/blog/2010/11/05/nlp-challenge-find-semantically-related-terms-over-a-large-vocabulary-1m/&quot;&gt;challenge&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
The task is to identify semantically related words from a shared corpus. &lt;br /&gt;
&lt;br /&gt;
So you&#39;re thinking, sure, no problem. &amp;nbsp;I&#39;ll look for common concurrences. &amp;nbsp;Maybe I&#39;ll start with some seed pairs and do some bootstrapping. &amp;nbsp; Or you do LSA, if you&#39;re into that sort of thing.&lt;br /&gt;
&lt;br /&gt;
But here&#39;s the rub, there are a few million documents, so you&#39;ve got to get clever if you&#39;re going to use LSA (cause that would require SVD of an impossibly large and sparse matrix).&lt;br /&gt;
&lt;br /&gt;
As if that weren&#39;t challenging enough, these &quot;documents&quot; are only a word or two long, so the concurrences you find are going to be pretty sparse.&lt;br /&gt;
&lt;br /&gt;
So, that&#39;s it. &amp;nbsp;Have at it.</description><link>http://spokenlanguageprocessing.blogspot.com/2010/11/semantically-related-term-challenge.html</link><author>noreply@blogger.com (Anonymous)</author><thr:total>0</thr:total></item></channel></rss>