<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:atom="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" version="2.0"><channel><atom:id>tag:blogger.com,1999:blog-232777626311457607</atom:id><lastBuildDate>Fri, 27 Jan 2012 20:53:18 +0000</lastBuildDate><category>observed</category><category>pin pen</category><category>media</category><category>animals</category><category>science journalism</category><category>Etc.</category><category>vocal fry</category><category>phonology</category><category>south</category><category>display</category><category>AAVE</category><category>monophthongization</category><category>phonetics</category><category>southwest</category><category>Chris Matthews</category><category>ay</category><category>spoonerism</category><category>n-word</category><category>tonogenesis</category><category>perception</category><category>lost tv media</category><category>l-vocalization</category><category>Louisiana</category><category>first post</category><category>oy</category><category>intelligence</category><category>irene</category><category>mystery</category><category>video</category><category>relaunch</category><category>natural misunderstanding</category><category>Canadian Raising</category><category>Hauser</category><category>probability</category><category>cognition</category><category>overheard</category><category>linguists</category><category>science</category><category>humor</category><category>animal cognition</category><category>individuals</category><category>visualization</category><category>l</category><category>TV</category><category>sound change</category><category>radio</category><category>linguistics</category><category>fill-feel</category><category>observations</category><category>tool</category><category>peeving</category><category>graphics</category><category>morphology</category><category>raising</category><category>language</category><category>Chomsky</category><category>nature-nurture</category><category>dialect</category><category>language change</category><category>portmanteau</category><category>praat</category><category>misc.</category><category>positive anymore</category><category>philadelphia</category><category>vowels</category><category>plotting</category><category>design</category><category>language attitudes</category><category>race</category><category>nyc</category><category>data</category><category>short-a</category><category>merger</category><title>Val Systems</title><description /><link>http://val-systems.blogspot.com/</link><managingEditor>noreply@blogger.com (Josef Fruehwald)</managingEditor><generator>Blogger</generator><openSearch:totalResults>73</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/ValSystems" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="valsystems" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-7988062161342709908</guid><pubDate>Fri, 27 Jan 2012 19:36:00 +0000</pubDate><atom:updated>2012-01-27T14:36:23.033-05:00</atom:updated><title>Distressing Numbers for Women</title><description>Sometimes I play with non-linguistic data sets recreationally. It's a totally valid hobby! I tend to gravitate towards data on the disparities between men and women, because gender equality is something that matters to me.&lt;br /&gt;
&lt;br /&gt;
I've had this one data set for a while which I got from the &lt;a href="http://www.guardian.co.uk/news/datablog/2009/mar/10/gender-educationsgendergap"&gt;Guardian Data Blog&lt;/a&gt;. It's 2006 data compiled by Unesco on men and women across a number of indicators. The ones of particular interest to me were student enrollment and estimated earned income. The student enrollment data is the percentage of potential students who are currently enrolled as students.&lt;br /&gt;
&lt;br /&gt;
So, for each country for these two indicators, I calculated the ratio of Female/Male, to have one comparable measure. And then I took the log of the ratio, cause that's a good thing to do.&lt;br /&gt;
&lt;br /&gt;
Before you look at the graph, make a guess. In countries with more gender equality in student enrollment, what do you think happens to gender equality in income?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-QjdavZ4p1iU/TyL50SwQrmI/AAAAAAAAA_M/-9BvoAsvhF8/s1600/edu_income.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="353" src="http://4.bp.blogspot.com/-QjdavZ4p1iU/TyL50SwQrmI/AAAAAAAAA_M/-9BvoAsvhF8/s400/edu_income.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
The answer is &lt;i&gt;nothing&lt;/i&gt;. And these are not all high income, high education countries either. These are global estimates, not just OECD countries.

&lt;br /&gt;
&lt;br /&gt;
On this graph, the red lines indicate total equality, a 1:1 ratio. What's especially striking about this graph is how many countries are cluster on the right of the red line. There are &lt;i&gt;a lot&lt;/i&gt; of countries where more women are enrolled as students than men. But those countries have no better income equality on average than those countries with extreme education inequality!&lt;br /&gt;
&lt;br /&gt;
This figure plots the density function (an estimate of how many countries are located at each point along the education dimension) and the cumulative density function (what percent of countries have at least that much equality or less).&lt;br /&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-fixkYHJ3bWI/TyL7Xpsi2UI/AAAAAAAAA_U/pSCpCZfR-gc/s1600/densities.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="353" src="http://2.bp.blogspot.com/-fixkYHJ3bWI/TyL7Xpsi2UI/AAAAAAAAA_U/pSCpCZfR-gc/s400/densities.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
In about 60% of the countries in the world, more women are students than men! The US is one of these. Maybe you've heard about it. They're calling it the "crisis of boys". Quite a crisis for boys, that on average they have about 90% the education, but&amp;nbsp;156% of the money.&lt;br /&gt;
&lt;br /&gt;
I wonder what this means for the education panacea for world problems.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-7988062161342709908?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2012/01/distressing-numbers-for-women.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-QjdavZ4p1iU/TyL50SwQrmI/AAAAAAAAA_M/-9BvoAsvhF8/s72-c/edu_income.png" height="72" width="72" /><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-7795150418056826230</guid><pubDate>Mon, 19 Dec 2011 19:18:00 +0000</pubDate><atom:updated>2011-12-20T22:53:16.347-05:00</atom:updated><title>I don't think it's linguists' fault.</title><description>Whenever media coverage of a linguistic phenomenon goes &lt;a href="http://val-systems.blogspot.com/2011/12/on-vocal-fry.html"&gt;as far off the rails as the recent vocal fry fiasco&lt;/a&gt;, linguists blame themselves. To quote some commentary that &lt;a href="http://dsbigham.net/"&gt;Doug Bigham&lt;/a&gt; posted to Facebook:&lt;br /&gt;
&lt;blockquote&gt;
It's not the journalists' fault; it's ours. We've failed miserably at public outreach because the "leaders of our field" don't believe the public will ever understand what we do and don't care to try and explain it at a level people will understand. [...] The culture of irrelevance we've created for ourselves can't be dismissed with a hand wave...&lt;/blockquote&gt;
That's some tough love, but I'm inclined to disagree. Perhaps linguists, like all academics, have some isolationist tendencies. Doug himself had a lot of trouble drumming up contributions to &lt;a href="http://popularlinguisticsonline.org/"&gt;the Popular Linguistics Magazine&lt;/a&gt;. But I think a more severe problem is that linguists' point of view is actively unwanted.&lt;br /&gt;
&lt;br /&gt;
To flesh out what I mean, I think it's worth speculating about why this particular piece of research on vocal fry captured the collective media imagination. The research itself was very modest in its scope, and there is a vast universe of research out there that media outlets could have chosen to report on. Putting aside the academic press, you could fill hours of television with just the postings to Science Now, where the vocal fry piece first got some play. So why did this particular piece of research get reported on TV, and all over the internet?&lt;br /&gt;
&lt;br /&gt;
The answer lies, I think, in the supposed culprits: young women. This is a very simple case of language shaming. The Today Show clip described vocal fry as "animal-like," and buffered the piece with iconic images of female frivolity: shopping, gossiping, talking about boys, and watching &lt;i&gt;Sex and the City&lt;/i&gt;. The original MSNBC blog post was updated with the "best comment so far" from Facebook, which said&lt;br /&gt;
&lt;blockquote class="tr_bq"&gt;
"These girls sound like a bunch of neurotic dolphins who do not make sense."&lt;/blockquote&gt;
"Brilliant," says the MSNBC blogger, "can you top that?" Vocal fry has thus been successfully framed as a negative behavior. &lt;br /&gt;
&lt;br /&gt;
Why is vocal fry framed so negatively? Well, it's almost a tautology to say that young women do something, and it is undesirable.&amp;nbsp;Vocal fry is an especially striking case. Before all of this media coverage, no one, except people who work on speech, even knew what it was, or commented on it. Once it was defined and explained, and associated with young women, suddenly it fit snugly into a classic declinism frame, and a linguistic inferiority of women frame.&lt;br /&gt;
&lt;br /&gt;
The supposed motives of young women for doing vocal fry are also a key element in the media coverage. They want to 1) emulate pop artists and 2) fit in with their friends. That is, they are shallow, frivolous, and thoughtless.&amp;nbsp;Really, the tone of the story is only a slightly refined version of &lt;a href="http://www.youtube.com/watch?v=jbhnRuJBHLs"&gt;this&lt;/a&gt;&amp;nbsp;or &lt;a href="http://www.youtube.com/watch?v=v5KmIXZM-V8"&gt;this&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Perhaps the coverage of vocal fry could be understood as being part of a larger trend of policing the behavior of women. In a lot of ways (dietarily, sexually, physically, professionally, etc.), there is a razor thin range of acceptability for young women, which now apparently includes their pitch contours. If you end your utterances with a final pitch rise, you're doing uptalk (a.k.a. &lt;a href="http://www.youtube.com/watch?v=SCNIBV87wV4"&gt;ending all your sentences with question marks&lt;/a&gt;), and if you end them with falling pitches, you're doing vocal fry.
&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;
&lt;br /&gt;
So where does the work of a linguist fit in here? Could we have provided higher quality research and better facts, in an equally&amp;nbsp;digestible&amp;nbsp;manner? Probably, but I submit that media interest in vocal fry has nothing to do with facts, or the quality of the research. The commentary of a linguist would not add grist to the mill of female inferiority, and would therefore just be ignored. In fact, that's exactly what happened with Janet Pierrehumbert's contribution to the Today Show story. What she said was completely lucid, and contained no technical mumbo jumbo, but the point of the coverage was not to educate, but to shame.&lt;br /&gt;
&lt;br /&gt;
The problem is that most people want to be able to use language as a device to separate the inferior from the superior. This kind of desire surfaces in almost every conversation I have about language with a non-expert. It becomes amplified in the media, and it operates at all levels of the social&amp;nbsp;hierarchy. There is &lt;a href="http://val-systems.blogspot.com/2011/01/grammar-phobia-or-judging-book-by-its.html"&gt;the denigration of people who speak non-standard Englishes&lt;/a&gt;.&amp;nbsp;Then, there is the denigration of women's and youth's speech. At the higher levels of the cultural elite, self-worth can be determined by your choice of &lt;i&gt;&lt;a href="http://val-systems.blogspot.com/2011/12/ignorant-slobs.html"&gt;octopuses, octopi, octopodes&lt;/a&gt;&lt;/i&gt;, or by whether you agree that by saying "&lt;a href="http://val-systems.blogspot.com/2010/08/on-bagel-lady.html"&gt;A whole wheat bagel, please&lt;/a&gt;," you should not have to be asked to specify that you don't want cream cheese.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;This&lt;/i&gt;&amp;nbsp;is the kind of social work that people want to use language for, and it is a frustrating cultural juggernaut to be at cross purposes with. And that is exactly why, in my opinion, most linguistic research does not gain traction in popular discourse. Before we can get to the interesting stuff, we first have to turn &lt;i&gt;everyone's&lt;/i&gt;&amp;nbsp;moral universe upside down.&lt;br /&gt;
&lt;br /&gt;
And that kind of task requires something more than just scientists being open to popularizing their research. We really have to be more agressive in a way that other sciences don't have to be. Really, it's necessary to be politicized, and I can fully understand that step being a difficult one to take for a researcher.&lt;br /&gt;
&lt;br /&gt;
I see this tension being the biggest roadblock to developing larger social relevance for linguistics. Are we scientists, or are we politicians? Can we be both, effectively?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-7795150418056826230?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/12/i-dont-think-its-linguists-fault.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>4</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-1359992710569971014</guid><pubDate>Sat, 17 Dec 2011 06:40:00 +0000</pubDate><atom:updated>2011-12-20T17:44:02.959-05:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">vocal fry</category><title>On Vocal Fry</title><description>"Vocal fry" has been a trending topic for about a week now. It began with &lt;a href="http://news.sciencemag.org/sciencenow/2011/12/vocal-fry-creeping-into-us-speec.html"&gt;a Science Now post&lt;/a&gt; that starts&amp;nbsp;out ominously&lt;br /&gt;
&lt;blockquote class="tr_bq"&gt;
A curious vocal pattern has crept into the speech of young adult women who speak American English...&lt;/blockquote&gt;
And then, it exploded. I've seen it posted all over the web, and have largely tried to ignore it. For me, when it comes to reading pieces like these, ignorance is bliss.&lt;br /&gt;
&lt;br /&gt;
But then, &lt;a href="https://twitter.com/#!/dialect"&gt;Lauren Hall-Lew&lt;/a&gt;&amp;nbsp;shared&amp;nbsp;an MSNBC blog post on the topic, entitled "&lt;a href="http://bodyodd.msnbc.msn.com/_news/2011/12/12/9393348-more-college-women-speak-in-creaks-thanks-to-pop-stars"&gt;More college women speak in creaks, thanks to pop stars&lt;/a&gt;." If I were religious, this would call for the serenity prayer. The post comes along with video from the Today Show, with Matt Lauer discussing the phenomenon.
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;center&gt;
&lt;object classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=10,0,0,0" height="245" id="msnbc7d46fb" width="420"&gt;&lt;param name="movie" value="http://www.msnbc.msn.com/id/32545640" /&gt;

&lt;param name="FlashVars" value="launch=45681253&amp;amp;width=420&amp;amp;height=245" /&gt;

&lt;param name="allowScriptAccess" value="always" /&gt;

&lt;param name="allowFullScreen" value="true" /&gt;

&lt;param name="wmode" value="transparent" /&gt;

&lt;embed name="msnbc7d46fb" src="http://www.msnbc.msn.com/id/32545640" width="420" height="245" FlashVars="launch=45681253&amp;amp;width=420&amp;amp;height=245" allowscriptaccess="always" allowFullScreen="true" wmode="transparent" type="application/x-shockwave-flash" pluginspage="http://www.adobe.com/shockwave/download/download.cgi?P1_Prod_Version=ShockwaveFlash"&gt;&lt;/embed&gt;&lt;/object&gt;
&lt;/center&gt;
&lt;br /&gt;
&lt;br /&gt;
What is wrong with this video is everything. There is a brief snippet where they interview a real linguist (Janet Pierrehumbert) who says (I paraphrase) "This isn't a new phenomenon, and it's not caused by pop-stars" (see also, &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=3626"&gt;the related Language Log post&lt;/a&gt;). But see how much air time that gets! The whole premise of the piece is wrong, and she says so, and they power right along like it's irrelevant. If you were to, say, introduce a political figure on air with the incorrect party or state affiliation, you'd have to apologize on air moments later. If you report that the jury found a defendant guilty when they were actually acquitted, you'd be ripped to shreds.&amp;nbsp;You state a bunch of garbage about language, and an expert tells you you've got it all wrong, oh, whatever, it's more fun this way. On this topic, and most others about language, the media coverage is of the same journalistic quality as "&lt;a href="http://en.wikipedia.org/wiki/Dewey_Defeats_Truman"&gt;Dewey Defeats Truman.&lt;/a&gt;"&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;

What do I know about vocal fry?&lt;/h2&gt;
Frankly, I'm not much of an expert on voice quality or register. I'm especially not too familiar with sociolinguistic work on voice quality, and that kind of knowledge seems to be necessary to evaluate the claims of this story.&lt;br /&gt;
&lt;br /&gt;
However, I have had quite a bit of experience dealing with vocal fry. Vowels and their acoustics are my thing, if you didn't know, and a vowel pronounced with vocal fry can be difficult to measure. I've looked at a lot of vowels, which means I've seen a lot of vocal fry, and have my own impressions about where it occurs. Basically, it happens most often when a speaker's pitch drops, like at a phrase boundary, or sometimes when a voiceless consonant follows the vowel.&lt;br /&gt;
&lt;br /&gt;
I'd agree that there is something more than simple mechanics of articulation going on with the use of vocal fry. There is definitely a stylistic component. I'd also agree, impressionistically, that women tend to do a bit more vocal fry than men, or at least it's more noticeable when they do.&lt;br /&gt;
&lt;br /&gt;
But vocal fry is by no means an exclusively female quality. Arguing from anecdotes is poor form, but here is an example of a relatively high profile male doing a lot of vocal fry.&lt;br /&gt;
&lt;iframe allowfullscreen="" frameborder="0" height="315" src="http://www.youtube.com/embed/loxJ3FtCJJA?rel=0" width="560"&gt;&lt;/iframe&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;

I read the paper.&lt;/h2&gt;
When watching science reporting like this, there's always the possibility that the researchers' work is being misconstrued, either by the media outlet, or by their institution's press office. So, I made good use of my institutional access to academic journals, and read the original paper (even &lt;a href="https://twitter.com/#!/search/realtime/jofrhwld%20%23vocalfry"&gt;livetweeted&lt;/a&gt; the process) by  Wolk, Abdelli-Beruh &amp;amp; Slavin (2011), which was published in the Journal of Voice. Here are the claims that rubbed me so wrong about the Today Show clip.&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;Use of vocal fry is a new phenomenon.&lt;/li&gt;
&lt;li&gt;Vocal fry is exclusively a female phenomenon.&lt;/li&gt;
&lt;li&gt;Vocal fry is created and spread by figures in popular media (e.g. Ke$ha, Kim Kardashian).&lt;/li&gt;
&lt;/ul&gt;
I read the original paper with the aim of determining whether
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;there is evidence in the paper supporting these claims,&lt;/li&gt;
&lt;li&gt;the researchers themselves made these claims.&lt;/li&gt;
&lt;/ul&gt;
Wolk et al. recorded 34 women between the ages of 18 and 25, both producing a sustained vowel sound, and reading a short passage. Then, three carefully selected sentences from the reading passage were evaluated by trained speech pathologists for whether the speaker was using vocal fry. About 2/3 of the speakers were judged to use vocal fry. They also did some acoustic analysis of the vocal fry.&lt;br /&gt;
&lt;br /&gt;
That is all the evidence that Wolk et al. collected, analyzed, and presented. Needless to say, it provides no support for any of the three points. On the first, they only analyzed one age group, so there is no way to tell if young people do it more or less than older people. Their discussion of background literature actually cites a number of papers from the mid 60s which argue that vocal fry is part of normal speech. So much for it being a new phenomenon. In the discussion, the authors don't outright claim that vocal fry is a new phenomenon, but they do frame the interesting research question as figuring out how much college students do it. They deserve a pass on this point, I think, but they should perhaps consider reframing their research questions as pertaining to a larger cultural pattern.&lt;br /&gt;
&lt;br /&gt;
On vocal fry as an exclusively female phenomenon, I think the structure of this study presupposes that outcome, rather than investigating it. Why study only female college students if you didn't already think that only women did vocal fry? Part of the answer to that seems to be that male subjects are hard to come by for speech pathologists. Wolk et al. cite a previous study of vocal fry that looked at first year speech pathology graduate students. The sample turned out to be 94% female. Abdelli-Beruh, the second author, told the Today Show reporter that 99% of her students are female. Regardless, without a male sample, it's really impossible to draw any hard conclusions about the gender difference. At any rate, Wolk et al. don't outright say that "men don't do it," so I'll give them a pass there.&lt;br /&gt;
&lt;br /&gt;
Now, for the worst part: the all important influence of popular media figures. There is less than zero evidence presented by Wolk et al. for causal influence of any variety. In fact, they cannot even claim that the patterns they found are primarily social rather than being primarily anatomical, or automatic. However, on page 4, they say
&lt;br /&gt;
&lt;blockquote&gt;
It is possible that these college students have either practiced or observed this vocal register and modeled it to match popular figures.&lt;/blockquote&gt;
They said it. On the basis of zero evidence, they went ahead and said it. This is not a case of the big bad media twisting an earnest researcher's words. These researchers went ahead and speculated in an unsubstantiated and, I think, irresponsible manner.&amp;nbsp;Claims require evidence, and on this point, they have none.
&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;

Vocal Hygiene
&lt;/h2&gt;
This paper also introduced me to a new range of concepts: "vocal abuse," "vocal misuse", "vocal hygiene." I have to admit, this was all news to me. They sound vaguely familiar as something a professional singer or actor worries about. &lt;br /&gt;
&lt;br /&gt;
But in this paper, there was some speculation that the common use of vocal fry might be detrimental to these speakers' vocal health. This aspect was picked up on in &lt;a href="http://www.npr.org/2011/12/17/143865090/limericks"&gt;the Limericks section of NPR's Wait Wait Don't Tell Me&lt;/a&gt;
&lt;br /&gt;
&lt;blockquote&gt;
That low crack when I sing is my choice, &lt;br /&gt;
but my E.N.T. (Ear, Nose &amp;amp; Throat Specialist) doesn't rejoice.&lt;br /&gt;
I end phrases real low, &lt;br /&gt;
where my cords shouldn't go.&lt;br /&gt;
I'm so cool that I'm hurting my voice.
&lt;/blockquote&gt;
I'm not a speech pathologist, but I'd be surprised that even speakers who use vocal fry at a high rate could do so to an extent that injures them. Wolk et al. actually don't report how &lt;i&gt;often&lt;/i&gt; their speakers used vocal fry, just now many used vocal fry at all (one time out of three sentences). But let's go extreme and say some speakers do it once per sentence with a falling final pitch. This would exclude questions, for instance, or sentences produced with a final rise for some other reason, like uptalk (women just can't win, can they?). That's still not a lot.&lt;br /&gt;
&lt;br /&gt;
I mean, there are languages out there with contrastive creaky voice. That means that in order to say the word you intend to, you &lt;i&gt;have&lt;/i&gt; to use vocal fry.
&lt;br /&gt;
&lt;hr /&gt;
Stay tuned for next time, where I will talk more about the media's coverage, and why &lt;a href="http://val-systems.blogspot.com/2011/12/i-dont-think-its-linguists-fault.html"&gt;I don't think train wrecks like this one are linguists' fault&lt;/a&gt;, which I think is a controversial position among linguists.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-1359992710569971014?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/12/on-vocal-fry.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://img.youtube.com/vi/loxJ3FtCJJA/default.jpg" height="72" width="72" /><thr:total>13</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-6994663922045645454</guid><pubDate>Wed, 07 Dec 2011 18:50:00 +0000</pubDate><atom:updated>2011-12-07T18:17:34.826-05:00</atom:updated><title>Ignorant Slobs!</title><description>Following up on &lt;a href="http://val-systems.blogspot.com/2011/12/adventures-in-plurality.html"&gt;my plurality post&lt;/a&gt;, Jon Stevens showed me &lt;a href="http://www.youtube.com/watch?v=wFyY2mK8pxk"&gt;this video&lt;/a&gt; done by &lt;a href="http://twitter.com/korystamper"&gt;Kory Stamper&lt;/a&gt;, an associate editor at Merriam-Webster. Based on the comments, it looks like it's gone a little bit viral.
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;center&gt;
&lt;iframe allowfullscreen="" frameborder="0" height="315" src="http://www.youtube.com/embed/wFyY2mK8pxk?rel=0" width="420"&gt;&lt;/iframe&gt;
&lt;/center&gt;&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
What is so striking to me is the fictional dialogue she presents at the beginning.

&lt;br /&gt;
&lt;blockquote&gt;
So let's say you're swimming in the ocean, and you see some eight legged cephalopods. You say to your friend, "Hey! I saw a group of octopuses." And your friend says, "Hey! &lt;b&gt;You're an ignorant slob!&lt;/b&gt;  You saw a group of octopi."
&lt;/blockquote&gt;
I'm sure that Kory Stamper herself doesn't believe that &lt;a href="http://val-systems.blogspot.com/2010/08/language-use-and-morality.html"&gt;a person's moral fiber is assayable from &lt;i&gt;how&lt;/i&gt; they speak&lt;/a&gt;. Instead, I think she is simply, and accurately, representing the attitude of a great many people who some us have to deal with quite regularly.&lt;br /&gt;
&lt;br /&gt;
And, I think that the trigger of the "ignorant slob" judgment here is very telling. We're not talking about a non-standard dialect which may, for instance, employ negative concord (a.k.a. double negatives), or feature different verb agreement patterns. &lt;i&gt;Those&lt;/i&gt; people are too far gone to even begin engaging with. We're not even talking about misguided prescriptive proclamations, like "don't end a sentence with a preposition," or "don't use the passive voice." That's high school English class material, unworthy of debate.&lt;br /&gt;
&lt;br /&gt;
No, we are talking about the plural form of &lt;i&gt;octopus&lt;/i&gt;. &lt;br /&gt;
&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-YTFtWviouh4/Tt_I0GzOVKI/AAAAAAAAA-0/nvYuZ6fCJio/s1600/Octopus2.jpg" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="243" src="http://4.bp.blogspot.com/-YTFtWviouh4/Tt_I0GzOVKI/AAAAAAAAA-0/nvYuZ6fCJio/s320/Octopus2.jpg" width="320" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;You are unworthy.&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
Only performance on a task as esoteric and irrelevant to every day life as forming the plural of &lt;i&gt;octopus&lt;/i&gt; is adequate to separate the elect from the damned. Woe unto you who accepts the heresy of &lt;i&gt;octopi&lt;/i&gt;. You must accept the Truth of &lt;i&gt;octopodes&lt;/i&gt; into your heart if you don't want to sound like a fucking idiot.&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;
&lt;br /&gt;
On a related note, no matter what their origins were, I suspect prescriptive proclamations like "don't end a sentence in a preposition" and "don't use the passive voice" only continue to be considered virtuous because they are nearly impossible to adhere to. (Hey! A twofer!)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-6994663922045645454?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/12/ignorant-slobs.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://img.youtube.com/vi/wFyY2mK8pxk/default.jpg" height="72" width="72" /><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-3042245426054476640</guid><pubDate>Tue, 06 Dec 2011 02:57:00 +0000</pubDate><atom:updated>2011-12-08T10:49:16.085-05:00</atom:updated><title>Adventures in Plurality</title><description>Update: December 8, 2011&lt;br /&gt;
I'm going to use this post as a running list of examples of over-latinate plurals.&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;
&lt;br /&gt;
Almost everyone is familiar with the uncertainty surrounding the plural words like &lt;i&gt;&lt;a href="http://en.wikipedia.org/wiki/Platypus#Taxonomy_and_etymology"&gt;platypus&lt;/a&gt;&lt;/i&gt;, &lt;i&gt;&lt;a href="http://en.wikipedia.org/wiki/Octopus#Etymology_and_pluralization"&gt;octopus&lt;/a&gt;, &lt;/i&gt;and &lt;i&gt;&lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2684"&gt;syllabus&lt;/a&gt;&lt;/i&gt;. They look kind of Latin, and a lot of high profile words with this kind of shape form their ending by changing the last syllable to&lt;i&gt;&amp;nbsp;&lt;/i&gt;"&lt;i&gt;i&lt;/i&gt;" (&lt;i&gt;alumni, foci, fungi&lt;/i&gt;). But in these uncertain cases, prescriptivists tell us we are hypercorrecting, and engaging in pseudo-Latin.&lt;br /&gt;
&lt;br /&gt;
But, I'm not so sure if this is simply a case where people are well educated enough to know the &lt;i&gt;-us&amp;nbsp;&lt;/i&gt;→&amp;nbsp;&lt;i&gt;-i&lt;/i&gt;, rule, but not enough to know a Greek word when they see it. For instance, I've seen it overapplied to words which aren't even spelled &lt;i&gt;-us&lt;/i&gt;. At 1:10 in this video, John Stewart says&lt;br /&gt;
&lt;blockquote&gt;
"We cannot allow ourselves, to get complacent, for the face of&amp;nbsp;tyranny&amp;nbsp;has many... orifi."&lt;/blockquote&gt;
&lt;div style="background-color: black; width: 520px;"&gt;
&lt;div style="padding: 4px;"&gt;
&lt;embed allowfullscreen="true" allowscriptaccess="always" base="." flashvars="" height="288" src="http://media.mtvnservices.com/mgid:cms:video:thedailyshow.com:250297" type="application/x-shockwave-flash" width="512"&gt;&lt;/embed&gt;&lt;br /&gt;
&lt;div style="background-color: white; font-family: Arial, Helvetica, sans-serif; font-size: 12px; margin-bottom: 0px; margin-top: 4px; padding: 4px; text-align: left;"&gt;
&lt;b&gt;&lt;a href="http://www.thedailyshow.com/watch/mon-september-28-2009/america--target-america"&gt;The Daily Show with Jon Stewart&lt;/a&gt;&lt;/b&gt;&lt;br /&gt;
Get More: &lt;a href="http://www.thedailyshow.com/full-episodes/"&gt;Daily Show Full Episodes&lt;/a&gt;,&lt;a href="http://www.indecisionforever.com/"&gt;Political Humor &amp;amp; Satire Blog&lt;/a&gt;,&lt;a href="http://www.facebook.com/thedailyshow"&gt;The Daily Show on Facebook&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;br /&gt;
Ok, clearly this was done for comedic effect, but I think it's only funny because we recognize &lt;i&gt;"orifi"&lt;/i&gt;&amp;nbsp;as well formed, but prescriptively incorrect.&lt;br /&gt;
&lt;br /&gt;
Even stranger, I recently had an experience where I wasn't quite sure how to form the plural of &lt;i&gt;danish&lt;/i&gt;&amp;nbsp;(as in pastry). I was telling a dinner party that I wasn't very hungry because I'd eaten a few at a coffee shop earlier. I said "I had a few..." and paused, because the first thing that came to my mind was "&lt;i&gt;dani". &amp;nbsp;&lt;/i&gt;Even stranger, my sister, who had seen me eat the offending pastries, offered "Dani?" And we are not alone! check out this &lt;a href="http://answers.yahoo.com/question/index?qid=20080628202250AAGFVIR"&gt;Yahoo! Question&lt;/a&gt;.
&lt;br /&gt;
&lt;blockquote&gt;
Whats the plural for danish?
Like if you have two danish(es?) is it dani?
Or just danishes?&lt;/blockquote&gt;
So for some people, the semi-productive latinate plural rule doesn't care if it's dealing with &lt;i&gt;s&lt;/i&gt; or &lt;i&gt;sh&lt;/i&gt;.&lt;br /&gt;
&lt;br /&gt;
In some ways, it makes total sense. I'd argue that the the sequence [ɨsɨs] isn't the greatest one in the world. Once you've got a rule which would let you avoid it, why not use that all the time?&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;
&lt;br /&gt;
In a note related to irregular plurals, I was once asked in a question period about what kind of "&lt;i&gt;metrices"&lt;/i&gt;&amp;nbsp;I use. This is way more interesting than it initially seems. "Oh, that's just analogy from &lt;i&gt;matrix&lt;/i&gt;," you say, but it isn't quite. The singular form is just &lt;i&gt;metric. &lt;/i&gt;The word doesn't have the appropriate shape to undergo the irregular pluralization until &lt;u&gt;after you've already added the regular plural suffix&lt;/u&gt;! So you wind up with &lt;i&gt;metric&lt;/i&gt; → &lt;i&gt;metrics&lt;/i&gt; → &lt;i&gt;metrices.&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;
UPDATE: December 7, 2011
&lt;br /&gt;
&lt;a href="http://www.ling.upenn.edu/~hilaryp/"&gt;Hilary Prichard&lt;/a&gt; has pointed me to this (rather depressing) example from Donald Trump, &lt;a href="http://www.washingtonpost.com/opinions/republicans-color-the-abortion-debate/2011/12/06/gIQAbNvpaO_story.html"&gt;discussing his plans on creating a version of the Apprentice for children&lt;/a&gt;
&lt;br /&gt;
&lt;blockquote&gt;
“We’re going to be picking 10 young wonderful children, and we’re going to make them apprenti,” Trump said. “We’re going to have a little fun with it.”&lt;/blockquote&gt;
&lt;br /&gt;
&lt;hr /&gt;
UPDATE: December 8, 2011&lt;br /&gt;
Jon Stevens pointed me to &lt;a href="http://ac360.blogs.cnn.com/2011/12/07/video-stephen-colbert-on-the-ridiculist/"&gt;this segment of Anderson Cooper's show called the RidicuList&lt;/a&gt; (originally broadcast December 7, 2011). At 2:35, Cooper says
&lt;blockquote&gt;
I did this story three different times six months ago on the RidicuList, and some of the video from the Colbert Report that-- Some of the video they used, came from the Third Eagle's video responses to my &lt;b&gt;RidicuLists&lt;/b&gt;. I like to call them &lt;b&gt;Ridiculi&lt;/b&gt;, but you get the point.
&lt;/blockquote&gt;

&lt;object width="416" height="374" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" id="ep"&gt;&lt;param name="allowfullscreen" value="true" /&gt;&lt;param name="allowscriptaccess" value="always" /&gt;&lt;param name="wmode" value="transparent" /&gt;&lt;param name="movie" value="http://i.cdn.turner.com/cnn/.element/apps/cvp/3.0/swf/cnn_416x234_embed.swf?context=embed&amp;videoId=bestoftv/2011/12/07/exp-ac-stephen-colbert-ridiculist.cnn" /&gt;&lt;param name="bgcolor" value="#000000" /&gt;&lt;embed src="http://i.cdn.turner.com/cnn/.element/apps/cvp/3.0/swf/cnn_416x234_embed.swf?context=embed&amp;videoId=bestoftv/2011/12/07/exp-ac-stephen-colbert-ridiculist.cnn" type="application/x-shockwave-flash" bgcolor="#000000" allowfullscreen="true" allowscriptaccess="always" width="416" wmode="transparent" height="374"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-3042245426054476640?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/12/adventures-in-plurality.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>4</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-6732246615986915413</guid><pubDate>Wed, 23 Nov 2011 21:44:00 +0000</pubDate><atom:updated>2011-11-23T17:24:26.839-05:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">linguistics</category><category domain="http://www.blogger.com/atom/ns#">phonology</category><category domain="http://www.blogger.com/atom/ns#">phonetics</category><title>Siri's strange phonotactics</title><description>&lt;a href="http://cogsci.jhu.edu/people/wilson.html"&gt;Colin Wilson&lt;/a&gt; recently gave a talk here at Penn about why speakers don't&amp;nbsp;necessarily&amp;nbsp;say words in a foreign language the way foreign language speakers do. For example, the capital of Georgia (the country) is Tbilisi, which an initial [tb] onset cluster. Here, listen to pronunciation on Wikipedia: &lt;a href="http://upload.wikimedia.org/wikipedia/commons/6/64/Tbilisi.ogg"&gt;Tbilisi&lt;/a&gt;, then say it back out loud. That's basically the experiment Colin was talking about.&lt;br /&gt;
&lt;br /&gt;
So, I'm guessing that if you didn't manage to say Tbilisi exactly like the recording did, you probably said something like [tɨbilisi], adding in an extra vowel between the [t] and [b]. There are a few different explanations for why you might have added in that extra sound. &lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;You hallucinated, and thought you heard&amp;nbsp;[tɨbilisi].&lt;/li&gt;
&lt;li&gt;You accurately heard&amp;nbsp;[tbilisi], but then when you tried to say it, it came out&amp;nbsp;[tɨbilisi].&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;
Colin is pursuing another kind of analysis, where the way a Georgian speaker says /tbilisi/ sounds more like the way you would say /tɨbilisi/ in English, than the way you would say /tbilisi/ in English (if you were ever to say such a thing).&lt;br /&gt;
&lt;br /&gt;
It's pretty cool stuff, and strangely reminded me of a similar repetition experiment I inadvertently performed with my iPhone. Here's a video re-enactment:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://2.gvt0.com/vi/qsgbAFsbAio/0.jpg" height="266" width="320"&gt;&lt;param name="movie" value="http://www.youtube.com/v/qsgbAFsbAio&amp;fs=1&amp;source=uds" /&gt;


&lt;param name="bgcolor" value="#FFFFFF" /&gt;


&lt;embed width="320" height="266"  src="http://www.youtube.com/v/qsgbAFsbAio&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/div&gt;
&lt;br /&gt;&lt;/div&gt;
How weird is that! Siri heard me say [ʃəvan], but for some reason repeated it back [sajobən]!&lt;br /&gt;
&lt;br /&gt;
Ok, I guess I &lt;i&gt;really&lt;/i&gt;&amp;nbsp;know what's going on here, and it's not phonotactics, but it's fun to pretend. Clearly, the transcription with the highest probability given my speech was the Irish spelling "Siobhan": P(transcription | audio). &amp;nbsp;But, given the text, the text to speech (P(audio | transcription)) produces&amp;nbsp;[sajobən].&lt;br /&gt;
&lt;br /&gt;
It still strikes me weird that Siri has some kind of dictionary lookup to give me "Siobhan" for&amp;nbsp;[ʃəvan], but then does a procedural text-to-speech.&lt;br /&gt;
&lt;br /&gt;
P.S. I think that I have an intrusive /l/ after "how" the second time I say "How do you spell Siobhan?".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-6732246615986915413?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/11/siris-strange-phonotactics.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-8664450658777621438</guid><pubDate>Fri, 21 Oct 2011 14:26:00 +0000</pubDate><atom:updated>2011-10-21T10:29:55.061-04:00</atom:updated><title>Academia and Innovation?</title><description>Quick post today (the academic year is here, hence my recent silence).&lt;br /&gt;
&lt;br /&gt;
Robert A. Muenchen is maintaining a report &lt;a href="https://sites.google.com/site/r4statistics/popularity"&gt;here&lt;/a&gt; on the popularity of R, a programming environment for statistics.&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
He's got a bunch of measures, but these really caught my eye. A site called Rexter Analytics did a survey in 2010 asking respondents which pieces of software they used in 2009. These were the results:&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-tlIzFUq4WzU/TqF-rIqNjEI/AAAAAAAAA-Q/fbjp34Fs12M/s1600/RexerSurvey.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/-tlIzFUq4WzU/TqF-rIqNjEI/AAAAAAAAA-Q/fbjp34Fs12M/s400/RexerSurvey.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
So, R is at the top of the list. KDnuggets did a similar poll,  and returned very similar results.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-MAGdgVFYdxI/TqF-qvs-2aI/AAAAAAAAA-A/AwilOBo4FmQ/s1600/Fig_6_KDnuggetsPollLanguages.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="229" src="http://3.bp.blogspot.com/-MAGdgVFYdxI/TqF-qvs-2aI/AAAAAAAAA-A/AwilOBo4FmQ/s320/Fig_6_KDnuggetsPollLanguages.PNG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
The take away message so far is that a lot of people who do data analysis use R. The plurality even. That is the zeitgeist. 

&lt;br /&gt;
&lt;br /&gt;
Now we come the the results that worry me.&amp;nbsp;Muenchen also did an analysis of Google Scholar citations of software packages, and produced this graph.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/--jQkFLOAKuc/TqF-qgKhd_I/AAAAAAAAA-I/K8_TQpds-bs/s1600/Fig_7_ScholarlyImpact.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://1.bp.blogspot.com/--jQkFLOAKuc/TqF-qgKhd_I/AAAAAAAAA-I/K8_TQpds-bs/s400/Fig_7_ScholarlyImpact.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
Clearly R has a pretty sharply rising slope, but it still comes in fourth after a bunch of software that, frankly, only academics can use because they get institutional licenses.&lt;br /&gt;
&lt;br /&gt;
I'm not worried because I think academics should be using R (even though I do). It has more to do with the fact that people in academia like to think of themselves as the forward thinkers, and the innovators of new ideas. But in this regard they are clearly &lt;i&gt;following behind&lt;/i&gt;&amp;nbsp;the trend that everyone else is setting. Maybe it's fitting that the SPSS curve looks not unlike what I'd imagine an ivory tower to be.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-8664450658777621438?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/10/academia-and-innovation.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-tlIzFUq4WzU/TqF-rIqNjEI/AAAAAAAAA-Q/fbjp34Fs12M/s72-c/RexerSurvey.png" height="72" width="72" /><thr:total>4</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-4816656999269236968</guid><pubDate>Thu, 01 Sep 2011 15:15:00 +0000</pubDate><atom:updated>2011-09-01T11:15:41.180-04:00</atom:updated><title>Battlestar Galactica: InvoVis</title><description>I recently re-watched &lt;a href="http://en.wikipedia.org/wiki/Battlestar_Galactica"&gt;Battlestar Galactica&lt;/a&gt; (the re-imagined series). I had never watched the end after the season 4 mid-season break. Over all, I liked the series a lot, but wasn't a big fan of the decidedly anti-modernity finale. Do you know what is great? Medicine, and good odds of not dying in your 40s. You know what's even better? Space ships and faster-than-light travel.&lt;br /&gt;
&lt;br /&gt;
Anyway, I don't want to give away spoilers (even thought that wouldn't ruin it for you, &lt;a href="http://ucsdnews.ucsd.edu/newsrel/soc/2011_08spoilers.asp"&gt;so says science&lt;/a&gt;). My point of posting is this cool medical display from season 4 (and maybe earlier, I just noticed it in season 4).&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-X8D90R34l4k/Tl-YbEvMtMI/AAAAAAAAA80/-pIv3pfzkwo/s1600/Screen+Shot+2011-08-24+at+3.20.08+PM.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="246" src="http://2.bp.blogspot.com/-X8D90R34l4k/Tl-YbEvMtMI/AAAAAAAAA80/-pIv3pfzkwo/s400/Screen+Shot+2011-08-24+at+3.20.08+PM.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I like this display a lot, because it fits in with the general BSG style of keeping things close to current reality, ish. Sure, they have humanoid robots, but they also still use nukes, not photon torpedoes.&lt;br /&gt;
&lt;br /&gt;
I could almost imagine seeing this display today, maybe in a tech company's speculative design video. It appears to incorporate some contemporary data display ideas, like &lt;a href="http://en.wikipedia.org/wiki/Sparkline"&gt;sparklines&lt;/a&gt;. My feeling is that in a lot of sci-fi, data displays like this are a lot more cryptic, and hardly seem practical from the view of an analyst. This display, while definitely looking futuristic, also looks like it's all about practicality.&lt;br /&gt;
&lt;br /&gt;
The element that gets the most screen space is the EKG, which animates and bleeps just like in any medical drama.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-O1QAy1rM3f0/Tl-cgQ4i0MI/AAAAAAAAA84/nGRF3DPTK8U/s1600/ekg.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="85" src="http://4.bp.blogspot.com/-O1QAy1rM3f0/Tl-cgQ4i0MI/AAAAAAAAA84/nGRF3DPTK8U/s400/ekg.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Then, there are these little widgets.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-90FQR8Q9GVQ/Tl-cx8496KI/AAAAAAAAA88/nawKu-YJCps/s1600/heart_rate.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/-90FQR8Q9GVQ/Tl-cx8496KI/AAAAAAAAA88/nawKu-YJCps/s1600/heart_rate.png" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I believe the larger number is the current heart rate. It updates fairly regularly, going up or down a few beats-per-minute (I missed a whole bunch of dialogue staring at this in the background). The little blue light above the heart rate blinks with every heart beat, or at least every time the display beeps. I don't know what the smaller number represents. I didn't see it update, so it might not represent dynamic data.&lt;br /&gt;
&lt;br /&gt;
Then, there's these three panels, probably small-multiples of some kind.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-TlfHUxibHI4/Tl-dzyM_9zI/AAAAAAAAA9A/CIqBu405yqI/s1600/frequency.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="45" src="http://2.bp.blogspot.com/-TlfHUxibHI4/Tl-dzyM_9zI/AAAAAAAAA9A/CIqBu405yqI/s400/frequency.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
They're largely static, except they redraw themselves every few seconds. So maybe they could be density distributions over a time interval, or maybe frequency analyses.&lt;br /&gt;
&lt;br /&gt;
Then there are these bars.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-jey5TibAV7M/Tl-eck_cP5I/AAAAAAAAA9E/CMO0WiDCbxk/s1600/bars.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="61" src="http://3.bp.blogspot.com/-jey5TibAV7M/Tl-eck_cP5I/AAAAAAAAA9E/CMO0WiDCbxk/s400/bars.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
This is maybe the most vexing element on the display for me. At first I thought that they might display blood sugar or oxygen relative to some baseline, but you'll notice that at some points, there are bars that go both above and below the baseline. So, they have to be two kinds of measures that are usually in a complementary distribution, but not always. Either way, it seems to clearly be a time series at a relatively large granularity, since it never redraws itself during a scene.&lt;br /&gt;
&lt;br /&gt;
Lastly, there's this strip at the bottom.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-WWc-at2YXBc/Tl-fXtKO7SI/AAAAAAAAA9I/OwwBOLu8haU/s1600/spectral.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="16" src="http://1.bp.blogspot.com/-WWc-at2YXBc/Tl-fXtKO7SI/AAAAAAAAA9I/OwwBOLu8haU/s400/spectral.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
It's relatively understated compared to everything else in the display, meaning it can't be any sort of really vital statistic. It looks like maybe a spectral analysis of some kind, or maybe another time series (sleeping and waking time?). This also remains static during scenes.&lt;br /&gt;
&lt;br /&gt;
There are also a lot of elements of the user interface which are very contemporary. Take these boxes for instance.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-aQgevwOp8SQ/Tl-gFs8CRyI/AAAAAAAAA9M/-FnUQIqHGms/s1600/ui.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="18" src="http://3.bp.blogspot.com/-aQgevwOp8SQ/Tl-gFs8CRyI/AAAAAAAAA9M/-FnUQIqHGms/s400/ui.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I think we all know that if you were to press on the screen on one of those triangles, these little boxes would expand to show more information, or contract and hide the information they're currently displaying. This is definitely something that wouldn't have been incorporated into speculative UI designs 20 years ago.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-4816656999269236968?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/09/battlestar-galactica-invovis.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-X8D90R34l4k/Tl-YbEvMtMI/AAAAAAAAA80/-pIv3pfzkwo/s72-c/Screen+Shot+2011-08-24+at+3.20.08+PM.png" height="72" width="72" /><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-367451739889802429</guid><pubDate>Sun, 28 Aug 2011 22:05:00 +0000</pubDate><atom:updated>2011-08-28T18:08:14.683-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">irene</category><title>Irene!</title><description>Well, here in Philadelphia, we've just braved Hurricane Irene. From what I've heard, damage here was relatively minimal, and we haven't lost power. My friends further north in NYC are in my thoughts, cause it looks like they got really hammered. &lt;br /&gt;
&lt;br /&gt;
The silver lining here for me is that I was able to go collect data from &lt;a href="http://www.wunderground.com/weatherstation/WXDailyHistory.asp?ID=KPAPHILA21"&gt;the Weather Underground station&lt;/a&gt; about six blocks away from where I live. Here are the numbers.&lt;br /&gt;
&lt;br /&gt;
We got 5.68 inches of rain, which fell most steadily between 6PM and midnight last night.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-m-gEG7_dmGI/Tlq590b4zDI/AAAAAAAAA8k/OlE9YQmfb14/s1600/rain.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="250" src="http://1.bp.blogspot.com/-m-gEG7_dmGI/Tlq590b4zDI/AAAAAAAAA8k/OlE9YQmfb14/s400/rain.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Barometric pressure, on the other hand, hit the floor at 6AM today.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-lDN3DJ7QWFk/Tlq5_uzHIsI/AAAAAAAAA8o/w6cr7geQ6QI/s1600/pressure.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="250" src="http://3.bp.blogspot.com/-lDN3DJ7QWFk/Tlq5_uzHIsI/AAAAAAAAA8o/w6cr7geQ6QI/s400/pressure.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
As for wind speeds, there are two measures from the weather station. Speed is, I believe, average wind speed over the reporting time bin (which varies between 1 and 7 minutes...), and Gust is, I believe, the maximum speed during that time bin. Either way, our max wind speeds were around 11PM last night, and they've stayed pretty high into this afternoon.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-NMQRDJUcR44/Tlq6AItegqI/AAAAAAAAA8s/znJHAYwjG7M/s1600/wind.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="250" src="http://2.bp.blogspot.com/-NMQRDJUcR44/Tlq6AItegqI/AAAAAAAAA8s/znJHAYwjG7M/s400/wind.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-367451739889802429?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/cumulative-rain-function.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-m-gEG7_dmGI/Tlq590b4zDI/AAAAAAAAA8k/OlE9YQmfb14/s72-c/rain.png" height="72" width="72" /><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-2566426274273001635</guid><pubDate>Tue, 23 Aug 2011 19:48:00 +0000</pubDate><atom:updated>2011-08-23T17:02:12.007-04:00</atom:updated><title>Earthquake: Do your part for data collection!</title><description>An earthquake just happened on the East Coast, my first! It turns out the US Geological survey has an online survey for earthquakes called "Did you feel it?" and the data is freely available! So&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/form.en.disabled.html"&gt;Go take the survey!&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
As of now, it looks like survey response has really petered out.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/form.en.disabled.html" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="329" src="http://earthquake.usgs.gov/earthquakes/dyfi/events/us/c0005ild/us/usc0005ild_plot_numresp.jpg" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
But you can download the data and some graphs here, &lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/index.html"&gt;in the downloads tab&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
I whipped up this quick visualization of the responses.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/form.en.disabled.html" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://2.bp.blogspot.com/-REcaY7SOPJU/TlQDPVGqLjI/AAAAAAAAA8Y/IP8DJMtgN3o/s640/quake.png" width="533" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Look at that big depressing gap in the response data, right where the epicenter was! And all across Pennsylvania.&lt;br /&gt;
&lt;br /&gt;
If you're from those areas, you really ought to &lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/form.en.disabled.html"&gt;go take the survey!&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Update&lt;/h2&gt;Well, I feel a little stupid. It looks like there are two locations on the USGS site for this earthquake, and the one I was looking at is not up-to-date... Maybe I don't feel so stupid, it's not the best kind of design.&lt;br /&gt;
&lt;br /&gt;
The real data to download is &lt;a href="http://earthquake.usgs.gov/earthquakes/dyfi/events/se/082311a/us/index.html"&gt;here&lt;/a&gt;. I've already updated the links above.&lt;br /&gt;
&lt;br /&gt;
And here's the real visualizations. Here's the raw data:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-zL7ryVNaVc0/TlQU47oGufI/AAAAAAAAA8c/NgU3fujBuLU/s1600/quake2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://2.bp.blogspot.com/-zL7ryVNaVc0/TlQU47oGufI/AAAAAAAAA8c/NgU3fujBuLU/s640/quake2.png" width="564" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
And here's mean values across a grid.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-u_6Hd23M7dE/TlQU5NPnSbI/AAAAAAAAA8g/9QshFFlFeCA/s1600/quake3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://1.bp.blogspot.com/-u_6Hd23M7dE/TlQU5NPnSbI/AAAAAAAAA8g/9QshFFlFeCA/s640/quake3.png" width="565" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-2566426274273001635?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/earthquake-do-your-part-for-data.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-REcaY7SOPJU/TlQDPVGqLjI/AAAAAAAAA8Y/IP8DJMtgN3o/s72-c/quake.png" height="72" width="72" /><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-1313818700794973058</guid><pubDate>Wed, 17 Aug 2011 18:58:00 +0000</pubDate><atom:updated>2011-08-31T22:31:22.120-04:00</atom:updated><title>Does blogging do me any good? A quantitative analysis.</title><description>I've been wondering if blogging does me any good. I don't mean for the heart and soul. I enjoy blogging and am going to keep it up (except for those end-of-semester hiatuses). But I've been wondering if blogging does me any good professionally, or whatever. Obviously, "a professional or whatever good" is hard to define, so I'll define it according to the data that I have.&lt;br /&gt;
&lt;br /&gt;
I maintain, along with this blog, &lt;a href="http://www.ling.upenn.edu/~joseff/"&gt;an academic website&amp;nbsp;&lt;/a&gt;where I have all of my more serious research stuff. I've got Google analytics set up on both my blog, and my academic site, keeping track of page views. So, if I can detect that page views of my blog drive some page views to my academic website, then I'll conclude that blogging is doing me some professional good. This makes a certain kind of sense, since what matters to me at this particular stage of my professional life is getting my ideas out there, and my ideas are catalogued on my academic site.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;
The raw data&lt;/h2&gt;
Here is one year's worth of traffic to Val Systems. Those two huge spikes are thanks to Mark Liberman, who reblogged &lt;a href="http://val-systems.blogspot.com/2010/08/britney-spears-tongue.html"&gt;my post about Brittany Spears' tongue&lt;/a&gt;, and to&amp;nbsp;&lt;strike&gt;the Car Talk Guys, who linked to &lt;a href="http://val-systems.blogspot.com/2010/09/pretentious-hole.html"&gt;my post about their short-a system&lt;/a&gt; on the Car Talk site for a bit&lt;/strike&gt; Sociological images, where &lt;a href="http://val-systems.blogspot.com/2011/01/grammar-phobia-or-judging-book-by-its.html"&gt;I guest posted about a "grammar" book&lt;/a&gt;.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-NK_fmn6YQ7U/Tkvt5lU7o0I/AAAAAAAAA7c/h4AZxqCTVZU/s1600/blog.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="205" src="http://3.bp.blogspot.com/-NK_fmn6YQ7U/Tkvt5lU7o0I/AAAAAAAAA7c/h4AZxqCTVZU/s400/blog.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Now here is the traffic from my academic site, and my research page on that site from the same time period.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-_BP550GSMAY/Tkvt6PrscDI/AAAAAAAAA7k/2d6zmN3yDN0/s1600/site.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://3.bp.blogspot.com/-_BP550GSMAY/Tkvt6PrscDI/AAAAAAAAA7k/2d6zmN3yDN0/s400/site.png" width="400" /&gt;&lt;/a&gt;&lt;a href="http://2.bp.blogspot.com/-C4kdems0OV4/Tkvt51xYseI/AAAAAAAAA7g/PneDuLU1rKQ/s1600/research.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://2.bp.blogspot.com/-C4kdems0OV4/Tkvt51xYseI/AAAAAAAAA7g/PneDuLU1rKQ/s400/research.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
As you can see, my academic site gets a &lt;i&gt;lot&lt;/i&gt;&amp;nbsp;less page views than my blog. Prospects are not very bright.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;
Autocorrelation&lt;/h2&gt;
My first step of analysis was to figure out how correlated page views of each site were within each site. That is, how correlated are page views on my blog with page views from one day later on my blog, or two days later, etc. To calculate this, I used the &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;acf()&lt;/span&gt; function in R. Here's the autocorrelation function from my blog. The x-axis represents how many days into the future you're comparing page views, and the y-axis represents the correlation between page views separated by that many days.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-kIcEycQglfE/Tkv3SJDVivI/AAAAAAAAA7o/zDj-_Kj6XYA/s1600/blog.acf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="308" src="http://3.bp.blogspot.com/-kIcEycQglfE/Tkv3SJDVivI/AAAAAAAAA7o/zDj-_Kj6XYA/s400/blog.acf.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
It looks like page views on my blog are pretty well correlated with the pages views from one day before (0.45). After that, there is a correlation drop off, which I'll interpret as new-post-decay. It seems like influence that a single new post has on my blog traffic is fairly minimal after five days.&lt;br /&gt;
&lt;br /&gt;
Here's the autocorrelation function for my academic site.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-qEZWX-7Se20/TkwC6i_3DeI/AAAAAAAAA7s/Mp-PpoWyuvI/s1600/site.acf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="308" src="http://2.bp.blogspot.com/-qEZWX-7Se20/TkwC6i_3DeI/AAAAAAAAA7s/Mp-PpoWyuvI/s400/site.acf.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
As you can see, the over-all size of the correlations are much smaller than for the blog. This is most likely because each new post is a new event that happens on my blog, which can have an effect which lasts for a few days, whereas nothing &lt;i&gt;happens&lt;/i&gt; on my academic site in the same way. However, there is an apparently cyclic pattern, where page views are most positively correlated at 7 day intervals, and most negatively correlated at 3 to 4 day intervals.&lt;br /&gt;
&lt;br /&gt;
Duh! Who does work on the weekends?&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-pnPW62PXIDc/TkwF_u71D2I/AAAAAAAAA7w/09-NDE5pSW0/s1600/cycle.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="262" src="http://1.bp.blogspot.com/-pnPW62PXIDc/TkwF_u71D2I/AAAAAAAAA7w/09-NDE5pSW0/s400/cycle.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
To factor out this cyclic pattern, I fit a linear regression of page views for my academic site and research page with weekday as a categorical predictor. I'll use the residuals from these regressions for doing the cross-correlation.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;
Cross-correlation&lt;/h2&gt;
Next, I checked the cross-correlation of (residualized) page views. This checks to see how correlated page views are between any two of the sites at different time lags. First, here's the cross correlation of my main academic site and my research page. I knew these would have to be highly correlated, since my research page is the most clicked link on my main page. &lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-RKzJO0PRIQk/TkwLNtUe7WI/AAAAAAAAA74/wksX2SYY0CM/s1600/s.r.acf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="308" src="http://4.bp.blogspot.com/-RKzJO0PRIQk/TkwLNtUe7WI/AAAAAAAAA74/wksX2SYY0CM/s400/s.r.acf.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;br /&gt;
Correlations with negative lag indicate that visits to my research page were correlated with visits to my main academic site a few days later. Positive lags mean visits to my academic page indicate that visits to my academic site were correlated with visits to my research page a few days later. The correlation at 0 indicates how correlated visits to my academic page and my research page were on the same day.&lt;br /&gt;
&lt;br /&gt;
Unsurprisingly, the only strong correlation between visits to my main academic site and my research page are on the same day. That spike around 10 days makes no sense, so it's probably just noise.&lt;br /&gt;
&lt;br /&gt;
So, drum-roll please, how correlated are visits to my blog and my main academic site?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-Ic9ujFrgiQQ/TkwL3Z5163I/AAAAAAAAA78/DFY_6G2UwNs/s1600/b.s.acf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="308" src="http://3.bp.blogspot.com/-Ic9ujFrgiQQ/TkwL3Z5163I/AAAAAAAAA78/DFY_6G2UwNs/s400/b.s.acf.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I would analyze this as bupkis. Likewise for my research page.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-Cl_Q8yeMdm4/TkwMk6s75YI/AAAAAAAAA8A/2bEZayTPLBE/s1600/b.r.acf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="308" src="http://1.bp.blogspot.com/-Cl_Q8yeMdm4/TkwMk6s75YI/AAAAAAAAA8A/2bEZayTPLBE/s400/b.r.acf.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;h2&gt;
To sum up&lt;/h2&gt;
It looks like blogging is just a fun diversion for me right now. Even though it would have been a lot of fun to come to my advisor or department chair with strong results that blogging is professionally fruitful, I'm fine with the way things turned out.&lt;br /&gt;
&lt;br /&gt;
However, I shouldn't have been surprised. If I &lt;i&gt;was&lt;/i&gt;&amp;nbsp;trying to use blogging as a platform for promoting my professional work, I wasn't doing it very well. If you're looking at my blog now (vs an RSS subscription), you may notice that I've added some links to the right, which lead to my academic site, and to my github site.&amp;nbsp;Why not try to make blogging work for me a little bit?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-1313818700794973058?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/does-blogging-do-me-any-good.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-NK_fmn6YQ7U/Tkvt5lU7o0I/AAAAAAAAA7c/h4AZxqCTVZU/s72-c/blog.png" height="72" width="72" /><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-7473814351441658969</guid><pubDate>Mon, 15 Aug 2011 03:05:00 +0000</pubDate><atom:updated>2011-08-14T23:05:44.087-04:00</atom:updated><title>Max Weber on why there is no decision process for research</title><description>In the process of moving, I've come across a bunch of books from my undergrad Sociology minor days, including a book of collected works by Max Weber. You may know him best for the notion of &lt;a href="http://en.wikipedia.org/wiki/Protestant_work_ethic"&gt;the Protestant work ethic&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
At any rate, the volume includes text from a lecture called &lt;i&gt;Science as a Vocation &lt;/i&gt;(&lt;a href="http://www.ne.jp/asahi/moriyuki/abukuma/weber/lecture/science_frame.html"&gt;available free online here&lt;/a&gt;), which I've decided to read through because of its personal relevancy, and I've come across this wonderful paragraph.&lt;br /&gt;
&lt;blockquote&gt;"Nowadays in circles of youth there is a widespread notion that science has become a problem in calculation, fabricated in laboratories or statistical filing systems just as 'in a factory,' a calculation involving only the cool intellect and not one's 'heart and soul.' First of all, one must say that such comments lack all clarity about what goes on in a factory or in a laboratory. In both, some idea has to occur to someone's mind, and it has to be a correct idea, if one is to accomplish anything worthwhile. And such intuition cannot be forced. It has nothing to do with any cold calculation. Certainly calculation is also an indispensable prerequisite. No sociologist, for instance, should think himself too good, even in his old age, to make tens of thousands of quite trivial computations in his head and perhaps for months at a time. One cannot with impunity try to transfer this task entirely to mechanical assistants if one wishes to figure something, even though the final result is often small indeed. But if no 'idea' occurs to his mind about the direction of his computations and, during his computations, about the bearing of the emergent single results, then even this small result will not be yielded."&lt;/blockquote&gt;&lt;br /&gt;
This seems to me to be a nice enough refutation, 90 years prescient, of &lt;a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_theory"&gt;that strange Wired article&lt;/a&gt; from a few years ago which claimed that big-data is going to kill the scientific method.&lt;br /&gt;
&lt;br /&gt;
It also resonates with an issue near and dear to my heart: promoting statistical literacy within linguistics. And that takes a two pronged approach. The first is developing statistical competency to be able to run and analyze your own statistics, without relying on semi-automated techniques, like stepwise regression, or put slightly differently, transferring the task entirely to mechanical assistants. The second is to be sure to treat statistical methods as tools for investigation, not to reify them as the objects if inquiry themselves, nor their results as god's truth, spoken by its R-acle.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-7473814351441658969?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/max-weber-on-why-there-is-no-decision.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-631612988211521443</guid><pubDate>Tue, 09 Aug 2011 17:09:00 +0000</pubDate><atom:updated>2011-08-09T13:09:51.672-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">language</category><title>Miraculous Thought Transference</title><description>&lt;a href="http://www.linuxkungfu.org/images/fun/geek/project.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" src="http://rookery9.aviary.com.s3.amazonaws.com/9323500/9323921_fb3c.png" /&gt;&lt;/a&gt;I've &lt;a href="http://val-systems.blogspot.com/2011/08/language-communication-and-iphone.html"&gt;already blogged&lt;/a&gt; about what I didn't like about &lt;a href="http://www.ted.com/talks/mark_pagel_how_language_transformed_humanity.html"&gt;Mark Pagel's TED talk&lt;/a&gt;. I'm not going to beat up on it more, specifically. Rather, I'd like to problematize the meme that he kicked it off with.&lt;br /&gt;
&lt;blockquote&gt;"Each of you possesses the most powerful, dangerous and subversive trait that natural selection has ever devised. It's a piece of neural audio technology for rewiring other people's minds. I'm talking about your &lt;b&gt;language&lt;/b&gt;, of course, because it &lt;b&gt;allows you to implant a thought from your mind directly into someone else's mind&lt;/b&gt;, and they can attempt to do the same to you, without either of you having to perform surgery." [emphasis added]&lt;/blockquote&gt;Hopefully by now, you've caught on to my own subversive juxtaposition. Briefly, I think this meme is cuter than it is true.&lt;br /&gt;
&lt;br /&gt;
I call it a meme, because I seem to recall it showing up in Steven Pinker's &lt;i&gt;The Language Instinct&lt;/i&gt;, and I'm sure it's popped up other places too. Obviously, this meme brushes right up against other issues regarding language and thought. For instance, is language the structure of thought, and does language somehow constrain our thoughts? I'm not well versed enough in these issues to comment, and I only mention them here in order to say that I won't be saying anything about them, except for what I have already said.&lt;br /&gt;
&lt;br /&gt;
Did that make sense? If so, I have succeeded in externalized telepathy. If not, that's sort of my point. Unsuccessful thought implants are a pervasive fact. Just ask the customer and the project leader, or the teacher and the student. If it were so easy to implant thoughts in others' minds, would schooling really take so long?&amp;nbsp;Perhaps thought implant rejection can be blamed on external factors, like inattention on the hearer's part, or the complexity of the thought being transmitted, but I'd be surprised if that was all there was to it.&lt;br /&gt;
&lt;br /&gt;
I'd guess, and this is where I enter into purest speculation, that successful communication between a speaker and hearer has a lot more to do with the fact that people are willing to attribute minds and intentional stances to just about anything, including other people, than with the design specifications of language.&lt;br /&gt;
&lt;br /&gt;
In fact, the ability to implant (false) beliefs in someone else's mind is most definitely not only possible within the domain of language. Just ask Marcel Marceau.&lt;br /&gt;
&lt;br /&gt;
&lt;iframe allowfullscreen="" frameborder="0" height="349" src="http://www.youtube.com/embed/i99k7nCnVwM?rel=0" width="425"&gt;&lt;/iframe&gt;&lt;br /&gt;
&lt;br /&gt;
Or, puzzle over this interesting item.&lt;br /&gt;
&lt;br /&gt;
&lt;img border="0" src="http://i.imgur.com/ARV2K.jpg" /&gt;&lt;br /&gt;
&lt;br /&gt;
Perhaps language is better&amp;nbsp;&amp;nbsp;than other natural forms of communication&amp;nbsp;at transmitting propositional content, but it's certainly not ideal for it either. If it were, then there wouldn't have been any need to develop&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Logic"&gt;formal logic&lt;/a&gt;, or&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Propositional_calculus"&gt;propositional calculus&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
So there is the problem that I want to create for this meme. Language does not really "implant a thought from your mind directly into someone else's mind," and insofar as it does, it doesn't do so uniquely above all other forms of communication. It's a pretty meme though, sort of like a poem about linguistics, and it's attention grabbing. But if it matters whether it's true and accurate, I don't think it stands up.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-631612988211521443?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/miraculous-thought-transference.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://img.youtube.com/vi/i99k7nCnVwM/default.jpg" height="72" width="72" /><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-9118162415430306856</guid><pubDate>Wed, 03 Aug 2011 17:36:00 +0000</pubDate><atom:updated>2011-08-03T18:37:12.905-04:00</atom:updated><title>Language, Communication, and iPhone</title><description>I'm a bit of a caffeine junky. Every day, regardless of where I am, I need to get my fix. I've also been very lucky to do some international traveling, which has put me in the situation where I need a coffee, but I don't speak the local language. And you know what? I've &lt;i&gt;always&lt;/i&gt; successfully ordered and paid for my coffee, and even gotten what I intended to order.&lt;br /&gt;
&lt;br /&gt;
Ok, enough speaking in parables. My point is that communication is not the same thing as language, and even complex economic transactions can be successfully carried out with only communication and no language.&lt;br /&gt;
&lt;br /&gt;
And that's why I'm not a big fan of this TED Talk by Mark Pagel, called &lt;i&gt;&lt;a href="http://www.ted.com/talks/mark_pagel_how_language_transformed_humanity.html"&gt;How language transformed humanity&lt;/a&gt;&lt;/i&gt;.&lt;br /&gt;
&lt;!--copy and paste--&gt;&lt;object width="526" height="374"&gt; &lt;param name="movie" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/param&gt;&lt;param name="bgColor" value="#ffffff"&gt;&lt;/param&gt;&lt;param name="flashvars" value="vu=http://video.ted.com/talk/stream/2011G/Blank/MarkPagel_2011G-320k.mp4&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/MarkPagel_2011G-embed.jpg&amp;vw=512&amp;vh=288&amp;ap=0&amp;ti=1203&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=mark_pagel_how_language_transformed_humanity;year=2011;theme=new_on_ted_com;theme=words_about_words;theme=a_taste_of_tedglobal_2011;theme=evolution_s_genius;event=TEDGlobal+2011;tag=Culture;tag=Science;tag=biology;tag=communication;tag=evolution;tag=language;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /&gt;&lt;embed src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" pluginspace="http://www.macromedia.com/go/getflashplayer" type="application/x-shockwave-flash" wmode="transparent" bgColor="#ffffff" width="526" height="374" allowFullScreen="true" allowScriptAccess="always" flashvars="vu=http://video.ted.com/talk/stream/2011G/Blank/MarkPagel_2011G-320k.mp4&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/MarkPagel_2011G-embed.jpg&amp;vw=512&amp;vh=288&amp;ap=0&amp;ti=1203&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=mark_pagel_how_language_transformed_humanity;year=2011;theme=new_on_ted_com;theme=words_about_words;theme=a_taste_of_tedglobal_2011;theme=evolution_s_genius;event=TEDGlobal+2011;tag=Culture;tag=Science;tag=biology;tag=communication;tag=evolution;tag=language;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;"&gt;&lt;/embed&gt; &lt;/object&gt; &lt;br /&gt;
&lt;br /&gt;
I think his introduction is far too simplistic, especially with regards to his passing comments about language acquisition. He says &lt;blockquote&gt;"Just imagine the sense of wonder in a baby when it first discovers that merely by uttering a sound, it can get objects to move across a room, as if by magic, and maybe into its mouth."&lt;/blockquote&gt;It is obvious that there must be more to the secret sauce of language acquisition than that. Even Nim Chimpsky was able to work out that by merely waving his hands around, he could get things into his mouth. Just read his quotations: &lt;a href="http://en.wikipedia.org/wiki/Nim_Chimpsky#Quotations"&gt;Wikipedia/Nim Chimpsky/Quotations&lt;/a&gt;. But Nim never acquired language.&lt;br /&gt;
&lt;br /&gt;
There's also something strangely self defeating about his entire evolutionary argument. He seems to say that humans evolved language as a means to the end of creating large, modern societies. I'm sure he doesn't &lt;i&gt;really&lt;/i&gt; think it worked like that. Evolution isn't goal oriented, and he's a biologist. Anyway, the last part of his talk is devoted to the "problem" of language diversity, and how we use it to build barriers between populations. The whole talk, laid out in one sentence, becomes: &lt;blockquote&gt;Humans evolved language in order to encourage cooperation and to build large societies, but then, we actually used it to build divisions between population groups, and that's a problem because of globalization.&lt;/blockquote&gt;How on earth could language be failing at the very goal for which it was apparently evolved?&lt;br /&gt;
&lt;br /&gt;
Now, I'm not saying the world would be exactly the same if there was no language. We probably wouldn't have an iPhone, as Pagel playfully illustrated in his talk. But how much language do we really need to achieve the goal of a large society, and arrive at iPhone? Does language really need to be recursive? If we couldn't say &lt;br /&gt;
&lt;ul&gt;&lt;li&gt;I know [that you hate me].&lt;/li&gt;
&lt;/ul&gt;could we still have arrived at iPhone? Who really needs relative clauses anyway? On the flip side, what if language were more "permissive," and we &lt;i&gt;could&lt;/i&gt; say&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;What&lt;sub&gt;i&lt;/sub&gt; did you see the man who bought &lt;i&gt;t&lt;sub&gt;i&lt;/sub&gt;&lt;/i&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
These are technical properties of language I'm talking about. They may seem like little details, but they're actually very fundamental to very nature of language. And it's almost impossible to connect them directly to the evolutionary story Mark Pagel is telling. All that story needs is &lt;i&gt;some&lt;/i&gt; means of communication, but says nothing about why we have the specific system of language that we do, out of all the possible systems that could have existed.&lt;br /&gt;
&lt;br /&gt;
Needless to say, linguists never concern themselves with questions like "is the evolutionary consequence of high applicatives an iPhone?" and good thing too.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;* * *&lt;/h2&gt;One thing that I did like was that he said "Tower of B[ei]bel." That's the way I say it.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Update&lt;/h2&gt;Apparently Pagel has a habit of saying strange things in public places: &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=1186"&gt;LanguageLog/Scrabble tips for time travelers?&lt;/a&gt;. Hat tip to Charles Yang.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-9118162415430306856?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/08/language-communication-and-iphone.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>4</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-8086741355897772856</guid><pubDate>Sun, 31 Jul 2011 18:30:00 +0000</pubDate><atom:updated>2011-07-31T14:30:32.193-04:00</atom:updated><title>A Review of Project Nim</title><description>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.the-numbers.com/video/Project-Nim/Project-Nim-poster.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="179" src="http://www.the-numbers.com/video/Project-Nim/Project-Nim-poster.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;i&gt;Project Nim&lt;/i&gt; is a new documentary out about the life of Nim Chimpsky, the chimpanzee that a group of researchers at Columbia tried to teach sign language. Here's a brief synopsis.&lt;br /&gt;
&lt;blockquote&gt;"Let's take a chimpanzee, put it in a house in the upper west side with a psychoanalyst who doesn't know anything about chimpanzees, language, language acquisition, or sign language. Also, she has 7 other children in that house. &lt;i&gt;What could go wrong?"&lt;/i&gt;&lt;/blockquote&gt;&lt;br /&gt;
To put Project Nim in some perspective, Nim Chimpsky was born in 1973, which is two years after &lt;a href="http://en.wikipedia.org/wiki/Stanford_prison_experiment"&gt;the Stanford Prison Experiment&lt;/a&gt;, and one year before the first legislation requiring Institutional Review Boards for institutions carrying out human subjects research. This is not to say that most social science research was so by-the-seat-of-their-pants back then, but it &lt;i&gt;was&lt;/i&gt; a different time.&lt;br /&gt;
&lt;br /&gt;
I came away from this film with a few different lessons.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Don't sleep with your advis(or/ee).&lt;/h2&gt;Just don't do it. Twice in the film, two different interviewees said about two different sexual entanglements, "I don't think it affected the science." But, as I heard Christopher Hitchens once say about interview subjects, a guilty mind wants to confess. &lt;br /&gt;
&lt;br /&gt;
The movie starts out with Nim being placed in the home of Stephanie LaFarge to be raised as a human child. Stephanie had 3 children of her own, and her husband had 4, bringing the total residency of her Manhattan brownstone to 7 human children, 2 adults, and 1 baby chimp. This frankly sounds a lot more like a reality TV show than a scientific experiment. Add to that the fact that they gave baby Nim alcohol and pot, and that Stephanie breast fed Nim, I'm not sure MTV could even air it.&lt;br /&gt;
&lt;br /&gt;
Why on earth was Stephanie LaFarge recruited to be Nim's mother? As far as I can tell, her only qualification was her sexual history with Project Nim PI, Herb Terrace. Her graduate degree was in psychoanalysis. She had no experience with chimpanzee research, or language research of any kind, and in fact, she was hostile to the scientific goals. She wouldn't keep logs, didn't have a project plan, and eventually tried to restrict the other researchers' access to Nim.&lt;br /&gt;
&lt;br /&gt;
The second affair which came up was, again, between the PI, Herb Terrace, and the head teacher on the project, who was only an undergrad at the time. The fallout of this brief relationship led to the head teacher leaving the project.&lt;br /&gt;
&lt;br /&gt;
First of all, I just don't think it's possible to pursue a relationship between a professor and an advisee (especially an undergraduate) in an ethical way. Given the power dynamic, some form of coercion is nearly impossible to avoid. I feel a little uneasy saying so in a public forum, which I think goes to say that this is not a problem that academia has left behind in the 70's. &lt;br /&gt;
&lt;br /&gt;
Secondly, all sorts of strange and bad things happened to the science because of the sex aspect. Nim would have never had such a strange early childhood, and would have had greater constancy with the project if the PI had not pursued inappropriate relationships.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Beware those with media savvy.&lt;/h2&gt;One frequently hears that scientists in general, and linguists in particular, don't do enough to popularize their research. Occasionally, we are scolded for holing up in our ivory towers, since we are too arrogant to try to share our love of science broadly.&lt;br /&gt;
&lt;br /&gt;
However, I think &lt;i&gt;Project Nim&lt;/i&gt; has a lot to say about the perils of researchers who are a little too keen to popularize their research. One of the ASL teachers on the project described Herb Terrace as an "absentee landlord," who only showed up for photo-ops and media interviews. All in all, the project appears to have been planned far better from a media perspective than from a research perspective. &lt;br /&gt;
&lt;br /&gt;
In case you were unaware, research, even really cool and good research, doesn't just show up on TV out of nowhere. It takes deliberate attempts on the part of the researcher or the university to drum up attention. And everything about this project seems perfectly constructed to be media fodder.&lt;br /&gt;
&lt;br /&gt;
In the meantime, there were &lt;i&gt;serious&lt;/i&gt; problems with the project, mostly having to do with Nim mauling research assistants, which Herb Terrace didn't really address, and had a hard time recollecting in the documentary interviews. The most serious incident, where Nim nearly bit through an interpreter's face, Terrace's reported reaction was that he was worried she would sue him, or that "it would get out."&lt;br /&gt;
&lt;br /&gt;
It was a little hard for me not to think of Marc Hauser during the movie, another high profile non-human primate researcher who has recently fallen on hard times due to questionable ethics. The connection between Terrace and Hauser is tenuous, but they run together in my mind, I guess, because they both worked hard to popularize their research.&lt;br /&gt;
&lt;br /&gt;
And this is why I, at least, am frequently wary of active researchers who are also active popularizers of their own research. It seems almost synonymous with sloppy research and compromised ethics in my mind.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Humans are not socialized chimpanzees&lt;/h2&gt;This certainly isn't a new lesson for me, because I've never really thought that humans are just socialized chimpanzees. However, I really like how this point was hammered home in a real way.&lt;br /&gt;
&lt;br /&gt;
In discussions about "human nature," the notion that our "true" nature is somehow more brutish and violent seems to come up a lot. In this conception, society is merely a veneer over top our inner chimp. &lt;br /&gt;
&lt;br /&gt;
Well, society didn't do too much to cover over Nim's external chimp. Our "true" human nature is manifest in the activity of all humans, meaning it must be very broad, and non-uniform, but non-arbitrary at the same time.&lt;br /&gt;
&lt;br /&gt;
Interestingly, I've also heard of research trying to figure out if dogs are just socialized wolves. A bunch of researchers tried to raise wolf pups as if they were dogs, a much more achievable task, I think, than raising a chimp as a human. The results were much the same as for Nim. After infancy, the wolves went nuts and tore the place apart, and the experiment had to be abandoned.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Conclusion&lt;/h2&gt;I really liked the movie, and would suggest it to anyone who appreciates a good documentary.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-8086741355897772856?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/review-of-project-nim.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-4077100928230596024</guid><pubDate>Tue, 26 Jul 2011 23:10:00 +0000</pubDate><atom:updated>2011-07-26T19:10:04.675-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">philadelphia</category><category domain="http://www.blogger.com/atom/ns#">linguistics</category><category domain="http://www.blogger.com/atom/ns#">dialect</category><title>The Philadelphian Dialect is Punk Rock</title><description>Before there was Youtube and the accent meme, there was, I guess, punk rock.&lt;br /&gt;
&lt;br /&gt;
In this music video from 1988, the Dead Milk Men, a Philadelphia area punk band, give a rather hyper-Philadelphian performance. For the most part, Philadelphians aren't that aware of what marks their dialect as distinct from other regions, nor are most non-Philadelphias aware that there is a unique Philadelphia dialect.&lt;br /&gt;
&lt;br /&gt;
Now, I say hyper-Philadelphian for a few reasons. The lead singer for this song, Joe Genaro, definitely Philadelphia dialect speaker, born about an hour outside of the city in Wagontown, PA.&lt;br /&gt;
&lt;iframe width="300" height="300" frameborder="0" scrolling="no" marginheight="0" marginwidth="0" src="http://maps.google.com/maps?f=d&amp;amp;source=s_d&amp;amp;saddr=Wagontown,+PA&amp;amp;daddr=Philadelphia,+PA&amp;amp;hl=en&amp;amp;geocode=FaSDYgIdXLt6-yldT60delzGiTEeAUEWju7veQ%3BFc-fYQIdcxeF-ynrS7XU2LfGiTHBWD6M2BT1iQ&amp;amp;gl=us&amp;amp;mra=ls&amp;amp;sll=40.01066,-75.842724&amp;amp;sspn=0.141209,0.307274&amp;amp;ie=UTF8&amp;amp;ll=40.017098,-75.50354&amp;amp;spn=0.631042,0.823975&amp;amp;z=9&amp;amp;output=embed"&gt;&lt;/iframe&gt;&lt;br /&gt;
&lt;small&gt;&lt;a href="http://maps.google.com/maps?f=d&amp;amp;source=embed&amp;amp;saddr=Wagontown,+PA&amp;amp;daddr=Philadelphia,+PA&amp;amp;hl=en&amp;amp;geocode=FaSDYgIdXLt6-yldT60delzGiTEeAUEWju7veQ%3BFc-fYQIdcxeF-ynrS7XU2LfGiTHBWD6M2BT1iQ&amp;amp;gl=us&amp;amp;mra=ls&amp;amp;sll=40.01066,-75.842724&amp;amp;sspn=0.141209,0.307274&amp;amp;ie=UTF8&amp;amp;ll=40.017098,-75.50354&amp;amp;spn=0.631042,0.823975&amp;amp;z=9" style="color:#0000FF;text-align:left"&gt;View Larger Map&lt;/a&gt;&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
But, local dialect features are one of those things that tend to get leveled a little when singing, and there is no hint of that in this performance. Some things even seem exaggerated to me, which is fitting with the song itself, which was shot in Philadelphia, and makes references to culturally relevant locations in the lyrics.&lt;br /&gt;
&lt;br /&gt;
So here is &lt;i&gt;Punk Rock Girl&lt;/i&gt;. Dialectal analysis immediately follows.&lt;br /&gt;
&lt;br /&gt;
&lt;div style="background-color:#000000;width:520px;"&gt;&lt;div style="padding:4px;"&gt;&lt;embed src="http://media.mtvnservices.com/mgid:uma:video:mtv.com:101968/cp~id%3D1535931%26vid%3D101968%26uri%3Dmgid%3Auma%3Avideo%3Amtv.com%3A101968" width="512" height="288" type="application/x-shockwave-flash" allowFullScreen="true" allowScriptAccess="always" base="." flashVars=""&gt;&lt;/embed&gt;&lt;p style="text-align:left;background-color:#FFFFFF;padding:4px;margin-top:4px;margin-bottom:0px;font-family:Arial, Helvetica, sans-serif;font-size:12px;"&gt;Get More: &lt;a href="http://www.mtv.com/music/artist/dead_milkmen/artist.jhtml" style="color:#439CD8;" target="_blank"&gt;The Dead Milkmen&lt;/a&gt;, &lt;a href="http://www.mtv.com/music/" style="color:#439CD8;" target="_blank"&gt;Music&lt;/a&gt;, &lt;a href="http://www.mtv.com/music/video/" style="color:#439CD8;" target="_blank"&gt;More Music Videos&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;
&lt;h2&gt;/ow/ fronting&lt;/h2&gt;/ow/ fronting is, perhaps, the most salient dialect feature on display in this song. It's certainly not unique to Philadelphia. In fact, it's what qualifies Philadelphia as the Northern-most Southern city. While Philadelphia has many other Northern features, like a very raised /ɔ/, stereotyped in &lt;i&gt;coffee talk&lt;/i&gt;, we depart from the rest of the North by fronting /ow/, and Joe Genaro does this to an extreme degree in this song. Right off the bat at 0:28, he says&lt;br /&gt;
&lt;blockquote&gt;And she al&lt;b&gt;most&lt;/b&gt; knocked me dead.&lt;br /&gt;
&lt;/blockquote&gt;Then he immediately follows this up with&lt;br /&gt;
&lt;blockquote&gt;I tapped her on the shoulder&lt;br /&gt;
And said do you have a &lt;b&gt;beau&lt;/b&gt;?&lt;br /&gt;
She looked at me and smiled and said she did not &lt;b&gt;know&lt;/b&gt;&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
In fact, all of his /ow/s in this song are incredibly fronted, except for the two tokens in &lt;i&gt;rollin&lt;/i&gt; and &lt;i&gt;stolen&lt;/i&gt; which, of course, are effected by the following /l/.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Canadian Raising&lt;/h2&gt;The song isn't filled with Canadian Raising tokens. In fact, there are only two, but the one is so stressed and clear and wonderful. At 1:01, the waitress says&lt;br /&gt;
&lt;blockquote&gt;Well no, we only have it &lt;b&gt;iced&lt;/b&gt;.&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
Canadian Raising continues to be a favorite variable of mine.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Short-a pattern&lt;/h2&gt;Philadelphia is known for its complicated pattern of tensing /æ/, which is similar to New York City. The tense version pops up expectedly in &lt;br /&gt;
&lt;blockquote&gt;0:46&lt;br /&gt;
Punk rock girl&lt;br /&gt;
Give me a &lt;b&gt;chance&lt;/b&gt;&lt;br /&gt;
Punk rock girl&lt;br /&gt;
Let's go &lt;b&gt;slam dance&lt;/b&gt;&lt;br /&gt;
&lt;/blockquote&gt;and&lt;br /&gt;
&lt;blockquote&gt;1:54&lt;br /&gt;
We went to a shopping mall&lt;br /&gt;
And &lt;b&gt;laughed&lt;/b&gt; at all the shoppers&lt;br /&gt;
&lt;/blockquote&gt;and&lt;br /&gt;
&lt;blockquote&gt;2:01&lt;br /&gt;
We &lt;b&gt;asked&lt;/b&gt; for Mojo Nixon&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
Unfortunately, &lt;i&gt;mad, bad&lt;/i&gt; and &lt;i&gt;glad&lt;/i&gt;, which are exceptionally tense,  don't appear anywhere in the song. However, at 1:29, he says &lt;i&gt;dad&lt;/i&gt;, which is definitely lax as expected.&lt;br /&gt;
&lt;br /&gt;
Tokens of /æ/ which are lax in Philadelphia where they are tense in many other dialects show up in&lt;br /&gt;
&lt;blockquote&gt;1:03&lt;br /&gt;
So we jumped up on the table and shouted &lt;b&gt;anarchy&lt;/b&gt;&lt;br /&gt;
&lt;/blockquote&gt;and&lt;br /&gt;
&lt;blockquote&gt;1:24&lt;br /&gt;
Her father took one look at me and he &lt;b&gt;began&lt;/b&gt; to squeal&lt;br /&gt;
&lt;/blockquote&gt;and&lt;br /&gt;
&lt;blockquote&gt;2:26&lt;br /&gt;
Eat fudge &lt;b&gt;banana&lt;/b&gt; swirl&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
&lt;h2&gt;/ey/ split&lt;/h2&gt;This one is pretty subtle. Most of his tokens of /ey/ don't sound very different from standard, but one word final token at 1:15 is pretty low, almost [æɪ].&lt;br /&gt;
&lt;blockquote&gt;On such a winter's &lt;b&gt;day&lt;/b&gt;.&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
Data suggests that all /ey/ used to have this quality in Philadelphia, which is another reason why it's related to the Southern and Midland dialects. A sound change has been raising /ey/ higher and higher, but not in word final position.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;on = dawn&lt;/h2&gt;Philadelphia maintains the distinction between &lt;i&gt;cot&lt;/i&gt; and &lt;i&gt;caught&lt;/i&gt; by raising the vowel in &lt;i&gt;caught&lt;/i&gt;, similar to New York City. One way in which Philadelphia differs from New York City is in the vowel class of the word &lt;i&gt;on&lt;/i&gt;. In most locations North of Philly, &lt;i&gt;on&lt;/i&gt; rhymes with the man's name &lt;i&gt;Don&lt;/i&gt;. But in most locations South of Philly, at least where a contrast is maintained, &lt;i&gt;on&lt;/i&gt; rhymes with the woman's name &lt;i&gt;Dawn&lt;/i&gt;. You can hear this in&lt;br /&gt;
&lt;blockquote&gt;0:38&lt;br /&gt;
I tapped her &lt;b&gt;on&lt;/b&gt; the shoulder&lt;br /&gt;
&lt;/blockquote&gt;&lt;blockquote&gt;1:03&lt;br /&gt;
So we jumped up &lt;b&gt;on&lt;/b&gt; the table and shouted anarchy&lt;br /&gt;
And someone played a Beach Boys song &lt;b&gt;on&lt;/b&gt; the jukebox&lt;br /&gt;
It it was "California Dreamin"&lt;br /&gt;
So we started screamin&lt;br /&gt;
&lt;b&gt;On&lt;/b&gt; such a winter's day&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
&lt;h2&gt;l-vocalization/darkening&lt;/h2&gt;Now, if you think you can reliably code l-vocalization embedded in a punk rock song, god bless you. But, there are a few tokens that are pretty clear. For instance, I don't think there's any /l/ in&lt;br /&gt;
&lt;blockquote&gt;0:38&lt;br /&gt;
I tapped her on the &lt;b&gt;shoulder&lt;/b&gt;&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
The thing that makes Philadelphia pretty unique is our tendency to darken and vocalize /l/ intervocalically (so &lt;i&gt;balance&lt;/i&gt; is pretty confusable with &lt;i&gt;bounce&lt;/i&gt;) and in initial clusters (like &lt;i&gt;cluster&lt;/i&gt;). I don't want to make any strong claim about being able to reliably hear it in this song, but listen to&lt;br /&gt;
&lt;blockquote&gt;2:12&lt;br /&gt;
We got into her car away we started &lt;b&gt;rollin&lt;/b&gt;&lt;br /&gt;
I said how much you pay for this&lt;br /&gt;
Said nothin man it's &lt;b&gt;stolen&lt;/b&gt;&lt;br /&gt;
&lt;/blockquote&gt;and compare it to&lt;br /&gt;
&lt;blockquote&gt;0:49&lt;br /&gt;
&lt;b&gt;Let's&lt;/b&gt; go slam dance&lt;br /&gt;
&lt;/blockquote&gt;&lt;br /&gt;
There is definitely not as much /l/ in &lt;i&gt;rollin&lt;/i&gt; and &lt;i&gt;stolen&lt;/i&gt; as there is in &lt;i&gt;let's&lt;/i&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;* * *&lt;/h2&gt;So, do you think I missed anything important? As a side note, I think I have the same shirt as the drummer.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-4077100928230596024?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/philadelphian-dialect-is-punk-rock_26.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>3</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-9200476722063691865</guid><pubDate>Tue, 19 Jul 2011 23:50:00 +0000</pubDate><atom:updated>2011-07-19T19:51:38.002-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">linguistics</category><category domain="http://www.blogger.com/atom/ns#">sound change</category><title>Language Change, Animated</title><description>This is &lt;i&gt;the&lt;/i&gt; visualization of language change that I've always wanted to produce! And now that I've made it, there are all sorts of aesthetic things I'd like to change, but &lt;i&gt;c'est la out-of-the-box-tools-from-google&lt;/i&gt;!&lt;br /&gt;
&lt;br /&gt;
I should note that the data underlying this graph would not exist but for the sweat, blood and tears of Bill Labov, Ingrid Rosenfelder, a team of undergraduate transcribers, the NSF, and 3 decades' worth of graduate research teams.&lt;br /&gt;
&lt;br /&gt;
Depicted below is data from the in-development Philadelphia Neighborhood Corpus. We have analyzed 235 speakers who were interviewed as part of the Researching the Speech Community course between 1973 and 2010. That gives us dates of birth between 1889 and 1991, a 102 year timespan! Actually, raw data isn't depicted. Rather, it's the smoothing curve that I fit to F1 and F2.&lt;br /&gt;
&lt;br /&gt;
Hit play to watch it go. You can select particular vowels, and toggle on and off trails. You can also adjust how the bubbles are colored in the top right corner.&lt;br /&gt;
&lt;br /&gt;
&lt;script src="https://spreadsheets0.google.com/gpub?url=http%3A%2F%2Foj0ijfii34kccq3ioto7mdspc7r2s7o9-ss-opensocial.googleusercontent.com%2Fgadgets%2Fifr%3Fup_initialstate%26up_title%3D%252Fay%252F%2520change%26up__table_query_url%3Dhttps%253A%252F%252Fspreadsheets0.google.com%252Fspreadsheet%252Ftq%253Frange%253DA1%25253AF589%2526key%253D0Akf8SrCgg_xNdEctZVZoQXlscVZlb0EyRTQzOU8tZFE%2526gid%253D0%2526pub%253D1%26url%3Dhttp%253A%252F%252Fwww.google.com%252Fig%252Fmodules%252Fmotionchart.xml%26spreadsheets%3Dspreadsheets&amp;height=433&amp;width=524"&gt;&lt;/script&gt;&lt;br /&gt;
&lt;br /&gt;
The particular vowels on display are /ay/ and /ay0/. /ay0/ is the pre-voiceless allophone, a personal favorite, and look at that thing go! I've also split up men and women, since that has been an important factor in this particular change. The other vowels are there just for context, and are held at fixed points.&lt;br /&gt;
&lt;br /&gt;
Not displayed is the extreme uniformity of this change across speakers. This thing is changing fast, and everyone in our corpus is marching along in surprising uniformity. Can you say "speech community" anyone?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-9200476722063691865?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/language-change-animated.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>1</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-8478605629768099220</guid><pubDate>Sat, 16 Jul 2011 23:26:00 +0000</pubDate><atom:updated>2011-07-16T19:26:02.704-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">linguistics</category><category domain="http://www.blogger.com/atom/ns#">dialect</category><title>More Dialects and Communication Density</title><description>I'm not sure if it was there before, but there's a tab on the &lt;a href="http://senseable.mit.edu/csa/"&gt;Senseable Cities lab's Connected States of America page&lt;/a&gt; with &lt;a href="http://senseable.mit.edu/csa/downloads.html"&gt;some of their data&lt;/a&gt;. Specifically, they provide an .svg of the United States with ID numbers which are cross referenced to .csv files, which label the calling and sms-ing communities. Hopefully, they'll also publish rawer data eventually.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Data munging&lt;/h2&gt;So, I took some of the Atlas of North American English data which labels cities and their dialect classification. I don't think I'll look at finer grained ANAE data, like particular vowels' quality, because I don't think that would be too great with the the granularity of the data available from Senseable. I had to associate city names with counties to merge the data with the .svg, and thankfully &lt;a href="http://code.google.com/p/google-refine/"&gt;Google Refine + Freebase&lt;/a&gt; was able to get me 2/3 of the way there. There are a few strange errors in the .svg file that no amount of automation was going to get around ("Orandge County, FL" Really?). I also pulled the coordinate data out of the .svg so that I could do this all in R, which is where I feel the most comfortable.&lt;br /&gt;
&lt;br /&gt;
For the ANAE data, I collapsed some sub-dialects together, like Inland North and North, and Inland South and South.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Mis-match Measure &lt;/h2&gt;So, I have counties with dialect classification, and counties with calling and sms-ing classifications. I want to come up with a way of evaluating the mis-match between these. Here's a sketch of how I did that.&lt;br /&gt;
&lt;br /&gt;
for D in Dialects:&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;for C in Calling_Communities:&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Within =  D ∪ C&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Outside = C - D&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;ratio&lt;sub&gt;d,c&lt;/sub&gt; = |Outside|/|Within|&lt;br /&gt;
&lt;br /&gt;
So, "Within" is the set of counties that are both in dialect D and calling community C. "Outside" is the set of counties that are in calling community C and in some other dialect than D. You might have thought that I'd also include the set of counties that are in dialect D and in some other calling community than C, but that's actually not so important. As I said before, these dialect regions are rather large, so I'd expect there to be many calling communities within one dialect. What's stranger is calling communities which &lt;i&gt;span&lt;/i&gt; dialects.&lt;br /&gt;
&lt;br /&gt;
So, for interpreting the ratio, as it reaches 0 or ∞, the fit between dialects and calling communities is pretty good. At 0, a calling community is contained entirely within a dialect. As it approaches ∞, a dialect is more and more marginally part of a calling community. &lt;br /&gt;
&lt;br /&gt;
Next step, I took abs(log(ratio&lt;sub&gt;d,c&lt;/sub&gt;)). Now I have a measure that runs from 0 to ∞, and the closer it is to 0, the bigger the mismatch. I also wanted to boost the match score of smaller dialect regions. I forget why, but it made sense at the time. So, I weighted these absolute log-odds by 1/|D|.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Results&lt;/h2&gt;Here are the median results per dialect compared to calling communities, from best to worst match:&lt;br /&gt;
&lt;br /&gt;
&lt;ol&gt;&lt;li&gt;West -&amp;nbsp;∞&lt;/li&gt;
&lt;li&gt;St. Louis Corridor - 0.45&lt;/li&gt;
&lt;li&gt;Florida - 0.35&lt;/li&gt;
&lt;li&gt;Western New England - 0.19&lt;/li&gt;
&lt;li&gt;Eastern New England - 0.08&lt;/li&gt;
&lt;li&gt;Western PA - 0.07&lt;/li&gt;
&lt;li&gt;Texas - 0.06&lt;/li&gt;
&lt;li&gt;South - 0.03&lt;/li&gt;
&lt;li&gt;North - 0.02&lt;/li&gt;
&lt;li&gt;Midland - 0.01&lt;/li&gt;
&lt;li&gt;Mid-Atlantic - 0&lt;/li&gt;
&lt;li&gt;NYC - 0&lt;/li&gt;
&lt;/ol&gt;&lt;div&gt;And for the sms data:&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;West &amp;nbsp;-&amp;nbsp;∞&lt;/li&gt;
&lt;li&gt;South -&amp;nbsp;∞&lt;/li&gt;
&lt;li&gt;St. Louis Corridor- 0.5&lt;/li&gt;
&lt;li&gt;Florida - 0.34&lt;/li&gt;
&lt;li&gt;Eastern New England - 0.17&lt;/li&gt;
&lt;li&gt;Western New England - 0.15&lt;/li&gt;
&lt;li&gt;Western PA - 0.07&lt;/li&gt;
&lt;li&gt;Texas - 0.06&lt;/li&gt;
&lt;li&gt;Midland - 0.05&lt;/li&gt;
&lt;li&gt;North - 0.02&lt;/li&gt;
&lt;li&gt;Mid-Atlantic - 0&lt;/li&gt;
&lt;li&gt;NYC - 0&lt;/li&gt;
&lt;/ol&gt;&lt;div&gt;I'd not put so much stock into the Mid-Atlantic and NYC scores. To a large degree this is due to them&amp;nbsp;cannibalizing&amp;nbsp;each other, and they're not that different dialectally anyway.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;What's really interesting is the poor Midland and Northern scores. While I haven't worked out a measurement for which dialects are most mixed within calling communities, I suspect their poor scores are related to each other.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;&lt;h2&gt;Graphs!&lt;/h2&gt;&lt;/div&gt;&lt;div&gt;In this first graph, each facet is for a calling community in which there is a Northern dialect county. The filled in bits are the counties which are within the calling community, and the colored counties are ones we have dialect data for.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-iY9RV9s_QrM/TiIa6I7We5I/AAAAAAAAA4Y/7BayvEN2gH4/s1600/north.call.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="395" src="http://4.bp.blogspot.com/-iY9RV9s_QrM/TiIa6I7We5I/AAAAAAAAA4Y/7BayvEN2gH4/s640/north.call.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;Calling data&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;div&gt;In 4 out of 7 calling communities in which there is a northern dialect county, there is also a Midland dialect county. That's basically along the entire border region between the two dialects.&lt;br /&gt;
&lt;br /&gt;
Here's the same graph for sms-ing communities.&lt;br /&gt;
&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-TfiYWSVHfw4/TiIb-LGEzEI/AAAAAAAAA4c/Qea5pVigHlc/s1600/north.sms.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="424" src="http://2.bp.blogspot.com/-TfiYWSVHfw4/TiIb-LGEzEI/AAAAAAAAA4c/Qea5pVigHlc/s640/north.sms.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;SMS data&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;br /&gt;
&lt;h2&gt;Conclusions&lt;/h2&gt;Yup, these communication communities don't line up with dialect boundaries like you'd expect.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-8478605629768099220?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/more-dialects-and-communication-density.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-iY9RV9s_QrM/TiIa6I7We5I/AAAAAAAAA4Y/7BayvEN2gH4/s72-c/north.call.png" height="72" width="72" /><thr:total>2</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-8826147144881389135</guid><pubDate>Tue, 12 Jul 2011 02:19:00 +0000</pubDate><atom:updated>2011-07-11T22:19:01.644-04:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">linguistics</category><title>Communication Density and Dialect Boundaries</title><description>One linguistics topic which non-specialists are almost always interested in is dialect geography, and I don't think that's strictly due to their desire to have regional biases confirmed. It seems like almost everybody has a genuine interest in where and how people speak differently from themselves. Granted, once you move away from fairly shallow lexical differences into phonetic and phonological ones, a lot of people's eyes glaze over.&lt;br /&gt;
&lt;br /&gt;
When it comes explaining why dialect boundaries are in one place, rather than another, dialect geographers tend to have two answers. First, different regions have different historical settlement patterns. Bill Labov frequently points out that the current phonological boundary between the North and the Midland in the United States coincides with boundary between where log cabins were built versus A-frame houses, which itself coincides with two different immigration streams with different points of origin on the East coast. &lt;br /&gt;
&lt;br /&gt;
Second, there are differential rates of communication between regions. Langauge appears to be transferred crucially by face-to-face communication. If two regions have stronger ties of communication between themselves than with other regions, then we think they're probably going to have more similar dialects. This was basically Keelan Evanini's argumentation about why Erie, PA basically has a Western Pennsylvania dialect, even though it had historically been part of the North. &lt;br /&gt;
&lt;br /&gt;
Given this second hypothesis about why dialect boundaries exist where they do, I was pretty excited to see &lt;a href="http://senseable.mit.edu/csa/"&gt;these results coming out of the Senseable City Lab&lt;/a&gt;, which in collaboration with AT&amp;amp;T and IBM Research, has produced maps illustrating how US counties cluster together in terms of cell phone traffic and sms traffic. &lt;br /&gt;
&lt;br /&gt;
The lines between communication clusters are exactly those that I would expect to define dialect boundaries. So, I took the call and sms community maps, and superimposed the major dialect boundaries from the Atlas of North American English. Here are the results.&lt;br /&gt;
&lt;br /&gt;
&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-9wIvdO2Rug8/ThugnFDm_-I/AAAAAAAAA2c/vNvAR-EYcJ8/s1600/isogloss_by_call.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="443" src="http://4.bp.blogspot.com/-9wIvdO2Rug8/ThugnFDm_-I/AAAAAAAAA2c/vNvAR-EYcJ8/s640/isogloss_by_call.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;Communication clustering by Calls&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;
&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-eAhu1uVyPWI/ThuhFhUEHDI/AAAAAAAAA2k/o1GpbgcRqtw/s1600/isogloss_by_sms.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="444" src="http://4.bp.blogspot.com/-eAhu1uVyPWI/ThuhFhUEHDI/AAAAAAAAA2k/o1GpbgcRqtw/s640/isogloss_by_sms.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;Communication Clustering by SMS&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;
Honestly, I'm a little disappointed with the outcome. I expected that for very large dialect regions, like the West and the South, they would would contain many different communication clusters, so that's fine. Where both a dialect boundary and a communication boundary line up with a state boundary, I don't think it should be counted as an alignment. If there's any tendency for people to be more likely to move within state lines than across state lines, then this alignment along state lines is probably better explained by the first factor, settlement history, than communication density.&lt;br /&gt;
&lt;br /&gt;
The crucial place to look for an alignment between communication and dialects seems to be the Ohio, West Virgina, Pennsylvania trifecta. In neither map does it look like communication density lines up quite right. Certainly, Pennsylvania is cut in half into a Western and Eastern region, but it seems like the Western PA dialect extends further East, almost to the threshold of Philadelphia.&lt;br /&gt;
&lt;br /&gt;
Ohio doesn't seem to be sliced up quite right either. In the calls data, Cleveland clusters with the rest of the state, while with the SMS data, it clusters with Western PA. Dialectally, Cleveland is neither like the rest of Ohio nor Western PA. Rather, it is more similar to Toledo and Detroit to the West, and Buffalo to the East.&lt;br /&gt;
&lt;br /&gt;
There are other unfortunate non-alignments, like how Baltimore is clustered with Virginia, while dialectally it's more similar to Philadelphia, and New England isn't chopped up communicationally the way it is dialectally.&lt;br /&gt;
&lt;br /&gt;
I'll conclude by saying that first, pat answers to explain natural phenomena don't always work out, and second, these communication clusters make some dialect boundaries pretty mysterious. If everyone in Ohio is clustered together into a cell phone calling community, then why don't they all talk the same? The answer to this probably has to do with a third factor: meaningful social divisions which are distinct from communication divisions, but remember what I said about pat answers?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-8826147144881389135?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/communication-density-and-dialect.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-9wIvdO2Rug8/ThugnFDm_-I/AAAAAAAAA2c/vNvAR-EYcJ8/s72-c/isogloss_by_call.png" height="72" width="72" /><thr:total>7</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-3019666343477521575</guid><pubDate>Sun, 10 Jul 2011 16:23:00 +0000</pubDate><atom:updated>2011-07-10T12:23:30.069-04:00</atom:updated><title>Estimated international population of gay men</title><description>I recently learned about the "fraternal birth order effect," where apparently for every older brother a man has, his probability of being gay as an adult increases. &lt;a href="http://en.wikipedia.org/wiki/Fraternal_birth_order_and_male_sexual_orientation"&gt;Here's a wikipedia entry&lt;/a&gt;. &lt;br /&gt;
&lt;br /&gt;
Now, apparently there's some debate over how real or how strong this effect really is, so I'm almost certainly taking some numerical result a little too seriously. But, it occurred to me that data such as total fertility rate, and birth sex ratios are attainable international statistics. If this fraternal birth order effect is pretty strong and reliable, you should be able to estimate what percent of the male population of a country is gay.&lt;br /&gt;
&lt;br /&gt;
So, I grabbed some data on &lt;a href="http://data.un.org/Data.aspx?d=GenderStat&amp;amp;f=inID%3A14"&gt;international total fertility rate from here&lt;/a&gt;, and data on &lt;a href="http://data.un.org/Data.aspx?d=PopDiv&amp;amp;f=variableID%3A52"&gt;birth sex ratios here&lt;/a&gt;. Now, I have to make some assumptions. First, all of these calculations take the average total fertility rate as a country level descriptor, but there's almost certainly a unique probability distribution for different fertility rates for every country. Second, I have to treat the probability of having a male baby as being independent from the sex of the prior babies a woman has had. Third, and most importantly, I'm treating fraternal birth order as the &lt;i&gt;only&lt;/i&gt; determinant of sexual orientation.&lt;br /&gt;
&lt;br /&gt;
These are all pretty drastic assumptions. For instance, there's some evidence that my second assumption (birth sex of babies from the same mother are independent processes) is false. From the UN data I have, here's the total fertility rate of the country by the sex ratio:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-dRgXR5ZYdpQ/ThnEEhzqjXI/AAAAAAAAAzk/iRFT3UZolAM/s1600/tfr.sex.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="242" src="http://4.bp.blogspot.com/-dRgXR5ZYdpQ/ThnEEhzqjXI/AAAAAAAAAzk/iRFT3UZolAM/s320/tfr.sex.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
This seems to suggest that as women have more babies, they're more likely to have girls. Note: I've left out data from four countries with highly skewed birth sex ratios, since &lt;a href="http://familyinequality.wordpress.com/2011/07/06/global-womens-progress-report/"&gt;these countries apparently have high rates of abortion of female fetuses&lt;/a&gt;. &lt;br /&gt;
&lt;br /&gt;
So, I'm thinking about this as a very rough back of the envelope estimate, not to be taken too seriously, but maybe some sort of indicator of the shape of the world.&lt;br /&gt;
&lt;br /&gt;
Here's the math:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;babies = 1, 2, ... total.fertility.rate&lt;/li&gt;
&lt;li&gt;boy.probability = male.ratio/2&lt;/li&gt;
&lt;li&gt;boy.babies = boy.probability^(babies)&lt;/li&gt;
&lt;li&gt;prob.gay.first.born = 0.12 (more on this below)&lt;/li&gt;
&lt;li&gt;prob.gay.n.born = prob.gay.n-1.born * 1.3 (from wikipedia)&lt;/li&gt;
&lt;li&gt;prob.gay = sum(prob.gay.1-to-n.born * boy.babies)&lt;/li&gt;
&lt;/ul&gt;&lt;div&gt;I hope that makes some sense. I grabbed 1.3 from wikipedia, which says "each older brother increases a man's odds of developing a homosexual orientation by 28–48%." I basically made up the probability that a first born son is gay. This was the one number that I couldn't seem to find, so I adjusted and played with it until the predicted percent of gay men in the United States was about 10%.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;Here are my results for the top 10 countries for percent of gay men.&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;Afghanistan (19%)&lt;/li&gt;
&lt;li&gt;Niger (18%)&lt;/li&gt;
&lt;li&gt;Liberia (18%)&lt;/li&gt;
&lt;li&gt;Mali (18%)&lt;/li&gt;
&lt;li&gt;Nigeria (18%)&lt;/li&gt;
&lt;li&gt;Burkina Faso (17%)&lt;/li&gt;
&lt;li&gt;Guinea (17%)&lt;/li&gt;
&lt;li&gt;Yemen (17%)&lt;/li&gt;
&lt;li&gt;Iraq (17%)&lt;/li&gt;
&lt;li&gt;Uganda (17%)&lt;/li&gt;
&lt;/ol&gt;&lt;div&gt;Unsurprisingly, the percent of gay men in a country is highly correlated with total fertility rate. I think this top 10 list highlights the importance of gay rights activism in Africa, especially in Uganda, which is considering making homosexuality a capital offense.&amp;nbsp;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;And for the self obsessed, the United States looked like this:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Smaller percent than 100 countries &amp;gt; tied with 17 countries &amp;gt; larger percent than 43.&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-3019666343477521575?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/07/estimated-international-population-of.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-dRgXR5ZYdpQ/ThnEEhzqjXI/AAAAAAAAAzk/iRFT3UZolAM/s72-c/tfr.sex.png" height="72" width="72" /><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-701779941350240487</guid><pubDate>Wed, 13 Apr 2011 21:23:00 +0000</pubDate><atom:updated>2011-04-13T17:23:27.187-04:00</atom:updated><title>Quantiative Reasoning Fail</title><description>Most of my research involves examining and reasoning about data. I'd say that in the course of my education as a linguist, I have developed some pretty ok quantitative and statistical reasoning skills. What's so great about having these reasoning skills is that they are very broadly applicable.&lt;br /&gt;
&lt;br /&gt;
Occasionally, I'll observe, or hear second hand, someone with &lt;i&gt;no&lt;/i&gt; quantitative reasoning skills discussing a topic that calls for them, and more often than not I'm blown away by the simple errors they make that lead to large confusions. For example, there was the time that George Will claimed that &lt;a href="http://www.washingtonpost.com/wp-dyn/content/article/2009/06/05/AR2009060502835.html"&gt;Obama is narcissistic because he used first person pronouns at a high rate&lt;/a&gt;.&lt;br /&gt;
&lt;blockquote&gt;"I," said the president, who is inordinately fond of the first-person singular pronoun, "want to disabuse people of this notion that somehow we enjoy meddling in the private sector."&lt;/blockquote&gt;Of course, George Will didn't exactly &lt;i&gt;count&lt;/i&gt; how often Obama used the first person pronoun. And crucially, he didn't compare Obama's usage to any other president. Mark Liberman, calling himself "one of those narrow-minded fundamentalists who believe that statements can be true or false" &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=1486"&gt;counted and compared&lt;/a&gt;, and found that Obama's usage rate of first person pronouns was actually less than the previous two presidents, not that it even really means anything.&lt;br /&gt;
&lt;br /&gt;
&lt;table border = "1"&gt; &lt;thead&gt;
    &lt;td&gt;&lt;b&gt;President&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;% of words which are first person pronouns&lt;/b&gt;&lt;/td&gt;
 &lt;/thead&gt;
 &lt;tr&gt;
   &lt;td&gt;Obama&lt;/td&gt;&lt;td&gt;2.65%&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
   &lt;td&gt;Bush II&lt;/td&gt;&lt;td&gt;4.49%&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
   &lt;td&gt;Clinton&lt;/td&gt;&lt;td&gt;3.87%&lt;/td&gt;
 &lt;/tr&gt;

&lt;/table&gt;&lt;br /&gt;
More recently, you have Senator Jon Kyl stating on the Senate floor that over 90% of what Planned Parenthood does is abortions. Of course, when fact checked, it turned out that about what 3% of what planned parenthood does is abortions. Kyl's defense? "It was never intended as a factual statement."  I think the Daily Show coverage says it best.&lt;br /&gt;
&lt;div style="background-color:#000000;width:520px;"&gt;&lt;div style="padding:4px;"&gt;&lt;embed src="http://media.mtvnservices.com/mgid:cms:video:thedailyshow.com:381267" width="512" height="288" type="application/x-shockwave-flash" allowFullScreen="true" allowScriptAccess="always" base="." flashVars=""&gt;&lt;/embed&gt;&lt;p style="text-align:left;background-color:#FFFFFF;padding:4px;margin-top:4px;margin-bottom:0px;font-family:Arial, Helvetica, sans-serif;font-size:12px;"&gt;&lt;b&gt;&lt;a href="http://www.thedailyshow.com/watch/mon-april-11-2011/countdown-to-the-next-countdown---jon-kyl-s-planned-parenthood-statistics"&gt;The Daily Show - Countdown to the Next Countdown - Jon Kyl's Planned Parenthood Statistics&lt;/a&gt;&lt;/b&gt;&lt;br/&gt;Tags: &lt;a href='http://www.thedailyshow.com/full-episodes/'&gt;Daily Show Full Episodes&lt;/a&gt;,&lt;a href='http://www.indecisionforever.com/'&gt;Political Humor &amp; Satire Blog&lt;/a&gt;,&lt;a href='http://www.facebook.com/thedailyshow'&gt;The Daily Show on Facebook&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;
Now, in the next bit the Daily Show did, they called what Senator Kyl did "lying." I have a different take. Worse than lying, I think Senator Kyl has no notion that numbers &lt;i&gt;mean&lt;/i&gt; anything. That "90%" is just an emphasis marker, like "so," or "extremely". Of course, this can't strictly be true, because there are some pretty important percentages that determine whether or not he keeps his job, and surely he attends to those.&lt;br /&gt;
&lt;br /&gt;
The question remains, how can Jon Kyl or George Will think they can just &lt;i&gt;say&lt;/i&gt; things without any regard to the actual facts of the world? I think the problem is not just isolated to these individuals, and the consequences are potentially severe.&lt;br /&gt;
&lt;br /&gt;
Take the debate that was raging before the passage of healthcare reform. One topic that really caught my eye was rescission, which is when heath insurance companies drop individual's coverage. Insurance Company representatives testified that rescission is very rare, only effecting one half of one percent of customers.&lt;br /&gt;
&lt;br /&gt;
And no one called them out on the uselessness of that number! 0.5% of all insurance customers is entirely uninformative! What really matters is what percent of people who &lt;i&gt;file claims&lt;/i&gt; are dropped. And even then, what really matters is how often people who have really severe, and expensive illnesses get dropped. &lt;a href="http://tauntermedia.com/2009/07/28/unconscionable-math/"&gt;This blog post&lt;/a&gt; estimated that it was close to 50% of people who file large claims get their health insurance dropped.&lt;br /&gt;
&lt;br /&gt;
And that's not rocket science! Yet, no one called these representatives out on the (probably intentional) uselessness of their data! I remember thinking to myself "What's wrong with all of you!?" &lt;br /&gt;
&lt;br /&gt;
Here's what it comes down to. As I see it, if you don't understand data, then you don't understand the world, and you will make bad decisions, and be taken advantage of.&lt;br /&gt;
&lt;br /&gt;
In conclusion, I think it would be a great idea to overhaul high school mathematics to make statistics the end game, instead of calculus, as proposed by Arthur Benjamin in &lt;a href="http://www.ted.com/talks/lang/eng/arthur_benjamin_s_formula_for_changing_math_education.html"&gt;this TED talk&lt;/a&gt;.&lt;br /&gt;
&lt;!--copy and paste--&gt;&lt;object width="446" height="326"&gt;&lt;param name="movie" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/param&gt;&lt;param name="bgColor" value="#ffffff"&gt;&lt;/param&gt;&lt;param name="flashvars" value="vu=http://video.ted.com/talks/dynamic/ArthurBenjamin_2009-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/ArthurBenjamin-2009.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=587&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=arthur_benjamin_s_formula_for_changing_math_education;year=2009;theme=numbers_at_play;theme=ted_in_3_minutes;theme=bold_predictions_stern_warnings;theme=how_we_learn;event=How+We+Learn;tag=economics;tag=education;tag=math;tag=statistics;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /&gt;&lt;embed src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" pluginspace="http://www.macromedia.com/go/getflashplayer" type="application/x-shockwave-flash" wmode="transparent" bgColor="#ffffff" width="446" height="326" allowFullScreen="true" allowScriptAccess="always" flashvars="vu=http://video.ted.com/talks/dynamic/ArthurBenjamin_2009-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/ArthurBenjamin-2009.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=587&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=arthur_benjamin_s_formula_for_changing_math_education;year=2009;theme=numbers_at_play;theme=ted_in_3_minutes;theme=bold_predictions_stern_warnings;theme=how_we_learn;event=How+We+Learn;tag=economics;tag=education;tag=math;tag=statistics;"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-701779941350240487?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/04/quantiative-reasoning-fail.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>2</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-8541219070675134992</guid><pubDate>Sun, 03 Apr 2011 17:47:00 +0000</pubDate><atom:updated>2011-04-03T13:49:51.193-04:00</atom:updated><title>Intertwound</title><description>I recently had a conversation with someone who repeatedly used "intertwound" for the past participle of &lt;i&gt;intertwine&lt;/i&gt;. If you &lt;a href="http://lmgtfy.com/?q=intertwound"&gt;Google&lt;/a&gt; &lt;i&gt;intertwound&lt;/i&gt;, you only get about 160 hits. Admittedly, trying to figure out why a small (160 hits out of the whole internet? Maybe "infinitesimal" is a better word.) number of people do something out of the ordinary isn't necessarily interesting or fruitful. However, these hits all seem to be unreflecting attempts at forming the past participle of &lt;i&gt;intertwine&lt;/i&gt;, and damnit, I'm intrigued.&lt;br /&gt;
&lt;br /&gt;
At first I thought this was a natural enough reanalysis to make, thinking that &lt;i&gt;twine&lt;/i&gt; formed its participle by changing the vowel from /ay/ to /aw/. BUT! As far as I can tell, all verbs which form their participle by changing /ay/ to /aw/ have the coda /aynd/ (&lt;i&gt;wind, find, bind, grind&lt;/i&gt;). &lt;i&gt;Twine&lt;/i&gt; ends in /ayn/.&lt;br /&gt;
&lt;br /&gt;
So how did anyone come to reanalyze &lt;i&gt;twine&lt;/i&gt;. There are a few possibilities. First, perhaps the /ay/ → /aw/ rule generalized to &lt;i&gt;twine&lt;/i&gt; despite its not exactly having the right phonological shape. Second, it might be the case that some people have misanalysed this structure:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-9fbCzrlpvjU/TZivMGD52zI/AAAAAAAAAks/zECfA7VBqGA/s1600/intertwined.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/-9fbCzrlpvjU/TZivMGD52zI/AAAAAAAAAks/zECfA7VBqGA/s1600/intertwined.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
for this structure&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-zNUMQf_WtS0/TZivVtL62LI/AAAAAAAAAkw/jBq17D9Dzaw/s1600/intertwind.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/-zNUMQf_WtS0/TZivVtL62LI/AAAAAAAAAkw/jBq17D9Dzaw/s1600/intertwind.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
That is, they've reanalyzed the final /d/ in &lt;i&gt;intertwined&lt;/i&gt; as actually being part of the stem. This way, the stem actually does have the /aynd/ coda, making it natural to extend the /ay/ → /aw/ rule to it.&lt;br /&gt;
&lt;br /&gt;
A third possibility, and the one that I think is closer to being the correct one, is that some people have reanalyzed the structure of &lt;i&gt;intertwined&lt;/i&gt; as&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-87gW4wSeXVo/TZiwCWGRi-I/AAAAAAAAAk0/oziSDBOfWbg/s1600/inter-t-Wind.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/-87gW4wSeXVo/TZiwCWGRi-I/AAAAAAAAAk0/oziSDBOfWbg/s1600/inter-t-Wind.png" /&gt;&lt;/a&gt;&lt;/div&gt;There is definitely a similarity between the meaning of &lt;i&gt;wind&lt;/i&gt; and &lt;i&gt;intertwined&lt;/i&gt;, so it might not be the craziest thing to think that &lt;i&gt;wind&lt;/i&gt; must be in &lt;i&gt;intertwined&lt;/i&gt; somewhere. Of course, that means there's this &lt;i&gt;-t-&lt;/i&gt; stuck in there which doesn't really mean anything at all.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-8541219070675134992?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/04/intertwound.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-9fbCzrlpvjU/TZivMGD52zI/AAAAAAAAAks/zECfA7VBqGA/s72-c/intertwined.png" height="72" width="72" /><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-7621187828792329687</guid><pubDate>Tue, 08 Mar 2011 17:51:00 +0000</pubDate><atom:updated>2011-03-08T12:51:53.862-05:00</atom:updated><title>You've got to do it.</title><description>This goes out to everyone to everyone I know, but especially my friends in academia. I feel like the message of this song is what most meetings with advisors are about, but who can say it better than Mr. Rogers? Do yourself a favor, follow this link:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://pbskids.org/rogers/songLyricsYouveGotToDoIt.html"&gt;This one.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;And listen to the song. Lyrics reposted here:&lt;br /&gt;
&lt;br /&gt;
&lt;blockquote&gt;&lt;h3&gt;You've got to do it&lt;/h3&gt;&lt;h2&gt;by Fred Rogers&lt;/h2&gt;You can make believe it happens,&lt;br /&gt;
Or pretend that something's true.&lt;br /&gt;
You can wish or hope or contemplate&lt;br /&gt;
A thing you'd like to do.&lt;br /&gt;
But until you start to do it,&lt;br /&gt;
You will never see it through.&lt;br /&gt;
'Cause the make-believe pretending&lt;br /&gt;
Just won't do it for you&lt;br /&gt;
&lt;br /&gt;
You've got to do it.&lt;br /&gt;
Every little bit&lt;br /&gt;
You've got to do it, do it, do it, do it&lt;br /&gt;
And when you're through,&lt;br /&gt;
You can know who did it,&lt;br /&gt;
For you did it, you did it, you did it.&lt;br /&gt;
&lt;br /&gt;
If you want to ride a bicycle&lt;br /&gt;
And ride it straight and tall.&lt;br /&gt;
You can't simply sit and look at it&lt;br /&gt;
'Cause it won't move at all.&lt;br /&gt;
But it's you who have to try it.&lt;br /&gt;
And it's you who have to fall (sometimes)&lt;br /&gt;
If you want to ride a bicycle&lt;br /&gt;
And ride it straight and tall.&lt;br /&gt;
&lt;br /&gt;
You've got to do it.&lt;br /&gt;
Every little bit&lt;br /&gt;
You've got to do it, do it, do it, do it&lt;br /&gt;
And when you're through,&lt;br /&gt;
You can know who did it,&lt;br /&gt;
For you did it, you did it, you did it.&lt;br /&gt;
&lt;br /&gt;
It's not easy to keep trying&lt;br /&gt;
But it's one good way to grow.&lt;br /&gt;
It's not easy to keep learning&lt;br /&gt;
But I know that this is so.&lt;br /&gt;
When you've tried and learned&lt;br /&gt;
You're bigger than you were a day ago.&lt;br /&gt;
It's not easy to keep trying&lt;br /&gt;
But it's one way to grow.&lt;br /&gt;
&lt;br /&gt;
You've got to do it.&lt;br /&gt;
Every little bit&lt;br /&gt;
You've got to do it, do it, do it, do it&lt;br /&gt;
And when you're through,&lt;br /&gt;
You can know who did it,&lt;br /&gt;
For you did it, you did it, you did it.&lt;br /&gt;
&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-7621187828792329687?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/03/youve-got-to-do-it.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>0</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-3331198920796323635</guid><pubDate>Mon, 07 Mar 2011 18:18:00 +0000</pubDate><atom:updated>2011-03-07T14:37:48.480-05:00</atom:updated><title>Most Typical Person</title><description>There has been an infographic video floating around which was put together by National Geographic called "7 Billion: Are You Typical?" Frankly, it bugs me a bunch on a few points.&lt;br /&gt;
&lt;br /&gt;
&lt;div&gt;&lt;object height="322" width="512"&gt;&lt;param name="movie" value="http://d.yimg.com/static.video.yahoo.com/yep/YV_YEP.swf?ver=2.2.46" /&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="AllowScriptAccess" VALUE="always" /&gt;&lt;param name="bgcolor" value="#000000" /&gt;&lt;param name="flashVars" value="id=24401530&amp;vid=8767738&amp;lang=en-us&amp;intl=us&amp;thumbUrl=http%3A//l.yimg.com/a/i/us/sch/cn/video08/8767738_rnd5671dda5_19.jpg&amp;embed=1&amp;ap=12135647" /&gt;&lt;embed src="http://d.yimg.com/static.video.yahoo.com/yep/YV_YEP.swf?ver=2.2.46" type="application/x-shockwave-flash" width="512" height="322" allowFullScreen="true" AllowScriptAccess="always" bgcolor="#000000" flashVars="id=24401530&amp;vid=8767738&amp;lang=en-us&amp;intl=us&amp;thumbUrl=http%3A//l.yimg.com/a/i/us/sch/cn/video08/8767738_rnd5671dda5_19.jpg&amp;embed=1&amp;ap=12135647" &gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;
&lt;a href="http://video.yahoo.com/watch/8767738/24401530"&gt;7 Billion: Are You Typical? — National Geographic Magazine&lt;/a&gt; @ &lt;a href="http://video.yahoo.com/"&gt;Yahoo! Video&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
I think this video plays too fast and loose with exceedingly simplistic statistical notions, resulting in simplistic results.&lt;br /&gt;
&lt;br /&gt;
To begin with, the video asserts that the most typical human is a 28 year old Han Chinese man. Now, I'm not sure what they mean by "typical human." My default assumption would be that the most typical human traits would be ones which are most evenly distributed across the world. In fact, National Geographic is treating the world as a large, uniformly mixed urn of people, and if you were to randomly draw from that urn, a 28 year old Han Chinese man has the highest expectation.&lt;br /&gt;
&lt;br /&gt;
Except, that's probably not true either. I've turned to Wolfram Alpha to check some global stats. And it is true that &lt;a href="http://www.wolframalpha.com/input/?i=World+median+age"&gt;the global median age is about 28&lt;/a&gt;, the largest ethnic group is Han Chinese, and that &lt;a href="http://www.wolframalpha.com/input/?i=world+male+population"&gt;50.3% of the world population is male&lt;/a&gt;. However, &lt;a href="http://www.wolframalpha.com/input/?i=China+male+median+age"&gt;the median age for men in China is 33.5 years&lt;/a&gt;. This "most typical" person is actually not especially typical for his country, based on this age pyramid.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-gUHiHWB4GYk/TXUSoL_wVkI/AAAAAAAAAkk/gc_atarf2Eo/s1600/wolframalpha-20110307111401054.gif" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="296" src="http://4.bp.blogspot.com/-gUHiHWB4GYk/TXUSoL_wVkI/AAAAAAAAAkk/gc_atarf2Eo/s400/wolframalpha-20110307111401054.gif" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
It looks like if you were to randomly sample China, someone in the 20-24 year old age bin would be more typical than a 28 year old.&lt;br /&gt;
&lt;br /&gt;
At some point, the video also says the most typical person in the world owns a cell phone. That doesn't appear to be true for China (&lt;a href="http://www.wolframalpha.com/input/?i=china+cell+phone+/+china+population"&gt;.48 cell phones per capita&lt;/a&gt;), nor does it appear to be true for the world at large (&lt;a href="http://www.wolframalpha.com/input/?i=world+cell+phone+/+world+population"&gt;.32 cell phones per capita&lt;/a&gt;). Maybe this is just an issue with differing data sources, or maybe National Geographic was playing some complicated word games. The actual line is "The most typical person has a cell phone, but not a bank account."  Maybe what they mean is that if you were to place the world population into this table:&lt;br /&gt;
&lt;br /&gt;
&lt;center&gt;&lt;br /&gt;
&lt;table border="1" frame="void" rules="all"&gt;&lt;tbody&gt;
&lt;tr&gt;       &lt;th&gt;&lt;/th&gt;       &lt;th&gt;Bank Account&lt;/th&gt;       &lt;th&gt;No Bank Account&lt;/th&gt;    &lt;/tr&gt;
&lt;tr&gt;    &lt;/tr&gt;
&lt;tr&gt;      &lt;td&gt;&lt;b&gt;Cell Phone&lt;/b&gt;&lt;/td&gt;      &lt;td align="center"&gt;A&lt;/td&gt;      &lt;td align="center"&gt;B&lt;/td&gt;    &lt;/tr&gt;
&lt;tr&gt;    &lt;/tr&gt;
&lt;tr&gt;       &lt;td&gt;&lt;b&gt;No Cell Phone&lt;/b&gt;&lt;/td&gt;       &lt;td align="center"&gt;C&lt;/td&gt;       &lt;td align="center"&gt;D&lt;/td&gt;    &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/center&gt;&lt;br /&gt;
cell B has the most people in it. But, that would be a strange thing to mean, especially because &lt;i&gt;most&lt;/i&gt; people would fall into cells A, C and D. In that case, it would probably be more accurate to say that the most typical person either has both a cell phone and a bank account, or neither.&lt;br /&gt;
&lt;br /&gt;
(&lt;span class="Apple-style-span" style="color: red;"&gt;Edit: Thinking about it now, if A+B = 32% according to Wolfram Alpha, then C + D = 68%, and there's no way to distributed 68% between those two cells so that they're both less common than B.&lt;/span&gt;)&lt;br /&gt;
&lt;br /&gt;
Maybe this is all quibbling over details, but what does &lt;i&gt;any&lt;/i&gt; of this video matter of the facts it gives are all a little off? I too could create an infographic video with flashy animation and an inspiring sound track (well, let's say I could, for the sake of argument), but with completely fabricated numbers. That video would not count for anything, because &lt;i&gt;it matters whether the facts are true, and accurate&lt;/i&gt;.&lt;br /&gt;
&lt;br /&gt;
So what is the point of this video? They tip their hand at the end. First, they say "typical is always relative," which is true, but they seem to be trying to deconstruct the notion that statistical generalizations are possible or useful. They end with talking about individual choices, and that "our choices make a big difference." Really, they seem to be pushing the idea that we are all individually the authors of global society, a notion that I find only slightly more plausible than the structure of the universe being moulded by our consciousness. &lt;br /&gt;
&lt;br /&gt;
I read this as being a very American conception, and looking at the video again, it seems to really be all about contemporary American anxiety about the economic development of China and India.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-3331198920796323635?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/03/most-typical-person.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-gUHiHWB4GYk/TXUSoL_wVkI/AAAAAAAAAkk/gc_atarf2Eo/s72-c/wolframalpha-20110307111401054.gif" height="72" width="72" /><thr:total>2</thr:total></item><item><guid isPermaLink="false">tag:blogger.com,1999:blog-232777626311457607.post-354320609555564800</guid><pubDate>Fri, 25 Feb 2011 21:45:00 +0000</pubDate><atom:updated>2011-02-25T16:55:47.667-05:00</atom:updated><category domain="http://www.blogger.com/atom/ns#">tool</category><category domain="http://www.blogger.com/atom/ns#">praat</category><title>Hand Coder Praat Script</title><description>I've written a Praat script for general hand coding of segmental variation, relying upon forced alignments produced by P2FA or FAAValign.&lt;br /&gt;
&lt;br /&gt;
The most recent version of the script is available here:&lt;br /&gt;
&lt;a href="https://github.com/JoFrhwld/FAAV/raw/master/praat/handCoder.praat"&gt;https://github.com/JoFrhwld/FAAV/raw/master/praat/handCoder.praat&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Background and documentation below.&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Background&lt;/h2&gt;&lt;br /&gt;
Recently, we hosted here at Penn a workshop on &lt;a href="http://www.ling.upenn.edu/phonetics/workshop/"&gt;New Tools and Methods for Very-Large-Scale Phonetics Research&lt;/a&gt;. It was definitely my kind of workshop. The talks and the posters were all very high quality, and very interesting.&lt;br /&gt;
&lt;br /&gt;
One tool that was featured rather prominently was the &lt;a href="http://www.ling.upenn.edu/phonetics/p2fa/"&gt;Penn Phonetics Lab Forced Aligner (P2FA)&lt;/a&gt;. This tool takes as input a recording of speech, a transcription of the speech, and returns a word and phone level alignment of the transcription to the audio (please see the P2FA page for more details).&lt;br /&gt;
&lt;br /&gt;
Of course, once you have a large corpus of time aligned transcription, the ideal thing to do is an automated analysis of the acoustic data. In fact, this is the goal of the FAAV project, which focuses on analyzing vowel formant data. The most recent version of our code to automatically analyze vowels is hosted here: &lt;a href="https://github.com/JoFrhwld/FAAV/tree/master/extractFormants"&gt;https://github.com/JoFrhwld/FAAV/tree/master/extractFormants&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
However, for most purposes, there doesn't already exist an automated method for acoustic analysis. For example, if you wanted to study -ing ~ -in variation, or TD deletion, you would have to first build a classifier, which would require some hand coded data anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;h2&gt;Documentation&lt;/h2&gt;So, I've written an interactive Praat script that allows you to rather flexibly define segments to search for, narrow down the search context to specific word and segmental contexts, and define segmental contexts to exclude, as well as a list of stop words. Given an audio file, and the output of P2FA or FAAValign, the script will search for the specified contexts, play them, and allow you to enter a code. It will then write your code along with other important information about the token which can be used for analysis in and of itself, or as training data for a classifier.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Setup&lt;/b&gt;&lt;br /&gt;
Open a Long Sound file and a Text Grid into Praat. These two objects must have the same name. Next open handCoder.praat. To run the script, select Run&amp;gt;Run.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="441" src="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup1.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Defining the Search&lt;/b&gt;&lt;br /&gt;
A dialogue box will open, allowing you to define segments to search for, and refinements of the search context. The default settings are for coding TD deletion.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_td.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_td.png" width="358" /&gt;&lt;/a&gt;&lt;/div&gt;You can understand these settings this way:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Search objects with these names.&lt;/li&gt;
&lt;li&gt;Send output to this file.&lt;/li&gt;
&lt;li&gt;Search for T and D.&lt;/li&gt;
&lt;li&gt;Restrict the search to word final contexts.&lt;/li&gt;
&lt;li&gt;The segment must be preceded by a consonant.&lt;/li&gt;
&lt;li&gt;No restriction on following context.&lt;/li&gt;
&lt;li&gt;Exclude segments preceded by R.&lt;/li&gt;
&lt;li&gt;Exclude segments followed by T, D, TH, DH, JH, and CH.&lt;/li&gt;
&lt;li&gt;Exclude AND.&lt;/li&gt;
&lt;li&gt;Play a window of 3 words preceding and following the word the segment is in.&lt;/li&gt;
&lt;li&gt;There is no default code&lt;/li&gt;
&lt;/ul&gt;These are what the settings from -ing, or str- coding would look like.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_ing.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_ing.png" width="287" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_str.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="https://github.com/JoFrhwld/FAAV/raw/master/praat/docs/handCoder/setup2_str.png" width="287" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;b&gt;Coding&lt;/b&gt;&lt;br /&gt;
As the script runs, it will play segments of the audio surrounding segments which meet the search criteria. Then, the coding window will open. It contains two fields: one for codes, and one for comments. After entering codes and comments, hitting enter, or clicking on Continue will move along to the next segment.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Output&lt;/b&gt;&lt;br /&gt;
The output of this script is a tab delimited file with the following pieces of data for each segment.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Object name&lt;/li&gt;
&lt;li&gt;Segment of focus&lt;/li&gt;
&lt;li&gt;Word position of the segment&lt;/li&gt;
&lt;li&gt;Code from the coding field&lt;/li&gt;
&lt;li&gt;Time of segment start&lt;/li&gt;
&lt;li&gt;Time of segment end&lt;/li&gt;
&lt;li&gt;Word of focus&lt;/li&gt;
&lt;li&gt;Word start&lt;/li&gt;
&lt;li&gt;Word end&lt;/li&gt;
&lt;li&gt;Preceding segment&lt;/li&gt;
&lt;li&gt;Preceding segment start&lt;/li&gt;
&lt;li&gt;Preceding segment end&lt;/li&gt;
&lt;li&gt;Window duration&lt;/li&gt;
&lt;li&gt;Vowels per second in the window&lt;/li&gt;
&lt;li&gt;Comments&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
&lt;h2&gt;Feedback&lt;/h2&gt;Please feel free to contact me with any comments or question. You can find my e-mail on my website: &lt;a href="http://www.ling.upenn.edu/~joseff/"&gt;http://www.ling.upenn.edu/~joseff/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/232777626311457607-354320609555564800?l=val-systems.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><link>http://val-systems.blogspot.com/2011/02/hand-coder-praat-script.html</link><author>noreply@blogger.com (Josef Fruehwald)</author><thr:total>0</thr:total></item></channel></rss>

