<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><!-- generator="wordpress/2.2" --><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Identity Resolution Daily</title>
	<link>http://identityresolutiondaily.com</link>
	<description>All About Identity and Entity Resolution</description>
	<pubDate>Fri, 06 Nov 2009 15:53:39 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2</generator>
	<language>en</language>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/identityresolutiondaily/VoRE" type="application/rss+xml" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
		<title>Identity Resolution Daily Links 2009-11-06</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/39QgKy0HKeA/</link>
		<comments>http://identityresolutiondaily.com/649/identity-resolution-daily-links-2009-11-06/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 15:53:38 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[EHR]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[EMPI]]></category>

		<category><![CDATA[Healthcare]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[EMR]]></category>

		<category><![CDATA[Product Information Management]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Homeland Security]]></category>

		<category><![CDATA[National Security]]></category>

		<category><![CDATA[Security]]></category>

		<category><![CDATA[Secure Flight]]></category>

		<category><![CDATA[Master Data Management]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Insurance Fraud]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>daily link posts</category><category>data quality</category><category>data quality pro</category><category>data quality software</category><category>Department of Homeland Security</category><category>DHS</category><category>EHR</category><category>electronic health records</category><category>electronic medical records</category><category>emr</category><category>entity analytics</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>Gartner</category><category>Homeland Security</category><category>identity matching</category><category>identity resolution</category><category>identity resolution Identity Resolution Engine identity resolution and analytics</category><category>identity resolution and analytics</category><category>identity resolution daily</category><category>Infoglide</category><category>infoglide software</category><category>initiate systems</category><category>insurance claims fraud</category><category>insurance fraud</category><category>Jonathan McDonald</category><category>master data management</category><category>MDM</category><category>no fly list</category><category>non obvious relationship</category><category>non obvious relationship awareness</category><category>NORA</category><category>Paul Leyh</category><category>PIM</category><category>product information management</category><category>Robert Barker</category><category>secure flight</category><category>transportation security administration</category><category>TSA</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/649/identity-resolution-daily-links-2009-11-06/</guid>
		<description><![CDATA[[Post from Infoglide] The Other Half of Entity Resolution
&#8220;In a recent post, Jonathan McDonald quotes one definition of entity resolution: &#8216;According to Gartner, entity resolution is &#8216;the capability to resolve multiple labels for individuals, products or other noun classes of data into a single resolved entity when pseudonyms, alias names or other synonym-style constructs exist.&#8217; [...]]]></description>
			<content:encoded><![CDATA[<p>[Post from <a href="http://www.infoglide.com/">Infoglide</a>] <a href="http://identityresolutiondaily.com/648/the-other-half-of-entity-resolution/">The Other Half of Entity Resolution</a></p>
<blockquote><p>&#8220;In a recent post, Jonathan McDonald quotes one definition of entity resolution: &#8216;According to Gartner, entity resolution is &#8216;the capability to resolve multiple labels for individuals, products or other noun classes of data into a single resolved entity when pseudonyms, alias names or other synonym-style constructs exist.&#8217; &#8230;While the definition nicely captures the value of &#8216;first degree&#8217; entity resolution, it falls short by omitting non-obvious relationship detection.&#8221;</p></blockquote>
<p><a href="http://www.ihealthbeat.org/articles/2009/11/5/study-us-lags-behind-many-other-countries-in-ehr-use.aspx">iHealthBeat: Study: U.S. Lags Behind Many Other Countries in EHR Use</a></p>
<blockquote><p> &#8220;The study found that 46% of U.S. physicians use <a href="http://en.wikipedia.org/wiki/Electronic_health_record">electronic health records</a>, up from 28% in 2006. The researchers found that 99% of doctors in the Netherlands use EHRs. Australia, Italy, New Zealand, Norway, Sweden and the U.K. also reported EHR adoption rates of 94% or higher. &#8220;</p></blockquote>
<p><a href="http://www.dataqualitypro.com/data-quality-home/profit-by-data-quality-best-practices.html">data quality PRO: Profit by Data Quality Best Practices</a></p>
<blockquote><p> &#8220;Insurers use data to manage litigation, detect fraudulent claims and limit financial exposure to claims through reinsurance, but this practice works only when the data is credible. It is no overstatement that sound, profitable property / casualty operations begin – and end – with <a href="http://en.wikipedia.org/wiki/Data_quality">quality data</a>.&#8221;</p></blockquote>
<p><a href="http://www.federalnewsradio.com/index.php?nid=15&amp;sid=1801900">Federal News Radio: What airline passengers need to know about TSA&#8217;s Secure Flight program</a></p>
<blockquote><p> &#8220;The information is then used &#8216;behind the scenes&#8217; to match against the <a href="http://en.wikipedia.org/wiki/No_fly_list">No-Fly list</a>. &#8216;It&#8217;s a behind the scenes process,&#8217; said Leyh. &#8216;If you get to the airport and you have your boarding pass, the <a href="http://en.wikipedia.org/wiki/Secure_Flight">Secure Flight</a> part of it, and the watch list matching part of it, is over. It&#8217;s done with.&#8217;&#8221;</p></blockquote>
<p><a href="http://www.information-management.com/news/master_data_management_customer_relationship_pim_crm_mdm-10016425-1.html">information management: Inefficiency as a Standard in Product Information Management</a></p>
<blockquote><p>&#8220;<a href="http://en.wikipedia.org/wiki/Product_information_management">Managing product information</a> across a large organization consists of much more than making sure prices and descriptions are accurate and consistent. Large manufacturers and retailers employ teams of people tasked with the job of cross checking product data. While the deployment of these teams is a good idea in theory, the process is loaded with inefficiency and errors are all but guaranteed.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=649&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_649" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/649/identity-resolution-daily-links-2009-11-06/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/649/identity-resolution-daily-links-2009-11-06/</feedburner:origLink></item>
		<item>
		<title>The Other Half of Entity Resolution</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/BBqiDGekljc/</link>
		<comments>http://identityresolutiondaily.com/648/the-other-half-of-entity-resolution/#comments</comments>
		<pubDate>Wed, 04 Nov 2009 20:47:20 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[Product Information Management]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Workers Compensation Fraud]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Retail Security]]></category>

		<category><![CDATA[Insurance Fraud]]></category>

		<category><![CDATA[Returns Fraud]]></category>

		<category><![CDATA[Lottery Fraud]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Loss Prevention]]></category>
<category>initiate systems</category><category>insider trading</category><category>insurance fraud</category><category>Jonathan McDonald</category><category>lotteries</category><category>lottery</category><category>lottery fraud</category><category>lottery retailer fraud</category><category>lottery ticket theft</category><category>lotto</category><category>lotto ticket</category><category>non obvious relationship</category><category>non obvious relationship awareness</category><category>NORA</category><category>retail crime</category><category>retail fraud</category><category>retail returns fraud</category><category>retail security</category><category>retail shrinkage</category><category>retail theft</category><category>returns fraud</category><category>returns management</category><category>Robert Barker</category><category>workers comp</category><category>workers compensation fraud</category><category>workers comp fraud</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/648/the-other-half-of-entity-resolution/</guid>
		<description><![CDATA[By Robert Barker, Infoglide Senior VP &#38; Chief Marketing Officer
In a recent post, Jonathan McDonald quotes one definition of entity resolution:
According to Gartner, entity resolution is “the capability to resolve multiple labels for individuals, products or other noun classes of data into a single resolved entity when pseudonyms, alias names or other synonym-style constructs exist. [...]]]></description>
			<content:encoded><![CDATA[<p>By Robert Barker, <a href="http://www.infoglide.com/">Infoglide</a> Senior VP &amp; Chief Marketing Officer</p>
<p>In a <a href="http://blog.initiate.com/wordpress/index.php/2009/11/02/entity-resolution-to-combat-criminal-and-terrorist-activities/">recent post</a>, Jonathan McDonald quotes one definition of entity resolution:</p>
<blockquote><p>According to Gartner, entity resolution is “the capability to resolve multiple labels for individuals, products or other noun classes of data into a single resolved entity when pseudonyms, alias names or other synonym-style constructs exist. This is especially true in cases wherein there exists intentional falsification of information or the creation of false identities. While most prevalent in detecting perpetrators of criminal or illegal activity, more-commercial applications exist as well.</p></blockquote>
<p>While the definition nicely captures the value of “first degree” <a href="http://en.wikipedia.org/wiki/Entity_resolution">entity resolution</a>, it falls short by omitting non-obvious relationship detection.</p>
<p>Basic entity resolution determines <em>“who’s who”</em> by sifting through massive amounts of noun/attribute data in multiple disparate data sources. Cutting through ambiguity caused by missing attributes, pseudonyms, aliases, and obvious efforts to deceive, it mines and resolves the essential elements of identity to form an unambiguous picture that greatly enhances business decisions and reduces risk.</p>
<p>However, in many application domains, pinpointing <em>“who knows whom”</em> is equally valuable. In <a href="http://identityresolutiondaily.com/423/leveraging-identity-resolution-data-sources/">detecting insider trading</a>, for example, it’s <em>important </em>to resolve identity information to achieve an unambiguous picture of a person of interest, but to expose fraudulent activity, it’s <em>critical </em>to identify second and third degree linkages between suspects and their friends, relatives, and business associates.</p>
<p>More examples abound. In insurance, fraudsters change roles each time they stage a car accident and also intentionally modify their identities in accident reports. <a href="http://www.infoglide.com/documents/whitepapers/PERMISSION%20TO%20DISTRIBUTE%20-%20Introducing%20Identity%20Resolution%20-%20Clendenen.pdf">Fraudulent employers who want to reduce their workers’ compensation premiums</a> will close their company and start a new one with modified identities of corporate officers. In retail, non-receipted returns of merchandise are often linked to store employees and the customers they enlist to act as their confederates. The list goes on and on.</p>
<p>In each case, entity resolution finds hidden connections by evaluating multiple ambiguous attributes with the same algorithms used to resolve identities. <a href="http://identityresolutiondaily.com/521/dateline-nbc-on-lottery-fraud/">A retail employee who takes a customer’s winning lottery ticket</a> (while telling the customer he didn’t win!) can be traced through address and phone information to other suspiciously connected people, e.g. frequent lottery winners and lottery commission employees.</p>
<p>With apologies to the experts at <a href="http://www.gartner.com/technology/home.jsp">Gartner</a>, here’s a suggested addition to the definition that acknowledges the other half of entity resolution:</p>
<blockquote><p>The capability to (a) resolve multiple labels for individuals, products or other noun classes of data into a single resolved entity when ambiguity from pseudonyms, alias names or other synonym-style constructs exists, and <strong><em>(b) to expose hidden connections between entities that are two or more degrees of separation apart</em></strong>. This is especially true in cases where there exists intentional falsification of information or the creation of false identities. While most prevalent in detecting perpetrators of criminal or illegal activity, more-commercial applications exist as well.</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=648&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_648" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/648/the-other-half-of-entity-resolution/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/648/the-other-half-of-entity-resolution/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-11-02</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/m1dbAF1ZgRE/</link>
		<comments>http://identityresolutiondaily.com/647/identity-resolution-daily-links-2009-11-02/#comments</comments>
		<pubDate>Mon, 02 Nov 2009 17:19:15 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Data Warehousing]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Decision Management]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[EHR]]></category>

		<category><![CDATA[Data Integration]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[EMR]]></category>

		<category><![CDATA[Healthcare]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Fusion Center]]></category>

		<category><![CDATA[Privacy]]></category>

		<category><![CDATA[Homeland Security]]></category>

		<category><![CDATA[Federal Government]]></category>

		<category><![CDATA[National Security]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Master Data Management]]></category>

		<category><![CDATA[Business Intelligence]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>austin fusion center</category><category>Austin Regional Intelligence Center</category><category>Bart Johnson</category><category>BI</category><category>business intelligence</category><category>business intelligence software</category><category>business intelligence technology</category><category>data integration</category><category>Department of Homeland Security</category><category>Deven McGraw</category><category>DHS</category><category>EHR</category><category>electronic health records</category><category>electronic medical records</category><category>emr</category><category>fusion center</category><category>fusion centers</category><category>fusion center network</category><category>Homeland Security</category><category>Jim Ericson</category><category>master data management</category><category>MDM</category><category>patient identification</category><category>privacy</category><category>privacy legislation</category><category>Robert Barker</category><category>TDWI World</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/647/identity-resolution-daily-links-2009-11-02/</guid>
		<description><![CDATA[By the Infoglide Team
Come by and see us at TDWI World in Orlando Nov. 3 &#38; 4, Booth 405
The Emculturated World: Unmanage Master Data Management
&#8220;MDM breaks down in the moment it becomes divorced from a practical, immediate attempt to capture just what is needed today. The moment it attempts to “bank” standard symbols ahead of [...]]]></description>
			<content:encoded><![CDATA[<p>By the <a href="http://www.infoglide.com/">Infoglide</a> Team</p>
<p><em>Come by and see us at <a href="http://www.tdwi.org/Orlando2009/">TDWI World</a> in Orlando Nov. 3 &amp; 4, Booth 405</em></p>
<p><a href="http://emculturate.wordpress.com/2009/11/02/unmanage-master-data-management/">The Emculturated World: Unmanage Master Data Management</a></p>
<blockquote><p>&#8220;<a href="http://en.wikipedia.org/wiki/Master_data_management">MDM</a> breaks down in the moment it becomes divorced from a practical, immediate attempt to capture just what is needed today. The moment it attempts to “bank” standard symbols ahead of their usage, the MDM process becomes speculative, and proscriptive.&#8221;</p></blockquote>
<p><a href="http://www.governing.com/column/can-i-say-no-electronic-health-record">Governing: Can I Say No to an Electronic Health Record?</a></p>
<blockquote><p>&#8220;In some instances, patients don’t even know <a href="http://en.wikipedia.org/wiki/Electronic_health_records">their information</a> is being shared. For example, if consumers turn over prescription drug records when applying for life insurance, the insurer will sometimes hand off the information to business partners who then hand it off to data miners. To keep a tighter grip on privacy, Deven McGraw, director of health privacy at the Center for Democracy and Technology, would like a set of rules that all organizations in the health IT world would have to follow.&#8221;</p></blockquote>
<blockquote><p><em>Related post: <a href="http://identityresolutiondaily.com/605/applying-identity-resolution-to-patient-identification-integrity/">&#8220;Applying Identity Resolution to Patient Identification Integrity&#8221;</a></em></p></blockquote>
<p><a href="http://www.mysanantonio.com/news/local_news/McManus_recalls_9-11_at_GEOINT_summit.html">San Antonio Express-News: McManus recalls 9-11 at GEOINT summit</a></p>
<blockquote><p>&#8220;Bart Johnson, acting undersecretary for intelligence and analysis with the <a href="http://en.wikipedia.org/wiki/United_States_Department_of_Homeland_Security">Homeland Security Department</a>, said cooperation is improving, although problems remain with security clearances and interdepartmental connectivity. &#8216;The federal government can only do so much in getting it down to the street level,&#8217; Johnson said. Homeland security and Justice Department officials have formed 72 “<a href="http://en.wikipedia.org/wiki/Fusion_centers">fusion centers</a>” — terrorism prevention and response centers where federal agencies work with the military, local law enforcement and private partners. Three are in Texas: <a href="http://identityresolutiondaily.com/611/austin-fusion-center-privacy-security/">Austin</a>, Dallas and Collin County near Dallas.&#8221;</p></blockquote>
<p><a href="http://www.information-management.com/blogs/-10016437-1.html">information management: From Search to Explore</a></p>
<blockquote><p>&#8220;It&#8217;s no surprise that people are looking at more and more internal and external resources for informed decision-making. In the internal case, <a href="http://en.wikipedia.org/wiki/Data_integration">data integration</a> is a foundation of master data management as well. But integration for <a href="http://en.wikipedia.org/wiki/Business_intelligence">BI</a> to common visual tools is increasingly taking place in subsystems, relational databases and cubes, and the visualization layer itself.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=647&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_647" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/647/identity-resolution-daily-links-2009-11-02/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/647/identity-resolution-daily-links-2009-11-02/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-10-30</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/ooT57y0an_0/</link>
		<comments>http://identityresolutiondaily.com/646/identity-resolution-daily-links-2009-10-30/#comments</comments>
		<pubDate>Fri, 30 Oct 2009 18:37:05 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Fusion Center]]></category>

		<category><![CDATA[Law Enforcement]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Data Profiling]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Lottery Fraud]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Homeland Security]]></category>

		<category><![CDATA[National Security]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Master Data Management]]></category>

		<category><![CDATA[Data Management]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>BeyeNETWORK</category><category>daily link posts</category><category>Daily Texan</category><category>data quality</category><category>data quality software</category><category>David Loshin</category><category>Department of Homeland Security</category><category>DHS</category><category>e discovery</category><category>electronically stored information</category><category>electronic discovery</category><category>fusion centers</category><category>fusion center network</category><category>Homeland Security</category><category>identity matching</category><category>identity resolution</category><category>identity resolution and analytics</category><category>lotteries</category><category>lottery</category><category>lottery fraud</category><category>lottery retailer fraud</category><category>lottery ticket theft</category><category>lotto</category><category>lotto ticket</category><category>Pankaj Joshi</category><category>privacy</category><category>secure flight</category><category>security</category><category>StoredIQ</category><category>transportation safety administration</category><category>TSA</category><category>USA Players</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/646/identity-resolution-daily-links-2009-10-30/</guid>
		<description><![CDATA[[Post from Infoglide] Enriching E-discovery Results with Identity Resolution
&#8220;Civil lawsuits often result in discovery orders from the court to produce every shred of possibly relevant internal communication. The need to comprehend patterns across the resulting vast amount of aggregated data is critical. To help organizations respond to these demands, powerful e-discovery software systems (e.g., see [...]]]></description>
			<content:encoded><![CDATA[<p>[Post from <a href="http://www.infoglide.com/">Infoglide</a>] <a href="http://identityresolutiondaily.com/645/enriching-e-discovery-results-with-identity-resolution/">Enriching E-discovery Results with Identity Resolution</a></p>
<blockquote><p>&#8220;Civil lawsuits often result in discovery orders from the court to produce every shred of possibly relevant internal communication. The need to comprehend patterns across the resulting vast amount of aggregated data is critical. To help organizations respond to these demands, powerful e-discovery software systems (e.g., see StoredIQ) create data topology maps that identify the relationships between active sources of multiple forms of electronically stored information (ESI).&#8221;</p></blockquote>
<p><a href="http://www.usaplayers.com/news/2009/gambling/october/lottery-winner-demands-payment-after-crooked-clerk-pilfers-ticket-11719.html">USA Players: Lottery Winner Demands Payment After Crooked Clerk Pilfers Ticket </a></p>
<blockquote><p>&#8220;Pankaj Joshi, the accused, was an employee at the convenience store in which Willis purchased his tickets. Joshi had allegedly told Willis that the ticket that he presumed was worth millions was worth only $2 dollars, which Joshi presumably paid to Willis. Joshi was charged with lottery fraud, and it is suspected that he took the winnings and fled to his homeland of Nepal.&#8221;<em> </em></p>
<p><em>For more, see</em><em> &#8220;<a href="http://identityresolutiondaily.com/521/dateline-nbc-on-lottery-fraud/">Lottery Fraud by Retailers Is an Identity Resolution Problem</a>&#8220;</em></p></blockquote>
<p><a href="http://www.dailytexanonline.com/top-stories/civil-liberties-groups-voice-fusion-center-apprehension-1.2038861">The Daily Texan: Civil liberties groups voice &#8216;fusion center&#8217; apprehension</a></p>
<blockquote><p> &#8220;It will be funded initially by <a href="http://en.wikipedia.org/wiki/Department_of_homeland_security">U.S. Department of Homeland Security</a> grants and will then become self-sustaining, using personnel already within APD’s budget. &#8216;It is really important for <a href="http://en.wikipedia.org/wiki/Fusion_centers">law enforcement to be able to share information</a> in a timely fashion, because when you share information, you can solve crimes quicker and, in some cases, prevent another serial offense from happening,&#8217; Carter said. Carter said Central Texas agencies possess large amounts of lawfully collected information, but separate information systems hinder the sharing of information.&#8221;</p></blockquote>
<p><a href="http://www.b-eye-network.com/view/11726">BeyeNETWORK: Master Data Management Checklist #5: Data Quality Mechanics</a></p>
<blockquote><p> [<a href="http://www.b-eye-network.com/blogs/loshin/">David Loshin</a>] &#8220;The ability to use the traditional <a href="http://en.wikipedia.org/wiki/Data_quality">data quality</a> toolset of data parsing, standardization and matching enables the development of a “customer master,” “product master,” “security master,” etc. that becomes the master entity index to be used for ongoing <a href="http://en.wikipedia.org/wiki/Identity_resolution">identity resolution</a> and elimination of duplicate entries.&#8221;</p></blockquote>
<p><a href="http://www.airlinesanddestinations.com/airlines/passenger-info-required-at-booking-under-tsas-secure-flight-program/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=passenger-info-required-at-booking-under-tsas-secure-flight-program">Airlines and Destinations: Passenger Info Required at Booking under TSA’s Secure Flight Program</a></p>
<blockquote><p> &#8220;When making a flight booking, each passenger must declare their full name just as it appears in their passport, as well as their gender and date of birth. The airline sends the information to the <a href="http://en.wikipedia.org/wiki/Transportation_Security_Administration">TSA</a> 72 hours before the flight departure time. The TSA compares the information with watch lists with the purpose of identifying suspected terrorists, preventing access to flights by passengers prohibited from flying, and identifying individuals for whom an enhanced security check should be performed.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=646&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_646" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/646/identity-resolution-daily-links-2009-10-30/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/646/identity-resolution-daily-links-2009-10-30/</feedburner:origLink></item>
		<item>
		<title>Enriching E-discovery Results with Identity Resolution</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/t6sUsbwxKtQ/</link>
		<comments>http://identityresolutiondaily.com/645/enriching-e-discovery-results-with-identity-resolution/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 16:14:13 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Law Enforcement]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Secure Flight]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Lottery Fraud]]></category>

		<category><![CDATA[Workers Compensation Fraud]]></category>

		<category><![CDATA[Identity Resolution]]></category>
<category>airline passenger screening</category><category>airport security</category><category>airport security checkpoint</category><category>airport security lines</category><category>data matching</category><category>e discovery</category><category>electronic discovery</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>identity resolution and analytics</category><category>identity matching</category><category>identity resolution</category><category>identity resolution daily</category><category>Infoglide</category><category>infoglide software</category><category>law enforcement</category><category>lotteries</category><category>lottery</category><category>lottery fraud</category><category>lottery retailer fraud</category><category>lottery ticket theft</category><category>lotto</category><category>lotto ticket</category><category>secure flight</category><category>StoredIQ</category><category>workers comp</category><category>workers compensation fraud</category><category>workers comp fraud</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/645/enriching-e-discovery-results-with-identity-resolution/</guid>
		<description><![CDATA[By Robert Barker, Infoglide Senior VP &#38; Chief Marketing Officer
Civil lawsuits often result in discovery orders from the court to produce every shred of possibly relevant internal communication. The need to comprehend patterns across the resulting vast amount of aggregated data is critical. To help organizations respond to these demands, powerful e-discovery software systems (e.g., [...]]]></description>
			<content:encoded><![CDATA[<p>By Robert Barker, <a href="http://www.infoglide.com/">Infoglide</a> Senior VP &amp; Chief Marketing Officer</p>
<p>Civil lawsuits often result in discovery orders from the court to produce every shred of possibly relevant internal communication. The need to comprehend patterns across the resulting vast amount of aggregated data is critical. To help organizations respond to these demands, powerful <a href="http://en.wikipedia.org/wiki/E-discovery">e-discovery</a> software systems (e.g., see <a href="http://www.storediq.com/index.aspx">StoredIQ</a>) create data topology maps that identify the relationships between active sources of multiple forms of electronically stored information (<a href="http://en.wikipedia.org/wiki/Electronically_Stored_Information">ESI</a>).</p>
<p>Reading about how it works made me speculate, “What value might identity resolution bring to e-discovery?” <a href="http://en.wikipedia.org/wiki/Identity_resolution">Identity resolution</a> (AKA “entity resolution”) technology has been used to create solutions for a wide range of problems. Most often, this involves creating an understanding about people and their hidden relationships with other people and organizations. <a href="http://identityresolutiondaily.com/521/dateline-nbc-on-lottery-fraud/">Lottery retailer fraud</a>, <a href="http://identityresolutiondaily.com/501/secure-flight-and-identity-resolution/">airline passenger screening</a>, and <a href="http://www.identityresolutiondaily.com/459/will-workers-comp-employer-fraud-keep-rising/">workers’ compensation</a> are a just a few examples of areas that have benefited from applying <a href="http://identityresolutiondaily.com/500/tdwi-interview-identity-resolution-reveals/">this emerging technology</a>.</p>
<p>Since lawsuits revolve around people, it shouldn’t be surprising that technology capturing enriched information about the identities of the actors involved in the suit could greatly illuminate what’s known about the parties involved. For example, imagine if an augmentation of e-discovery with identity resolution could do two things for each person involved in the suit:</p>
<ol>
<li>Automate detection of hidden relationships between the participants and other relevant players by drawing from multiple public and private data sources; and</li>
<li>Generate link analyses, including graphical depictions involving the participants and other “entities” like conversation threads, that greatly enrich the litigants’ understanding.</li>
</ol>
<p>Caveat: I’m not an e-discovery expert. However, based on what we know about identity resolution, I wonder if it&#8217;s possible to enhance the results of the e-discovery process?</p>
<p>If you have knowledge and experience in the space, I’d like to hear your thoughts.</p>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=645&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_645" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/645/enriching-e-discovery-results-with-identity-resolution/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/645/enriching-e-discovery-results-with-identity-resolution/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-10-26</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/Wyo-jz9CNVo/</link>
		<comments>http://identityresolutiondaily.com/644/identity-resolution-daily-links-2009-10-26/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 18:56:28 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Law Enforcement]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Data Warehousing]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Cloud Computing]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Workers Compensation Fraud]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Lottery Fraud]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>amazon web services</category><category>cloud computing</category><category>daily link posts</category><category>Dan Woods</category><category>data quality pro</category><category>data quality software</category><category>data warehouse</category><category>data warehousing</category><category>entity analytics</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>Evolved Technologist</category><category>Gartner</category><category>google app engine</category><category>identity resolution and analytics</category><category>identity matching</category><category>identity resolution</category><category>identity resolution and analytics</category><category>identity resolution daily</category><category>Infoglide</category><category>infoglide software</category><category>law enforcement</category><category>lotteries</category><category>lottery</category><category>lottery fraud</category><category>lottery retailer fraud</category><category>lottery ticket theft</category><category>lotto</category><category>lotto ticket</category><category>new york state insurance department</category><category>new york state insurance fund</category><category>NYSIF</category><category>salesforce.com</category><category>TDWI</category><category>the data warehousing institute</category><category>workers comp</category><category>workers compensation fraud</category><category>workers comp fraud</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/644/identity-resolution-daily-links-2009-10-26/</guid>
		<description><![CDATA[By the Infoglide Team
Come by and see us at TDWI World in Orlando Nov. 3 &#38; 4!
Forbes.com: Who Is In Charge Of Your Data?
[Dan Woods] &#8220;But in most companies, no single person is charged with the task of making sure that the right data is being captured in an efficient way that ensures data quality. [...]]]></description>
			<content:encoded><![CDATA[<p>By the <a href="http://www.infoglide.com/">Infoglide</a> Team</p>
<p><em>Come by and see us at <a href="http://www.tdwi.org/orlando2009/">TDWI World</a> in Orlando Nov. 3 &amp; 4!</em></p>
<p><a href="http://www.forbes.com/2009/10/19/software-control-enterprise-technology-cio-network-data.html">Forbes.com: Who Is In Charge Of Your Data?</a></p>
<blockquote><p>[<a href="http://www.evolvedtechnologist.com/about-dan-woods">Dan Woods</a>] &#8220;But in most companies, no single person is charged with the task of making sure that the right data is being captured in an efficient way that ensures <a href="http://en.wikipedia.org/wiki/Data_quality">data quality</a>. <a href="http://www.tdwi.org/">The Data Warehousing Institute</a> estimated the annual cost of poor data quality at $600 billion in 2002. Other studies have produced similar estimates.&#8221;</p></blockquote>
<p><a href="http://www.statesman.com/news/content/news/stories/local/2009/10/21/1021lottoscam.html">Austin American Statesman: Clerk accused of absconding with lottery cash</a></p>
<blockquote><p>&#8220;So when the 25-year-old quit his job at the convenience store and claimed a $1 million lottery jackpot in Austin, Joshi&#8217;s co-workers were suspicious and told investigators, the affidavit said. Those investigators now believe that in May, after a regular customer brought in his <a href="http://en.wikipedia.org/wiki/Texas_lottery">lottery</a> tickets and asked Joshi to check if they were winners, <a href="http://identityresolutiondaily.com/521/dateline-nbc-on-lottery-fraud/">Joshi kept the winning ticket</a>, did not tell the customer and claimed the prize for himself, according to the affidavit and Travis County Assistant District Attorney Patty Robertson.&#8221;</p></blockquote>
<p><a href="http://www.hartfordbusiness.com/news10616.html ">Hartford Business: State Recommits To Fighting Shadow Labor</a></p>
<blockquote><p>&#8220;The <a href="http://wcc.state.ct.us/">state board</a> charged with cracking down on employers who fail to pay employee taxes and workers’ compensation premiums will meet on Nov. 5, following a 10-month hiatus.&#8221;</p></blockquote>
<p><a href="http://news.cnet.com/8301-30685_3-10378782-264.html">cnet news: Gartner: Brace yourself for cloud computing</a></p>
<blockquote><p> &#8220;<a href="http://en.wikipedia.org/wiki/Cloud_computing">Cloud computing</a> takes several forms, from the nuts and bolts of <a href="http://en.wikipedia.org/wiki/Amazon_Web_Services">Amazon Web Services</a> to the more finished foundation of <a href="http://en.wikipedia.org/wiki/Google_App_Engine">Google App Engine</a> to the full-on application of <a href="http://en.wikipedia.org/wiki/Salesforce.com">Salesforce.com</a>. Companies should figure out what if any of those approaches are most suited to their challenges, <a href="http://en.wikipedia.org/wiki/Gartner">Gartner</a> said.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=644&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_644" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/644/identity-resolution-daily-links-2009-10-26/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/644/identity-resolution-daily-links-2009-10-26/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-10-23</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/RIWo21Kdsv4/</link>
		<comments>http://identityresolutiondaily.com/643/identity-resolution-daily-links-2009-10-23/#comments</comments>
		<pubDate>Fri, 23 Oct 2009 20:06:05 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[EHR]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[EMPI]]></category>

		<category><![CDATA[Healthcare]]></category>

		<category><![CDATA[EMR]]></category>

		<category><![CDATA[Cloud Computing]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Privacy]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Workers Compensation Fraud]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>cloud computing</category><category>daily link posts</category><category>david blumenthal</category><category>EHR</category><category>electronic health records</category><category>electronic medical records</category><category>EMPI</category><category>emr</category><category>entity analytics</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>Gartner</category><category>healthcare</category><category>identity resolution</category><category>identity resolution Identity Resolution Engine identity resolution and analytics</category><category>identity resolution daily</category><category>Identity Resolution Engine</category><category>Infoglide</category><category>infoglide software</category><category>name matching</category><category>privacy</category><category>ramesh menon</category><category>Risk and Insurance</category><category>Steve Tuckey</category><category>workers comp</category><category>workers compensation fraud</category><category>workers comp fraud</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/643/identity-resolution-daily-links-2009-10-23/</guid>
		<description><![CDATA[[Post from Infoglide] Measuring Entity Resolution Accuracy
&#8220;In the last post we looked at the problem of comparing two entity resolution (ER) outcomes.  If S represents a list of entity references, then the effect of applying an ER process is to divide S into subsets where each subset comprises all of the references to the same [...]]]></description>
			<content:encoded><![CDATA[<p>[Post from <a href="http://www.infoglide.com/">Infoglide</a>] <a href="http://identityresolutiondaily.com/642/measuring-entity-resolution-accuracy/">Measuring Entity Resolution Accuracy</a></p>
<blockquote><p>&#8220;In the last post we looked at the problem of comparing two <a href="http://en.wikipedia.org/wiki/Identity_resolution">entity resolution</a> (ER) outcomes.  If S represents a list of entity references, then the effect of applying an ER process is to divide S into subsets where each subset comprises all of the references to the same entity.&#8221;</p></blockquote>
<p><a href="http://www.cloudave.com/link/gartner-says-cloud-computing-is-the-top-technology-trend-in-2010">Cloud Avenue: Gartner Says Cloud Computing Is The Top Technology Trend In 2010 </a></p>
<blockquote><p>&#8220;Compared to the beginning of 2009, the <a href="http://en.wikipedia.org/wiki/Cloud_computing">cloud computing</a> landscape now is very different with a huge potential to change the face of IT forever.&#8221;</p></blockquote>
<p><a href="http://www.ihealthbeat.org/Articles/2009/10/16/Blumenthal-Officials-Working-To-Boost-EHR-Connectivity-Security.aspx">iHealthBeat: Blumenthal: Officials Working To Boost EHR Connectivity, Security</a></p>
<blockquote><p>&#8220;<a href="http://en.wikipedia.org/wiki/Office_of_the_National_Coordinator_for_Health_Information_Technology">Blumenthal</a> also addressed concerns about whether <a href="http://en.wikipedia.org/wiki/Electronic_health_records">EHR</a> systems would compromise the privacy and security of personal health data. He said regulations are in place to ensure that any health data used for research purposes are stripped of all individually identifiable information.&#8221;</p></blockquote>
<p><a href="http://blogs.informatica.com/perspectives/index.php/2009/10/22/data-sharing-and-privacy-eternally-opposed/">Informatica Blog: Data Sharing and Privacy - Eternally Opposed?</a></p>
<blockquote><p>&#8220;Nevertheless, the risks to privacy from data breaches and concerns about government access to vast stores of private citizen information continue to be recurring themes in today&#8217;s security environment. But do the benefits of complete and actionable data always conflict with the desire to secure and maintain privacy?&#8221;</p></blockquote>
<p><a href="http://www.workerscompinsider.com/archives/001125.html">Workers&#8217; Comp Insider: Fraud is on the rise</a></p>
<blockquote><p>&#8220;Steve Tuckey is currently writing an in-depth series on fraud for <a href="http://www.riskandinsurance.com/">Risk and Insurance</a>. The first installment, Transparency of Evidence, deals with fraud by doctors, hospitals and other healthcare professionals. He notes that &#8216;grayer areas of so-called abuse or overutilization continue to vex payers, insurance companies and lawmakers eager to maintain the financial stability and integrity of <a href="http://en.wikipedia.org/wiki/Workers%27_compensation">the system</a> that has protected workers for nearly a century.&#8217;&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=643&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_643" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/643/identity-resolution-daily-links-2009-10-23/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/643/identity-resolution-daily-links-2009-10-23/</feedburner:origLink></item>
		<item>
		<title>Measuring Entity Resolution Accuracy</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/HzdjKgJJWrk/</link>
		<comments>http://identityresolutiondaily.com/642/measuring-entity-resolution-accuracy/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 20:59:48 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[Deduplication]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Identity Resolution]]></category>
<category>data matching</category><category>Data Quality and Record Linkage Techniques</category><category>deduplication</category><category>entity analytics</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>ERIQ</category><category>Fellegi Sunter Model</category><category>identity matching</category><category>identity resolution</category><category>identity resolution and analytics</category><category>identity resolution daily</category><category>Infoglide</category><category>infoglide software</category><category>john talburt</category><category>name matching</category><category>Rand Index</category><category>Talburt Wang Index</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/642/measuring-entity-resolution-accuracy/</guid>
		<description><![CDATA[By John Talburt, PhD, CDMP, Director, UALR Laboratory for Advanced Research in Entity Resolution and Information Quality (ERIQ)
In the last post we looked at the problem of comparing two entity resolution (ER) outcomes.  If S represents a list of entity references, then the effect of applying an ER process is to divide S into subsets [...]]]></description>
			<content:encoded><![CDATA[<p>By <a href="http://ifsc.ualr.edu/jrtalburt/">John Talburt</a>, PhD, CDMP, Director, UALR Laboratory for Advanced Research in Entity Resolution and Information Quality (<a href="http://technologize.ualr.edu/eriq/">ERIQ</a>)</p>
<p>In the last post we looked at the problem of comparing two entity resolution (<a href="http://en.wikipedia.org/wiki/Entity_resolution">ER</a>) outcomes.  If S represents a list of entity references, then the effect of applying an ER process is to divide S into subsets where each subset comprises all of the references to the same entity.  More formally this is called a “partition” of S.  A partition of a set S is simply a collection of non-empty, non-overlapping subsets of S that contain all of the elements of S.  In other words it is a way to divide S into subsets so that every element of S is in one, and only one, of the subsets.</p>
<p>By viewing ER outcomes as partitions of the underlying set of references, the problem of comparing outcomes translates into the problem of comparing two partitions of the same list S.  As pointed out in the last post, there are several methods for making these comparisons including the <a href="http://en.wikipedia.org/wiki/Rand_index">Rand Index</a> and the <a href="http://www.igi-global.com/downloads/excerpts/1599040263ch1.pdf">Talburt-Wang Index</a>.</p>
<p>This contrasts with the traditional view of evaluating record linking in terms of “merging” or “<a href="http://en.wikipedia.org/wiki/Data_deduplication">de-duplicating</a>” two lists of records.  The book <a href="http://www.amazon.com/Data-Quality-Record-Linkage-Techniques/dp/0387695028/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1256148810&amp;sr=8-1">Data Quality and Record Linkage Techniques</a> by Herzog, et al. provides a great overview of this treatment of resolution.  The list-versus-list approach focuses on analyzing the set of all possible pairs of records that can be formed between the two lists.  For example, if List A has 80 records and List B has 100 records, there would be 8,000 (80 x 100) possible record pairs in which the first record comes from List A, and the second from List B.  However, most of the analytical techniques based on this approach, such as the <a href="http://www.jstor.org/pss/2286061">Fellegi-Sunter Model</a>, start with the assumption that each one of the lists does not have any internal duplication.  This is a convenient, but often unrealistic assumption to make when working with large lists, especially those from external providers.</p>
<p>When the ER outcome problem is cast in terms of merging two lists, ER accuracy can be viewed in terms of precision and recall, measures borrowed from information retrieval.  Each record in List A can be thought of as a query into List B.  The precision of that query would be the ratio of the correct links it makes with records in List B to the total number of links it makes with records in List B.  Similarly its recall would be the ratio of its correct links with records in List B to the total of number of records in B that it should be linked with.  By extending these measures over all the records in List A, it is possible to define an overall precision and recall measure for the linkage between the two lists.</p>
<p>My preference, however, is to simply view List A and List B as forming a combined list in which linking can take place, not only between the records of A and B, but also internally between records within A and in B.  In my opinion this is a better reflection of what is usually done in the processing of real list files.  The records from two or more lists, or at least the identifying attributes from the records, are standardized into a common file format and combined into a single list.  An ER process is then performed on the combined list leading to its partition into subsets as described above.</p>
<p>If the correct partition of a list references is known, then the accuracy of a given ER process acting on that list can be represented at the value of the similarity index (e.g. Rand of T-W) obtained by comparing the partition generated by the ER process to the correct partition.  Partition similarity indices are designed to take values from 0 to 1.  Values closer to 0 indicate less similarity, and values closer to 1 indicate closer similarity, with the value equal to 1 if and only if the two partitions are identical.</p>
<p>Whether you are using precision and recall measures or a partition similarity measure of accuracy, both require knowing the correct links.  Of course, if we knew all of the correct links, we wouldn’t need the ER process to begin with.  In general, we only know the correct links for some sample of the references that we are dealing with.</p>
<p>When the entities are people, e.g. customers, obtaining even a relatively small sample of records with the correct links can be difficult.  My experience is that organizations generally do this in three ways: inspection by domain experts, information volunteered by employees, or telemarketing confirmation.</p>
<p>A random selection of records for inspection can be useful, but it is biased toward true positive linking and has little value in detecting false negatives.  An expert might determine that “Jaems Doe on Main St” should link to “James Doe on Main St”, but is unlikely to determine that “Mary Doe on Main St” should link to “Mary Smith on Elm St” without prior knowledge that these are the same customer (or the presence of additional attributes besides “name”).</p>
<p>Employees and their families are often called upon to volunteer benchmark data for linking.  Because this represents an internal view of identity, it can very rich and replete with prior and alternate names and addresses, dates-of-birth, and other biographical information.  However, unless the company is very diverse, the benchmark will only address a very narrow population demographic.</p>
<p>Perhaps the most unbiased sample is that obtained by a third-party telemarketing firm.  While this approach can reach a broad sample of the population with varying demographics, it is the most expensive of the three options, and without some internal validation, may not be as accurate as the others.</p>
<p>Next time,  I will continue the discussion of entity resolution metrics. In the meantime, your thoughts are welcome.</p>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=642&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_642" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/642/measuring-entity-resolution-accuracy/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/642/measuring-entity-resolution-accuracy/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-10-19</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/yLtwCMdP_4M/</link>
		<comments>http://identityresolutiondaily.com/641/identity-resolution-daily-links-2009-10-19/#comments</comments>
		<pubDate>Mon, 19 Oct 2009 18:26:26 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[EHR]]></category>

		<category><![CDATA[EMPI]]></category>

		<category><![CDATA[Data Profiling]]></category>

		<category><![CDATA[EMR]]></category>

		<category><![CDATA[Healthcare]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Data Governance]]></category>

		<category><![CDATA[Master Data Management]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Data Management]]></category>

		<category><![CDATA[Fusion Center]]></category>

		<category><![CDATA[Data Matching]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>BAM INTEL</category><category>daily link posts</category><category>data governance</category><category>data management</category><category>data matching</category><category>data profiling</category><category>data quality</category><category>data quality software</category><category>Department of Homeland Security</category><category>DHS</category><category>EHR</category><category>electronic health records</category><category>electronic medical records</category><category>EMPI</category><category>emr</category><category>entity analytics</category><category>entity matching</category><category>entity resolution</category><category>entity resolution and analysis</category><category>fusion center</category><category>fusion centers</category><category>fusion center network</category><category>healthcare</category><category>health information exchange</category><category>identity resolution and analytics</category><category>identity matching</category><category>identity resolution</category><category>identity resolution daily</category><category>Infoglide</category><category>infoglide software</category><category>John Kalogirou</category><category>master data management</category><category>MDM</category><category>Michael Dowling</category><category>name matching</category><category>steve sarsfield</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/641/identity-resolution-daily-links-2009-10-19/</guid>
		<description><![CDATA[By the Infoglide Team
information management: Multi-Entity MDM Enablement
&#8220;Most efforts, however, are executed in surroundings inhibited by existing infrastructure (legacy applications, tools, hardware and integration), dispersed organizational structures and suboptimal processes. This reality introduces challenges in architecting and deploying efficient and effective multi-entity MDM solutions.&#8221;
BAM INTEL: BAM&#8217;s Thinking on the New DHS Standards
&#8220;Public Fusion Centers must [...]]]></description>
			<content:encoded><![CDATA[<p>By the <a href="http://www.infoglide.com/">Infoglide</a> Team</p>
<p><a href="http://www.information-management.com/specialreports/2009_166/master_data_management_mdm_pim_cdi_governance_enablement-10016215-1.html">information management: Multi-Entity MDM Enablement</a></p>
<blockquote><p>&#8220;Most efforts, however, are executed in surroundings inhibited by existing infrastructure (legacy applications, tools, hardware and integration), dispersed organizational structures and suboptimal processes. This reality introduces challenges in architecting and deploying efficient and effective multi-entity <a href="http://en.wikipedia.org/wiki/Master_data_management">MDM</a> solutions.&#8221;</p></blockquote>
<p><a href="http://bamintel.blogspot.com/2009/10/bam-thinking-on-new-dhs-standards.html">BAM INTEL: BAM&#8217;s Thinking on the New DHS Standards</a></p>
<blockquote><p>&#8220;Public <a href="http://en.wikipedia.org/wiki/Fusion_centers">Fusion Centers</a> must be seen by citizens and policy-makers to play a direct role in the response to disasters as well as intelligence gathering. They cannot remain in the intelligence-sharing role only and not take some of the spotlight when their good work prevents or lessens the impact of America’s next disaster.&#8221;</p></blockquote>
<p><a href="http://www.newsday.com/opinion/opinion-revolution-right-in-your-doctor-s-hand-1.1527199">newsday.com: OPINION: Revolution right in your doctor&#8217;s hand</a></p>
<blockquote><p>&#8220;For doctors and their patients (in other words, all of us), the <a href="http://en.wikipedia.org/wiki/Electronic_health_record">electronic health record</a> is a far more revolutionary idea than those that brought us the ability to download a song, post a video online or read and send e-mails when you&#8217;re on a camping trip. While those other innovations indirectly enhance the quality of life, they are designed for entertainment or business purposes. The EHR directly improves quality of life because the end result of its design is better health.&#8221;</p></blockquote>
<p><a href="http://smartdatacollective.com/Home/21892">SmartData Collective: Data May Require Unique Data Quality Processes</a></p>
<blockquote><p>&#8220;All <a href="http://en.wikipedia.org/wiki/Data_quality">data quality</a> projects can appear the same from afar but ultimately can be as different as stars and planets. One of the biggest ways they vary is in the data itself and whether it is chiefly made up of name and address data or some other type of data.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=641&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_641" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/641/identity-resolution-daily-links-2009-10-19/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/641/identity-resolution-daily-links-2009-10-19/</feedburner:origLink></item>
		<item>
		<title>Identity Resolution Daily Links 2009-10-16</title>
		<link>http://feedproxy.google.com/~r/identityresolutiondaily/VoRE/~3/UwCR2dfeGsY/</link>
		<comments>http://identityresolutiondaily.com/640/identity-resolution-daily-links-2009-10-16/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 19:16:23 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
		<category><![CDATA[Name Matching]]></category>

		<category><![CDATA[Law Enforcement]]></category>

		<category><![CDATA[Entity Analytics]]></category>

		<category><![CDATA[Infoglide]]></category>

		<category><![CDATA[Entity Resolution]]></category>

		<category><![CDATA[EHR]]></category>

		<category><![CDATA[Anonymous Identity Resolution]]></category>

		<category><![CDATA[Identity Management]]></category>

		<category><![CDATA[Identity Matching]]></category>

		<category><![CDATA[EMR]]></category>

		<category><![CDATA[Fusion Center]]></category>

		<category><![CDATA[Workers Compensation Fraud]]></category>

		<category><![CDATA[Privacy]]></category>

		<category><![CDATA[Homeland Security]]></category>

		<category><![CDATA[Federal Government]]></category>

		<category><![CDATA[National Security]]></category>

		<category><![CDATA[Identity Resolution]]></category>

		<category><![CDATA[Security]]></category>

		<category><![CDATA[Master Data Management]]></category>

		<category><![CDATA[Mistaken Identity Resolution]]></category>

		<category><![CDATA[Entity Resolution and Analysis]]></category>

		<category><![CDATA[Daily Link Posts]]></category>
<category>Department of Homeland Security</category><category>DHS</category><category>EHR</category><category>electronic health records</category><category>electronic medical records</category><category>emr</category><category>EU</category><category>European Union</category><category>fusion center</category><category>fusion center network</category><category>Larry Dubov</category><category>master data management</category><category>MDM</category><category>new york state insurance fund</category><category>NYSIF</category><category>Project Indect</category><category>workers comp</category><category>workers compensation fraud</category><category>workers comp fraud</category>
		<guid isPermaLink="false">http://identityresolutiondaily.com/640/identity-resolution-daily-links-2009-10-16/</guid>
		<description><![CDATA[[Post from Infoglide] Avoiding False Positives: Analytics or Humans?
&#8220;The European Union recently started a five-year research program in conjunction with its expanding role in fighting crime and terrorism. The purpose of Project Indect is to develop advanced analytics that help monitor human activity for &#8216;automatic detection of threats and abnormal behaviour and violence.&#8217; Naturally, the [...]]]></description>
			<content:encoded><![CDATA[<p>[Post from <a href="http://www.infoglide.com/">Infoglide</a>] <a href="http://identityresolutiondaily.com/639/avoiding-false-positives-analytics-or-humans/">Avoiding False Positives: Analytics or Humans?</a></p>
<blockquote><p>&#8220;The European Union recently started a five-year research program in conjunction with its expanding role in fighting crime and terrorism. The purpose of Project Indect is to develop advanced analytics that help monitor human activity for &#8216;automatic detection of threats and abnormal behaviour and violence.&#8217; Naturally, the project has drawn suspicion and criticism, both from those who oppose the growing power of the EU and from watchdog groups concerned about encroachments into privacy and civil liberty&#8230;&#8221;</p></blockquote>
<p><a href="http://sdtimes.com/GUEST_VIEW_OLD_THINKING_DOES_A_DISSERVICE_TO_NEW_DATA_HUBS/By_LARRY_DUBOV/About_DATABASES/33828">SDTimes: Old thinking does a disservice to new data hubs</a></p>
<blockquote><p>&#8220;The enterprise needs to be able to understand the origin, the time and possibly the reason for a change. These audit needs must be supported by the data hub at the attribute level. <a href="http://en.wikipedia.org/wiki/Master_data_management">MDM</a> solutions that maintain the golden record dynamically address this need by supporting the history of changes in the source systems record content.&#8221;</p></blockquote>
<p><a href="http://blog.accision.com/?p=95">Accision Health Blog: Surveys Show Importance of EHR</a></p>
<blockquote><p>&#8220;A new Rand study is one of the first to link the use of <a href="http://en.wikipedia.org/wiki/Electronic_health_records">electronic health records</a> in community-based medical practices with higher quality of care.  <a href="http://en.wikipedia.org/wiki/Rand_corporation">Rand Corporation</a> researchers found in a study of 305 groups of primary care physicians that the routine use of multifunctional EHRs was more likely to be linked to higher quality care than other common strategies, such as structural changes used for improving care.&#8221;</p></blockquote>
<p><a href="http://ww3.nysif.com/EyebrowPages/AboutNYSIF/NYSIFNews/2009/Central%20NY%20Contractor%20Hit%20with%20Workers%20Comp%20Fraud%20Charges.aspx">NYSIF: Central NY Contractor Hit with Workers Comp Fraud Charges</a></p>
<blockquote><p>&#8220;Investigators said Mr. Decker previously had an insurance policy with <a href="http://ww3.nysif.com/EyebrowPages/AboutNYSIF.aspx">NYSIF</a> when he operated RD Builders in November 2005, a policy cancelled for non-payment a few months later. In 2008, he applied to NYSIF’s Syracuse office for <a href="http://en.wikipedia.org/wiki/Workers_compensation">workers’ compensation insurance</a> doing business as Bull Rock Development, Inc.&#8221;</p></blockquote>
<p><a href="http://www.publicintelligence.net/office-of-intelligence-and-analysis-dhs/">public intelligence: Office of Intelligence and Analysis (DHS)</a></p>
<blockquote><p>&#8220;These entities are unified under local <a href="http://en.wikipedia.org/wiki/Fusion_centers">fusion centers</a>, which provide state and local officials with intelligence products while simultaneously gathering information for federal sources.  As of July 2009, there were 72 designated fusion centers around the country with 36 field representatives deployed. The Department has provided more than $254 million from FY 2004-2007 to state and local governments to support the centers.&#8221;</p></blockquote>
<p class="akst_link"><a href="http://identityresolutiondaily.com/?p=640&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_640" class="akst_share_link" rel="nofollow">Share This</a>
</p>]]></content:encoded>
			<wfw:commentRss>http://identityresolutiondaily.com/640/identity-resolution-daily-links-2009-10-16/feed/</wfw:commentRss>
		<feedburner:origLink>http://identityresolutiondaily.com/640/identity-resolution-daily-links-2009-10-16/</feedburner:origLink></item>
	</channel>
</rss>
