<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:blogger='http://schemas.google.com/blogger/2008' xmlns:georss='http://www.georss.org/georss' xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-6719829</id><updated>2026-04-04T19:56:15.090+00:00</updated><title type='text'>Rick On the Road</title><subtitle type='html'>Reflections on the monitoring and evaluation of development aid projects, programmes and policies, and development of organisation&#39;s capacity to do the same. This blog also functions as the &lt;b&gt;Editorial section of the MandE NEWS website&lt;/b&gt; - see the Links section below on the left.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default?alt=atom'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default?alt=atom&amp;start-index=26&amp;max-results=25'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>132</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-6719829.post-2266877393149368710</id><published>2026-03-28T12:48:00.005+00:00</published><updated>2026-03-28T16:20:52.949+00:00</updated><title type='text'>When rankings tell different stories: an introduction to the Rank Explorer</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;em&gt;&lt;/em&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;em&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZQDmJ3vlPJuUfFKNncxYkkWks_JYUYB-WOMZBSOSknVnZMNMhfUhKuc327gQgV4wsCHDKZQX8dRAWF3tlAvvg1gBEreENSqHbljocLoeJCBHUFr4ISu4SnrjG_b5Sc4axYjjuqdvN8TQSuWqbNPInZdvEuOns4ZfWzKXj4bhKq9D1vr_6QELt/s1400/Rank%20aggregatir.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;682&quot; data-original-width=&quot;1400&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZQDmJ3vlPJuUfFKNncxYkkWks_JYUYB-WOMZBSOSknVnZMNMhfUhKuc327gQgV4wsCHDKZQX8dRAWF3tlAvvg1gBEreENSqHbljocLoeJCBHUFr4ISu4SnrjG_b5Sc4axYjjuqdvN8TQSuWqbNPInZdvEuOns4ZfWzKXj4bhKq9D1vr_6QELt/s16000/Rank%20aggregatir.png&quot; /&gt;&lt;/a&gt;&lt;/em&gt;&lt;/div&gt;&lt;em&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/em&gt;&lt;p&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;em&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Caveat Emptor: I delegated the writing of this blog posting to Claude AI (Sonnet 4.6), based on an extended prior dialogue on the subject below, then a summary prompt of what was wanted. My post-production edits were quite limited.&lt;/span&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;--o0o--&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There is a situation that turns up repeatedly in evaluation and research practice that is easy to overlook precisely because it looks like a data analysis problem rather than a methodological one. The situation is this: &lt;strong&gt;you have a set of cases that have been ranked on multiple factors, along with a ranking of their outcomes, and you want to understand what the relationship is between the factors and the outcome.&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This sounds straightforward. And to a degree it is — you can correlate each factor against the outcome, identify the strongest relationships, and build a composite ranking that aggregates them. These are useful things to do. But they share a common assumption that is not always warranted: that the factors combine additively, that each one contributes independently to the outcome, and that a case that scores well on several factors will therefore tend to score well overall.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;That assumption is often wrong. And when it is wrong, the gap between your composite ranking and the actual outcomes is telling you something important.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The problem with additive aggregation&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Consider a concrete example. You have 63 local authority areas. You have ranked them on ten factors thought to be associated with population-level physical activity — access to green space, deprivation levels, sports facility density, and others. You have also ranked them on an outcome measure. You build a composite ranking from the factors, correlate it with the outcome, and find it predicts reasonably well — perhaps a Spearman r of 0.75.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;That is a decent result. But it is hiding something. Some areas with strong factor rankings are performing poorly on the outcome; others with weak factor rankings are performing well. If you look closely, there are two or three quite different combinations of factors that each seem sufficient, on their own, to predict a good outcome. These are not variations on the same story — they are distinct causal pathways. However an additive composite &lt;b&gt;averages&lt;/b&gt; across them, and in doing so obscures the structure.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This is what researchers in the QCA tradition call &lt;strong&gt;equifinality&lt;/strong&gt; — multiple routes to the same outcome. Additive methods cannot find it. A decision tree can.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What the Rank Explorer does&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The &lt;strong&gt;Rank Explorer&lt;/strong&gt; is a browser-based companion tool to The Ethnographic Explorer. It is designed to import the rankings data that TEE generates from its Contrast tab, though it will also accept ranking data from any other source in the same CSV format.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;[▶ &lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260328_TEE_RankExplorer_Vs20.html&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;Try the Rank Explorer&lt;/a&gt;]&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool has four analysis tabs. The first three — &lt;strong&gt;Individual Factors&lt;/strong&gt;, &lt;strong&gt;Composite Builder&lt;/strong&gt;, and &lt;strong&gt;Scatter Plot&lt;/strong&gt; — provide standard additive analysis: Spearman correlations for each factor, composite rankings using several aggregation methods (equal-weight, correlation-weighted, stepwise greedy, and exhaustive search), and a scatter plot that visualises how well your composite ranking predicts the outcome, with adjustable classification thresholds.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The fourth tab, &lt;strong&gt;Pathway Explorer&lt;/strong&gt;, is where the configurational logic comes in. It builds an optimal classification tree over your data using exhaustive search: at each node, every available factor and every possible rank cut-off is tested, and the split that best separates high-outcome from low-outcome cases is chosen. The result is a tree that shows which &lt;em&gt;specific combinations&lt;/em&gt; of factor ranks distinguish the two groups, displayed in a row-by-level icicle layout that makes the branching structure easy to follow.&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiCw4MlXKbVdwPq-Zth5eI66rO7qKin5EeDgpLi5oK1qB-y8CEqlGM4mUemuMJIp7P1lWEs25BWw-D40a4WlWghPt_JCXzuFFy1YEzu_hWIECBfATLAei-jHMT_QsrcA5nw4s8gsnHAJGN_YYyncNeo3F8VGTWulH9pAdwIT8IUt216gj0AImjJ/s1572/trees%20vs3.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;420&quot; data-original-width=&quot;1572&quot; height=&quot;170&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiCw4MlXKbVdwPq-Zth5eI66rO7qKin5EeDgpLi5oK1qB-y8CEqlGM4mUemuMJIp7P1lWEs25BWw-D40a4WlWghPt_JCXzuFFy1YEzu_hWIECBfATLAei-jHMT_QsrcA5nw4s8gsnHAJGN_YYyncNeo3F8VGTWulH9pAdwIT8IUt216gj0AImjJ/w640-h170/trees%20vs3.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;The tree is not just a visual. Each leaf node shows which cases ended up there, whether they were correctly classified, and the conditions that led to that grouping. A pathway summary below the tree lists the conditions for each leaf in plain language — for instance, &quot;Active travel infrastructure: rank 5 or better AND Deprivation index: rank 8 or better.&quot;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When the gap between additive and configurational results is itself a finding&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One of the more useful diagnostics the tool enables is comparing the classification accuracy of the best composite ranking against that of the decision tree. If both are similar, the additive story is probably adequate. If the tree substantially outperforms the composite — reaching, say, 90% or 100% accuracy where the composite only reached 75% — that gap is a finding in itself: the causal structure in the data is better described by conjunctions of conditions than by sums of contributions.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This matters for intervention design. If a high-outcome classification requires &lt;em&gt;both&lt;/em&gt; good green space access &lt;em&gt;and&lt;/em&gt; good active travel infrastructure (rather than either being substitutable for the other), then improving one without the other may produce no discernible effect. Additive analysis will not surface that conclusion; configurational analysis will.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A note on scale, depth, and selective deepening&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tree-building algorithm uses exhaustive search, which is thorough but computationally intensive. With datasets of 60–70 cases and 10 factors, a depth-3 tree typically builds in a few seconds. Depth 4 or beyond is a different matter: computation time increases steeply, and more importantly, deeper trees on small datasets will often find spurious distinctions — patterns that reflect the quirks of the sample rather than anything real.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The Rank Explorer addresses this through a &lt;strong&gt;subgroup analysis&lt;/strong&gt; feature that may be a modest innovation in decision tree practice for small-N datasets. Once a tree has been built, each leaf node displays an &lt;em&gt;Analyse subgroup →&lt;/em&gt; button. Clicking it filters the dataset to only the cases in that leaf and opens a fresh analysis session for that group alone — with all four tabs, including the Pathway Explorer, reconfigured for the subgroup. The outcome cut-off resets automatically to the median of the subgroup&#39;s outcome ranks, so the high/low distinction remains balanced within the smaller group.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This allows selective deepening of a specific branch without rebuilding the entire tree at greater depth. If one leaf contains 24 cases that the main tree could not separate further, the subgroup analysis asks a different and analytically legitimate question: within this group, what distinguishes the relatively better-performing cases from the worse ones? The answer applies conditionally — only to cases that reached that leaf — but that conditionality is precisely what makes it interpretable. A banner remains visible throughout the subgroup session as a reminder that the high/low labels are relative to the subgroup, not the full dataset.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;For larger or more complex datasets, the &lt;strong&gt;stepwise greedy&lt;/strong&gt; method in the Composite Builder tab is a useful preliminary step: it adds factors to the composite one at a time, selecting whichever remaining factor most improves the correlation with the outcome at each step. The resulting path table shows the marginal contribution of each factor, making it straightforward to identify a smaller subset that carries most of the predictive weight — before running the Pathway Explorer on that reduced set.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Beyond TEE data&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is designed as a TEE companion but is not restricted to it. Any CSV with a column of case names and a set of ranking columns will load correctly. Evaluation practitioners who have generated case rankings through other means — expert scoring panels, secondary data, peer comparison exercises — can use the same analytical workflow.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Some framings that could map onto the same tool:&lt;/span&gt;&lt;/p&gt;
&lt;ul class=&quot;[li_&amp;amp;]:mb-0 [li_&amp;amp;]:mt-1 [li_&amp;amp;]:gap-1 [&amp;amp;:not(:last-child)_ul]:pb-1 [&amp;amp;:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3&quot;&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Programme portfolios:&lt;/strong&gt; rank a set of projects on design-quality dimensions and an outcome measure, then identify which combinations of design features distinguish the most successful from the rest&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Organisational assessments:&lt;/strong&gt; rank a set of partner organisations on capability dimensions, use the tree to find which combinations are most predictive of delivery performance&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Cross-country comparison:&lt;/strong&gt; rank a set of countries or regions on contextual factors alongside a development indicator, and look for the configurational patterns that additive index approaches miss&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In each case the structure is the same: cases, factor rankings, an outcome ranking, and the question of what the relationship looks like once you stop assuming it is additive.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;An invitation to experiment&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is best explored with data you already have. If you have ever built a composite index and felt that it was not quite capturing something you could see in the data, or have had the experience of an outlier case that your model consistently misclassifies, the Pathway Explorer is a reasonable next step. Loading your own data, building a tree at depth 2 or 3, and comparing the pathway classification against your composite should take no more than a few minutes.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I am continuing to develop both tools and would welcome feedback on the approach, the interface, or uses I have not considered.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Accessing the code:&lt;/strong&gt; The Rank Explorer runs entirely in your browser — no login required, no data is transmitted anywhere. To save your own copy of the code, open the tool, right-click, select View Page Source, copy the entire code, paste it into a text file, rename it to end in &lt;code class=&quot;bg-text-200/5 border border-0.5 border-border-300 text-danger-000 whitespace-pre-wrap rounded-[0.4rem] px-1 py-px text-[0.9rem]&quot;&gt;.html&lt;/code&gt; rather than &lt;code class=&quot;bg-text-200/5 border border-0.5 border-border-300 text-danger-000 whitespace-pre-wrap rounded-[0.4rem] px-1 py-px text-[0.9rem]&quot;&gt;.txt&lt;/code&gt;, and open it in any web browser.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Further reading&lt;/span&gt;&lt;/h3&gt;
&lt;ul class=&quot;[li_&amp;amp;]:mb-0 [li_&amp;amp;]:mt-1 [li_&amp;amp;]:gap-1 [&amp;amp;:not(:last-child)_ul]:pb-1 [&amp;amp;:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3&quot;&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Schneider, C. Q., &amp;amp; Wagemann, C. (2012). Set-Theoretic Methods for the Social Sciences: A Guide to Qualitative Comparative Analysis. Cambridge University Press. &lt;a href=&quot;https://doi.org/10.1017/CBO9781139004244&quot;&gt;https://doi.org/10.1017/CBO9781139004244&lt;/a&gt;&amp;nbsp;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The Ethnographic Explorer, which generates the ranking data this tool is designed to analyse, is described in an earlier post: &lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;https://mandenews.blogspot.com/2026/03/making-implicit-knowledge-explicit.html&quot;&gt;Making implicit knowledge explicit&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;https://evalc3.net&quot;&gt;EvalC3&lt;/a&gt; — a related tool that uses configurational logic to classify cases against user-defined outcome criteria, with a different interface and methodology.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/2266877393149368710/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2026/03/when-rankings-tell-different-stories.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2266877393149368710'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2266877393149368710'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2026/03/when-rankings-tell-different-stories.html' title='When rankings tell different stories: an introduction to the Rank Explorer'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZQDmJ3vlPJuUfFKNncxYkkWks_JYUYB-WOMZBSOSknVnZMNMhfUhKuc327gQgV4wsCHDKZQX8dRAWF3tlAvvg1gBEreENSqHbljocLoeJCBHUFr4ISu4SnrjG_b5Sc4axYjjuqdvN8TQSuWqbNPInZdvEuOns4ZfWzKXj4bhKq9D1vr_6QELt/s72-c/Rank%20aggregatir.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-6439146721180254351</id><published>2026-03-22T09:46:00.000+00:00</published><updated>2026-03-22T09:46:33.249+00:00</updated><title type='text'>Introducing Rank Order Counterfactuals (ROC)</title><content type='html'>&lt;p class=&quot;MsoNormal&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhODhiWfw1em7tzl3REMp_u0W6B3gItXhfQClSB9foFdqj9T-DnTF72C3Qt7kVn3V71gw_Yy331MC7Zvhyphenhyphen3ZXlstqf4Rt2hV_tHHdmvZ7ikvjebtGyLha9K5wd9XpeeJ2tpTNoTiIRxdLn7SQG_H0z9iF4pRFxhPR-JoBC7c_Ome2jP8az0-fIr/s549/counterfactual%20rankings.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;549&quot; data-original-width=&quot;531&quot; height=&quot;640&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhODhiWfw1em7tzl3REMp_u0W6B3gItXhfQClSB9foFdqj9T-DnTF72C3Qt7kVn3V71gw_Yy331MC7Zvhyphenhyphen3ZXlstqf4Rt2hV_tHHdmvZ7ikvjebtGyLha9K5wd9XpeeJ2tpTNoTiIRxdLn7SQG_H0z9iF4pRFxhPR-JoBC7c_Ome2jP8az0-fIr/w621-h640/counterfactual%20rankings.png&quot; width=&quot;621&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;A counterfactual is a description of what would have
happened, if an intervention had not taken place. The use of randomised control
groups is one way to construct a counterfactual. A population of people are
randomly assigned to either a control group or an intervention group.
Differences in the outcomes of those populations are then
compared.&amp;nbsp;If&amp;nbsp;the difference is sufficiently statistically significant
then a plausible causal claim can be made that difference in outcomes is
because of the intervention.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;As might be expected, there are plenty of circumstances when
social programs are designed and implemented, where it is simply not practical
to organise a randomised control group.&amp;nbsp;In addition, the comparisons that
are made will be between averages of the two groups. However, in many social
programmes such averages are of limited practical use, because the
implementation contexts are so varied and no single “solution” is likely to be
applicable. Average effects can still be informative at a high level, but they
need to be complemented by methods that take contextual diversity seriously.&lt;br /&gt;
&lt;br /&gt;
I&#39;m currently working with an evaluation team that is examining a large-scale
public health programme in the United Kingdom,&amp;nbsp;covering many different
locations and involving many different types of local partnerships.&amp;nbsp;But
with one common outcome of concern, which is to increase people&#39;s physical
activity levels in their daily life. In their work the evaluation team is
already making use of a causal configurational approach to the understanding of
what works for whom in what circumstances.&amp;nbsp;It is finding different
configurations of causal conditions across these locations that are associated
with changes in activity levels. This approach is consistent with the high
level of diversity in locations partnerships and interventions.&lt;br /&gt;
&lt;br /&gt;
But what it does not yet have is a counterfactual, a defensible description of
what might have happened in these locations in the absence of this
intervention.&amp;nbsp;This is where the idea of a&amp;nbsp;&lt;b&gt;rank order
counterfactual&lt;/b&gt;&amp;nbsp;becomes relevant.&amp;nbsp;By a&amp;nbsp;rank‑order
counterfactual&amp;nbsp;I mean a very specific kind of “what would have happened
otherwise.” Instead of trying to predict the exact outcome that would have been
achieved in each location without an intervention, we can start by asking a
simpler, comparative question:&amp;nbsp;&lt;i&gt;which&lt;/i&gt;&amp;nbsp;location would probably
have changed more, and&amp;nbsp;&lt;i&gt;which&lt;/i&gt;&amp;nbsp;less, if the intervention had
never existed?&amp;nbsp;The answer will be in the form of a rank ordering of
locations, from those with more to less expected change.&amp;nbsp;That
ranking&amp;nbsp;would be&amp;nbsp;constructed based on&amp;nbsp;all
available&amp;nbsp;baseline information, trends, and contextual knowledge. This
proposed approach falls into a category of counterfactuals known as
&quot;logically constructed counterfactuals&quot;, and it aligns well with
configurational evaluation because it focuses on patterns of relative change
across diverse contexts.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A subsequent evaluation of those same locations should also
be able to generate a new rank ordering, which is based on observed outcomes.
These counterfactual and actual rankings can then be compared, using a
scatterplot and correlation measures. The scatterplot is also visually powerful
for communication: it lets people see at a glance which locations behave as
expected and which ones stand out as surprises. If the intervention had no
effect we should see a linear relationship, the observed and counterfactual rankings
should be the same. If the intervention had positive, or perhaps even negative
effects, this should not happen. We might see various locations which are
outliers from that expected trend.&amp;nbsp;When locations we expected to be
“natural leaders” did not improve much, and those we expected to be “natural
laggards” moved to the top of the league table, that pattern is a signal that
the intervention may have been influential, and it gives the evaluation team
clear cases where alternative explanations should be probed. The task of the
evaluation is then to probe those alternative explanations, not to assume the
intervention is the only possible cause. The rankings are not a substitute for
theory‑based
evaluation; they are a way to make its claims sharper and more testable. The focus
on ranking differences can convert a vague theory (“we think these factors
matter”) into a concrete, specific prediction about which locations should do
better.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The sensitivity of the rank comparison process will depend
on the number of ranked items.&amp;nbsp;The more rank positions there are, the more
sensitivity&amp;nbsp;there will be to differences in performance, which is
good.&amp;nbsp;But, as shown in research on sorting algorithms,&amp;nbsp;the time
required to generate a complete sorting,&amp;nbsp;using any of the well-known
methods, can be significant. Growing faster than proportionally to the number
of items, though far slower than exponential growth. In addition to the extra
time required,&amp;nbsp;a rank order counterfactual&amp;nbsp;will require a stronger
evidential base where the number of rank positions is greater.&amp;nbsp;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When a large number of locations&amp;nbsp;are involved in an
intervention one practical way of addressing this tension is to use a
stratified random sample,&amp;nbsp;and to generate the rankings for that
sample&amp;nbsp;only. Another approach to managing large numbers&amp;nbsp;of locations
is to think of ranked bands of locations rather than individual rankings for
each location.&amp;nbsp;What should be of interest, then, are systematic shifts in
band membership between the counterfactual and actual observations – for
example, locations expected to be in the “low‑change” band
turning up in the “high‑change” band in
practice.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In this way, rank‑order counterfactuals do not
replace theory‑based evaluation, but sharpen it: they turn general
expectations about context into explicit, testable predictions about who should
have changed most in the absence of the programme. In work which I hope to
document in the European Evaluation Society conference&amp;nbsp;later this
year&amp;nbsp;I will explain&amp;nbsp;how the use of the hierarchical card sorting
process&amp;nbsp;was used to generate&amp;nbsp;argument and evidence based
counterfactual&amp;nbsp;rank orderings, and how an LLM was used to support this
process.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/6439146721180254351/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2026/03/introducing-rank-order-counterfactuals.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6439146721180254351'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6439146721180254351'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2026/03/introducing-rank-order-counterfactuals.html' title='Introducing Rank Order Counterfactuals (ROC)'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhODhiWfw1em7tzl3REMp_u0W6B3gItXhfQClSB9foFdqj9T-DnTF72C3Qt7kVn3V71gw_Yy331MC7Zvhyphenhyphen3ZXlstqf4Rt2hV_tHHdmvZ7ikvjebtGyLha9K5wd9XpeeJ2tpTNoTiIRxdLn7SQG_H0z9iF4pRFxhPR-JoBC7c_Ome2jP8az0-fIr/s72-w621-h640-c/counterfactual%20rankings.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-4161592393989058319</id><published>2026-03-07T14:40:00.023+00:00</published><updated>2026-03-15T18:20:29.259+00:00</updated><title type='text'>Making implicit knowledge explicit, contestable and usable: an introduction to The Ethnographic Explorer</title><content type='html'>



&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5BfVXSV8AYfMbK52FrfUJMKDIBH5O9CXqiqkidqft_tFAQB1X3jXMnEZBUa1z6HCOhwR-waijiqiE8YG83SU1QXXtDgp5z9DSGrIqAWLUg-lxswrnGca-qTOhmldbMw912J-f7OapNm8OHLRYpfGqp242Bie63qYln0jDSEX4oA0NQbnyeBPO/s1403/2026%2003%2015%20screen%20shot.png&quot; style=&quot;font-family: verdana; font-size: large; margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;906&quot; data-original-width=&quot;1403&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5BfVXSV8AYfMbK52FrfUJMKDIBH5O9CXqiqkidqft_tFAQB1X3jXMnEZBUa1z6HCOhwR-waijiqiE8YG83SU1QXXtDgp5z9DSGrIqAWLUg-lxswrnGca-qTOhmldbMw912J-f7OapNm8OHLRYpfGqp242Bie63qYln0jDSEX4oA0NQbnyeBPO/s16000/2026%2003%2015%20screen%20shot.png&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;span style=&quot;text-align: left;&quot;&gt;Caveat Emptor: I delegated the writing of this blog posting to Claude AI (Sonnet 4.6), based on an extended prior dialogue on the subject below, then a summary prompt of what was wanted. My post-production edits were quite limited.&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;div&gt;&lt;div style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;--o0o--&lt;/span&gt;&lt;/div&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There is a problem that turns up repeatedly in evaluation practice — and in many other fields — that many of us work around rather than solve directly. The problem is this: &lt;strong&gt;the people who know most about a programme, a portfolio, or a set of cases often cannot easily say what they know, or why they make the judgements they do.&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This is not a failure of intelligence. It is the normal condition of what Michael Polanyi called tacit knowledge — the &quot;we know more than we can tell&quot; problem that is endemic to any field built on experience and judgement. The challenge for evaluation is to find structured ways of drawing it out.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The theoretical anchor: information as difference&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The approach behind The Ethnographic Explorer draws on a deceptively simple idea from Gregory Bateson: &lt;strong&gt;information is a difference that makes a difference&lt;/strong&gt;. What we notice, and consider significant, is always defined by contrast — not by properties of things in isolation. A project is &quot;successful&quot; relative to others; a group is &quot;vulnerable&quot; in comparison to other groups in a given context. And some of those differences have noticeable consequences, they &quot;make a difference&quot;. Together they can be seen as simple &quot;IF...THEN...&quot; rules.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This suggests that a structured method for eliciting knowledge should be built around comparison — specifically, around asking people to identify and articulate differences, and then to explain what those differences imply. The Hierarchical Card Sort methodology is one way of doing this and is the basis of The Ethnographic Explorer&#39;s&amp;nbsp;process of inquiry.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;How the Ethnographic Explorer works&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There are three linked stages.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In the &lt;strong&gt;Sort&lt;/strong&gt; stage, a respondent is presented with a set of cases — projects, organisations, events, beneficiaries, or any entities they know well. They are asked to divide all the cases into two groups representing the most significant difference between them, from their point of view. Each group is named, the difference is recorded, and then the respondent is asked: &lt;em&gt;&quot;What difference does this difference make?&quot;&lt;/em&gt; — a question that surfaces the consequence or implication of the distinction. The process repeats on each subgroup until every group contains a single case. The result is a binary tree: a structured, hierarchical map of the respondent&#39;s view of the case set.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tree is informative in three ways: it reveals the &lt;strong&gt;contents&lt;/strong&gt; of the distinctions the respondent considers important; it identifies the &lt;strong&gt;limits&lt;/strong&gt; of their knowledge (where further differences cannot be found); and it indicates the &lt;strong&gt;direction&lt;/strong&gt; of their attention (where further distinctions could usefully be explored).&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In the &lt;strong&gt;Compare&lt;/strong&gt; stage, the facilitator asks comparison questions at each split in the tree. Questions can be &lt;em&gt;in degree&lt;/em&gt; (&quot;Which group of cases is more likely to face sustainability problems?&quot;) or &lt;em&gt;in kind&lt;/em&gt; (&quot;How do these two groups differ in terms of their relationship with local government?&quot;). Degree questions produce a ranking of all cases; kind questions produce descriptive contrasts. Both types build on the structure already revealed by the sort.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In the &lt;strong&gt;Contrast&lt;/strong&gt; stage, any two degree-based rankings are plotted against each other in a scatter plot. The resulting quadrant analysis shows where the two rankings agree (cases high on both, or low on both) and where they diverge (cases high on one and low on the other). Adjustable cutoff sliders allow the facilitator to explore different thresholds, and a Spearman correlation coefficient summarises the overall relationship. Outlier cases — those that diverge most between the two rankings — are often the most analytically interesting. Strong relationships can be cast as potentially useful IF...THEN rules&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The Ethnographic Explorer&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;With substantial coding help from Claude AI, I have been developing a browser-based implementation of this methodology — &lt;strong&gt;The Ethnographic Explorer&lt;/strong&gt; — as a standalone single-file application. This new version supersedes an earlier WordPress-based tool at &lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;http://ethnographic.mande.co.uk&quot;&gt;ethnographic.mande.co.uk&lt;/a&gt;, which required a server to run. The new version requires nothing beyond a web browser.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EthnographicExplorer_Vs48.html&quot;&gt;▶  Try The Ethnographic Explorer&lt;/a&gt; &lt;em&gt;[When you get there, click on Introduction for guidance on how to explore the tool&#39;s functions]&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is designed for use in a shared-screen video call with a single respondent, or screened to multiple participants in a workshop setting. The facilitator drives the interface; the respondent provides the knowledge. A typical exercise with 8–12 cases takes between 45 minutes and two hours, depending on range of comparisons made.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A worked example: 12 Largest cities&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Seven jscon files are are available alongside the app to allow you to explore the use of the app. The first is simply a list, which you can then sort and compare and contrast. The second is the same list, already sorted by Claude AI at my request, on the basis of &quot;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&amp;nbsp;their appeal to international tourists, as seen from 6 different perspectives.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;To load either dataset, click &lt;strong&gt;Import JSON&lt;/strong&gt; in the top-right toolbar and select the file (after being downloaded into your computer).&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260307_EE_The_12_largest_cities_in_the_w_Vs1.json&quot; style=&quot;font-family: verdana; font-size: large;&quot;&gt;Download 12 Largest Cities (unsorted)&lt;/a&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_BudgetBackpacker_Vs2.json&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Budget Backpacker view (sorted)&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_FoodAndCulture_Vs2.json&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Food and Culture view (sorted)&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_SafetyConscious_Vs2.json&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Safety Concious view (sorted)&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_SustainableTourism_Vs2.json&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Sustainable Tourism view (sorted)&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_TravelJournalist_Vs2.json&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Travel Journalist view (sorted)&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2026/03/20260315_EE_12Cities_HeritageSpecialist_Vs2.json&quot; style=&quot;font-family: verdana; font-size: large;&quot; target=&quot;_blank&quot;&gt;Download 12 Largest Cities - Heritage Specialist view (sorted)&lt;/a&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What the tool produces&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Each exercise generates a structured record of the respondent&#39;s knowledge about a set of cases:&lt;/span&gt;&lt;/p&gt;
&lt;ul class=&quot;[li_&amp;amp;]:mb-0 [li_&amp;amp;]:mt-1 [li_&amp;amp;]:gap-1 [&amp;amp;:not(:last-child)_ul]:pb-1 [&amp;amp;:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3&quot;&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;a binary sort tree, with each split labelled by the most significant difference and its consequence&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;one or more named rankings, each derived from a sequence of binary degree judgements working from the root set down through the subsets&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;one or more descriptive contrasts, capturing how subgroups differ in kind on specified attributes&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;a scatter plot for any pair of degree rankings, with quadrant analysis and Spearman correlation&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;a qualitative responses panel, collecting the in-kind descriptions at each split&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The full exercise exports as a JSON file, preserving the complete tree structure, all difference descriptions, all responses, and the exercise metadata. Files can be imported to resume exactly where you left off, or shared with a colleague for further analysis.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Beyond evaluation&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The underlying structure of the method applies wherever you have a set of cases and a respondent who has differentiated knowledge about them. Some other framings that could be explored with the same tool:&lt;/span&gt;&lt;/p&gt;
&lt;ul class=&quot;[li_&amp;amp;]:mb-0 [li_&amp;amp;]:mt-1 [li_&amp;amp;]:gap-1 [&amp;amp;:not(:last-child)_ul]:pb-1 [&amp;amp;:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3&quot;&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Organisational learning:&lt;/strong&gt; a team reviews a set of completed projects, identifying the distinctions that, in retrospect, most predicted success or failure&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Capacity assessment:&lt;/strong&gt; a trainer sorts a cohort of staff by their readiness for different kinds of work, making the basis for those judgements explicit and discussable&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Stakeholder analysis:&lt;/strong&gt; a key informant sorts a set of stakeholder organisations, revealing the distinctions they consider most consequential for programme implementation&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Policy analysis:&lt;/strong&gt; a policy analyst sorts a set of interventions by their perceived effectiveness, then compares that ranking against a ranking of their political feasibility&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In each case, the sort structure is the same, the comparison logic is identical, and only the cases, the domain framing, and the named differences change.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;An invitation to experiment&lt;/span&gt;&lt;/h3&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is best understood by using it. I would encourage anyone engaged in evaluation, learning, or knowledge management work to try loading in a set of cases they know well — even with a rough sort to start — and see what structure emerges. The Compare and Contrast stages are particularly useful for surfacing assumptions that are rarely made explicit in standard reporting.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I am continuing to develop the tool and would welcome feedback on the methodology, the interface, or applications I have not yet considered.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;font-claude-response-body break-words whitespace-normal leading-[1.7]&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;strong&gt;Accessing the code:&lt;/strong&gt; In addition to trying the app online, you can download a copy of the code and run it independently. Go to the app in yur directory, right-click, select View Page Source, copy the entire code, paste it into a text file, rename the file to end in &lt;code class=&quot;bg-text-200/5 border border-0.5 border-border-300 text-danger-000 whitespace-pre-wrap rounded-[0.4rem] px-1 py-px text-[0.9rem]&quot;&gt;.html&lt;/code&gt; rather than &lt;code class=&quot;bg-text-200/5 border border-0.5 border-border-300 text-danger-000 whitespace-pre-wrap rounded-[0.4rem] px-1 py-px text-[0.9rem]&quot;&gt;.txt&lt;/code&gt;, and open it in a web browser. The tool runs entirely in your browser — no login required, no data sent anywhere.&lt;/span&gt;&lt;/p&gt;
&lt;h3 class=&quot;text-text-100 mt-2 -mb-1 text-base font-bold&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Further reading&lt;/span&gt;&lt;/h3&gt;
&lt;ul class=&quot;[li_&amp;amp;]:mb-0 [li_&amp;amp;]:mt-1 [li_&amp;amp;]:gap-1 [&amp;amp;:not(:last-child)_ul]:pb-1 [&amp;amp;:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3&quot;&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Polanyi, M. (1966). &lt;em&gt;The Tacit Dimension&lt;/em&gt;. Doubleday. — the source of the &quot;we know more than we can tell&quot; framing.&lt;/span&gt;&lt;/li&gt;&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Bateson, G. (1972). &lt;em&gt;Steps to an Ecology of Mind&lt;/em&gt;. Ballantine Books. — the source of the &quot;difference that makes a difference&quot; framing.&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Kelly, G.A. (1955). &lt;em&gt;The Psychology of Personal Constructs&lt;/em&gt;. Norton. — the intellectual precursor to card sorting methods.&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The 2025 WordPress version of the tool, with documentation and worked examples, is at &lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;http://ethnographic.mande.co.uk&quot;&gt;ethnographic.mande.co.uk&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;whitespace-normal break-words pl-2&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Help pages for the comparison stage, with nine worked examples of question types, are at the &lt;a class=&quot;underline underline underline-offset-2 decoration-1 decoration-current/40 hover:decoration-current focus:decoration-current&quot; href=&quot;https://www.mande.co.uk/wp-content/uploads/2024/03/Help%20Pages%20Comparisons.html&quot;&gt;Comparisons Help page&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/4161592393989058319/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2026/03/making-implicit-knowledge-explicit.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4161592393989058319'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4161592393989058319'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2026/03/making-implicit-knowledge-explicit.html' title='Making implicit knowledge explicit, contestable and usable: an introduction to The Ethnographic Explorer'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5BfVXSV8AYfMbK52FrfUJMKDIBH5O9CXqiqkidqft_tFAQB1X3jXMnEZBUa1z6HCOhwR-waijiqiE8YG83SU1QXXtDgp5z9DSGrIqAWLUg-lxswrnGca-qTOhmldbMw912J-f7OapNm8OHLRYpfGqp242Bie63qYln0jDSEX4oA0NQbnyeBPO/s72-c/2026%2003%2015%20screen%20shot.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-7545557538945607695</id><published>2026-03-03T08:55:00.013+00:00</published><updated>2026-03-07T14:45:40.123+00:00</updated><title type='text'>Optimising method selection: an introduction to the set covering problem — and a tool to help</title><content type='html'>
&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhF-aVAaq-43jlhOlHWj8HGL2t5Ym8BS8RT4PEj27fsQNEvgn-d6NKu16HXiPxNLmKeivoE0KMdJNTkDk6nJ18FKzVaXsAcB_VOaVbRAl-27c8RH41LHudUJdLjTo89RWuuH1XFjGKBcG88wt7e_ZHyW09sEexWY_9khHXr0RCdfuIenteZLjxF/s1479/graohic.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;767&quot; data-original-width=&quot;1479&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhF-aVAaq-43jlhOlHWj8HGL2t5Ym8BS8RT4PEj27fsQNEvgn-d6NKu16HXiPxNLmKeivoE0KMdJNTkDk6nJ18FKzVaXsAcB_VOaVbRAl-27c8RH41LHudUJdLjTo89RWuuH1XFjGKBcG88wt7e_ZHyW09sEexWY_9khHXr0RCdfuIenteZLjxF/s16000/graohic.png&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b style=&quot;font-family: verdana;&quot;&gt;Caveat Emptor:&lt;/b&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt; I delegated the writing of this blog posting to Claude AI(Sonnet 4.6), based on an extended prior dialogue on the subject below, then a summary prompt of what was wanted. My post production &lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;edits&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt; were quite limited&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;--0O0--&lt;/span&gt;&lt;/div&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There is a class of problem that turns up repeatedly in evaluation planning, and in many other fields, that most of us solve informally and imprecisely. The problem is this: &lt;b&gt;given a set of needs to be addressed, what is the smallest combination of responses that covers all of them?&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In evaluation this might be: given a set of evaluation questions I need to answer, what is the minimum combination of methods that gives me adequate coverage across all of them? In public health it might be: which combination of clinics ensures every neighbourhood has access to at least one? In software testing: which set of test cases exercises every code path? In logistics: which depots can serve every delivery zone?&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;These are all instances of what computer scientists call the &lt;b&gt;Set Covering Problem&lt;/b&gt; — a well-studied problem in combinatorial optimisation that dates back to the 1970s. The formal version asks: given a collection of sets, find the smallest sub-collection whose union covers all elements of a target universe.&lt;/span&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Why it matters for evaluation practice&lt;/span&gt;&lt;/h3&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When designing an evaluation, practitioners face exactly this structure. We have a list of evaluation questions (or dimensions of quality, or stakeholder concerns), and a repertoire of methods — each of which can address some but not all of those questions to a satisfactory level. Choosing methods one at a time, based on familiarity or habit, rarely produces the most efficient combination. We either end up with redundant overlap&lt;span style=&quot;color: #666666;&quot;&gt;s&lt;/span&gt; in some areas and blind spots in others, or with far more methods than the budget or timeline can support.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A more systematic approach asks: which combination of methods is both &lt;i&gt;complete&lt;/i&gt; (covers all questions at an adequate level) and &lt;i&gt;minimal&lt;/i&gt; (uses as few methods, and as little resource, as possible)?&lt;/span&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;How the optimisation works&lt;/span&gt;&lt;/h3&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;For a small number of methods and questions, you could check all possible combinations by hand. But the number of combinations grows exponentially — with ten methods, there are over a thousand possible subsets to evaluate. This is where an algorithm helps.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The simplest approach is a &lt;b&gt;greedy algorithm&lt;/b&gt;: at each step, pick the method that covers the most currently-uncovered questions, then repeat until everything is covered. This is fast and usually finds a good solution, but not necessarily the best one.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A more thorough approach — &lt;b&gt;exhaustive search&lt;/b&gt; — systematically checks all combinations up to a specified size and returns every minimal solution. This is slower but reveals the full landscape of equally-good options, which is often more useful than a single answer, particularly when cost or other practical constraints come into play.&lt;/span&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The Coverage Optimiser&lt;/span&gt;&lt;/h3&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;With substantial coding help from Claude AI, I have been developing a small browser-based tool — the &lt;b&gt;Coverage Optimiser&lt;/b&gt; — that applies both approaches to a user-defined matrix of methods and question types.&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin: 1.5em 0px;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads//2026/03/20260303_CoverageOptimiser_Vs9b.html&quot; style=&quot;background-color: #1a73e8; border-radius: 4px; color: white; font-weight: bold; padding: 10px 20px; text-decoration: none;&quot; target=&quot;_blank&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;▶&amp;nbsp; Try the Coverage Optimiser&lt;/span&gt;&lt;/a&gt;&amp;nbsp;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;[When you get there click on Introduction, to get suggestions on how to explore the functions of the app]&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The default example matrix uses ten foresight methods (Scenario Planning, Delphi, Horizon Scanning, and others) rated against five question types — Descriptive, Valuative, Explanatory, Predictive, and Prescriptive — at HIGH, MEDIUM, or LOW in terms of their usefulness in building a futures perspective into an evaluation. The tool finds the minimal combinations of methods that achieve the desired coverage level across all question types.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Each solution is displayed with:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;the methods involved, each with its cost rating&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;a question-by-question coverage check&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;the total cost of the combination&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;an overlap score — the number of questions covered by more than one method in the solution&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Overlap is worth attending to: it indicates redundancy, which in evaluation terms means resilience. If one method proves impractical in the field, a solution with higher overlap is more likely to remain viable.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The matrix itself is fully editable in a dedicated Matrix Editor tab. You can rename methods and question types, adjust HIGH/MEDIUM/LOW ratings on a choosen criteria, set importance weights for each question type (0–10), set cost ratings for each method (0–10), sort rows by any column, import and export as JSON, and print or save the matrix or results as PDF.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Note:&amp;nbsp;&lt;/span&gt;&lt;i style=&quot;font-family: &amp;quot;Times New Roman&amp;quot;;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;The Coverage Optimiser runs entirely in your browser — no login required, no data sent anywhere.&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Beyond methods and questions&lt;/span&gt;&lt;/h3&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is not limited to foresight methods or evaluation questions. The underlying logic applies wherever you have a set of &lt;b&gt;options&lt;/b&gt; and a set of &lt;b&gt;requirements&lt;/b&gt;, and where each option partially addresses some requirements. Some other framings that could be loaded into the same tool:&lt;/span&gt;&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Solutions × Problems:&lt;/b&gt; which combination of policy interventions covers the broadest range of identified problems?&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Stakeholders × Information needs:&lt;/b&gt; which combination of engagement activities ensures all key stakeholder groups have their core information needs met?&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Data sources × Indicators:&lt;/b&gt; which combination of data collection instruments covers all required indicators, at minimum cost?&lt;/span&gt;&lt;/li&gt;
  &lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Partners × Geographic areas:&lt;/b&gt; which combination of implementing partners ensures all target districts are reached?&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In each case the matrix structure is the same, the optimisation logic is identical, and only the labels and rating criteria change.&lt;/span&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;An invitation to experiment&lt;/span&gt;&lt;/h3&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The tool is best understood by using it. I would encourage anyone planning a multi-method evaluation, or foresight exercise, to load in their own methods and questions — even with rough ratings to start — and see what combinations emerge. The exhaustive search mode is particularly useful for revealing that several equally-minimal combinations exist, which opens up a more deliberate conversation about which is preferable given cost, feasibility, or complementarity.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I am continuing to develop the tool and would welcome feedback on the matrix structure, the rating scales, or applications I have not yet considered.&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Those who have used &lt;a href=&quot;https://evalc3online.org&quot; target=&quot;_blank&quot;&gt;EvalC3&lt;/a&gt; will find a family resemblance here: both tools use systematic search to find efficient combinations — EvalC3 searching for attribute combinations that predict specific individual outcomes, the Coverage Optimiser searching for method combinations that cover multiple requirements. The underlying computational logic is related, even if the problems look different on the surface.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Accessing the code&lt;/b&gt;: In addition to trying out the app, as it already exists online, you can also download a copy of the code and have it working independently on your own website. Go to the app online, right click your mouse, select View Page Source, copy the entire code, paste it into a txt file, rename the text file to end in .html not .txt, click on that file in a directory to open it up in a web browser. Simples, yes?&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;

&lt;h3&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Further reading&lt;/span&gt;&lt;/h3&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;

&lt;span&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li style=&quot;font-size: large;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;A lot of the available reading on set covering algorithms is in the computer science domain. For a more accessible starting point, see the Wikipedia article on the &lt;/span&gt;&lt;a href=&quot;https://en.wikipedia.org/wiki/Set_cover_problem&quot; style=&quot;font-family: verdana;&quot; target=&quot;_blank&quot;&gt;Set cover problem&lt;/a&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;font-size: large;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;The next blog posting explores comparisons with QCA.&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;font-size: large;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;This exercise in method development was an unplanned outcome of the writing of a book chapter on bridging the fields of evaluation and foresight. The book is...&lt;/span&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;&lt;span face=&quot;Arial, Helvetica, sans-serif&quot; style=&quot;background-color: white; color: #222222;&quot;&gt;Lea Kleinsorg, Jan Tobias Polak, Christian Grünwald (2027): Futures-informed evaluation: Methodological approaches and empirical applications, SpringerNature, Heidelberg.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/span&gt;&lt;div&gt;&lt;span&gt;

&lt;/span&gt;&lt;hr /&gt;

&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/7545557538945607695/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2026/03/optimising-method-selection.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7545557538945607695'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7545557538945607695'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2026/03/optimising-method-selection.html' title='Optimising method selection: an introduction to the set covering problem — and a tool to help'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhF-aVAaq-43jlhOlHWj8HGL2t5Ym8BS8RT4PEj27fsQNEvgn-d6NKu16HXiPxNLmKeivoE0KMdJNTkDk6nJ18FKzVaXsAcB_VOaVbRAl-27c8RH41LHudUJdLjTo89RWuuH1XFjGKBcG88wt7e_ZHyW09sEexWY_9khHXr0RCdfuIenteZLjxF/s72-c/graohic.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-7190151461933707238</id><published>2025-12-25T01:38:00.016+00:00</published><updated>2026-01-06T20:32:14.273+00:00</updated><title type='text'>Exracting additional knowledge and performance from a configurational model that already has wide coverage</title><content type='html'>&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A &lt;a href=&quot;https://evaluatingcomplexity.org/resources/how-to-use-search-for-new-models&quot; target=&quot;_blank&quot;&gt;decision tree algorithm&lt;/a&gt;, as available within &lt;a href=&quot;https://evalc3online.org/authenticate&quot; target=&quot;_blank&quot;&gt;EvalC3,&lt;/a&gt; can generate a classification tree&amp;nbsp; (a set of predictive models) of the kind shown here.&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEhP-LvMCaaXuRNP4NO21y20UX_CmgmLf7VGXnPNEr_J_UefxdrJ7IGvN1B94bN5-4nGnJ8500hUxFhOOGbTDt6_wC0tbUW2-gk1Hu7bE3wvMQxPq5bm69830VQCDqEDPeac5fStMZrb4g7WK_gW0O-L2GjNO3bhajuKuKFHwwuMPJm7a_elQTxW&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img alt=&quot;&quot; data-original-height=&quot;796&quot; data-original-width=&quot;1114&quot; height=&quot;458&quot; src=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEhP-LvMCaaXuRNP4NO21y20UX_CmgmLf7VGXnPNEr_J_UefxdrJ7IGvN1B94bN5-4nGnJ8500hUxFhOOGbTDt6_wC0tbUW2-gk1Hu7bE3wvMQxPq5bm69830VQCDqEDPeac5fStMZrb4g7WK_gW0O-L2GjNO3bhajuKuKFHwwuMPJm7a_elQTxW=w640-h458&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Some of the models (each branch is a model) are very detailed (i.e. has lots of attributes) and have narrow coverage. Such as HasQuotas+NotPost Conflict Situation+High Level of Human Development+ Low Womens Status = Low levels of womens representation in Parliament, which covers two cases (Senegal, Tanzania).&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Others are quite simple, with only two or three attributes and can have much wider coverage. Such as HasQuotas+ IsPostConflict = High levels of womens representation in Parliament, which covers two six cases (Burundi,&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;Ethiopia,&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;Mozambique,&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;Namibia,&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;South Africa,&lt;span style=&quot;white-space: pre;&quot;&gt;	&lt;/span&gt;Uganda)&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;These wide coverage models may have unexplored potential, in the form of unexploited information content within the cases they cover. The raw (i.e. numerical) outcome data for &lt;u&gt;the cases they cover only&lt;/u&gt; can be examined and recalibrated i.e re-dichotomised into two new sub-groups representing relatively higher versus lower outcome values &lt;u&gt;within that set only&lt;/u&gt;.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A new configurational analysis can then focus on that sub-set of cases to see if (a) any of the pre-existing attrubutes could predict membership of the two sub-groups, or (b) if any additional attributes, based on other knowledge of these cases, could do so.The ability to predict such finer grained performance differences would be a significant improvement.&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This analytic step is a complementary move to that known as &quot;pruning&quot;, where the removal of a mode attribute improves coverage, at the cost of precision.&amp;nbsp; Here an extra attribute is sought that will improve precision but at the cost of coverage. Perhaps it could be called &quot;grafting&quot;...&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript: But how significant will this addition to the model be? If, as above, there are six cases involved, there are 2^6 possible binary groupings of these case i.e 32. So any one grouping of two sets of cases has a 1/32 or 3.125% chance of occuring randomly (if the cases are causally independent).&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/7190151461933707238/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2025/12/recalibration-of-numerical-outcome-data.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7190151461933707238'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7190151461933707238'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2025/12/recalibration-of-numerical-outcome-data.html' title='Exracting additional knowledge and performance from a configurational model that already has wide coverage'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/a/AVvXsEhP-LvMCaaXuRNP4NO21y20UX_CmgmLf7VGXnPNEr_J_UefxdrJ7IGvN1B94bN5-4nGnJ8500hUxFhOOGbTDt6_wC0tbUW2-gk1Hu7bE3wvMQxPq5bm69830VQCDqEDPeac5fStMZrb4g7WK_gW0O-L2GjNO3bhajuKuKFHwwuMPJm7a_elQTxW=s72-w640-h458-c" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-3424930256320307335</id><published>2025-12-02T22:44:22.363+00:00</published><updated>2025-12-10T07:01:24.189+00:00</updated><title type='text'>Objectives as data: The potential uses of updatable outcome targets</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;b style=&quot;font-family: verdana; font-size: large;&quot;&gt;The context&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A specialist agency is funding more than 40 different partner organisations, each working in a different part of the country but with the same overall objective of increasing people&#39;s levels of physical activity (because of the positive health consequences).&amp;nbsp;These partners are often working with quite different communities, and all have substantial degree of independence about how they work towards the overall objective.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Some agency&amp;nbsp;representatives have asked about the nature of the target that program as a whole is working towards, and have emphasised how essential it is that there be clarity in this area.&amp;nbsp;By target they mean an actual number. Specifically the percentage of people self-reporting that they achieve a certain level of physical activity each week, as identified by an annual survey that is already underway and will be repeated in future.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Possible responses&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In principle it would be possible to set a target for the proportion of the population reporting being physically active. Such as 75%.&amp;nbsp; But it would be very hard to identify an optimal target percentage, given the diversity of partner localities, and the communities within these.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Relative targets may be more&amp;nbsp;appropriate. Such as a 25% increase in reported activity levels. Especially&amp;nbsp;if partners were each asked to identify what they think are achievable percentage increases in their own localities within the next survey period. This estimation would take place in a context where these partners already have experience working in those locations, identifying some of the things that work and dont work. My hypothesis, yet to be tested, would be that these partners will make quite conservative estimates. And if so, this might come as some surprise to the donor and perhaps lead to some revision of their own expectations&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Taking this idea further, partners could be periodically asked if they wanted to adjust their expectations upwards or downwards , of the change that could be achieved - in the time remaining in the interventions lifespan. Subject to being able to explain the rationale for doing so. My second hypothesis is that this number, and commentary, could be a valid and useful form of progress reporting in its own right.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Making sense of the responses&lt;/b&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;An assessment of overall progress over longer time scale would need to consider both the scale of ambitions and the extent of their achievement. These can&#39;t be combined into one number based on a simple formula because any such number could be achieved by adjustment of expectations and or performance. However it could be usefully represented by a scatterplot, with data points reflecting each of the partners, of the kind shown below.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEjXC8VbVkP6MJQMSyDXj0VLGNvJxYAWlni03e8xyNHclIXJ-NExqzlQFHqYUEkJ5XRVMVyjwoBTG5LoMyqBjkQifEq0G5J-zkTDttoNfnRm6tpXyZ7O3jZs1T95AFBMHu9mANK1Au-ZlyS1QtEnLu28JC_AZWxo1PeuMjQKCWnVwDoNz4KSrXWVQumgV75V&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;img alt=&quot;&quot; data-original-height=&quot;413&quot; data-original-width=&quot;606&quot; height=&quot;218&quot; src=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEjXC8VbVkP6MJQMSyDXj0VLGNvJxYAWlni03e8xyNHclIXJ-NExqzlQFHqYUEkJ5XRVMVyjwoBTG5LoMyqBjkQifEq0G5J-zkTDttoNfnRm6tpXyZ7O3jZs1T95AFBMHu9mANK1Au-ZlyS1QtEnLu28JC_AZWxo1PeuMjQKCWnVwDoNz4KSrXWVQumgV75V&quot; width=&quot;320&quot; /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The location of partners in different quadrants suggests different implications about how the different partners should be managed&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;High ambition/low achievement: May need additional support, capacity building, or problem-solving&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Low ambition/low achievement: May need fundamental partnership restructuring or exit considerations&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;High ambition/high achievement: Candidates for scaling, sharing learning, reduced support intensity&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Modest ambition/high achievement: Opportunities to stretch ambitions&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This framework also provides plenty of potentially useful analytic questions&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Are ambitions increasing or decreasing?&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Is the gap between expected and actual narrowing or widening?&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;For a given level of actual achievement did differences in expectations have any role or consequences&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;For a given level of expected change what might explain&amp;nbsp;the differences in the partners&amp;nbsp;actual achievements&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;How do individual partners positions within this matrix change over time? Are there distinct types of trajectories and how can these differences be explained?&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;In summary&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A single numerical value based on the data in this matrix will provide a meaningless simplification.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In contrast, a&amp;nbsp;scatterplot visualisation&amp;nbsp;can generate multiple potentially useful perspectives.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;It is more useful to see targets&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;as necessarily malleable responses to changing conditions, than as unarguable reference points.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;text-indent: -18pt;&quot;&gt;&lt;b&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: 0px;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;text-indent: -18pt;&quot;&gt;There is a type&amp;nbsp; of reinforcement learning algorithm known as Temporal Difference Learning (TDD), that embodies a very similar process.&amp;nbsp;&lt;/span&gt;It is described as &quot;a model-free reinforcement learning method that learns by updating its predictions based on the difference between future predictions and the current estimate&quot;. Model-free means it has no built in model of the world it is working in.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: 0px;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When implemented as a human process it is vulnerable to gaming, because the agents (humans) are aware of the system&#39;s mechanics, unlike the neural networks or simplified agents typically used in computational TD learning. But one adaptation, suggested by Gemini AI, is to &quot;reward partners not just for the +/-gap, but for the accuracy of their final predictions over multiple cycles&quot;. &lt;b&gt;Relatively higher accuracy, over multiple time periods, might be indicative of potentially generalisable / replicable delivery capacity, usable beyound the current context.&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/3424930256320307335/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2025/12/the-context-specialist-agency-is.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/3424930256320307335'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/3424930256320307335'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2025/12/the-context-specialist-agency-is.html' title='Objectives as data: The potential uses of updatable outcome targets'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/a/AVvXsEjXC8VbVkP6MJQMSyDXj0VLGNvJxYAWlni03e8xyNHclIXJ-NExqzlQFHqYUEkJ5XRVMVyjwoBTG5LoMyqBjkQifEq0G5J-zkTDttoNfnRm6tpXyZ7O3jZs1T95AFBMHu9mANK1Au-ZlyS1QtEnLu28JC_AZWxo1PeuMjQKCWnVwDoNz4KSrXWVQumgV75V=s72-c" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-4571254254610915432</id><published>2024-06-18T14:40:00.008+00:00</published><updated>2024-07-05T18:15:43.927+00:00</updated><title type='text'>On two types of Theories of Change: Temporal and atemporal, and how they might be bridged</title><content type='html'>&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEitpluhCSNO3rFbaIjA06zDxu6C-jglVjoiijY2eqriw8dyGP14XIXo0nNq9yb4qs9a21BwCBfSHoKbRkFkqELMT0Ra62hS6G6bATxxKVJKNy30ThEo7-M6ZdpV90xs_Tya7N4FS2bp36gtHIwWwM2FCm0xpFxSnKjVwFoeItmMExNsBEWQ0ARI/s911/Precambrian.jpg&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;911&quot; data-original-width=&quot;800&quot; height=&quot;514&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEitpluhCSNO3rFbaIjA06zDxu6C-jglVjoiijY2eqriw8dyGP14XIXo0nNq9yb4qs9a21BwCBfSHoKbRkFkqELMT0Ra62hS6G6bATxxKVJKNy30ThEo7-M6ZdpV90xs_Tya7N4FS2bp36gtHIwWwM2FCm0xpFxSnKjVwFoeItmMExNsBEWQ0ARI/w451-h514/Precambrian.jpg&quot; width=&quot;451&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There are two quite different ways of representing theories of change – of the kind that might be useful when planning and monitoring development programmes of one kind or another.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;The first kind is seen in representational devices such as the Logical Framework, Logic Models and boxes-and-arrows type diagrams.  &lt;b&gt;These differentiate events according to their location at different points over time,&lt;/b&gt; taking place between the initial provision of funding, its allocation and use and then it&#39;s subsequent effects and final impacts. These are &lt;i&gt;temporal &lt;/i&gt;models.&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The second kind, seen much less often, are seen in the analyses generated by Qualitative Comparative Analysis (QCA) and simple machine learning methods known as Decision Trees or Classifiers.&amp;nbsp; Here the theory is in the form of multiple configurations of different attributes that are associated with desired outcome, and its absence.  Those attributes may be of the intervention and/or its context.&amp;nbsp;&amp;nbsp;&lt;b&gt;The defining feature of this approach is the focus on cases and differences between cases, rather than different points or periods in time. &lt;/b&gt; These cases are often geographical entities, or groups or persons, which have some persistence over time.  They are effectively&lt;i&gt;&amp;nbsp;atemporal &lt;/i&gt;models&lt;i&gt;.&lt;/i&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Each of these approaches have their own merits.  Theory of change which describes the sequence of expected events over time and how they relate to each other is useful for planning, monitoring and evaluation purposes.&amp;nbsp; But it runs the risk of assuming a homogeneity of effects across all locations where it is&amp;nbsp;is implemented.  On the other hand, a QCA-type configurational approach helps us identify diversity in contexts and implementations, and its consequences. But it may not have any immediate management consequences, about what needs to be done when.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;i&gt;One &lt;/i&gt;of my current interests is exploring the possibility of combining these two approaches, such that we have theories of&amp;nbsp;change that differentiate those events over time, while also differentiating cases across space where those events may or may not be happening.&amp;nbsp;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One paper which I&#39;ve just been told about is exploring these possibilities, as seen from a QCA starting point:Pagliarin, S., &amp;amp; Gerrits, L. (2020). &lt;a href=&quot;https://doi.org/10.1177/2059799120959170&quot; target=&quot;_blank&quot;&gt;Trajectory-based Qualitative Comparative Analysis: Accounting for case-based time dynamics.&lt;/a&gt; Methodological Innovations, 13.&amp;nbsp; In this paper the authors introduce the innovative idea of cases as &lt;i&gt;different periods of time in the same location&lt;/i&gt;, where each of those subsequent periods of time may have various attributes of interest present or absent, along with an outcome of interest being present or absent.&amp;nbsp; This approach seems to have potential for enabling a systematic approach to within-case investigations complementing what might have been prior cross-case investigations.&amp;nbsp; There is the potential to identify specific attributes, or combinations of these, which are necessary or sufficient for changes to take place within a given case.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Somewhat tangentially...&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The same paper reminded me reminded me of some evaluation fieldwork I did in Burkina Faso in 1992, where I was interviewing farmers about the history of their development of a small market garden using irrigation water obtained from a nearby lake.  Looking back at the history of the market, which I think was about six years old at the time, I asked them to identify the most significant change that had taken place during the period of time.  They identified installation of the water pump in year 198?, and pointed out how it expanded the scale of their cultivation thereafter.  I can remember also asking, but with less recall of what they then said, follow-up questions about the most significant change that it taken place in each smaller time period either side of that event, and then its consequences.&amp;nbsp; I was in effect asking them to carve up the history of the garden into segments, and sub-segments, of time not defined by calendar, but by key events – each of which had consequences. These were in effect temporal &quot;cases&quot;. Each of these had a configuration of multiple attributes, i.e. being attributes of the nested set of time periods that it belonged to. Associated with each of these were differenting judgements about the&amp;nbsp; the productivity of the market garden.&amp;nbsp; But with our team&#39;s time being short supply, I never got the opportunity to gather a full data set, so to speak.&amp;nbsp;&lt;/span&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;i&gt;Another&lt;/i&gt; of my current interests, prompted by the above conjectures, is the possible use of specific form of Hierarchical Card Sorting (HCS) as a means introducing a temporal element into case-based configurational analysis. The HCS process generates a tree structure of nested binary distinctions between cases. It is concievable that different broad criteria could be introduced for the type of differences being identified at each level of the branching structure. For example, at the top level the &quot;most significant difference&quot; being sought could be specified as being&amp;nbsp; &quot;in terms of funding received&quot;, then at the next level, &quot;in terms of outputs generated&quot; , and so on (Criteria 1,2,3 etc in Figure 1 below) .&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEiDW6Dk4BSO66mvQ8Dwo8_jbjVzCVpDbd5oePrU00kHvGYMzOal-yvZNVRN9mq09aqTbFvS5dwdvVka4oF1blyVoX6C1yuWgoFEt6VjCTZdvXykZH0ggpCMKZp1JV0GJE2CnKlKXpMaPqsS6XBeh0T9T7Vy1Y8qqUfHwt64PrMzdxWBDkpl18jN&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img alt=&quot;&quot; data-original-height=&quot;403&quot; data-original-width=&quot;663&quot; height=&quot;340&quot; src=&quot;https://blogger.googleusercontent.com/img/a/AVvXsEiDW6Dk4BSO66mvQ8Dwo8_jbjVzCVpDbd5oePrU00kHvGYMzOal-yvZNVRN9mq09aqTbFvS5dwdvVka4oF1blyVoX6C1yuWgoFEt6VjCTZdvXykZH0ggpCMKZp1JV0GJE2CnKlKXpMaPqsS6XBeh0T9T7Vy1Y8qqUfHwt64PrMzdxWBDkpl18jN=w557-h340&quot; width=&quot;557&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;Figure 1 below&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/4571254254610915432/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2024/06/on-two-types-of-theories-of-change.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4571254254610915432'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4571254254610915432'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2024/06/on-two-types-of-theories-of-change.html' title='On two types of Theories of Change: Temporal and atemporal, and how they might be bridged'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEitpluhCSNO3rFbaIjA06zDxu6C-jglVjoiijY2eqriw8dyGP14XIXo0nNq9yb4qs9a21BwCBfSHoKbRkFkqELMT0Ra62hS6G6bATxxKVJKNy30ThEo7-M6ZdpV90xs_Tya7N4FS2bp36gtHIwWwM2FCm0xpFxSnKjVwFoeItmMExNsBEWQ0ARI/s72-w451-h514-c/Precambrian.jpg" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-6567698763151036404</id><published>2024-04-24T12:04:00.030+00:00</published><updated>2024-04-30T15:57:26.982+00:00</updated><title type='text'>Developing and using a Configurational Theory of Change within an evaluation</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgK8e8U1bg4SCSRWXCYsaPuC7nkWYD5mMZdPSnJY2ZWe_-HQu2aBAJxC-IGkAhS5zvY26aJezVrWvaqGdJMvM3MfTd5KFskjoGJjbcAhtP1Di74cGPHs6PepxoyXYi_eqHIVhyphenhyphenTUqUavLzyvPJ3hYJMPU979wo2JwQ2waQK1KDscm-ENWux5MJ-/s702/CAA%20tree.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;510&quot; data-original-width=&quot;702&quot; height=&quot;464&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgK8e8U1bg4SCSRWXCYsaPuC7nkWYD5mMZdPSnJY2ZWe_-HQu2aBAJxC-IGkAhS5zvY26aJezVrWvaqGdJMvM3MfTd5KFskjoGJjbcAhtP1Di74cGPHs6PepxoyXYi_eqHIVhyphenhyphenTUqUavLzyvPJ3hYJMPU979wo2JwQ2waQK1KDscm-ENWux5MJ-/w640-h464/CAA%20tree.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;Figure 1&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium; text-align: left;&quot;&gt;Ho-hum, yet another evaluation brand being promoted in an already crowded marketplace.FFS...&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Yes, I think this reaction is understandable, but I think there is something here captured under this title (Developing and using a Configurational Theory of Change... ) which has potential value.&amp;nbsp;&lt;/span&gt; &lt;span style=&quot;font-family: verdana;&quot;&gt;I will try to explain...&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Many evaluators make use of theories of change, as part of a theory-based approach to evaluation.  Many theories of change are described in some type of diagrammatic form.  And a typical feature of those diagrams is their convergent nature.  That is, they start of with a range of different types of inputs and activities which follow various causal pathways towards a limited number of final outcomes.&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This image is almost the complete opposite of what happens in actual practice on the ground.  Financial inputs come from a limited number of sources, these become available to a small range of partners who carry out their own range of activities, in a variety of different locations each with their own populations, including those intended and not intended to be affected.  This description is of course a simplification, but it applies to many development aid programme designs.  The point I&#39;m making here is that this in-reality process is not convergent it is divergent!&amp;nbsp; It seems like the diagrammatic theories of change I have described are a type of &lt;a href=&quot;https://en.wikipedia.org/wiki/Procrustes#:~:text=A%20Procrustean%20bed%20is%20an,method%20of%20looking%20for%20clues.&quot; target=&quot;_blank&quot;&gt;Procrustean bed&lt;/a&gt;.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;This blog posting has been most immediately prompted by a report I have just reviewed on potential evaluation strategies for a large national level climate finance strategy (CFS).  The theory of change describes multiple causal pathways connecting the initial provision of government finance through to four expected types of expected impacts.&amp;nbsp; With two of these causal pathways alone the number of projects being funded is in the hundreds.  The report struggled with the issue of how to measure the expected impacts given the scale and likely diversity of events on the ground.  And the corresponding challenge of how to sample those projects.  Part of my diagnosis of the problem here was the evaluation team&#39;s measurement-led approach.  And the weakness of the conceptual framework i.e. the incapacity of&amp;nbsp;the theory of change to capture the diversity of what was taking place.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Describing the alternative to my client is now my challenge.  I think the alternative has two parts.  Firstly, one should start at the beginning, where the money becomes available, and then follow the money (and the people responsible) as it gets distributed according to its intended purposes.  If things are not happening as expected early on in this process then this affects expectations of what might and might not be observable later on in the form of &#39;outcomes&#39; or &#39;impacts&#39;.  Put crudely, there is no point trying to observe the impact of something that has not yet been delivered. And in the case of strategies like the CFS, a large part of success can simply be gettting the money where it should be spent.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Secondly, as money is distributed from a central fund, decisions are going to be made about how it should be parcelled out in different amounts for different purposes through different institutions.&amp;nbsp; Each time that happens the decisions that have been made about how to do this are hopefully not random.  Evaluating how those decisions were made may not necessarily be all that useful,&amp;nbsp;because often there will be&amp;nbsp;opaque&amp;nbsp;mini, meso and macro political processes involved.  But the announced decisions may include some intentionally explicit expectations about the official purposes of different allocations.  Interviews those responsible for those allocations might also elicit more informal and more current expectations about what might be the short and longer terms effects of some of these allocations, when compared to others.The point I am emphasising here is that sometimes we can come to evaluative judgements not through the use of any overriding predetermined criteria, but by using a more inductive process, where we compare one option to another.  This is an excuse for me to quote Marx (G):&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Friend says to Marx – &#39;Life is difficult&#39;. &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Marx replies to friend – &#39;Compared to what?&#39;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;This type of inductive comparative evaluation doesn&#39;t have to be completely free form.  It is conceivable for example that we could look at two tranches of government climate finance funding and ask (those with proximate responsibilities for that funding) what difference there might between those blocks of funding in terms of how each might meet one or more of the OECD criteria (These range in their concerns from the more immediate issues of coherence and efficiency to later concerns with effectiveness and impact).&amp;nbsp;Respondents answers in the form of expectations can be seen as mini theories a.k.a. hypotheses that then might be testable through the gathering of relevant data.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Before these questions can be posed the cases that are going to be compared would need to be identified.  The &#39;cases&#39;&amp;nbsp;in this example would be particular blocks of funding.  Further along the implementation process the cases could be partners who are receiving funding, or activities that those partners implementing, or communities those activities are directed towards. Nevertheless, at any point along this &lt;/span&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;chain there is still a challenge, which is how to select cases for comparison. For example, i&lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;f we are looking at a particular budget document which distributes funding into multiple purpose categories we will be faced with the question of which of these categories to compare.&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span&gt;One way forward is to &lt;i&gt;let the interviewed person decide&lt;/i&gt;, especially if they have responsibilities in this area. Using&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;a href=&quot;https://mande.co.uk/special-issues/hierarchical-card-sorting-hcs/&quot; style=&quot;font-family: verdana;&quot; target=&quot;_blank&quot;&gt;hierarchical card sorting (HCS)&lt;/a&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;the interviewer starts with a&amp;nbsp;request, which is phrased like this:&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt; &#39;&lt;/span&gt;&lt;i style=&quot;font-family: verdana;&quot;&gt;What is the most significant difference between all these budget categories in terms of how they will achieve the objectives of the climate Finance strategy?  Please sort the budget categories into two piles according to this difference and then explain it to me&lt;/i&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&quot;.&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt; Having identified ppiles of types of cases that can be compared the respondent can then be asked for details about their expectations of the cases in one pile versus the other (See FN1).&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;The same question can then be reiterated by focusing on each of those two piles in turn and getting the respondent to break them into two smaller sub- piles. When their answers are followed by explanations this will help differentiate expectations in further detail.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;/span&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjavX0BvWpts_iiJP_HY6ZWyu4MFek2Fj45AYqv39ApouWqP1EnZsT6tNjryPCZItJnCt4gXCc9Xoa2iP52RoBiRGzyGnXtd-fivJsoWilsEYxZMPuvYhD4xohmqPgv1DHE0NOB73UGokWLCkPEa5vqABVcmbLDVgdREnpPKMr05AzMZTTlkEUA/s3597/Annotated%20tree%20Anon1_1.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;1164&quot; data-original-width=&quot;3597&quot; height=&quot;207&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjavX0BvWpts_iiJP_HY6ZWyu4MFek2Fj45AYqv39ApouWqP1EnZsT6tNjryPCZItJnCt4gXCc9Xoa2iP52RoBiRGzyGnXtd-fivJsoWilsEYxZMPuvYhD4xohmqPgv1DHE0NOB73UGokWLCkPEa5vqABVcmbLDVgdREnpPKMr05AzMZTTlkEUA/w640-h207/Annotated%20tree%20Anon1_1.png&quot; title=&quot;Figure 2&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;Figure 2 (click on to enlarge)&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Figure 2 shows the results of such an exercise, where the respondents were NGO staff responsible for the development and management of a portfolio of projects. They were asked to sort the projects into two piles according to &quot;&lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;i&gt;What they saw as the most significant difference between the projects, of a kind that would make a difference to what they could achieve&lt;/i&gt;&quot;. Their choices generated the tree structure. They were then asked to make a series of binary choices at each branching point, indentifying which of the two types of projects described there that &quot;&lt;i&gt;they expected to be most successful, in terms of the extent to which they will contribute to the achievement of the overall objectives of the portfolio&lt;/i&gt;&quot; . Their choices are shown by the red links. In this diagram their responses have been sorted such that the preferred red option is always shown above the non-preferred option. The aggregate result is a ranked set of 8 types of projects, with the highest rank (1) at the top. Each of these types is not an isolated category of its own, but part of a configuration that can be read along each branch, from left to right.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Here are some of the type descriptions and the reasons why one versus the other was selected most likely to contirbute to the portfolio objectives. Further discusison would be needed to esytablish how the presence/absence of these characteristics could be identified on the ground.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;MsoTableGrid&quot; style=&quot;border-collapse: collapse; border: none; margin-left: 5.4pt; mso-border-alt: solid windowtext .5pt; mso-padding-alt: 0cm 5.4pt 0cm 5.4pt; mso-table-layout-alt: fixed; mso-yfti-tbllook: 1184;&quot;&gt;
 &lt;tbody&gt;&lt;tr style=&quot;height: 13.5pt; mso-yfti-firstrow: yes; mso-yfti-irow: 0;&quot;&gt;
  &lt;td rowspan=&quot;2&quot; style=&quot;background: rgb(242, 219, 219); border: 1pt solid windowtext; height: 13.5pt; mso-background-themecolor: accent2; mso-background-themetint: 51; mso-border-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 99.2pt;&quot; valign=&quot;top&quot; width=&quot;132&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;Wider focus&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style=&quot;border-left: none; border: 1pt solid windowtext; height: 13.5pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;Aim to
  influence wider policy and environment, and have more sustainable and wider
  impact beyond children and their families.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style=&quot;height: 13.5pt; mso-yfti-irow: 1; mso-yfti-lastrow: yes;&quot;&gt;
  &lt;td style=&quot;border-bottom: 1pt solid windowtext; border-left: none; border-right: 1pt solid windowtext; border-top: none; height: 13.5pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; mso-border-top-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;&lt;span style=&quot;color: #0070c0;&quot;&gt;Likely to be more successful: Because it will have a
  wider reach and be more sustainable&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;MsoTableGrid&quot; style=&quot;border-collapse: collapse; border: none; margin-left: 5.4pt; mso-border-alt: solid windowtext .5pt; mso-padding-alt: 0cm 5.4pt 0cm 5.4pt; mso-table-layout-alt: fixed; mso-yfti-tbllook: 1184;&quot;&gt;
 &lt;tbody&gt;&lt;tr style=&quot;height: 20.1pt; mso-yfti-firstrow: yes; mso-yfti-irow: 0;&quot;&gt;
  &lt;td rowspan=&quot;2&quot; style=&quot;background: rgb(242, 219, 219); border: 1pt solid windowtext; height: 20.1pt; mso-background-themecolor: accent2; mso-background-themetint: 51; mso-border-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 99.2pt;&quot; valign=&quot;top&quot; width=&quot;132&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;Local focus&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style=&quot;border-left: none; border: 1pt solid windowtext; height: 20.1pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;More hands on
  work with children on a day to day basis. Impact may be sustained but it will
  be limited to children and their families.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style=&quot;height: 16.65pt; mso-yfti-irow: 1; mso-yfti-lastrow: yes;&quot;&gt;
  &lt;td style=&quot;border-bottom: 1pt solid windowtext; border-left: none; border-right: 1pt solid windowtext; border-top: none; height: 16.65pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; mso-border-top-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;&lt;span style=&quot;color: #0070c0;&quot;&gt;Likely to be less successful:&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;MsoTableGrid&quot; style=&quot;border-collapse: collapse; border: none; margin-left: 5.4pt; mso-border-alt: solid windowtext .5pt; mso-padding-alt: 0cm 5.4pt 0cm 5.4pt; mso-table-layout-alt: fixed; mso-yfti-tbllook: 1184;&quot;&gt;
 &lt;tbody&gt;&lt;tr style=&quot;height: 20.1pt; mso-yfti-firstrow: yes; mso-yfti-irow: 0;&quot;&gt;
  &lt;td rowspan=&quot;2&quot; style=&quot;background: rgb(198, 217, 241); border: 1pt solid windowtext; height: 20.1pt; mso-background-themecolor: text2; mso-background-themetint: 51; mso-border-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 99.2pt;&quot; valign=&quot;top&quot; width=&quot;132&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;Locally driven&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style=&quot;border-left: none; border: 1pt solid windowtext; height: 20.1pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;Partner and
  the projects are locally rooted, driven by local needs and priorities. They
  are more likely to “get it right”. They can’t walk away when Comic Relief
  funding ends. More likely to be sustainable.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style=&quot;height: 20.1pt; mso-yfti-irow: 1; mso-yfti-lastrow: yes;&quot;&gt;
  &lt;td style=&quot;border-bottom: 1pt solid windowtext; border-left: none; border-right: 1pt solid windowtext; border-top: none; height: 20.1pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; mso-border-top-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;&lt;span style=&quot;color: #0070c0;&quot;&gt;Likely to be more successful: More embedded in the
  context, will outlast the project, be more responsive.&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;MsoTableGrid&quot; style=&quot;border-collapse: collapse; border: none; margin-left: 5.4pt; mso-border-alt: solid windowtext .5pt; mso-padding-alt: 0cm 5.4pt 0cm 5.4pt; mso-table-layout-alt: fixed; mso-yfti-tbllook: 1184;&quot;&gt;
 &lt;tbody&gt;&lt;tr style=&quot;height: 13.5pt; mso-yfti-firstrow: yes; mso-yfti-irow: 0;&quot;&gt;
  &lt;td rowspan=&quot;2&quot; style=&quot;background: rgb(198, 217, 241); border: 1pt solid windowtext; height: 13.5pt; mso-background-themecolor: text2; mso-background-themetint: 51; mso-border-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 99.2pt;&quot; valign=&quot;top&quot; width=&quot;132&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;UK driven&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style=&quot;border-left: none; border: 1pt solid windowtext; height: 13.5pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;UK driven
  projects, almost sub-contracting. They have a set end-point.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style=&quot;height: 13.5pt; mso-yfti-irow: 1; mso-yfti-lastrow: yes;&quot;&gt;
  &lt;td style=&quot;border-bottom: 1pt solid windowtext; border-left: none; border-right: 1pt solid windowtext; border-top: none; height: 13.5pt; mso-border-alt: solid windowtext .5pt; mso-border-left-alt: solid windowtext .5pt; mso-border-top-alt: solid windowtext .5pt; padding: 0cm 5.4pt; width: 307.85pt;&quot; valign=&quot;top&quot; width=&quot;410&quot;&gt;
  &lt;p class=&quot;MsoNormal&quot; style=&quot;line-height: normal; margin-bottom: 0cm;&quot;&gt;&lt;span style=&quot;color: #0070c0;&quot;&gt;Likely to be less successful:&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;There is a larger question here of course that also relates to sampling.  Who are you going to interview in this way?  The suggestion above was &#39;to follow the money &#39;. In other words, to follow lines of responsibility and interview people about the domains of activity they are responsible for, using HCS as a means of structuring the discussion.  There is a strategy choice here between what is known as a breadth-first search versus a depth-first search strategies.  From a given point in a flow of funds (and of responsibilities) there can be distributions going in different directions, each of which could all be explored.  Following all of these is a form of breadth-first search.  Alternatively the focus could be just on one of those developments, and following the subsequent distribution of funding and responsibility further down one (or few) line.  This is a form of depth-first search.  Which of those search strategies to pursue is probably a matter to be decided by the evaluation client. But may also need to be adaptive, informed by what was found by the evaluation team in prior interviews.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;table cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto; text-align: left;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_PbDqhHFm6XGjHCIOKNgKERhvO1TpQfrTywNjDypE-H9sKM-XmSOIB1p4_a4eDWPbNsEdUx-glaFIgikEEQgY0RE7C3KpehtMkslwVAVWe4ZBvsLsPvb5p8asETfe0DqCHVYqsgHk6hRsBdfLFvohyaigbPdTi1XE076CUmcgFa5SxGxMpcCp/s782/BFS%20DFS.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;392&quot; data-original-width=&quot;782&quot; height=&quot;325&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_PbDqhHFm6XGjHCIOKNgKERhvO1TpQfrTywNjDypE-H9sKM-XmSOIB1p4_a4eDWPbNsEdUx-glaFIgikEEQgY0RE7C3KpehtMkslwVAVWe4ZBvsLsPvb5p8asETfe0DqCHVYqsgHk6hRsBdfLFvohyaigbPdTi1XE076CUmcgFa5SxGxMpcCp/w647-h325/BFS%20DFS.png&quot; title=&quot;Courtesy: Jacky Liu&quot; width=&quot;647&quot; /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;Courtesy &lt;a href=&quot;https://www.linkedin.com/pulse/comparison-breadth-first-search-depth-first-methods-uses-liu-ud4ce/&quot; target=&quot;_blank&quot;&gt;Jacky Lieu: Comparison of Breadth-First Search and Depth-First Search: Understanding Their Methods and Uses&lt;/a&gt;&amp;nbsp;&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;h2 style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;But what about aggregation?&lt;/span&gt;&lt;/h2&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;If you followed my suggested approach, the closer you got to the people whose lives were of&amp;nbsp; final/main concern, the small the segments of all the funding you would be looking at. These would be more comparable than when looking at as part of a larger group, with more customised context specific assessments of expected and actual impact. But how would you / the evaluation team then be able to make any overall statement about the strategy as a whole?&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;The way forward is to think of performance measurement in slightly different terms, than just using a simple indicator based measure. Imagine a scatter plot, with one dimension X describing relative i.e. ranked &lt;i&gt;expectations&lt;/i&gt; of achievement and the other dimension Y describing ranked&amp;nbsp;&lt;i&gt;actual/observed/assessed&lt;/i&gt; achievements. The entities in the scatter plot are the groups of cases in the smallest available sub-categories that were developed. Their rank position, relative to each other, is evident&amp;nbsp; when all the binary assessments of expected performance are generated through the process described above. &lt;a href=&quot;https://mande.co.uk/special-issues/hierarchical-card-sorting-hcs/#exploring&quot; target=&quot;_blank&quot;&gt;See here &lt;/a&gt;for more on how this is done. The scatter plot can in turn be summarised in at least two different ways: using a measure of rank correlation (or how achievement relates to expectations) and using Classification Accuracy, if and when a minimum rank position of achievement is identfiied. Equally importantly, qualitative descriptions can be given of cases that exemplify performance that most meets expectations, and the reverse, along with&amp;nbsp; positive and negative deviants (outliers).&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;What we could end up with is a tree structure documenting multiple routes to both high and low performance, implemented in varyingly different&amp;nbsp; contexts (describable at different levels of scale).&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Other scatter plot designs are more relevant to assessments of strategies. The ranking generated by Figure 2 was plotted against the age of the projects and their grant size, which might be expected to be influenced by the contents of a funding strategy. Neither of these two measures showed any relationship to perceived strategic priorities!&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;h2 style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;To be continued....&lt;/span&gt;&lt;/span&gt;&lt;/h2&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;PS1: When asking about expected effects of one type of allocation versus another, it may make sense to encourage a focus on more immediately expected effects first, and then later ones. They may be more likely, more easily articulated and more evaluable.&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span&gt;PS2:&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Hughes-McLure, S. (2022). Follow the money. Environment and Planning A: Economy and Space, 54(7), 1299–1322. &lt;a href=&quot;https://doi.org/10.1177/0308518X221103267&quot;&gt;https://doi.org/10.1177/0308518X221103267&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/6567698763151036404/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2024/04/case-based-comparative-evaluation-cce.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6567698763151036404'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6567698763151036404'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2024/04/case-based-comparative-evaluation-cce.html' title='Developing and using a Configurational Theory of Change within an evaluation'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgK8e8U1bg4SCSRWXCYsaPuC7nkWYD5mMZdPSnJY2ZWe_-HQu2aBAJxC-IGkAhS5zvY26aJezVrWvaqGdJMvM3MfTd5KFskjoGJjbcAhtP1Di74cGPHs6PepxoyXYi_eqHIVhyphenhyphenTUqUavLzyvPJ3hYJMPU979wo2JwQ2waQK1KDscm-ENWux5MJ-/s72-w640-h464-c/CAA%20tree.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-7773462035444030312</id><published>2023-12-22T06:18:00.024+00:00</published><updated>2024-02-01T09:03:07.521+00:00</updated><title type='text'>Using the Confusion Matrix as a general-purpose analytic framework</title><content type='html'>&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRxdrRk2oszhunUdpw0WMkjacOhBj74GOg7vZ_oqfdhP5DS2TVsrzHq6PXWtg200lMz-DVK3GvCbEj8RrobVTfDxxDq-6dNUNWvpGj-A4TVJutdtgpjvJAB-Ihm97xbhhcQrOLageNYVd3yVLM6rhYmL9z_uZ3Rp5QSa2AOiCOvOsoMIXjmfO0/s300/download%20(1).png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;168&quot; data-original-width=&quot;300&quot; height=&quot;168&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRxdrRk2oszhunUdpw0WMkjacOhBj74GOg7vZ_oqfdhP5DS2TVsrzHq6PXWtg200lMz-DVK3GvCbEj8RrobVTfDxxDq-6dNUNWvpGj-A4TVJutdtgpjvJAB-Ihm97xbhhcQrOLageNYVd3yVLM6rhYmL9z_uZ3Rp5QSa2AOiCOvOsoMIXjmfO0/s1600/download%20(1).png&quot; width=&quot;300&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Background&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This posting has been prompted by work I have done this year for the World Food Programme (WFP) as member of their &lt;a href=&quot;https://www.wfp.org/publications/evaluation-methods-advisory-panel-wfp-2022-review#:~:text=Launched%20in%20January%202022%2C%20WFP&#39;s,innovations%20on%20methods%20in%20evaluation.&quot; target=&quot;_blank&quot;&gt;Evaluation Methods Advisory Panel (EMAP)&lt;/a&gt;. One task was to carry out a review, along with colleague Mike Reynolds, of the methods used in the 2023 Country Strategic Plans evaluations. You will be able to read about these, and related work, in a forthcoming report on the panel&#39;s work, which I will link to here when it becomes available.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One of the many findings of potential interest was: &quot;&lt;span style=&quot;text-align: justify;&quot;&gt;&lt;i&gt;there were relatively very few
references to how data would be analysed, especially compared to the detailed
description of data collection methods&lt;/i&gt;&quot;. In my own experience, this problem is widespread, found well beyond WFP. In the same report I proposed the use of what is known as the Confusion Matrix, as &lt;b&gt;a general purpose analytic framework&lt;/b&gt;. Not as the only framework, but as one that could be used alongside more specific frameworks associated with particular intervention theories such as those derived from the social sciences.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;font-size: medium; text-align: justify;&quot;&gt;&lt;b&gt;What is a Confusion Matrix?&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A Confusion Matrix is a type of truth table,&amp;nbsp; i.e., a table representing all the logically possible combinations of two variables or characteristics. In an evaluation context these two characteristics could be the presence and absence of an intervention, and the presence and absence of an outcome.&amp;nbsp; An intervention represents a specific theory (aka model), which includes a prediction that a specific type of outcome will occur if the intervention is implemented.&amp;nbsp; In the 2 x 2 version you can see above, there are four types of possibilities:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ol style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The intervention is present and the outcome is present. Cases like this are known as True Positives&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&amp;nbsp;The intervention is present but the outcome is absent. Cases like this are known as False Positives.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The intervention is absent and the outcome is absent. Cases like this are known as True Negatives&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The intervention is absent but the outcome is present. Cases like this are known as False Negatives.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Common uses of the Confusion Matrix&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The use of Confusion Matrices is most commony associated with the field of machine learning and predictive analytics, but it has much wider application. These include the fields of medical diagnostic testing, predictive maintenance,&amp;nbsp; fraud detection,&amp;nbsp; customer churn prediction,&amp;nbsp;remote sensing and geospatial analysis, cyber security,&amp;nbsp;computer vision, and natural language processing. In these applications the Confusion Matrix is populated by the number of cases falling into each of the four categories. These numbers are in turn the basis of a wide range of performance measures, which are described in detail in the &lt;a href=&quot;https://en.wikipedia.org/wiki/Confusion_matrix&quot; target=&quot;_blank&quot;&gt;Wikipedia article on the Confusion Matrix&lt;/a&gt;. A selection of these is &lt;a href=&quot;https://evalc3.net/4-evaluate-model-2-copy/&quot;&gt;described here&lt;/a&gt;, in this blog on the use of the EvalC3 Excel app&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;The claim&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Although the use of a Confusion Matrix is&amp;nbsp; commonly associated with quantitative analyses of performance, such as the accuracy of predictive models, it can also be a useful framework for thinking in more qualitative terms. This is a less well known and publicised use, which I elaborate on below. It is the inclusion of this wider potential use that is the basis of my claim that &lt;b&gt;the Confusion Matrix can be seen as a general-purpose analytic framework&lt;/b&gt;.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;The supporting arguments&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The claim has at least four main arguments:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ol style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The structure of the Confusion Matrix serves as &lt;b&gt;a useful reminder and checklist&lt;/b&gt;, that at least four different kinds of cases should be sought after, when constructing and/or evaluating a claim that X (e.g. an intervention) lead to Y (e.g an outcome).&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;True Positive cases, which we will usually start looking for first of all. At worst, this is all we look for.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;False Positive cases, which we are often advised to do, but often dont invest much time in actually doing so. Here we can learn what does not work and why so.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;False Negative cases, which we probably do even less often. Here we can learn what else works, and perhaps why so,&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;True Negative cases, because sometimes there are asymmetric causes at play i.e not just the absence of the expected causes&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The contents of the Confusion Matrix helps us to &lt;b&gt;identify interventions that are necessary, sufficient or both&lt;/b&gt;. This can be practically useful knowledge&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If there are no FP cases, this &lt;b&gt;suggests&lt;/b&gt; an intervention is sufficient for the outcome to occur. The more cases we investigate , without still finding a TP, the stronger this suggestion is. But if only one FP is found, that tells us the intervention is not sufficient. Single cases can be informative. Large numbers of cases are not aways needed.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If there are no FN cases, this&amp;nbsp;&lt;b&gt;suggests&lt;/b&gt;&amp;nbsp;an intervention is necessary for the outcome to occur. The more cases we investigate , without still finding a FN, the stronger this suggestion is. But if only one FN is found, that tells us the intervention is not necessary.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If there are no FP or FN cases, this suggests an intervention is sufficient and necessary for the outcome to occur. The more cases we investigate, without still finding a TP or FN, the stronger this suggestion is. But if only one FP, or FN is found, that tells us that the intervention is not sufficient or not necessary, respectively.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The contents of the Confusion Matrix help us&amp;nbsp;&lt;b&gt;identify the type and scale of errors&amp;nbsp; and their acceptability&lt;/b&gt;. FP and FN cases are two different types of error that have different consequences in different contexts. A brain surgeon will be looking for an intervention that has a very low FP rate, because errors in brain surgery can be fatal, so cannot be recovered. On the other hand, a stockmarket investor is likely to be looking for a more general purpose model, with few FNs. However, it only has to be right 55% of the time to still make them money. So a high rate of FPs may not be a big&amp;nbsp; concern. They can recover their losses through further trading. In the field of humanitarian assistance the corresponding concerns are with coverage (reaching all those in need, i.e minimising False Negatives) and leakage (minimising inclusion of those not in need i.e False Positives). There are Confusion Matrix based performance measures for both kinds error and for the degree that both kinds of error are balanced (See the Wikipedia entry)&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The contents of the Confusion Matrix can help us &lt;b&gt;identify usefull case studies for comparison purposes.&lt;/b&gt; These can include&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Cases which exemplify the True Positive results&lt;/b&gt;, where the model (e.g an intervention) correctly predicted the presence of the outcome. Look within these cases to find any likely causal mechanisms connecting the intervention and outcome. Two sub-types can be useful to compare:&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Modal cases, which represent the most common characteristics seen in this group, taking all comparable attributes into account, not just those within the prediction model.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Outlier cases, which represent those which were most dissimilar to all other cases in this group, apart from having the same prediction model characteristics&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;background-color: white; box-sizing: inherit; color: #1a1a1a; font-weight: 700;&quot;&gt;Cases which exemplify the&amp;nbsp;False Positives&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;, where the model incorrectly predicted the presence of the outcome.There are at least two possible explanations that can be explored:&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In the False Positive cases, there are one or more&amp;nbsp;other factors that all the cases have in common, which&amp;nbsp;are&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;em style=&quot;background-color: white; box-sizing: inherit; color: #1a1a1a;&quot;&gt;blocking&lt;/em&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;the model configuration&amp;nbsp;from working i.e. delivering the outcome&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In the True Positive cases, there are one or more other factors that&amp;nbsp;all the cases have in common, which&amp;nbsp;are&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;em style=&quot;background-color: white; box-sizing: inherit; color: #1a1a1a;&quot;&gt;enabling&lt;/em&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;&amp;nbsp;the model configuration&amp;nbsp;from working i.e. delivering the outcome, but which are absent in the False Positive cases&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&lt;span style=&quot;background-color: white;&quot;&gt;Note:&amp;nbsp;&lt;/span&gt;&lt;/span&gt;For comparisons with TPs cases, TP and FP cases should be maximally&amp;nbsp; similar in their case attributes. I think this is called&amp;nbsp;&amp;nbsp;MSDO (most similar, different outcome) based case selection&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Cases which exemplify the False Negatives&lt;/b&gt;, where the outcome occurred despite the absence the attributes of the model. There are three possibilities of interest here:&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There may&amp;nbsp;be some False Negative cases&amp;nbsp;that have all but one of&amp;nbsp;the attributes found in the prediction model. These cases would be worth examining, in order to understand why&amp;nbsp;the absence of a&amp;nbsp;particular attribute that is part of the predictive model does not prevent the outcome from occurring. There may be some counter-balancing enabling factor at work, enabling the outcome.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;It is possible that some cases have been classed as FNs because they missed specific data on crucial attributes that would have otherwise classed them as TPs.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Other cases may represent genuine alternatives, which need within-case investigation to identify the attributes that appear to make them successful&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;background-color: white; box-sizing: inherit; color: #1a1a1a; font-weight: 700;&quot;&gt;Cases which exemplify the True&amp;nbsp;Negatives&lt;/span&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;, where&amp;nbsp;the absence the attributes of the model is associated with the absence of the outcome.&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;background-color: white; color: #1a1a1a;&quot;&gt;Normally this are seen as not being of much interest. But t&lt;/span&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;here may cases here with all but one of the intervention attributes. If found then the missing attribute may be viewed as:&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;ol&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;color: #1a1a1a; font-size: medium;&quot;&gt;A necessary attribute, without which the outcome can occur&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;span style=&quot;color: #1a1a1a; font-size: medium;&quot;&gt;An INUS attribute i.e. an attribute that is Insufficient but Necessary in a configuration that is Unnecessary but Sufficient for the outcome (See Befani, 2016). It would then be worth investigating how these critical attributes have their effects by doing a detailed within-case analysis of the cases with the critical missing attribute.&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;li&gt;&lt;span style=&quot;color: #1a1a1a; font-family: verdana; font-size: medium;&quot;&gt;Cases may become TNs for two reasons. The first, and most expected, is that the causes of positive outcomes are absent. The second, which is worth investigating, is that there are additional and different causes at work which are causing the outcome to be absent. The first of these is described as causal symmetry, the second of these is described as causal asymmetry. Because of the second possibility is worthwhile paying close attention to TN cases to identify the extent to which symmetrical causes or asymmetrical causes are at work. The findings could have significant implications for any intervention that is being designed. Here a useful comparision would be&amp;nbsp; between maximally similar TP and TN cases.&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/ol&gt;&lt;/ol&gt;&lt;div&gt;&lt;span style=&quot;color: #1a1a1a; font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Resources&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;color: #1a1a1a; font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;color: #1a1a1a; font-family: verdana; font-size: medium;&quot;&gt;Some of you may know that I have built the Confusion Matrix into the design of &lt;a href=&quot;https://evalc3.net/&quot; target=&quot;_blank&quot;&gt;EvalC3&lt;/a&gt;, an Excel app for cross-case analysis, that combines measurement concepts from the disparate fields of machine learning and QCA (Qualitative Comparative Analysis). With fair winds. this should become available as a free to use web app in early 2024, courtesy of a team at Sheffield Hallam University. There you will be able to explore and exploit the uses of the Confusion Matrix for both quantative and qualitative analyses.&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/7773462035444030312/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2023/12/using-confusion-matrix-as-multi-purpose.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7773462035444030312'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7773462035444030312'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2023/12/using-confusion-matrix-as-multi-purpose.html' title='Using the Confusion Matrix as a general-purpose analytic framework'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRxdrRk2oszhunUdpw0WMkjacOhBj74GOg7vZ_oqfdhP5DS2TVsrzHq6PXWtg200lMz-DVK3GvCbEj8RrobVTfDxxDq-6dNUNWvpGj-A4TVJutdtgpjvJAB-Ihm97xbhhcQrOLageNYVd3yVLM6rhYmL9z_uZ3Rp5QSa2AOiCOvOsoMIXjmfO0/s72-c/download%20(1).png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-7228537941923922996</id><published>2023-10-28T13:34:00.002+00:00</published><updated>2023-10-31T09:20:31.334+00:00</updated><title type='text'>Beyond summarisation by AI and/or editors- Readers can now interrogate full transcripts of meeting discussions</title><content type='html'>&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Over the last two months, a small group of us have been managing a MSC Monthly Online Gathering. In each meeting we have recorded the discussions, then generated a transcript, both using Otter.AI. Then I have used Claude AI, to generate a one-page summary of each discussion. That itself seems likely to be useful to both attendeess and non-attendees. (Though I have yet obtain feedback on this meeting output). You can view two AI summaries of discussions in the October meeting, here:&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2023/10/18th-October-MSC-AM-Rick.pdf&quot;&gt;https://mande.co.uk/wp-content/uploads/2023/10/18th-October-MSC-AM-Rick.pdf&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;https://mande.co.uk/wp-content/uploads/2023/10/18th-October-PM-Konny.pdf&quot;&gt;https://mande.co.uk/wp-content/uploads/2023/10/18th-October-PM-Konny.pdf&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;But why not jump ahead and give people more than a simple feedback opportunity. Let&#39;s enable them to question the full text of the transcript, in their own individual way, albeit after being informed about the overall topics covered during the discussion via the AI summaries above. This is now possible using a third party app known as &lt;a href=&quot;https://beta.pickaxeproject.com/&quot; target=&quot;_blank&quot;&gt;Pickaxe&lt;/a&gt;. Here you can design an AI prompt that can then be made publically usable, preloaded with a given discussion transcript.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Here is a link to the two very simple Pickaxe public prompts I have developed that you can now use to interrogate the two discussions.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px; text-align: left;&quot;&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;AM session&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;white-space-collapse: preserve;&quot;&gt;&lt;span style=&quot;color: #2b00fe;&quot;&gt;&lt;a data-saferedirecturl=&quot;https://www.google.com/url?q=https://beta.pickaxeproject.com/axe?id%3DInterrogate_the_transcript_of_a_meeting_WA76O&amp;amp;source=gmail&amp;amp;ust=1698574108686000&amp;amp;usg=AOvVaw2Sal-GBGEhP5LzFcsQSj09&quot; href=&quot;https://beta.pickaxeproject.com/axe?id=Interrogate_the_transcript_of_a_meeting_WA76O&quot; target=&quot;_blank&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;https://beta.pickaxeproject.&lt;wbr&gt;&lt;/wbr&gt;com/axe?id=Interrogate_the_&lt;wbr&gt;&lt;/wbr&gt;transcript_of_a_meeting_WA76O&lt;/span&gt;&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;white-space-collapse: preserve;&quot;&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;PM session&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #2b00fe; font-family: verdana; font-size: medium;&quot;&gt;&lt;a data-saferedirecturl=&quot;https://www.google.com/url?q=https://beta.pickaxeproject.com/axe?id%3DExplore_issues_discussed_in_the_18th_October_MSC_Monthly_Online_Gathering_PM_session_1OPYP&amp;amp;source=gmail&amp;amp;ust=1698574108686000&amp;amp;usg=AOvVaw0oAeKZ3_F6DCIAO_nlYMYY&quot; href=&quot;https://beta.pickaxeproject.com/axe?id=Explore_issues_discussed_in_the_18th_October_MSC_Monthly_Online_Gathering_PM_session_1OPYP&quot; target=&quot;_blank&quot;&gt;https://beta.pickaxeproject.&lt;wbr&gt;&lt;/wbr&gt;com/axe?id=Explore_issues_&lt;wbr&gt;&lt;/wbr&gt;discussed_in_the_18th_October_&lt;wbr&gt;&lt;/wbr&gt;MSC_Monthly_Online_Gathering_&lt;wbr&gt;&lt;/wbr&gt;PM_session_1OPYP&lt;/a&gt;&amp;nbsp;&lt;span style=&quot;white-space-collapse: preserve;&quot;&gt; &lt;/span&gt;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;You can ask follow up questions, click on &quot;Go to Chat&quot;&lt;/span&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If you try these out, I will get feedback, in the form of a visible record of how you used it. You could also provide feedback on this experience, using the Comment function below&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Give it a go, now...&lt;/span&gt;!&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Postscript 31 October&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I think the performance of Pickaxe on this task is poor, compared to that of Claue AI on the same task.&amp;nbsp; I will be disabling this implementation in the next day or so&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/7228537941923922996/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2023/10/beyond-summarisation-by-ai-andor.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7228537941923922996'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7228537941923922996'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2023/10/beyond-summarisation-by-ai-andor.html' title='Beyond summarisation by AI and/or editors- Readers can now interrogate full transcripts of meeting discussions'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-6910956305926827233</id><published>2023-08-31T13:30:00.247+00:00</published><updated>2023-09-12T07:35:41.527+00:00</updated><title type='text'>Evaluating thematic coding and text summarisation work done by artificial intelligence (LLM)</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Evaluation is a core part of the workings of artificial intelligence algorithms.  It is something that can be built in, in the shape of specific segments of code.  But it is also an additional human element which needs to complement and inform the subsequent use of any outputs of artificial intelligence systems.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If we take supervised machine learning algorithms as one of the simpler forms of artificial intelligence, all of these have a very simple basic structure.&amp;nbsp; Their operations involve &lt;b&gt;the reiteration of search followed by evaluation&lt;/b&gt;.  For example,&amp;nbsp;we have a dataset which describes a number of cases, which could be different locations where a particular development intervention is taking place.  Each of these cases have a number of attributes which we think may be useful predictors of an outcome we are interested in.  And in addition, some of those predictors (or combinations thereof) might reflect some underlying causal mechanisms which would be useful for us to know about.  The simplest form of machine learning will involve what is called an exhaustive or brute force search of each possible combination of those attributes (defined in terms of their presence or absence, in this simple example).  Taking one combination at a time, the algorithm will evaluate whether it predicted the outcome or not, and then store that judgement.  Reiterating that process, it will then compare the next judgement to this earlier judgement and replace that earlier judgement if the new one is better.  And so on until all possible combinations have been evaluated and compared to previous judgement.  In more complex machine learning algorithms involving artificial neural networks the&amp;nbsp;evaluation and feedback processes can be much more complex, but the abstract description still fits.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What I&#39;m interested in talking about here is what happens outside the block of code that&amp;nbsp;does this type of processing.  Specifically, with the products that are produced and how we humans can evaluate its value.  This is territory where a lot of effort has already been expended, most noticeably on the subject of &lt;a href=&quot;https://link.springer.com/article/10.1007/s44176-022-00006-z&quot; target=&quot;_blank&quot;&gt;algorithmic fairness &lt;/a&gt;and&amp;nbsp;what is known as &lt;a href=&quot;https://en.wikipedia.org/wiki/AI_alignment&quot; target=&quot;_blank&quot;&gt;the alignment problem&lt;/a&gt;.&amp;nbsp; These could be crudely described as representing both short and long-term concerns respectively.&amp;nbsp; I won&#39;t be exploring that literature here, interesting and important as it is.&lt;br /&gt;&lt;br /&gt;What I will be talking about here is two examples of my own current experiments with the use of one AI application known as &lt;a href=&quot;https://claude.ai/chats&quot; target=&quot;_blank&quot;&gt;Claude AI&lt;/a&gt;, used to do some forms of qualitative data analysis.  In the field that I work in, which is largely to do with international development aid programs, a huge amount of qualitative data i.e text is generated and I think it is fair to say that its analysis is a lot more problematic than when we are dealing with many forms of quantitative data.  So the arrival of large language model&amp;nbsp;(LLM) versions of artificial intelligence appears to offer&amp;nbsp;some interesting opportunities for making some usable progress in this difficult area.&lt;br /&gt;&lt;br /&gt;The text data that I have been working with as been generated by participants in a participatory scenario planning process, carried out using &lt;a href=&quot;http://ParEvo.org&quot;&gt;ParEvo.org&lt;/a&gt;, and implemented by the International Civil Society Centre in Germany this year.  The full details of that exercise will be available soon in a ICSC publication.&amp;nbsp; The exercise generated a branching tree structure of storylines about the future, built with 109&amp;nbsp;paragraphs of text, contributed by 15 participants, over eight iterations.What I will be describing here concerns two types of analysis of that text data.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Text summarisation&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;[this section has been redrafted] The first was a text summarisation task, where I asked Claude AI to produce one sentence headline summaries of each of these 109 texts. Text summarisation is a very common application of LLMs. This it did quickly, as usual, and the results looked plausible. But&amp;nbsp;but&amp;nbsp;by&amp;nbsp;now I had&amp;nbsp;also learned to be appropriately&amp;nbsp;sceptical&amp;nbsp;and was asking myself&amp;nbsp;how &#39;accurate&#39;&amp;nbsp;these headlines were. I could examine each headline and its associated text, but this would take time. So I tried another approach.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I opened up&amp;nbsp;a new prompt window in Claude AI and uploaded 2 files. One containing the headlines, and the other containing each of the 109 texts preceded by an identification number.  I then asked Claude AI to match each headline with the text that it best described, and to display the results using the ID number of the text (rather than its full&amp;nbsp;contents) and the predicted associated headline. This process has some similarities with &lt;a href=&quot;https://www.languagescientific.com/the-what-and-why-of-back-translation-and-reconciliation/&quot; target=&quot;_blank&quot;&gt;back translation.&lt;/a&gt;  What I was interested in here was how well it could reassign the headlines to their original texts.&amp;nbsp;&amp;nbsp;If it did well this would give me some confidence in the accuracy of its analytic processes, and might obviate the need for a manual check up of the headlines&#39; fit with content.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;My first attempt was a clear failure, with classification accuracy of 21%, being far worse than chance. On examination this was caused by the way I had formated the uploaded data. The second attempt, using two separated data files,&amp;nbsp;was more successful This time the classification accuracy was 63%. Given that the 27% error could occur at two stages (headline creation and headline matching) it could be argued that the classification error was more like half this value i.e 13.5% and so the classication accuracy was more like 76.5%. At this point it seemed worthwhile to also examine the misclassifications ( a back translation stage called reconciliation) - what headline was mismatched with what headline.&amp;nbsp; An examination of the false classifications suggested that around 40% of the mismatches may have been because of words they had in common, despote the full headline being different.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Where does that leave me? With some confidence in the headline generation process, but could we do better?&amp;nbsp; Could we find a better way to generate reproducable headlines...See further below where I talk about ensemble methods.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Content analysis&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The second task&amp;nbsp;was a type of content analysis. Because of a specific interest, I had separated&amp;nbsp;a subset of the hundred nine paragraphs into two groups, the first of which had been the subject to further narrative development by the participants (aka surviving storylines), and the second being others which were not developed any further (aka extinct storylines). I asked Claude AI to analyse the subset of the texts in terms of three attributes: the vocabulary, the style of writing, and the genre. Then for each attribute, to sort the texts into two groups, and describe what each group had in common and how they differed from the other group. It then did so. Here is an image of its output.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjIfhhW0WBpFsHLtWq6i_nIizjOUu11p6a2g86k5Z83SpKn0ZjmRbMJNclFRWPpcJuKKNfpIcrQkUKlOzAm1h580e8e_DdN6TLYfE9cvJsJd4VnPC5jfjQ6ognlj9P4dghAI_h-93n1nRIhFgrifziPh_INXu0dybqUMpum99S7GQJW1rQWFtZd/s582/example%20output.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;420&quot; data-original-width=&quot;582&quot; height=&quot;420&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjIfhhW0WBpFsHLtWq6i_nIizjOUu11p6a2g86k5Z83SpKn0ZjmRbMJNclFRWPpcJuKKNfpIcrQkUKlOzAm1h580e8e_DdN6TLYfE9cvJsJd4VnPC5jfjQ6ognlj9P4dghAI_h-93n1nRIhFgrifziPh_INXu0dybqUMpum99S7GQJW1rQWFtZd/w581-h420/example%20output.png&quot; width=&quot;581&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;But how can I evaluate this output? If I looked at one of the texts in a particular group would I find the attributes that Claude AI was telling me that the group it belonged to possessed?&amp;nbsp; In order to make this form of verification easier, and smaller in scale,&amp;nbsp; I gave Claude AI a follow-up task: for each of the two groups under each of the three attributes of the text Claude AI should provide the ID number of an &lt;i&gt;exemplar &lt;/i&gt;body of text which best represented the presence of the characteristics that were described.&amp;nbsp; This it was able to do, and in my first use of&amp;nbsp;the specific case examples I found that 9/10 did fit the summary description provided for the group.  This strategy is similar to another one which I&#39;ve used with GPT4,&amp;nbsp;when trying to extract specific information about evaluation methods used in a set of evaluation reports.  There I have asked it to provide page or paragraph references for any claim about what methods are being used in the evaluation.  Broadly speaking, in&amp;nbsp;a large majority of cases, these page references pointed to relevant sections of text.&amp;nbsp;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;My second strategy was another version of back translation, connecting concrete instances with pre-existing abstract descriptions. This time I opened a new prompt session, still within Claude AI, and uploaded a file containing the same subset of paragraphs, and then in the prompt window I copy and pasted the description of the attributes of the three sets of two groups identified earlier (without information on which text belnged to which group).&amp;nbsp; I then asked Claude AI to identify which paragraphs of text fitted which of the 3 x 2 groups, which it did.&amp;nbsp; I then collated the results of the two tasks in an Excel file, which you can see here below (click on image to magnify it). The green cells are where the predicted group matches the original group, and the yellow cells are where there were mismatches.&amp;nbsp; The overall classification accuracy was 67%, whch is better than chance but not great either. I should also add that this was done with prompt information that included the IDs of the exemplars mentioned above (a format called &quot;one-shot learning&quot;)&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiphfuRAnXCrWgRhfm12DWVsNRKlBfygppyU0C0TsYHXKUx96kSSmToAfIFefUblwLAY4QXEY6Dznhk1_gXLENnzqsv1J5WduAMKK4759-Xs3vGJwLl1RAdIcAzIwnIb4cfLwc8BzOLSMFSDlU5mVOx-zDSy-PWrM_4Av7V20sDYALIQ04W5tF/s755/evalauting%20thematic%20coding.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;343&quot; data-original-width=&quot;755&quot; height=&quot;281&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiphfuRAnXCrWgRhfm12DWVsNRKlBfygppyU0C0TsYHXKUx96kSSmToAfIFefUblwLAY4QXEY6Dznhk1_gXLENnzqsv1J5WduAMKK4759-Xs3vGJwLl1RAdIcAzIwnIb4cfLwc8BzOLSMFSDlU5mVOx-zDSy-PWrM_4Av7V20sDYALIQ04W5tF/w622-h281/evalauting%20thematic%20coding.png&quot; width=&quot;622&quot; /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What was I evaluating when I was doing these &quot;reverse translations&quot;? It could probably be described as a test of, or search for, some form of construct validity. Was there any stable concept involved?&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Ensemble methods&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Given the two results reported above, which were better than chance, but not much better, what else could be done? There is one possible way forward, which might give us more confidence in the products generated by LLM analyses.&amp;nbsp; Both Claude AI and ChatGPT4, and probably others, allow users to hit a&amp;nbsp;Retry button,&amp;nbsp;to generate another response to the same&amp;nbsp;prompt. These will usually vary, and the degree of variation can be controlled by a parameter known as &quot;temperature&quot;.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;An &lt;a href=&quot;https://en.wikipedia.org/wiki/Ensemble_learning&quot; target=&quot;_blank&quot;&gt;ensemble approach&lt;/a&gt; in this context would be to generate multiple responses using the same prompt and then use some type of aggregation process to find the best result.  Similar to &#39;wisdom of crowds&quot;&amp;nbsp;processes.&amp;nbsp; In its simplest form this would, for example, involve counting the number of times each different headlines were proposed for the same item of text, and selecting one with the highest count. This approach will work where you have predefined categories as &quot;targets&quot;. Those categories could have been developed inductively (as above) or deductively, from prior theory. It may even be possible to design a prompt script that include multiple genetration steps, and even the aggregation and evaluation stages.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;But to begin with i will probably focus on testing a manual version of the process. I will report on some experiments with this approach in the next few days....&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Update 02/09/23: A yet to be tested draft prompt that could automate the process&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjNjzEiAScLKqVbG5v6215-O8ZfksEcIcubc-rk3z-YwIq3XQiJEuQ4u_PorxaCDh8ebUN-fyfoU5DZXKHuTbN-hCCvmt0E2haitnGr8v7YmmZ_070jhZdCX0dXE4IPk2lQSKaLuk00-LRecDALCKfAKVEOh3nD5rRRQxqMSAjFbBqRKEIRywxL/s680/draft%20prompt.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;658&quot; data-original-width=&quot;680&quot; height=&quot;490&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjNjzEiAScLKqVbG5v6215-O8ZfksEcIcubc-rk3z-YwIq3XQiJEuQ4u_PorxaCDh8ebUN-fyfoU5DZXKHuTbN-hCCvmt0E2haitnGr8v7YmmZ_070jhZdCX0dXE4IPk2lQSKaLuk00-LRecDALCKfAKVEOh3nD5rRRQxqMSAjFbBqRKEIRywxL/w505-h490/draft%20prompt.png&quot; width=&quot;505&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A lesson learned on the way: I initially wrote out a rough draft of a Claude AI prompt that might help automate the process I&#39;ve described above.&amp;nbsp; I then ask Claude AI to convert this into a prompt which would be understood and generate reliable and interpretable results.&amp;nbsp; When it did this it was clear that part of my intentions had not been understood correctly (however you interpret the word understood).&amp;nbsp; This could be just an epiphenomenon, in the sense of it only being generated by this particular enquiry.&amp;nbsp; Or, it could point to a deeper or more structurally embedded analytic risk that would have consequences if I actually ask Claude AI to implement the rough draft in its original form (as distinct from simply refine that text as a prompt).&amp;nbsp; The latter possibility concerned me, so I edited the prompt text that had been revised by Claude AI to remove the misunderstood part of the process. The version you see above is Claude AIs interpretation of my revised version, which I think will now work. Lets see,,,!&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;color: red;&quot;&gt;Update 03/09/23&lt;/span&gt;: It looks like the ensemble method may work as expected. Using 10 iterations only, which is a small number compared to how they are normally used, the classication accuracy increased to 84%. In the data displayed about numbers of time each predicted headline was matched to a given text there were 4 instances where there were ties. There were also 8 instances where the best match was still only found in less than 5 of the 10 iterations. More iterations might generate more definitive best matches and increase the accuracy rate. The correct match was already visible in the second and third ranking best matches of 4 of the 18 incorrectly matches headlines.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Another lesson learned, perhaps: Careful wording of prompts is important, the more explicit the instructions are the better. I learned to preface the word &quot;match&quot; with a more specific &quot;analyze the content of all the numbered texts in File 1 and identify which one the headline best describes&quot; . And careful formating of the text data files was also potentially important. making it clear where each text began and ended and removing any formating artifacts that could cause confusion.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;And because of experiences with such sensitivities, I think i should re-do the whole analysis, to see if I generate the same or similar results!!!&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Ensembles of brittle prompts?&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I just came across this glimpse of a paper &quot;&lt;i&gt;&lt;a href=&quot;https://towardsdatascience.com/prompt-ensembles-make-llms-more-reliable-ae57ec35b5f7&quot; target=&quot;_blank&quot;&gt;Prompt Ensembles Make LLMs More Reliable&lt;/a&gt;&lt;/i&gt;&quot; which is a different version of the idea I explored above. Here the prompt that is in use is also varied, from iteration to iteration.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Ensembles of brittel prompts?&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/6910956305926827233/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2023/08/evaluating-thematic-coding-and-text.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6910956305926827233'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6910956305926827233'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2023/08/evaluating-thematic-coding-and-text.html' title='Evaluating thematic coding and text summarisation work done by artificial intelligence (LLM)'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjIfhhW0WBpFsHLtWq6i_nIizjOUu11p6a2g86k5Z83SpKn0ZjmRbMJNclFRWPpcJuKKNfpIcrQkUKlOzAm1h580e8e_DdN6TLYfE9cvJsJd4VnPC5jfjQ6ognlj9P4dghAI_h-93n1nRIhFgrifziPh_INXu0dybqUMpum99S7GQJW1rQWFtZd/s72-w581-h420-c/example%20output.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-4315222800374575634</id><published>2023-05-26T10:56:00.005+00:00</published><updated>2025-01-22T02:28:27.161+00:00</updated><title type='text'>Finding useful distinctions between different futures</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This blog posting is a response to &lt;a href=&quot;https://thevoroscope.com/2017/02/24/the-futures-cone-use-and-history/&quot; target=&quot;_blank&quot;&gt;Joseph Voros&#39;s informative blog posting about the Futures Cone&lt;/a&gt;. It is a useful contribution in as much as it helps us think about the future in terms of different sets of possibilities.&amp;nbsp;Here is a copy of his edited version.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjU1RFwxJJSKp0ayxkvJ0sQpc4bbCXl-UqBEX2UpcO_4KcBkGC50FFkvYv5-72NvFlbHyHiXgACq1AqPugUnPaj6_np-1U7vOQ62hxuaAxD2ZfApLFbmg9-pfAF4gQ5HZ86B4FxFUtFRTm8_VPoi4AEDqwBznO-cOMKCntJ6eQWMHvbu9y9Fw/s844/futures%20cone.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;572&quot; data-original-width=&quot;844&quot; height=&quot;413&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjU1RFwxJJSKp0ayxkvJ0sQpc4bbCXl-UqBEX2UpcO_4KcBkGC50FFkvYv5-72NvFlbHyHiXgACq1AqPugUnPaj6_np-1U7vOQ62hxuaAxD2ZfApLFbmg9-pfAF4gQ5HZ86B4FxFUtFRTm8_VPoi4AEDqwBznO-cOMKCntJ6eQWMHvbu9y9Fw/w609-h413/futures%20cone.png&quot; width=&quot;609&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Figure 1: Voros, 2017&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;My alternative, shown below, was developed in the context of supporting &lt;a href=&quot;http://ParEvo.org&quot;&gt;ParEvo.org&lt;/a&gt; explorations of alternative futures. It has some similarities and differences. For a start, here is the diagram.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgawta_3EvqYFD8CRaudhDVHe3ItO9aUCkFZyobWTbUrOF8RFgmU50rqNZaG_pX9ih_06se0ZoOYnrNwzS27ZjofhobZ0dF1aHxUZeCfOGVV7OPJlXeUAe0cYV93bkBwvmviKdZiGl7URuJkaC1uKnBy34H1QDD0kvhK0geaJVK0EZVTdAiEw/s1232/Framework%205.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;687&quot; data-original-width=&quot;1232&quot; height=&quot;353&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgawta_3EvqYFD8CRaudhDVHe3ItO9aUCkFZyobWTbUrOF8RFgmU50rqNZaG_pX9ih_06se0ZoOYnrNwzS27ZjofhobZ0dF1aHxUZeCfOGVV7OPJlXeUAe0cYV93bkBwvmviKdZiGl7URuJkaC1uKnBy34H1QDD0kvhK0geaJVK0EZVTdAiEw/w634-h353/Framework%205.png&quot; width=&quot;634&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Figure 2: Sets and sub-sets of alternative futures Davies, 2023&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&lt;br /&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;I will now list Joseph&#39;s explanation of each of the terms he used, and how they might relate to mine (in red)&lt;/span&gt;&lt;/span&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;box-sizing: inherit; color: #1a1a1a; font-weight: 700;&quot;&gt;Possible&lt;/span&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&amp;nbsp;&lt;i&gt;– these are those futures that we think ‘might’ happen, based on some future knowledge we do not yet possess, but which we might possess someday (e.g., warp drive).&lt;/i&gt; &lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;I think these fall in the grey area above (which also contain the dark and light green).&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;Plausible&lt;/span&gt;&amp;nbsp;–&lt;i&gt; those we think ‘could’ happen based on our current understanding of how the world works (physical laws, social processes, etc).&lt;/i&gt;&lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;I think these fall somewhere within the green matrix&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;Probable&lt;/span&gt;&amp;nbsp;– &lt;i&gt;those we think are ‘likely to’ happen, usually based on (in many cases, quantitative) current trends. &lt;/i&gt;&lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;These probably fall within the Likely row of the green matrix&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;Preferable&lt;/span&gt;&amp;nbsp;– &lt;i&gt;those we think ‘should’ or ‘ought to’ happen: normative value judgements as opposed to the mostly cognitive, above. There is also of course the associated converse class—the&amp;nbsp;&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;&lt;span style=&quot;box-sizing: inherit;&quot;&gt;un&lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;box-sizing: inherit;&quot;&gt;-preferred&lt;/span&gt;&amp;nbsp;futures—a ‘shadow’ form of&amp;nbsp;&lt;span style=&quot;box-sizing: inherit;&quot;&gt;anti&lt;/span&gt;-normative futures that we think should&amp;nbsp;&lt;span style=&quot;box-sizing: inherit;&quot;&gt;not&lt;/span&gt;&amp;nbsp;happen nor ever be allowed to happen (e.g., global climate change scenarios comes to mind).&lt;/i&gt;&lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;These probably fall within the Desirable column of the green matrix&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;Projected&lt;/span&gt;&amp;nbsp;–&lt;i&gt; the (singular) default,&amp;nbsp;&lt;span style=&quot;box-sizing: inherit;&quot;&gt;business as usual&lt;/span&gt;, ‘baseline’, extrapolated ‘continuation of the past through the present’ future. This single future could also be considered as being ‘the most probable’ of the Probable futures.&lt;/i&gt; &lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;As suggested above, probably at the most likely end of the Likely row in the above green matrix&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;box-sizing: inherit;&quot;&gt;&lt;span style=&quot;color: #1a1a1a;&quot;&gt;(&lt;span style=&quot;box-sizing: inherit; font-weight: 700;&quot;&gt;Predicted&lt;/span&gt;) –&amp;nbsp;&lt;em style=&quot;box-sizing: inherit;&quot;&gt;the&lt;/em&gt;&lt;i&gt;&amp;nbsp;future that someone claims ‘&lt;span style=&quot;box-sizing: inherit;&quot;&gt;will&lt;/span&gt;’ happen. I briefly toyed with using this category for a few years quite some time ago now, but I ended up not using it anymore because it tends to cloud the openness to possibilities (or, more usefully, the ‘preposter-abilities’!) that using the full Futures Cone is intended to engender. &lt;/i&gt;&lt;/span&gt;&lt;span style=&quot;color: red;&quot;&gt;Probably also at the most likely end of the Likely row in the above green matrix&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;&lt;/div&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Preposterious&lt;/b&gt; events &lt;span style=&quot;color: red;&quot;&gt;are not really covered. Perhaps they are at the extreme end of the Unlikely events with known probabilities i.e zero likelihood.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style=&quot;color: red; font-size: medium;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Though lacking in alliteration my schema does have some more practically useful features&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The primary additional feature is that for each different kind of future there are some conjectured consequences in terms of likely appropriate responses. Some of these are shown red:&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Organisational &quot;slack&quot; i.e. uncommitted resources or reserves that could enable responses to the unforeseen (though, of course,&amp;nbsp; not every kind of unforseen event)&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Fringe investments, such as blue sky research, can be appropriate where a possibility is in sight but its likelihood of happening is far from clear&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Robust responses are those that might work, though not necessarily be the most effective or most efficient, across a span of possibilities having varying probabilities and desirabilities&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Customised responses are those more tailored to specific combinations of un/likely and un/desirable events. The following more detailed version of the green martix describes some major possible variations of this kind&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_k97PLbc5PiMnnav8zeCqGFXyQQwU8XvK-ltAWIat1SM4FqsoAB_Qw4wPMqxbFu0aL8zlTHuhvwppjAi8tRxriJyJCKIya_nMzVnKI5yrV7nduq1fEd-IBLbJZHKAxlABpJ7EeggIxMjhc07x4_iRAZrc3GUvbbQJWW_iHtEhUzR53BPI-g/s838/Desirability%20x%20likelihood%20matrix.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;398&quot; data-original-width=&quot;838&quot; height=&quot;261&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg_k97PLbc5PiMnnav8zeCqGFXyQQwU8XvK-ltAWIat1SM4FqsoAB_Qw4wPMqxbFu0aL8zlTHuhvwppjAi8tRxriJyJCKIya_nMzVnKI5yrV7nduq1fEd-IBLbJZHKAxlABpJ7EeggIxMjhc07x4_iRAZrc3GUvbbQJWW_iHtEhUzR53BPI-g/w550-h261/Desirability%20x%20likelihood%20matrix.png&quot; width=&quot;550&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Figure 3&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Where to next?&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I would like to hear from readers their views on the possible utility of these distinctions. And whether any other distinctions could be added to or replace those I have used.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;h3 style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript 2025 01 22&lt;/span&gt;&lt;/h3&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I came across this useful matrix view in &lt;br /&gt;Luís, A., Garnett, K., Pollard, S. J. T., Lickorish, F., Jude, S., &amp;amp; Leinster, P. (2021). Fusing strategic risk and futures methods to inform long-term strategic planning: Case of water utilities. Environment Systems &amp;amp; Decisions, 41(4), 523–540. https://doi.org/10.1007/s10669-021-09815-1&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOasM-tEOwYGDwxpr0jr0xg4pzLohogptavIoBpMTlMFgFxu_rSI7yZlsTo3w993AUTmPTZhQb7KSziTDLWNiUv7iKeT3p6sM1-OEMXLsqVjOFPFLJkOWZuR2nEOFj_SQMPEC3ArnHuB6XUAmVzJCIGXFg-Or5nrR8PaxgM0Fgq636OeE5WIsb/s708/level%20of%20contro.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;482&quot; data-original-width=&quot;708&quot; height=&quot;324&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOasM-tEOwYGDwxpr0jr0xg4pzLohogptavIoBpMTlMFgFxu_rSI7yZlsTo3w993AUTmPTZhQb7KSziTDLWNiUv7iKeT3p6sM1-OEMXLsqVjOFPFLJkOWZuR2nEOFj_SQMPEC3ArnHuB6XUAmVzJCIGXFg-Or5nrR8PaxgM0Fgq636OeE5WIsb/w475-h324/level%20of%20contro.png&quot; width=&quot;475&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/4315222800374575634/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2023/05/finding-useful-distinctions-between.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4315222800374575634'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/4315222800374575634'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2023/05/finding-useful-distinctions-between.html' title='Finding useful distinctions between different futures'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjU1RFwxJJSKp0ayxkvJ0sQpc4bbCXl-UqBEX2UpcO_4KcBkGC50FFkvYv5-72NvFlbHyHiXgACq1AqPugUnPaj6_np-1U7vOQ62hxuaAxD2ZfApLFbmg9-pfAF4gQ5HZ86B4FxFUtFRTm8_VPoi4AEDqwBznO-cOMKCntJ6eQWMHvbu9y9Fw/s72-w609-h413-c/futures%20cone.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-3192800117599116898</id><published>2023-03-06T12:02:00.004+00:00</published><updated>2023-03-06T12:03:46.824+00:00</updated><title type='text'>How can evaluators practically think about multiple Theories of Change in a particular context?</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;This blog posting is been prompted by&amp;nbsp;participation in two recent events.  One was some work I was doing with the ICRC, reviewing Terms of Reference for an evaluation.&amp;nbsp; The other was listening in as a participant to this week&#39;s European Investment Bank conference titled &quot;&lt;i&gt;Picking up the pace: Evaluation in a rapidly changing world&lt;/i&gt;&quot;.&amp;nbsp;&lt;/p&gt;&lt;p&gt;When I was reviewing some Terms of Reference for an evaluation I noticed a gap which I have seen many times before.  While there was a reasonable discussion of the types of information that would need to be gathered there was a conspicuous absence of any discussion of how that data would be analysed.  My feedback included the suggestion that the Terms of Reference needed to ask the evaluation team for a description of the &lt;i&gt;analytical framework&lt;/i&gt; they would use to analyse the data they were collecting.&lt;br /&gt;&lt;br /&gt;The first two sessions of this week&#39;s EIB conference were on the subject of foresight and evaluation.  In other words how evaluators can think more creatively and usefully about&amp;nbsp; possible futures – a subject of considerable interest to me.  You might notice that I&#39;ve referred to&amp;nbsp;futures rather than the future, intentionally emphasising the fact that there may be many different kinds of futures, and with some exceptions (e.g. climate change) is not easy to identify which of these will actually eventuate.&lt;br /&gt;&lt;br /&gt;To be honest, I wasn&#39;t too impressed with the ideas that came up in this morning&#39;s discussion about how evaluators could pay more attention to the plurality of possible futures.  On the other hand, I did feel some sympathy for the panel members who were put on the spot to answer some quite difficult questions on this topic.&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Benefiting from the luxury of more time to think about this topic, I would like to make a suggestion that might be practically usable by evaluators, and worth considering by commissioners of evaluations. The suggestion is how an evaluation team could realistically give attention not just to a single &quot;official&quot;&amp;nbsp; Theory Of Change about an intervention, but to multiple relevant Theories Of Change about an intervention and&amp;nbsp;its expected outcomes.  In doing so I hope to address both issues I have raised above: (a) the need for an evaluation team to have a conceptual framework structuring how it will analyse the data it collects, and (b) the need to think about more than one possible future and how that might be realised i.e. more than one Theory of Change.&lt;br /&gt;&lt;br /&gt;The core idea is to make use of something which I have discussed many times previously in this blog, known as the &lt;a href=&quot;https://en.wikipedia.org/wiki/Confusion_matrix&quot; target=&quot;_blank&quot;&gt;Confusion Matrix &lt;/a&gt;– to those involved in machine learning, and more generally described simply as a truth table - one that describes four types of possibilities.&amp;nbsp;It takes the following form:&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcgaD0hfqrSCKTPlh6jd0feQ46CYm9uxzmSAJDG4trLCYES4D5KBWR7eF0I4Nqd81_Zu7pxpaKXOEdtoRVYEK7qx9R2Zyy2oL1qr-RxAfoQRjJgEsHPo7eEqR52hmM9pJOcxAjHMlAa0CKpqFh6egD7FSQbyji9eDdgutShv2pYTVahKoi3Q/s847/confusion%20matrix%202.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;204&quot; data-original-width=&quot;847&quot; height=&quot;144&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcgaD0hfqrSCKTPlh6jd0feQ46CYm9uxzmSAJDG4trLCYES4D5KBWR7eF0I4Nqd81_Zu7pxpaKXOEdtoRVYEK7qx9R2Zyy2oL1qr-RxAfoQRjJgEsHPo7eEqR52hmM9pJOcxAjHMlAa0CKpqFh6egD7FSQbyji9eDdgutShv2pYTVahKoi3Q/w601-h144/confusion%20matrix%202.png&quot; width=&quot;601&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;In the field of machine learning the main interest in the Confusion Matrix is the associated performance measures that can be generated, and used to analyse and assess the performance of different predictive models.&amp;nbsp; While these are of interest, what I want to talk about here is how we can use the same framework to think about different types of theories, as distinct from different types of observed results.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;There are four different types of Theories of Change that can be seen in the Confusion Matrix.  The first (1) describes&amp;nbsp;what is happening when intervention is present and the expected outcome of that intervention is present.  This is the familiar territory of the kind of Theories of Change that an evaluator will be asked to examine.&lt;br /&gt;&lt;br /&gt;The second (2) describes&amp;nbsp;what is happening when intervention is present and the expected outcome of that intervention is absent.  This theory would describe what additional conditions are present, or what expected conditions are absent, which will make a difference – leading to the expected outcome being absent.&amp;nbsp; When it comes to analysing data on what actually happened identifying these conditions can lead to modification of the first (1) Theory of Change&amp;nbsp;such that it becomes a better predictor of the outcome and there are fewer False Positives (found in cell 2).  Ideally the less False Positives the better.  But from a theory development point of view there should always be some situations described in cell 2 because there will never be an all-encompassing theory that works everywhere.  There will always be boundary conditions beyond which the theory is not expected to work.  So an important part of an evaluation is not just to refine the theory about what works (1)&amp;nbsp;but also to refine the theory of the circumstances in which it will not be expected to work&amp;nbsp; (2),&amp;nbsp; sometimes known as conditions or boundary conditions.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The third theory (3) describes what is happening when the intervention is absent but nevertheless the&amp;nbsp;outcome is present.  Consideration of this possibility involves recognition of what is known as &quot;multi-finality&quot; i.e. that some events can arise from multiple alternative causal conditions (or combinations of&amp;nbsp; causal conditions).&amp;nbsp; It&#39;s not uncommon to find advice to evaluators that they should consider alternative theories to those they are currently focused on.  For example in the literature on contribution analysis.  But it strikes me that this is often close to&amp;nbsp;a ritualistic requirement, or at least treated that way in practice.  In this perspective alternative theories are a potential threat to the theory being focused on (1).  But a much more useful&amp;nbsp;&lt;span style=&quot;text-align: center;&quot;&gt;perspective&lt;/span&gt; would be to treat these alternative theories&amp;nbsp;as potentially useful other courses of an action that an agent could take, which warrant serious attention in their own right.  And if they are shown to have some validity this does not by definition mean that the main theory of change (1) is wrong.  It&#39; simply means that there are alternative ways of achieving the outcome, which can only be a bonus finding.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The fourth theory describes what is happening when intervention is absent and the outcome is also absent (4).&amp;nbsp; In its simplest interpretation, it may be that the actual absence of the attributes of the intervention is the reason why the outcome is not present.  But this can&#39;t be assumed.  There may be other factors which have been more important causes.  For example the presence of an earthquake,&amp;nbsp;or the holding of a very contested election.  This possibility is captured by the term&amp;nbsp;&quot;asymmetric&amp;nbsp;causality&quot; i.e. that the causes of something not happening may not simply be the absence of the causes of something happening.  Knowing about these&amp;nbsp;other possible causes of desired outcome not happening is surely important, in addition to and alongside knowing about how an intervention does cause the outcome.  Knowing more about these causes might help other parties with other interventions in mind move cases with this experience from being True Negatives (4) to being False Negatives (3)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In summary, I think there is an argument for evaluators not being too myopic when they are thinking about Theories of Change they need to pay attention to.&amp;nbsp; It should not be all about testing the&amp;nbsp;first (1) type of Theory of Change, and considering all the other possibility is simply as challengers,&amp;nbsp;which may or may not&amp;nbsp;then be dismissed&amp;nbsp; Each of those other types of theories (2-3-4)&amp;nbsp;are important and useful in their own right and deserve attention.&lt;br /&gt;&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/3192800117599116898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2023/03/how-can-evaluators-practically-think.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/3192800117599116898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/3192800117599116898'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2023/03/how-can-evaluators-practically-think.html' title='How can evaluators practically think about multiple Theories of Change in a particular context?'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcgaD0hfqrSCKTPlh6jd0feQ46CYm9uxzmSAJDG4trLCYES4D5KBWR7eF0I4Nqd81_Zu7pxpaKXOEdtoRVYEK7qx9R2Zyy2oL1qr-RxAfoQRjJgEsHPo7eEqR52hmM9pJOcxAjHMlAa0CKpqFh6egD7FSQbyji9eDdgutShv2pYTVahKoi3Q/s72-w601-h144-c/confusion%20matrix%202.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-5167672469555319600</id><published>2022-10-18T10:58:00.001+00:00</published><updated>2022-10-18T10:59:34.526+00:00</updated><title type='text'>Four types of futures that should be covered by a Theories of Change</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;a href=&quot;http://ParEvo.org&quot;&gt;ParEvo.org&lt;/a&gt; is a web app that enables the collaborative exploration of alternative futures, online. In the evaluation stage, participants are asked to identify which of the surviving storylines fall into each of these categories:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Most desirable&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Least desirable&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Most likely&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Least likely&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In one part of the analysis of storylines&amp;nbsp;generated during a ParEvo exercise&amp;nbsp;the storylines are plotted on scatter plot, where the two dimensions are likelihood and desirability, as seen in this example&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqjJFPCEYROwK9LLVdTdFVQp4IcT41V2NW93pMSIGA_jrCqJMEVzUQ9ER5mf4ZVIkbNrvZto1NkAlRwqDKbScWIRrskVcrtJCbbHmWpt6Sc2-VSz9-AaB1MgT4wtEAUtSCOOHMU86c0t9dgs7piBt2FmWzgXcae8Z6HXiEx6aKDAR3hExlVg/s516/desirability%20&amp;amp;%20likelihood.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;465&quot; data-original-width=&quot;516&quot; height=&quot;485&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqjJFPCEYROwK9LLVdTdFVQp4IcT41V2NW93pMSIGA_jrCqJMEVzUQ9ER5mf4ZVIkbNrvZto1NkAlRwqDKbScWIRrskVcrtJCbbHmWpt6Sc2-VSz9-AaB1MgT4wtEAUtSCOOHMU86c0t9dgs7piBt2FmWzgXcae8Z6HXiEx6aKDAR3hExlVg/w538-h485/desirability%20&amp;amp;%20likelihood.png&quot; width=&quot;538&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Most Theories of Change that I have come across, when working as an evaluator, focus on a future that is seen as desirable and likely (as in expected). At best, the undesirable futures will be mentioned in an accompanying section on risks and their management.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A less myopic approach might be useful, one which would orient the users of the Theory of Change&amp;nbsp;to a more adaptive&amp;nbsp;stance towards the future.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One way forward would be to think of a four-part Theory of Change, each of which has different implications. as follows&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6kyza0qwwgLPjIlYpAqo08CLjJwZqbzwySOOOsANt4GrfN3ku6fyqC14ibNLjwqxdiAWoBdWvIIzGLtF94lzP3qK6b5LqoOlPMt90dYB6O4_vJr9XtkdfER1FtxtVs9vXD65vzPgGRKGM1zw24xRT16qnvIgSWZZWmImLEJ_oEJWEYeHHYw/s638/4%20futures.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;172&quot; data-original-width=&quot;638&quot; height=&quot;184&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6kyza0qwwgLPjIlYpAqo08CLjJwZqbzwySOOOsANt4GrfN3ku6fyqC14ibNLjwqxdiAWoBdWvIIzGLtF94lzP3qK6b5LqoOlPMt90dYB6O4_vJr9XtkdfER1FtxtVs9vXD65vzPgGRKGM1zw24xRT16qnvIgSWZZWmImLEJ_oEJWEYeHHYw/w685-h184/4%20futures.png&quot; width=&quot;685&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The top right cell may already be covered by a Theory of Change. In the desirable but unlikely, and undesirable but likely two cells it would be useful to have ordered lists that describe events, what needs to be done before they happen, and what needs to be done after they happen. In the unlikely and undesirable&amp;nbsp;cell plans for monitoring the status of these events need to be spelled out, and updated on an ongoing basis&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div&gt;&lt;/div&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/5167672469555319600/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/10/four-types-of-futures-that-should-be.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/5167672469555319600'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/5167672469555319600'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/10/four-types-of-futures-that-should-be.html' title='Four types of futures that should be covered by a Theories of Change'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqjJFPCEYROwK9LLVdTdFVQp4IcT41V2NW93pMSIGA_jrCqJMEVzUQ9ER5mf4ZVIkbNrvZto1NkAlRwqDKbScWIRrskVcrtJCbbHmWpt6Sc2-VSz9-AaB1MgT4wtEAUtSCOOHMU86c0t9dgs7piBt2FmWzgXcae8Z6HXiEx6aKDAR3hExlVg/s72-w538-h485-c/desirability%20&amp;%20likelihood.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-778178067769976186</id><published>2022-10-13T15:30:00.016+00:00</published><updated>2022-10-18T10:22:46.661+00:00</updated><title type='text'>We need more doubt and uncertainty!</title><content type='html'>&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This week the &lt;a href=&quot;https://svuf.nu/konferens-2022/&quot; target=&quot;_blank&quot;&gt;Swedish Evaluation Society (SVUK)&lt;/a&gt;&amp;nbsp; is holding its annual conference.  I took part in a session today on Theories of Change.  The first part of my presentation summarised the points I made in a 2018 CEDIL Inception Report titled &#39;&lt;a href=&quot;https://richardjdavies.files.wordpress.com/2018/11/2018-08-31-inception-paper-no-15-mande-news-pdf-copy-2.pdf&quot; target=&quot;_blank&quot;&gt;Theories of Change: Technical Challenges with Evaluation Consequences&lt;/a&gt;&#39;.  Following the presentation I was asked by&amp;nbsp;&lt;span style=&quot;background-color: white;&quot;&gt;Gustav Petersson, the discus&lt;/span&gt;&lt;span style=&quot;background-color: white;&quot;&gt;sant, w&lt;/span&gt;hether we should pay more attention to the &lt;i&gt;process &lt;/i&gt;of generating diagrammatic Theories of Change.  I could only agree, reflecting that for example it was not uncommon that a representative of a conference working group might summarise a very comprehensive and in-depth discussion in all too brief and succinct terms when reporting back to a plenary.  Leaving out, or understating, the uncertainties , ambiguities and disagreements.  Similarly the completed version of a diagrammatic Theory of Change is likely to suffer from the same limitations ...&amp;nbsp;being an overly simplified version of a much more complex and nuanced discussions between those involved in its construction that went on beforehand.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Later in the day I was reminded of &lt;a href=&quot;http://www.earthstar.co.uk/deep1.htm&quot; target=&quot;_blank&quot;&gt;this section in the Hitchhiker&#39;s Guide to the Galaxy&lt;/a&gt; where Vroomfondel, representing a group of striking philosophers said &#39;&quot;&lt;i&gt;That&#39;s right!&lt;/i&gt;&quot; and shouted , &quot;&lt;i&gt;we demand rigidly defined areas of doubt and uncertainty!&lt;/i&gt;&quot;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I&#39;m inclined to make a similar kind of request of those developing Theories of Change.&amp;nbsp; And of those subsequently charged with assessing the evaluability of the associated intervention, including its Theory of Change. What I mean is that the description of the Theory of Change should make it clear which various parts of the theory the owner(s) of that theory are more confident in verses less confident. Along with descriptions of the nature of the doubt or uncertainty and its causes e.g. first-hand experience, or supporting evidence (or lack of) from other sources.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Those undertaking an evaluability assessment could go a step further and convert various specific forms of doubt and uncertainty into evaluation questions that could form an important part of the Terms of Reference for an evaluation.&amp;nbsp; This might go some way to remedying another problem discussed during the session, which is the all too common (in my experience) phenomena of Terms of Reference only making generic references to an intervention&#39;s Theory of Change.  For example, by asking in broad terms about &quot;what works and in what circumstances&quot;.  Rather than the testing of various specific parts of that theory,&amp;nbsp;which would arguably&amp;nbsp;be more useful, and better use of limited time and resources.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;The bottom line&lt;/b&gt;: The articulation of a Theory of Change should conclude with a list of important evaluation questions. Unless there are good reasons to the contrary, those questions should then appear in the Terms of Reference for a subsequent evaluation&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjRjpkJtllQ2z9XDqqj5sdpYKxGWr3QKM1u0v7LF71iPhI8kqVm63d7d1AgfnObtWSzJyAvxokJMXXnt03TQ-7iV9BAfLNlO0OAN9wdFnDX7C8xHFdp1J4ff9lJHLCcu_IrHT_rTpOHkFNUwhnzpVQjVUgzdizgIjAkV9Uy_1qvTX9QhQtRPw/s640/vroom.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;480&quot; data-original-width=&quot;640&quot; height=&quot;240&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjRjpkJtllQ2z9XDqqj5sdpYKxGWr3QKM1u0v7LF71iPhI8kqVm63d7d1AgfnObtWSzJyAvxokJMXXnt03TQ-7iV9BAfLNlO0OAN9wdFnDX7C8xHFdp1J4ff9lJHLCcu_IrHT_rTpOHkFNUwhnzpVQjVUgzdizgIjAkV9Uy_1qvTX9QhQtRPw/s320/vroom.png&quot; width=&quot;320&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;PS:&amp;nbsp;Vroomfondel is a philosopher. He appears in chapter 25 of The Hitchhiker&#39;s Guide to the Galaxy, along with his collegue Majikthise, as a representative of the Amalgamated Union of Philosophers, Sages, Luminaries and Other Thinking Persons (AUPSLOTP; the BBC TV version inserts &#39;Professional&#39; before &#39;Thinking&#39;). The Union is protesting about Deep Thought, the computer which is being asked to determine the Answer to the Ultimate Question of Life, the Universe and Everything. See&amp;nbsp;&lt;a href=&quot;https://hitchhikers.fandom.com/wiki/Vroomfondel&quot;&gt;https://hitchhikers.fandom.com/wiki/Vroomfondel&lt;/a&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/778178067769976186/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/10/we-need-more-ambiguity-and-uncertainty.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/778178067769976186'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/778178067769976186'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/10/we-need-more-ambiguity-and-uncertainty.html' title='We need more doubt and uncertainty!'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjRjpkJtllQ2z9XDqqj5sdpYKxGWr3QKM1u0v7LF71iPhI8kqVm63d7d1AgfnObtWSzJyAvxokJMXXnt03TQ-7iV9BAfLNlO0OAN9wdFnDX7C8xHFdp1J4ff9lJHLCcu_IrHT_rTpOHkFNUwhnzpVQjVUgzdizgIjAkV9Uy_1qvTX9QhQtRPw/s72-c/vroom.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-2469023829967379080</id><published>2022-06-30T14:34:00.010+00:00</published><updated>2022-09-27T14:00:52.444+00:00</updated><title type='text'>Using ParEvo to conduct thought experiments</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I have just had an interesting conversation with an NGO network who have been developing some criteria to: (a)&amp;nbsp; help speed up the approval and release of funding in humanitarian emergencies, but (b) at same time minimising risk of poor use of those funds.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;They think these criteria are useful but are not entirely sure whether those seeking funding will agree.&amp;nbsp; So they are exploring ways of testing out their applicability through a wider consultation process.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One way doing this, which we have been discussing, involves the use of &lt;a href=&quot;http://ParEvo.org&quot;&gt;ParEvo.org&lt;/a&gt;. The plan is that a group of participants representing potential grantees will develop a set of storylines which starts off with a particular organisation seeking funding for a particular humanitarian emergency. Then a branching structure of possible subsequent storyline developments will be articulated through the usual ParEvo process&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;After those storylines been developed there will be an evaluation phase, as is common practice now with most ParEvo exercises.&amp;nbsp; At this point the participants will be asked two generic types of questions ( and variations on these), as described below:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;1.&amp;nbsp; Which of the criteria in the current framework would be most likely to help avoid or mitigate the problems seen in storyline X? (Answer=Description &amp;amp; Explanation)&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;and if the answer is none, are there any other criteria that could be included in the framework that might have helped?&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;2.&amp;nbsp; Which of the storylines in the current exercise would have most benefited by criteria X in the current framework, in the sense of problems described there would have been avoided or mitigated. (Answer=Description &amp;amp; Explanation)&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;and if the answer is none, does this suggest that the criteria is irrelevant and could be removed?&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript: One&amp;nbsp;interesting thing about this type of &lt;a href=&quot;https://en.wikipedia.org/wiki/Thought_experiment&quot; target=&quot;_blank&quot;&gt;thought experiment&lt;/a&gt; is that the theory (the proposed funding criteria) and the possible realities that they may be applied to (where the theories may or may not work there as expected) are constructed by different parties who are independent from each other.&amp;nbsp; This is not usually the case with thought experiments,&amp;nbsp;and could be seen as a positive variation.&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Stay tuned for if and when this idea flies, then soars or crashes&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://imgs.xkcd.com/comics/the_difference.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;740&quot; data-original-width=&quot;393&quot; height=&quot;740&quot; src=&quot;https://imgs.xkcd.com/comics/the_difference.png&quot; width=&quot;393&quot; /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;Courtesy&lt;a href=&quot;https://xkcd.com/&quot; target=&quot;_blank&quot;&gt; https://xkcd.com/&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;For more on thought experiments, see &lt;a href=&quot;https://aeon.co/essays/do-thought-experiments-really-uncover-new-scientific-truths&quot; target=&quot;_blank&quot;&gt;Armchair science&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both;&quot;&gt;&lt;a href=&quot;https://aeon.co/essays/do-thought-experiments-really-uncover-new-scientific-truths&quot; target=&quot;_blank&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Thought experiments played a crucial role in the history of science. But do they tell us anything about the real world?&lt;/span&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/2469023829967379080/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/06/using-parevo-to-conduct-thought.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2469023829967379080'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2469023829967379080'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/06/using-parevo-to-conduct-thought.html' title='Using ParEvo to conduct thought experiments'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-2702566015712788899</id><published>2022-06-17T17:12:00.006+00:00</published><updated>2022-06-17T21:40:28.953+00:00</updated><title type='text'>Alternative futures as &quot;search strategies&quot;</title><content type='html'>&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiW1NWTFNjAz0w4X-dvb-kMJNWMlnC-iC8K8iYFKe89B18udrMPeK-uuW4dHGtD5EtD_NjHQCBTXG9ZUJhdvc9wwu4K9ZtLpbrnlVsTX1YO8g-oRNnW6qGKJdE46wm9Kv3RL992uT3op3JPMi-gNv-VHvmoSmmfHKrsYkxf0EKA6AiBg5cowQ/s348/search%20strategy.jpg&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;145&quot; data-original-width=&quot;348&quot; height=&quot;266&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiW1NWTFNjAz0w4X-dvb-kMJNWMlnC-iC8K8iYFKe89B18udrMPeK-uuW4dHGtD5EtD_NjHQCBTXG9ZUJhdvc9wwu4K9ZtLpbrnlVsTX1YO8g-oRNnW6qGKJdE46wm9Kv3RL992uT3op3JPMi-gNv-VHvmoSmmfHKrsYkxf0EKA6AiBg5cowQ/w640-h266/search%20strategy.jpg&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When you read the phrase &quot;search strategy&#39; this may bring to mind what you need when you are doing a literature search on the Internet.&amp;nbsp; Or you may be thinking about different forms of supervised machine learning, which involve different types of search strategies.&amp;nbsp; For example in my Excel-based &lt;a href=&quot;https://evalc3.net/how-it-works/search-options/&quot; target=&quot;_blank&quot;&gt;EvalC3&lt;/a&gt; prediction modelling app there are four different search strategies that users can choose from, to help find the most accurate predictive model describing what combinations of attributes are the best predictor of a particular outcome.&amp;nbsp; Or you may have heard of James March, an organisational theorist who in 1981 wrote a paper called &#39;&lt;a href=&quot;https://www.sciencedirect.com/science/article/abs/pii/0167268181900123&quot; target=&quot;_blank&quot;&gt;A model of adaptive organizational search &lt;/a&gt;&#39; where he talks about how organisations find the right new technologies to develop and explore.This is probably the closest thing to the type of search process that I&#39;m describing below.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Right now I am in the process of helping some other consultants design a &lt;a href=&quot;https://parevo.org&quot; target=&quot;_blank&quot;&gt;ParEvo exercise&lt;/a&gt;, in which recipients of research grants from the same foundation will collaboratively develop a number of alternative storylines describing how their efforts to ensure the uptake and use of the research findings takes place (and sometimes fails to take place) over the coming three years.&amp;nbsp; Because these are descriptions of possible futures they are inherently a form of fiction.&amp;nbsp; But please note they are not an attempt at &quot;predicting&quot; fiction.&amp;nbsp; Rather, they are more like a form of &#39;preparedness enabling &#39; fiction.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;As part of the planning process for this exercise we have had to articulate our expectations of what will come out of this exercise, in terms of possible desirable benefits for both the participants and the foundation.&amp;nbsp; In other words the beginnings of a Theory of Change, which needs to be supplemented by details of how the exercise will be best be run in this particular instance, and thus hopefully deliver these results.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When thinking about reasonable expectations for this exercise I came up with the following possibilities, which are now under discussion:&lt;/span&gt;&lt;/p&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;1&amp;nbsp;Participants will hear&amp;nbsp;different interpretations and views of&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;ol&gt;&lt;li style=&quot;margin-left: 15px;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What&amp;nbsp;&lt;b&gt;other participants mean when they use the term &quot;research uptake &#39;&amp;nbsp;&lt;/b&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li style=&quot;margin-left: 15px;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;What&amp;nbsp;&lt;b&gt;successful, and unsuccessful, research uptake&lt;/b&gt;&amp;nbsp;looks like in its various forms, to&amp;nbsp;various participants&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li style=&quot;margin-left: 15px;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;How the&amp;nbsp;&lt;b&gt;process of research uptake can be facilitated,&amp;nbsp;and inhibited&lt;/b&gt;, by a range of factors – some within researchers control and some beyond their control.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;2.&amp;nbsp; This experience may then i&lt;b&gt;nform how each of the participants proceed with their own work&lt;/b&gt;&amp;nbsp;on facilitating research uptake&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;3. The storylines that are generated by the end of the exercise will provide the participants and the XXXX trust with a&amp;nbsp;&lt;b&gt;flexible set of expectations&lt;/b&gt;&amp;nbsp;against which actual progress with research uptake can be compared at a later date.&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;So, my current thinking is that what we have here is a description of &lt;b&gt;a particular kind of search strategy where both the objectives worth pursuing, and the means of achieving them, are both being explored at the same time&lt;/b&gt;, at least within the ParEvo exercise.&amp;nbsp; Though other things will also be happening after the exercise, hopefully involving some use of the ideas generated during exercise (see possibility 2)&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;There is also another facet of the idea of search strategies which needs to be mentioned here.&amp;nbsp; When search is used in a machine learning context it is always accompanied by an evaluation function which determines whether the search continues or comes to a stop because the best possibility has now been identified (a stopping rule, I think is the term involved).&amp;nbsp; So, in the three possibilities listed above the last one describes the possibility of an evaluation function.&amp;nbsp; Exactly how it will work needs more thinking, but I think it will be along the lines of asking participants in the prior exercise to identify the extent to which their experience in the interim period has fitted any of the storylines that were developed earlier, and in what ways it has and has not, and why so in both cases.&amp;nbsp; Stay tuned...&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222;&quot;&gt;&lt;i&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white; color: #222222; font-family: Arial, Helvetica, sans-serif; font-size: small;&quot;&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/2702566015712788899/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/06/alternative-futures-as-search-strategies.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2702566015712788899'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2702566015712788899'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/06/alternative-futures-as-search-strategies.html' title='Alternative futures as &quot;search strategies&quot;'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiW1NWTFNjAz0w4X-dvb-kMJNWMlnC-iC8K8iYFKe89B18udrMPeK-uuW4dHGtD5EtD_NjHQCBTXG9ZUJhdvc9wwu4K9ZtLpbrnlVsTX1YO8g-oRNnW6qGKJdE46wm9Kv3RL992uT3op3JPMi-gNv-VHvmoSmmfHKrsYkxf0EKA6AiBg5cowQ/s72-w640-h266-c/search%20strategy.jpg" height="72" width="72"/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-7511706271909663785</id><published>2022-04-28T11:02:00.009+00:00</published><updated>2022-04-28T14:51:25.301+00:00</updated><title type='text'>Budgets as theories</title><content type='html'>&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-jXA-7gCGogVFxGICYWoNhuWqWjZFwr27_xDHNT-clg7XasxpGNOCRMUJcKnJhiBz9QyCoIXnKkv7jiMd_CY9Z8IkwNEYj75Uh4Jb7EreSR0wo2SzY2oCR44q-JPSwVo2gDbl6kkwGruIhOk-suOdT6TKLUhMtR_fDL2ilGa3ck0bH-Gxzw/s274/images%20(1).jpg&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;184&quot; data-original-width=&quot;274&quot; height=&quot;269&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-jXA-7gCGogVFxGICYWoNhuWqWjZFwr27_xDHNT-clg7XasxpGNOCRMUJcKnJhiBz9QyCoIXnKkv7jiMd_CY9Z8IkwNEYj75Uh4Jb7EreSR0wo2SzY2oCR44q-JPSwVo2gDbl6kkwGruIhOk-suOdT6TKLUhMtR_fDL2ilGa3ck0bH-Gxzw/w400-h269/images%20(1).jpg&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A government has a new climate policy. It outlines how climate investments will be spread through a number of different ministries, and implemented by those ministries using a range of modalities.  Some funding will be channelled to various multilateral organisations. Some will be spent directly by the ministries. Some will be channelled on to the private-sector.  At some stage in the future this government wants to evaluate the impact of this climate policy. But before then it is been suggested that an evaluability assessment might be useful, to ask if how and when such an evaluation might be feasible.&lt;/span&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This could be a challenge to those with the task of undertaking the evaluability assessment.  And even for those planning the Terms of Reference for that evaluability assessment.  The climate policy is not yet finalised.  And if the history of most government policy statements (that I have seen) has any lessons it is that you can&#39;t expect to see a very clearly articulated Theory of Change of the kind that you might expect to find in&amp;nbsp;the design of a particular aid programme.&lt;br /&gt;&lt;br /&gt;My provisional suggestion at this stage is that the evaluability assessment should treat the government&#39;s budget, particularly those parts involving funding of climate investments, as a theory of what is intended.  And to treat the actual flows of funding that subsequently occur as the implementation of that theory.&amp;nbsp;&amp;nbsp;My naïve understanding of the budget is that it consists of categories of funding, along with subcategories and sub- subcategories, et cetera.  In other words a type of tree structure involving a nested series of choices about where more versus less funds should go.&amp;nbsp; So, the first task of an evaluability assessment would be to map out the theory i.e. the intentions as captured by budget statements at different levels of detail, moving from national to ministerial and then to small units thereafter.  And to comment on the adequacy of these descriptions and and gaps that need to be addressed.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This exercise on its own will not be sufficient as an explication of the climate policy theory because it will not tell&amp;nbsp;us how these different flows of funding are expected to do their work.  One option would be to follow each flow down to its &#39;final recipient&#39;,&amp;nbsp;if such a thing can actually be identified.  But that would be a lot of work and probably leave us with a huge diversity of detailed mechanisms.  Alternatively, one might do this on sampling basis, but how would appropriate samples be selected?&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;There is an alternative which could be seen as a necessity that could then be complemented by a sampling process.  This would involve examining each binary choice, starting from the very top of the budget structure and asking &#39;key informants&quot;&amp;nbsp;questions about why climate funding was present&amp;nbsp;in one category but not the other, or more in one category than the other.&amp;nbsp; This question on its own might have limited value because budgeting decisions are likely to have a complex and often muddy history, and the responses received might have a substantial element of &#39;constructed rationality&#39;&amp;nbsp;.  Nevertheless the answers could provide some useful context.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A more useful follow-up question would be to then ask the same informants about their &lt;i&gt;expectations of differences in performance&lt;/i&gt; of the amount of climate financing via category X versus category Y.&amp;nbsp; Followed by a question about how &lt;i&gt;they expect to hear about&lt;/i&gt; the achievement of that performance, if at all. &amp;nbsp;Followed by a question about what they would &lt;i&gt;most like to know&lt;/i&gt; about performance in this area.  Here performance could be seen in terms of the continuum of behaviours, ranging from simple delivery of the amount of funds as originally planned, to their complete expenditure,&amp;nbsp;followed by some form of reporting on outputs and outcomes, and maybe even some form of evaluation, reporting some form of changes.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;These three follow-up questions would address three facets of an evaluability assessments (EA): a) The ToC - about expected&amp;nbsp; changes, b) Data availability , c) Stakeholder interests.&amp;nbsp;&amp;nbsp;Questions would involve two types of comparisons:&amp;nbsp;funding versus no funding, and more versus less funding. The fourth EA question, about the surrounding&amp;nbsp;institutional context, typically asks about the factors that may enable and/or limit an evaluation of what actually happened&amp;nbsp;(more on &lt;a href=&quot;https://www.entwicklung.at/fileadmin/user_upload/Dokumente/Evaluierung/GL_for_Evaluability_Assessments.pdf&quot; target=&quot;_blank&quot;&gt;evaluability assessments here&lt;/a&gt;).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&amp;nbsp;There will of course be complications in this sort of approach..&amp;nbsp;Budget documents will not simply be a nested series of binary choices, at each level their work may be multiple categories available rather than just two.&amp;nbsp; However informants could be asked to identify &#39;&lt;a href=&quot;https://mande.co.uk/special-issues/hierarchical-card-sorting-hcs/&quot; target=&quot;_blank&quot;&gt;the most significant difference&lt;/a&gt; &#39;between all these categories, in effect introducing an intermediary binary category.  There could also be a great number of different levels to the budget documents, with each new level in effect doubling the number of choices and associated questions that need to be asked. Prioritisation of enquiries would be needed, possibly based on a &#39;follow the (biggest amount of) money &#39;principle.&amp;nbsp; It is also possible that quite a few informants will have limited ideas or information about the binary comparisons they are asked about.&amp;nbsp; A wider selection of informants might help fill that gap.&amp;nbsp; Finally there is the question of how to &#39;validate&quot; the views expressed about expected differences in performance,&amp;nbsp;availability of performance information and relevant questions about performance.&amp;nbsp; Validation might take the form of a survey of a wider constituency of stakeholders within the organisation of interest, of the views expressed by the informants.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;PS: Re this comment in the third para above: &quot;And to treat the actual flows of funding that subsequently occur as the implementation of that theory&quot;&amp;nbsp; One challenge the EA team might find is that while it may have accessed to detailed budget documents, in many places it may not yet be clear where funds have been tagged as climate finance spending. That itself would be an important EA finding.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;To be continued...&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/7511706271909663785/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/04/budgets-as-theories.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7511706271909663785'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/7511706271909663785'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/04/budgets-as-theories.html' title='Budgets as theories'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh-jXA-7gCGogVFxGICYWoNhuWqWjZFwr27_xDHNT-clg7XasxpGNOCRMUJcKnJhiBz9QyCoIXnKkv7jiMd_CY9Z8IkwNEYj75Uh4Jb7EreSR0wo2SzY2oCR44q-JPSwVo2gDbl6kkwGruIhOk-suOdT6TKLUhMtR_fDL2ilGa3ck0bH-Gxzw/s72-w400-h269-c/images%20(1).jpg" height="72" width="72"/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-6225096118089063141</id><published>2022-04-24T16:05:00.022+00:00</published><updated>2022-04-29T14:13:16.500+00:00</updated><title type='text'>Making small samples of large populations useful</title><content type='html'>&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I was recently contacted by someone who is working for a consulting firm that has a contract to evaluate the implementation of a large-scale health program covering a huge number of countries.&amp;nbsp; Their client had questioned their choice of 6 countries as case studies.&amp;nbsp; They were encouraging the consultancy firm to expand the number of country case studies, apparently because they thought this would make this sample of country cases more representative of the population of countries as a whole.&amp;nbsp; However, the consulting firm wasn&#39;t planning to aggregate results of the six country case studies and then make a claim about generalisability of findings across the whole population of countries.&amp;nbsp; Quite the opposite, the intention was that each country case study would provide a detailed perspective on one or more particular issues that was well exemplified by that case.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In our discussions, I ended up suggesting a strategy that might satisfy both parties in that it addressed to some extent the question of generalisable knowledge at the same time was designed to exploit the particularities of individual country cases.&amp;nbsp; My suggestion was relatively simple, although implementing might take a bit of work making use of whatever data is available on the full population of countries.&amp;nbsp; The suggestion was that for each individual case study the first step in the process would be to identify and explain the interesting particularities of that case, within the context of the evaluation&#39;s objectives.&amp;nbsp; &amp;nbsp; Then the evaluation team would look through whatever data is available on the whole population of countries, with the aim of identifying a sub-set of other countries that had similar characteristics (perhaps both generic {political, socio-economic indicators} and issue specific) with the case study country. These would then be assumed to be the countries where the case study findings and recommendations could be most relevant.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;A shown in the diagram below, it is possible that the sub-set of countries relevant to each case study county might overlap to some extent. Even when one case study country is examined it is possible that it might have more than one particularity of interest, each of whose analysis might be usefully generalised to a limited number of other countries. And those different sub-sets of countries may themselves overlap to some extent (not shown below).&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;table align=&quot;center&quot; cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;tr-caption-container&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style=&quot;text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhnnj-928C82jWv5aoo1IHAzMtq201fXxuw9k4c8V58ELTbdLaiZ5TXHiliZ3WNx3Wux4h-Hcb_X_wWHxfqp5rFRcJIV1BChnIgVHKzQQVNzQRppzU8ZpSaoWTDX3zGW6GDs3JDaepR23bwZnnkKPwSxJhsrXugvTc3qpZTmYjHhwMC-sIUgA/s829/limited%20representativeness.png&quot; style=&quot;margin-left: auto; margin-right: auto;&quot;&gt;&lt;span style=&quot;color: black;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;787&quot; data-original-width=&quot;829&quot; height=&quot;380&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhnnj-928C82jWv5aoo1IHAzMtq201fXxuw9k4c8V58ELTbdLaiZ5TXHiliZ3WNx3Wux4h-Hcb_X_wWHxfqp5rFRcJIV1BChnIgVHKzQQVNzQRppzU8ZpSaoWTDX3zGW6GDs3JDaepR23bwZnnkKPwSxJhsrXugvTc3qpZTmYjHhwMC-sIUgA/w400-h380/limited%20representativeness.png&quot; width=&quot;400&quot; /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class=&quot;tr-caption&quot; style=&quot;text-align: center;&quot;&gt;&lt;br /&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Green nodes = case study countries&lt;br /&gt;Red nodes = remainder of the whole population&amp;nbsp;&lt;br /&gt;Red nodes connected to green nodes = countries that might find green node country case study findings relevant&lt;br /&gt;Unconnected red nodes = Parts of whole population where case study findings not expected to have any relevance&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Another possibility, perhaps seen as unadvisable in normal circumstances, would be to identify the relevant countries to any case study analysis after the fact, not necessarily or only before.&amp;nbsp; After the case study had actually been carried out there would be much more information available on the relevant particularities of the case study country that might make it easier to identify which other countries these finding were most relevant to.&amp;nbsp;However the client of the evaluation might need to be given some reassurance in advance. For example, by ensuring that at least some of these (red node) countries were identified at the beginning, before the case studies were underway.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;PS: It is possible to quantify the nature of this kind of sampling. For example, in the above diagram&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Total number of cases =&amp;nbsp; 37&amp;nbsp; (red and green).&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Case study cases = 5 (14%)&amp;nbsp; of all cases&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Relevant-to-case-study cases = 17 (46%) of all cases&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Relevant-to-&amp;gt;1-case-study cases = 3 (8%) of all cases&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Not-relevant-to-case-study* cases = 15 (40%) of all cases&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;div style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;*Bear in mind also that in many evaluations case studies will not be the only means of inquiry. For example, there are probably internal and external data sets that can be gathered and analysed re the whole set of 37 countries.&lt;/span&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Conclusion&lt;/b&gt;: We should not be thinking in terms of binary options. It is not true that either&amp;nbsp; a case is part of a representative sample of a whole population, or it is representative and of interest only to itself. It can be relevant to a sub-set of the population.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/6225096118089063141/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2022/04/making-small-samples-of-large.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6225096118089063141'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/6225096118089063141'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2022/04/making-small-samples-of-large.html' title='Making small samples of large populations useful'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhnnj-928C82jWv5aoo1IHAzMtq201fXxuw9k4c8V58ELTbdLaiZ5TXHiliZ3WNx3Wux4h-Hcb_X_wWHxfqp5rFRcJIV1BChnIgVHKzQQVNzQRppzU8ZpSaoWTDX3zGW6GDs3JDaepR23bwZnnkKPwSxJhsrXugvTc3qpZTmYjHhwMC-sIUgA/s72-w400-h380-c/limited%20representativeness.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-8297098833519733181</id><published>2021-11-25T11:00:00.001+00:00</published><updated>2021-11-27T22:45:24.990+00:00</updated><title type='text'>Choosing between simpler and more complex versions of a Theory of Change</title><content type='html'>&lt;p&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Background: Over the last few months I have been involved as a member of the Evaluation Task Force, convened by the &lt;a href=&quot;https://www.apf.org/page/Evaluation&quot; target=&quot;_blank&quot;&gt;Association of&amp;nbsp; Professional&amp;nbsp;Futurists.&lt;/a&gt; Futurists being people who explore alternative&amp;nbsp;futures using various foresight and scenario&amp;nbsp;planning methods. The intention is to help strengthen the evaluation&amp;nbsp;capacity of those doing this kind of work&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;One part of this work will involve the development of various forms of introductory materials&amp;nbsp;and guidelines&amp;nbsp;documents. These will inevitably&amp;nbsp;include discussion of the use of Theories&amp;nbsp;of Change, and questions about appropriate&amp;nbsp;levels&amp;nbsp;of detail and complexity&amp;nbsp;that they should involve.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In my dialogues with other Task Force members I have recently&amp;nbsp; made the following&amp;nbsp;comments, which may be of wider interest:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;As already noted, a ToC can take&amp;nbsp;various forms, from very simple linear versions to very complex network versions.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;I have a hypothesis that may be useful when we are developing&amp;nbsp;guidance&amp;nbsp;on use of ToC by futurists. In fact I have two hypotheses:&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;H1: A simple&amp;nbsp;linear ToC is more likely to be appropriate when dealing with outcomes that are closest in time to a given foresight&amp;nbsp;activity of interest. Outcomes that are more distant&amp;nbsp;in time, happening long after the foresight&amp;nbsp;activity has finished, would be better represented&amp;nbsp;in a ToC that took a more complex network (i.e. systems map type) form&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;Why so?: As time passes after a foresight activity, more and more other forces, or various kinds, are likely to come into play and influence the longer term outcome of interest. As a proportion of all influences, the foresight&amp;nbsp;activity will grow progressively smaller and smaller. A type of ToC that takes into account this widening set of influences would seem essential&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;H2: This need for progressively&amp;nbsp;more complex ToC, as the outcome&amp;nbsp;of interest is located further away&amp;nbsp;in time, can be moderated by a second&amp;nbsp;variable, which is the social distance between those involved in the foresight activity and those involved in the outcome&amp;nbsp;of interest . [Social distance is measured in social network analysis (SNA) terms by units known as&amp;nbsp; &quot;degree&quot;, i.e, the number of person-to-person linkages needed for information to flow between one person and another]. So, if the outcome is a change in the functioning&amp;nbsp;of the same organization that the foresight exercise participants they themselves&amp;nbsp;belong to, this distance will be short, relative to an outcome relating to another organisation altogether - where there may be few if any direct&amp;nbsp;links between the exercise participants and staff of that organisation&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;The implications of these two perspectives could be graphically represented&amp;nbsp;in a scatter&amp;nbsp;plot or two-by-two matrix e.g.&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;color: #0b5394; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhS7Y8GK2dVRhhIzEDskQ3KnQb6c6eSeaYmWwGh7Z0rsSHEsHfxT3uPt1Jv5i3-cle6vjAmRhaSWhWXJG1p9o41GbV4ecRY6R92o3avRZVkPky7FlV4-8T_t6bYD0qhRhdYgxrZ/s610/time+and+social+distance.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;464&quot; data-original-width=&quot;610&quot; height=&quot;450&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhS7Y8GK2dVRhhIzEDskQ3KnQb6c6eSeaYmWwGh7Z0rsSHEsHfxT3uPt1Jv5i3-cle6vjAmRhaSWhWXJG1p9o41GbV4ecRY6R92o3avRZVkPky7FlV4-8T_t6bYD0qhRhdYgxrZ/w591-h450/time+and+social+distance.png&quot; width=&quot;591&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;div style=&quot;background-color: white; color: #222222; font-family: Arial, Helvetica, sans-serif; font-size: small; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;On reflection, this view probably needs some more&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;background-color: transparent;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;articulation. Social distance will probably not be present&amp;nbsp;in the form of a single pathway through a network of actors. Especially given that any foresight activity will typically&amp;nbsp;involve multiple participants, each with their own access&amp;nbsp;to relevant networks. So there may be a third relevant dimension here to think about, which is the diversity&amp;nbsp;of the participants. Greater diversity&amp;nbsp;being plausibly associated with a greater&amp;nbsp;range of social (and causal) pathways to the outcome of interest. And thus the need for more complex representations&amp;nbsp;of the Theory of Change.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;background-color: transparent;&quot;&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style=&quot;background-color: white;&quot;&gt;&lt;span style=&quot;background-color: transparent;&quot;&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgEoPpTIZ1VPgve0PZ2BtOYR2psMGUvXpkPyO8vGEoLOQY2cQrI6B2JZA3o-Ko5eUyj7cNNt26-66o4w82UkKlCScKUZ05VYNi5BitBDGu4x2lsPoyjwI8NQvwAlY0P_2mQKSdL/s895/cube.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;518&quot; data-original-width=&quot;895&quot; height=&quot;354&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgEoPpTIZ1VPgve0PZ2BtOYR2psMGUvXpkPyO8vGEoLOQY2cQrI6B2JZA3o-Ko5eUyj7cNNt26-66o4w82UkKlCScKUZ05VYNi5BitBDGu4x2lsPoyjwI8NQvwAlY0P_2mQKSdL/w612-h354/cube.png&quot; width=&quot;612&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style=&quot;color: #222222; font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/8297098833519733181/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/11/choosing-between-simpler-and-more.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/8297098833519733181'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/8297098833519733181'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/11/choosing-between-simpler-and-more.html' title='Choosing between simpler and more complex versions of a Theory of Change'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhS7Y8GK2dVRhhIzEDskQ3KnQb6c6eSeaYmWwGh7Z0rsSHEsHfxT3uPt1Jv5i3-cle6vjAmRhaSWhWXJG1p9o41GbV4ecRY6R92o3avRZVkPky7FlV4-8T_t6bYD0qhRhdYgxrZ/s72-w591-h450-c/time+and+social+distance.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-1324633703275855295</id><published>2021-11-01T17:57:00.041+00:00</published><updated>2021-11-09T15:39:08.927+00:00</updated><title type='text'>Exploring counterfactual histories of an intervention</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Background&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I and others&amp;nbsp; are providing technical advisory support to the evaluation of a large complex multilateral health intervention, one which is still underway. The intervention has multiple parts implemented by different partners and the surrounding context is changing. The intervention design is being adapted as time moves on. Probably not a unique situation.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;The proposal&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;As &lt;i&gt;one &lt;/i&gt;of a number of parts of a multi-year evaluation process I &lt;i&gt;might&amp;nbsp;&lt;/i&gt;suggest the following:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;1. A timeline is developed describing what the partners in the intervention see as key moments in its history, defined as where decisions have been taken to change , stop or continue, a particular course(s) of action.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;2. Take each of those moments as a kind of case study, which the evaluation team then elaborates in detail: (a) the options that were discussed, and others not, (d) the rationales for choosing for and against those discussed options at the time, (d) the current merit of those assessments, as seen in the light of subsequent events. [See more details below]&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;The objective?&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;To identify (a) how well the intervention has responded to changing circumstances and (b) any lessons that might be relevant to the future of the intervention, or generalisable to other similar intervention.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span&gt;This seems like a form of (contemporary and micro-level) historical research, investigating actual versus possible causal pathways. It seems different from a Theory of Change (ToC) based evaluation, where the focus is on&amp;nbsp;&lt;/span&gt;what was&amp;nbsp;&lt;i&gt;expected&lt;/i&gt;&amp;nbsp;to happen and then what&amp;nbsp;&lt;i&gt;did&lt;/i&gt;&amp;nbsp;happen. Whereas with this proposed historical research&amp;nbsp;in to decisions&amp;nbsp;taken the primary reference point is what &lt;i&gt;did&lt;/i&gt; happen, then what &lt;i&gt;could&lt;/i&gt; have happened.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;It also seems different from what I understand is a process tracing form of inquiry where, I think, the focus is on particular hypothesised causal pathway. Not the consideration of multiple alternative possible pathways, as would be the case within each of a series of decision making case studies proposed here. There may be a breadth rather than depth of inquiry difference here.[Though I may be over-emphasising the difference here, ...I am just reading &lt;a href=&quot;https://doi.org/10.1080/09636412.2015.1036610&quot; target=&quot;_blank&quot;&gt;Mahoney, 2013&lt;/a&gt; on use of process tracing in historical research]&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The multiple possible alternatives that could have been chosen are the counterfactuals I am referring to here.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;The challenges?&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;As Henry Ford allegedly said &quot;History is just one damn&amp;nbsp;thing after another&quot; There are way too many events in most interventions where alternative histories could have taken off in a different direction. For example, at a normally trivial level, someone might have missed their train. So to be practical but also systematic and transparent the process of inquiry would need to focus on specific types of events, involving particular kinds of actors. Preferably where decisions were made about courses of action. Such as Board Meetings. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;And in such individual settings how wide should the should the evaluation focus be? For example, only on where decisions were made to change something, or also where decisions were made to continue doing something? And what about the absence of decisions being even considered, when they might have been expected to be considered. That is, decisions about decisions.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Reading &lt;a href=&quot;https://www.jstor.org/stable/10.20851/j.ctt1t304sf.12?seq=1#metadata_info_tab_contents&quot; target=&quot;_blank&quot;&gt;some of the literature&lt;/a&gt; about counter-factual history, written by historians, there is clearly a risk of developing historical counterfactuals that stray too far from what is known to have happened, in terms of imagined consequences of consequences, etc. In response, some historians talk about the need to limit inquiries to&quot;constrained counterfactuals&quot; and the use of a &quot;Minimal Rewrite Rule&quot;. [I will find out more about these]&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Perhaps another way forward is to talk about counter-factual reasoning, rather than counterfactual history (&lt;a href=&quot;http://www.technologystories.org/counterfactual-history-and-the-history-of-technology/&quot; target=&quot;_blank&quot;&gt;Schatzberg, 2014&lt;/a&gt;) . This seems to be more like what the proposed line of inquiry might be all about i.e. how the alternatives to what actually was decided and happened were considered (or not even considered) by the intervening agency. But even then, the evaluators&#39; assessments of these reasonings would seem to necessarily involve some exploration of consequences of these decisions, and only some of which will have been observable, and others only conjectured.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;The merits?&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;When compared to a ToC testing approach this historical approach does seem to have some merit. One of the problems of a ToC approach, particularly when applied to a complex intervention is the multiplicity of possible causal pathways, relative to the limited time and resources available available to an evaluation team. Choices usually need to be made, because not all avenues can be explored (unless some can excluded by machine learning explorations or other quantitative processes of analysis).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;However, on reflection, the contrast with a historical analysis of the reality of what actually happened is not so black and white. In large complex programmes there are typically many people working away in parallel, generating their own local and sometimes intersecting histories. There is not just one history from within which to sample decision making events&amp;nbsp; In this context a well articulated ToC may be a useful map, a means of identifying where to look for those histories in the making.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;&lt;span style=&quot;color: red;&quot;&gt;Where next&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I have since found that that the evaluation team has been thinking along similar lines to myself i.e. about the need to document and analyse the history of key decisions made. If so, the focus now should be on elaborating questions that would be practically useful, some of which are touched on above. Including:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;1. How to identify and sample important decision making points&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;At least two options here:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;1. Identify a specific type of event where it is know that relevant decisions are made. E.g, Board Meetings. This is a top-down deductive approach. Risk here is that many decisions will (and have to be) made outside and prior to these events, and just receive official authorisation at these meetings. Snowball sampling backwards to original decisions may be possible...&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;2. Try using the &lt;a href=&quot;https://mande.co.uk/special-issues/hierarchical-card-sorting-hcs/&quot;&gt;HCS method&lt;/a&gt; to partition the total span of time of interest into smaller (and nested) periods of time. Then identify decisions that have generated the differences observed between these periods (which will sought about the intervention strategy). This is a more bottom-up inductive approach.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;2. How to analyses individual decisions.&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The latter includes interesting issues such as&amp;nbsp; how much use should be made of prior/borrowed theories about what constitute good decision making, versus using a more inductive approach that emphasises understanding how the decisions were made within their own particular context. I am more in favor of the latter at present&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Here is a VERY provisional framework/checklist for what could be examined, when looking at each decision making event:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimRdCr9mpb3tZhxMIpV_je53C6u0-yI0OG852F2n1KFQzxEV1NCZjQqUSBpBiu6eGnvK8wy2rx1nAjUvKit6eBmkwOXzXEUseKaJ8yi_-6OvRTFCa7NLdpuTSnU1p_-rlDcrHB/s741/dm+matrix2.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;307&quot; data-original-width=&quot;741&quot; height=&quot;266&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimRdCr9mpb3tZhxMIpV_je53C6u0-yI0OG852F2n1KFQzxEV1NCZjQqUSBpBiu6eGnvK8wy2rx1nAjUvKit6eBmkwOXzXEUseKaJ8yi_-6OvRTFCa7NLdpuTSnU1p_-rlDcrHB/w641-h266/dm+matrix2.png&quot; width=&quot;641&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;In this context it may also be useful to think about a wider set of relevant ideas like the role of &quot;&lt;/span&gt;&lt;a href=&quot;https://en.wikipedia.org/wiki/Path_dependence&quot; style=&quot;font-family: verdana;&quot; target=&quot;_blank&quot;&gt;path dependency&lt;/a&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&quot; and &quot;&lt;/span&gt;&lt;a href=&quot;https://en.wikipedia.org/wiki/Sunk_cost&quot; style=&quot;font-family: verdana;&quot;&gt;sunk costs&lt;/a&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&quot;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;3. How to aggregate/synthesise/summarise the analysis of multiple individual decision making cases&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This is still being thought about, so caveat emptor:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;span style=&quot;background-color: #f6f6f6; font-size: medium;&quot;&gt;Objectives were to identify:&lt;/span&gt;&lt;/span&gt;&lt;span style=&quot;background-color: #f6f6f6; font-family: verdana;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;background-color: #f6f6f6; font-family: verdana;&quot;&gt;(a) how well the intervention has responded to changing circumstances.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Possible&lt;/span&gt;&lt;span style=&quot;background-color: #f6f6f6;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;&amp;nbsp;summarising device? Rate each decision&amp;nbsp;making event on degree to which it was optimal under the circumstances. Backed by a rubric explaining rating values.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Cross tabulate these ratings against&amp;nbsp;a ratings of the subsequent&amp;nbsp;impact of the decision&amp;nbsp;that was made? An &quot;Increased/decreased potential impact&quot; scale.? Likewise supported by a rubric (i.e. annotated scale).&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style=&quot;background-color: #f6f6f6; font-family: verdana;&quot;&gt;(b) any lessons that might be relevant to the future of the intervention, or generalisable to other similar intervention.&lt;/span&gt;&lt;/p&gt;&lt;blockquote style=&quot;border: none; margin: 0px 0px 0px 40px; padding: 0px;&quot;&gt;&lt;p style=&quot;text-align: left;&quot;&gt;&lt;span style=&quot;background-color: #f6f6f6;&quot;&gt;&lt;span style=&quot;font-family: verdana;&quot;&gt;Text summary of implications identified from the analysis of each decision making event, with priority to more impactful/consequential decisions?&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Lot more thinking yet to be done here...&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;color: red; font-family: verdana; font-size: medium;&quot;&gt;&lt;b&gt;Miscellaneous points of note hereafter...&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript 1: There must be a wider literature on this type of analysis, where there may be some useful experiences.&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&amp;nbsp;&quot;“&lt;i&gt;Ninety per cent of problems have already been solved in some other field. You just have to find them&lt;/i&gt;.”&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: large;&quot;&gt;&amp;nbsp;&lt;/span&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;McCaffrey, T. (2015)&amp;nbsp;&amp;nbsp;&lt;a href=&quot; https://www.newscientist.com/article/mg22730360-400-how-creative-computers-will-dream-up-things-wed-never-imagine/&quot; target=&quot;_blank&quot;&gt;New Scientist.&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Postscript 2: I just came across the idea of an &quot;even if...&quot; type counterfactual. As in &quot;Even if I did catch the train, I would still not have got the job&quot;. This is where when an imagined action, different from what really what happened, still leads to the same outcome as when the real action took place.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;
  


&lt;a href=&quot;https://commons.wikimedia.org/wiki/File:Conceptual_Thinking_by_Andy_McMahon.jpg&quot; title=&quot;Wishmi, CC BY-SA 4.0 &amp;lt;https://creativecommons.org/licenses/by-sa/4.0&amp;gt;, via Wikimedia Commons&quot;&gt;&lt;img alt=&quot;Conceptual Thinking by Andy McMahon&quot; src=&quot;https://upload.wikimedia.org/wikipedia/commons/thumb/f/f6/Conceptual_Thinking_by_Andy_McMahon.jpg/512px-Conceptual_Thinking_by_Andy_McMahon.jpg&quot; width=&quot;512&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;  
  &lt;a href=&quot;https://commons.wikimedia.org/wiki/File:Conceptual_Thinking_by_Andy_McMahon.jpg&quot;&gt;Wishmi&lt;/a&gt;, &lt;a href=&quot;https://creativecommons.org/licenses/by-sa/4.0&quot;&gt;CC BY-SA 4.0&lt;/a&gt;, via Wikimedia Commons&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/1324633703275855295/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/11/when-and-how-it-may-be-worthwhile.html#comment-form' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1324633703275855295'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1324633703275855295'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/11/when-and-how-it-may-be-worthwhile.html' title='Exploring counterfactual histories of an intervention'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimRdCr9mpb3tZhxMIpV_je53C6u0-yI0OG852F2n1KFQzxEV1NCZjQqUSBpBiu6eGnvK8wy2rx1nAjUvKit6eBmkwOXzXEUseKaJ8yi_-6OvRTFCa7NLdpuTSnU1p_-rlDcrHB/s72-w641-h266-c/dm+matrix2.png" height="72" width="72"/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-8869907095590335766</id><published>2021-08-22T12:08:00.007+00:00</published><updated>2021-10-26T15:45:59.253+00:00</updated><title type='text'>Reconciling the need for both horizontal and vertical dimensions in a Theory of Change diagram</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;In their usual table form, Logical Frameworks are strong on their horizontal dimension but weak on their vertical dimension. On the horizontal dimension is the explanation of what kind of data will be collected and used to measure the changes that are described. This is good for accountability. On the vertical dimension is the explanation of how events at one level will connect and cause events at another level. This is good for learning.&amp;nbsp; But unfortunately LogFrames often simply provide lists of events at each level, with relatively little description of which event will connect to which, especially where multiple and mixed sets of connections might be expected between events. On the other hand diagrammatic versions of a Theory of&amp;nbsp; Change tend to be much better at explicating the various causal pathways at work, but weak on the information they provide on the horizontal dimension - on how various events will be observed and measures. Both of these problems reflect both a lack of space to do both things and different relative priorities pursued within those constraints.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The Donor Committee for Enterprise Development (DCED) has produced a web page based Theory of Change to explain its way of&amp;nbsp; working, which I think points the way to reconciling these conflicting needs. At first glance here is what you see, when you visit &lt;a href=&quot;https://www.enterprise-development.org/what-works-and-why/evidence-framework/#1610541058742-3e60ff49-7341&quot; target=&quot;_blank&quot;&gt;this page of their website.&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgVjuA11oi9UJAt29jlIbAyO3ogyexocOmv1cqA3s4M6KnBk3OL-gEZq2v4va_nRJEcV1P5jC3aVIPPYQej1s5yMBUNMK5OmkbKO5Z0G_4MF2mlRKJBnfZVyKqd_UsqRZpVv_iZ/s826/DCED.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;786&quot; data-original-width=&quot;826&quot; height=&quot;461&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgVjuA11oi9UJAt29jlIbAyO3ogyexocOmv1cqA3s4M6KnBk3OL-gEZq2v4va_nRJEcV1P5jC3aVIPPYQej1s5yMBUNMK5OmkbKO5Z0G_4MF2mlRKJBnfZVyKqd_UsqRZpVv_iZ/w483-h461/DCED.png&quot; width=&quot;483&quot; /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The different causal pathways are quite visible, more so than within a standard LogFrame table format. But another common weakness of diagrammatic versions of Theories of Change is the lack of explanation of what is going on within each of these pathways. The DCED addressed this problem by allowing visitors to click on a link and be taken to another web page, where visitors get a detailed text description of the available evidence, plus any assumptions, about the causal process(es) that connect the events connected by the arrow.&lt;/span&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The one weakness in this DCED ToC diagram is the lack of detail about the horizontal dimension- how the various events described in the diagram will be observed./ measured and by who and when and where. But this is clearly resolvable by using the same approach with the links: enable users to click on any &lt;i&gt;event &lt;/i&gt;and be taken to a web page where this information is provided for that specific event. As shown below:&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKKnbebLjLkrw5IsYAMvzjkuMtaOjQD2U0bLu0BRWf76k5dXe2crIUPaRTf5phYtH3XrJxDDliZJrZVgSHs-qFE5HaoX4rQFIf6e0Lvw2qjLQb1UXe0TO3oYn8z4FgNpcZgICC/s1339/DCED+3.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;786&quot; data-original-width=&quot;1339&quot; height=&quot;363&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKKnbebLjLkrw5IsYAMvzjkuMtaOjQD2U0bLu0BRWf76k5dXe2crIUPaRTf5phYtH3XrJxDDliZJrZVgSHs-qFE5HaoX4rQFIf6e0Lvw2qjLQb1UXe0TO3oYn8z4FgNpcZgICC/w617-h363/DCED+3.png&quot; width=&quot;617&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/8869907095590335766/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/08/reconciling-need-for-both-horizontal.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/8869907095590335766'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/8869907095590335766'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/08/reconciling-need-for-both-horizontal.html' title='Reconciling the need for both horizontal and vertical dimensions in a Theory of Change diagram'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgVjuA11oi9UJAt29jlIbAyO3ogyexocOmv1cqA3s4M6KnBk3OL-gEZq2v4va_nRJEcV1P5jC3aVIPPYQej1s5yMBUNMK5OmkbKO5Z0G_4MF2mlRKJBnfZVyKqd_UsqRZpVv_iZ/s72-w483-h461-c/DCED.png" height="72" width="72"/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-2100166780663989397</id><published>2021-07-19T14:46:00.012+00:00</published><updated>2021-09-05T22:02:44.719+00:00</updated><title type='text'>Diversity and complexity? Where should we focus our attention?</title><content type='html'>&lt;p&gt;&amp;nbsp;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;This posting has been promoted by Michael Bamberger&#39;s recent two blog postings on &quot;&lt;a href=&quot;https://www.3ieimpact.org/blogs/building-complexity-development-evaluations&quot; target=&quot;_blank&quot;&gt;Building complexity into development evaluation&lt;/a&gt;&quot; on the 3ie website: &lt;i&gt;Evidence Matters: Towards equitable, inclusive and sustainable development&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;I started to make some comments underneath each of the two postings but have now decided to try to organise and extend my thoughts here.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;My starting point is an ongoing concern about how unproductive the discussion has been about&amp;nbsp; complexity (especially in relation to evaluation). Like an elephant giving birth to a mouse, has been my chosen metaphor in the past. There probably is some rhetorical overkill here, but it does express some of my felt frustration with the limited value of the now quite extended discussion.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Michael&#39;s blog postings have not allayed these concerns. My concerns start with the idea of measuring complexity, both how you do it and how measuring would in fact be useful. Measuring complexity is Michael&#39;s proposed first step in a &quot;&lt;i&gt;practical five-step approach to address complexity-responsive evaluation in a systematic way&lt;/i&gt;&quot; A lot of ink has already been spilled on the topic of measurement, which is the first of the five steps. A useful summary can be found in Melanie Mitchel&#39;s widely read&lt;a href=&quot;https://www.google.co.uk/books/edition/Complexity/sSgzHayrDBsC?hl=en&quot; target=&quot;_blank&quot;&gt; Complexity: A guided Tour (2009&lt;/a&gt;:94-114) and &lt;a href=&quot;https://www.sciencedirect.com/science/article/abs/pii/0003491688900942&quot; target=&quot;_blank&quot;&gt;Loyd, 1998&lt;/a&gt;, who counted at least 40 different ways. But I cant see any references back to any of these methods, suggesting that not much is being learned from past efforts, which is a pity.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Along with the challenge of how to do it is the question of why you would want to do it, ...how might it be useful? The second blog posting explains that &quot;&lt;i&gt;&amp;nbsp;In addition to providing stakeholders with an understanding of what complexity means for their program, the checklist also helps decide whether the program is sufficiently complex to merit the additional investment of time and resources required to conduct a complexity-focused evaluation&lt;/i&gt;&quot;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;The second of these outcomes might be a more observable consequence, so my first question here is where is the cut-off point in a checklist derived score that would at least inform such a decision, and how is that cut-off point justified.&amp;nbsp;&lt;a href=&quot;https://www.3ieimpact.org/sites/default/files/2021-07/complexity-blg-Annex1-Checklist_assessing_level_complexity.pdf&quot; target=&quot;_blank&quot;&gt;The checklist&lt;/a&gt; has 25 attribute questions spread over 4 dimensions. This has not yet been made clear.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;My next question is how do the results of this measurement exercise inform the next of the five steps:&amp;nbsp;&quot;&lt;i&gt;Breaking the project into evaluable components and identifying the units of analysis for the component evaluation&lt;/i&gt; &quot;. So far, I have not found any answers to this question either. PS 2021 07 29: Michael did reply to a Comment of mine raising the same issue, and suggested that high versus low scores on rated complexity might be one way.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Another concern which I have already written about in my comment on the blog postings is that&amp;nbsp;complexity seems to be very much “in the eye of the beholder”, i.e. depending on who you are and what you are looking for.  My partner sees much more complexity in the design and behaviour of moths and butterflies than I do.  A friend of mine sees much more complexity in the performance of classical music than I do.&amp;nbsp; Such observations prompt me to thinking that perhaps we should not put too much effort into trying to objectively measure complexity.  Rather, perhaps we should take a more ethnographic perspective on complexity – i.e. we should pay attention to where people are seeing complexity and where they are not, and what are the implications thereof.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;If we accept this suggestion it is still the case that the challenge of identifying complexity is still with us, but in a different form.  So, I have another suggestion, which is to pay much more attention to diversity, as an important and related concept to complexity.  As&lt;a href=&quot;https://www.google.co.uk/books/edition/Diversity_and_Complexity/Mi6zkXss14IC?hl=en&amp;amp;gbpv=1&amp;amp;printsec=frontcover&quot; target=&quot;_blank&quot;&gt; Scott Page&lt;/a&gt; has well described,&amp;nbsp;there is a close and complicated relationship between diversity and&amp;nbsp;complexity. Nevertheless, there are some practically useful points to note about the concept of diversity.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Firstly the presence of diversity is indicative of the absence of a common constraint, and the presence of many different causal influences. So can be treated as a proxy - indicating the presence of complex processes.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Secondly, there is extensive and more internally consistent and practically useful set of ways in which diversity can be measured.&amp;nbsp; These mainly have their origins in the studies of biodiversity but have a much wider applicability.&amp;nbsp; Diversity can also be measured in other spheres, in human relationships (using social network analysis tools) and how people see the world (using forms of ethnographic enquiry known as pile or card sorting).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Thirdly, diversity has some potentially global relevance as a value and as objective.&amp;nbsp; Diversity of behaviour can be indicative of choice and empowerment.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Fourthly, diversity can also be seen as an important independent variable as well, enabling adaptation and creativity.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;All this is not to say that diversity cannot also be problematic.&amp;nbsp; Some forms of diversity in the present could severely limit the extent of diversity in the future. For example, if within a population there was a wide range of different types of guns held by households and many different views on how and when they could legitimately be used.&amp;nbsp; At the more mundane level, within organisations different kinds of tasks may benefit from different levels of diversity within the groups addressing those tasks.&amp;nbsp; So diversity presents a useful and important problematic in a way that the concept of complexity does not. What forms of diversity we want to see, and see sustained over time, and how can they be enabled? Where do we want choice and where should be accept restriction?&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;Arguing for more attention to diversity, rather than complexity, does not mean there also needs to be a whole new school of evaluation developed around this idea (Get Brand X Evaluation, just hot off the press! Uhh... No). It is consistent with a number of ideas already found useful, including the idea of equifinality (An outcome can arise from multiple different causes) and multifinality ( A cause can have multiple different outcomes), and the idea of multiple conjunctural causation. It is also compatible with a social constructionist and interpretive perspective on social reality.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-family: verdana; font-size: medium;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/2100166780663989397/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/07/diversity-and-complexity-where-should.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2100166780663989397'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/2100166780663989397'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/07/diversity-and-complexity-where-should.html' title='Diversity and complexity? Where should we focus our attention?'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-1364759417078460298</id><published>2021-06-14T16:24:00.036+00:00</published><updated>2021-08-22T11:27:54.016+00:00</updated><title type='text'>Paired case comparisons as an alternative to a configurational analysis (QCA or otherwise)</title><content type='html'>&lt;p&gt;[Take care, this is still very much a working draft!] Criticisms and comments welcome though&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;The challenge&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;The other day I was asked for some advice on how to implement a QCA type of analysis within an evaluation that was already fairly circumscribed in its design.  Circumscribed both by the commissioner and by the team proposing to carry out the evaluation.  The commissioner had already indicated that they wanted a case study orientated approach and had even identified the maximum number of case studies that they wanted to see (ten) .&amp;nbsp; While the evaluation team could see the potential use of a QCA type analyses they were already committed to undertaking a process type evaluation, and did not want a QCA type analyses to dominate their approach.  In addition, it appeared that there already was a quite developed conceptual framework that included many different factors which might be contribute causes to the outcomes of interest.&lt;br /&gt;&lt;br /&gt;As is often the case, there seemed to be a shortage of cases and an excess of potentially explanatory variables.  In addition, there were doubts within the evaluation team as to whether a thorough QCA analysis would be possible or justifiable given the available resources and priorities.&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;Paired case comparisons as the alternative&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;My first suggestion to the evaluation team was to recognise that there is some middle ground between across-case analysis involving medium to large numbers of cases, and a within-case analysis.  As described by Rihoux and Ragin (2009)&amp;nbsp; a QCA analysis will use both, going back and forth, using one to inform the other, over a number of iterations..  The middle ground between these two options is case comparisons – particularly &lt;i&gt;comparisons of pairs of cases.&lt;/i&gt;  Although in the situation described above there will be a maximum of 10 cases that can be explored, the number of pairs of these cases that can be compared is still quite big (45).&amp;nbsp; With these sort of numbers&amp;nbsp;some sort of strategy is necessary for making choices about the types of pairs of cases that will be compared.  Fortunately there is already a large literature on case selection.  My favourite summary is the one by&amp;nbsp; Gerring, J., &amp;amp; Cojocaru, L. (2015). &lt;a href=&quot;https://authorzilla.com/yrdm1/case-selection-a-diversity-of-methods-and-criteria-bu-blogs.html&quot; target=&quot;_blank&quot;&gt;Case-Selection: A Diversity of Methods and Criteria.&amp;nbsp;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;My suggested approach was&amp;nbsp;to use what is known as the &lt;a href=&quot;https://en.wikipedia.org/wiki/Confusion_matrix&quot; target=&quot;_blank&quot;&gt;Confusion Matrix&lt;/a&gt; as the basis for structuring the choice of cases to be compared.&amp;nbsp; A Confusion Matrix&amp;nbsp;is a simple truth table, showing a combination of two sets of possibilities (rows and columns), and the incidence of those possibilities (cell values). For example as follows:&lt;br /&gt;&lt;/p&gt;&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgauUzthP5Ae5c-ArtJmtcV-rVwZ1Umkq2hw6chK_IYFVFwvNAFWYgxqZZwB5J0Iz7ZnPcq2AvkDN2vLIQzHcjIquMeipGDIci5KkYmq-UxwVObdX3dYWkCXvGzeA3QwZg5YfmD/s499/cf1.png&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; data-original-height=&quot;106&quot; data-original-width=&quot;499&quot; height=&quot;136&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgauUzthP5Ae5c-ArtJmtcV-rVwZ1Umkq2hw6chK_IYFVFwvNAFWYgxqZZwB5J0Iz7ZnPcq2AvkDN2vLIQzHcjIquMeipGDIci5KkYmq-UxwVObdX3dYWkCXvGzeA3QwZg5YfmD/w640-h136/cf1.png&quot; width=&quot;640&quot; /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Inside the Confusion Matrix are four types of cases:&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;ol style=&quot;text-align: left;&quot;&gt;&lt;li&gt;True Positives where there are cases with attributes that fit my theory and where the expected outcome is present&lt;/li&gt;&lt;li&gt;False Positives, where there are cases with attributes that fit my theory but where the expected outcome is absent&lt;br /&gt;&lt;/li&gt;&lt;li&gt;False Negatives, where there are cases which do not have attributes that fit my theory but where nevertheless the outcome is present&lt;br /&gt;&lt;/li&gt;&lt;li&gt;True Negatives, where there are cases which do not have attributes that fit my theory and where the outcome is absent as expected&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div&gt;Both QCA and supervised machine learning approaches are good at identifying individual (or packages of)&amp;nbsp; attributes which are good predictors of when outcomes are present or when they are absent – in other words where there are large number of True Positive and True Negative cases. And the incidence of exceptions: the False Positive and False Negatives. But this type of cross case-based led analysis do not seem to be available as an option to the evaluation team I have mentioned above.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;1. Starting with True Positives&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;So my suggestion has been to look at the 10 cases that they have at hand, and start by focusing in on those cases where the outcome is present (first column). Focus on the case that is most similar to others with the outcome present, because findings about this case may be more likely to apply to others. See below on measuring similarity) . When examining that case identify one or more attributes which is the most likely explanation for the outcome being present.  Note here that this initial theory is coming from a single within-case analysis, not a cross-case analysis.  The evaluation team will now have a single case in the category of True Positive.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;2. Comparing False Positives and True Positives&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The next step in the analysis is to identify cases which can be provisionally described as a False Positive. Start by finding a case which has the outcome absent. Does it have the same theory-relevant attributes as the True Positive? If so, retain it as a False Positive. Otherwise, move it to the True Negative category. Repeat this move for all remaining cases with the outcome absent. From among all those qualifying as False Positives, find one which is otherwise be&amp;nbsp;&lt;i&gt;as similar as possible&lt;/i&gt; in all its other attributes to the True Positive case.This type of analysis choice is called MSDO,&amp;nbsp;standing for &quot;most similar design, different outcome&quot; - see the de Meur reference below.&amp;nbsp; Also see below on how to measure this form of similarity.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The aim here is to find how the causal mechanisms at work differ. One way to explore this question is to look for an additional attribute that is present in the True Positive case but absent in the False Negative case, despite those cases otherwise being most similar.&amp;nbsp; Or, an attribute that is absent in the True Positive but present in the False Negative case. In the former case the missing case could be seen as a kind of &lt;i&gt;enabling&lt;/i&gt; factor, whereas in the latter case it could be seen as more like a &lt;i&gt;blocking &lt;/i&gt;factor. &amp;nbsp;If nether can be found by comparison of coded attributes of the cases then a more intensive examination of raw data on the case might still identify them, and lead to an updated/elaboration of theory behind the True Positive case. Alternately, that examination might suggest measurement error is the problem and that the False Positive case needs to be reclassified as True Positive.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;3. Comparing False Negatives and True Positives&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The third step in the analysis is to identify at least one most relevant case which can be described as a False Negative.&amp;nbsp; This False-Negative case should be one that is &lt;i&gt;as different as possible&lt;/i&gt; in all its attributes to the True Positive case. This type of analysis choice is called MDSO, standing for &quot;most different design, same outcome&quot;.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&amp;nbsp;The aim here should be to try to identify if the same or different causal mechanisms is at work,&amp;nbsp; when compared to that seen in the True Positive case. One way to explore this question is to look for one or more attributes that both the True Positive and False Negative case have in common, despite otherwise being &quot;most different&quot;. If found, and if associated with the causal theory in the True Positive case,&amp;nbsp; then the False Negative case can now be reclassed as a True Positive. The theory describing the now two True Positive cases can now be seen as provisionally &quot;necessary&quot;for the outcome, until another False Negative case is found and examined in a similar fashion.If the casual mechanism seems to be different then the case remains as a False Negative.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Both the second and third step comparisons described above will help : (a0 elaborate the details, and (b) establish the limits of the scope of the theory identified in step one. This suggested process makes use of the Confusion Matrix as a kind of very simple chess board, where pieces (aka cases) are introduced on to the board, one at a time, and then sometimes moved to other adjacent positions (depending on their relation to other pieces on the board).Or, the theory behind their chosen location is updated.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If there are only ten cases available to study, and these have an even distribution of outcomes present and absent, then this three step process of analysis could be reiterated five times i.e. once for each case where the outcome was present. Thus involving&amp;nbsp; up to 10 case comparisons, out of the 45 possible.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;font-size: medium;&quot;&gt;&lt;b&gt;Measuring similarity&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The above process depends on the ability to make systematic and transparent judgements about similarity. One way of doing this, which I have previously built into an Excel app called &lt;a href=&quot;https://evalc3.net/&quot; target=&quot;_blank&quot;&gt;EvalC3&lt;/a&gt;, is to start by describing each case with a string of binary coded attributes of the same kind as used in QCA, and in some forms of supervised machine learning. An example set of workings can be&amp;nbsp;&lt;a href=&quot;https://1drv.ms/x/s!Ake517EnBNEUgdNFk-Qs7cxKpnLbkg?e=CKlDu3&quot; target=&quot;_blank&quot;&gt;seen in this Excel sheet&lt;/a&gt;, showing&amp;nbsp; an imagined data set of 10 cases, with 10 different attributes and then the calculation and use of&amp;nbsp; Hamming Distance as the similarity measure to chose cases for the kinds of comparisons described above. That list of attributes and the Hamming distance measure, is likely to&amp;nbsp; need to be updated, as the investigation of False Positives and False Negatives proceeds.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Incidentally, the more attributes that have been coded per case, the more discriminating this kind of approach can become. In contrast to cross-case analysis where an increase in numbers of attributes per case is usually problematic&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;&lt;b&gt;Related sources&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For some of my earlier thoughts on case comparative analysis &lt;a href=&quot;https://evalc3.net/how-it-works/within-case-analysis/&quot; target=&quot;_blank&quot;&gt;see&amp;nbsp; here,&lt;/a&gt;&amp;nbsp;These were developed for use within the context of a cross-case analysis process. But the argument above is about how to proceed when the staring point is a within-case analysis.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;See also:&lt;/div&gt;&lt;div&gt;&lt;ul style=&quot;text-align: left;&quot;&gt;&lt;li&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;Nielsen, R. A. (2014).&amp;nbsp;&lt;/span&gt;&lt;i style=&quot;text-indent: -2em;&quot;&gt;&lt;a href=&quot;http://www.mit.edu/~rnielsen/Case%20Selection%20via%20Matching.pdf&quot; target=&quot;_blank&quot;&gt;Case Selection via Matching&lt;/a&gt;&lt;/i&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;de Meur, G., Bursens, P., &amp;amp; Gottcheiner, A. (2006). &lt;a href=&quot;http://link.springer.com/chapter/10.1007/0-387-28829-5_4&quot; target=&quot;_blank&quot;&gt;MSDO/MDSO Revisited for Public Policy Analysis&lt;/a&gt;. In B. Rihoux &amp;amp; H. Grimm (Eds.), Innovative Comparative Methods for Policy Analysis (pp. 67–94). Springer US.&amp;nbsp;&lt;/li&gt;&lt;li&gt;de Meur, G., &amp;amp; Gottcheiner, A. (2012).&lt;a href=&quot;https://dx.doi.org/10.4135/9781446249413.n12&quot; target=&quot;_blank&quot;&gt; The Logic and Assumptions of MDSO–MSDO Designs&lt;/a&gt;. In The SAGE Handbook of Case-Based Methods (pp. 208–221).&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;Rihoux, B., &amp;amp; Ragin, C. C. (Eds.). (2009). Configurational Comparative Methods: Qualitative Comparative Analysis (QCA) and Related Techniques. Sage. Pages 28-32 for a description of &lt;/span&gt;&lt;i style=&quot;text-indent: -2em;&quot;&gt;&quot;MSDO/MDSO: A systematic&amp;nbsp; procedure for matching cases and conditions&quot;.&amp;nbsp;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;span style=&quot;text-indent: -32px;&quot;&gt;Goertz, G. (2017)&lt;/span&gt;&lt;i style=&quot;text-indent: -32px;&quot;&gt;. Multimethod research, causal mechanisms, and case studies: An integrated approach.&lt;/i&gt;&lt;span style=&quot;text-indent: -32px;&quot;&gt; Princeton University Press.&lt;/span&gt;&lt;/li&gt;&lt;li style=&quot;text-indent: -32px;&quot;&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;p&gt;&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/1364759417078460298/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/06/case-comparisons-as-alternative-to.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1364759417078460298'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1364759417078460298'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/06/case-comparisons-as-alternative-to.html' title='Paired case comparisons as an alternative to a configurational analysis (QCA or otherwise)'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgauUzthP5Ae5c-ArtJmtcV-rVwZ1Umkq2hw6chK_IYFVFwvNAFWYgxqZZwB5J0Iz7ZnPcq2AvkDN2vLIQzHcjIquMeipGDIci5KkYmq-UxwVObdX3dYWkCXvGzeA3QwZg5YfmD/s72-w640-h136-c/cf1.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6719829.post-1178250385762675371</id><published>2021-05-24T16:47:00.006+00:00</published><updated>2021-05-25T07:35:52.332+00:00</updated><title type='text'>The potential use of Scenario Planning methods to help articulate a Theory of Change</title><content type='html'>&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Over the past few months&amp;nbsp;I have been engaged in discussions with other members of the Association of Professional Futurists (APF) Evaluation Task Force&amp;nbsp;about how activities and outcomes in the field of foresight/alternative futures/scenario planning can usefully be evaluated.&lt;br /&gt;&lt;br /&gt;Just recently the subject of Theories of Change has come up, and it struck me that there are at least three ways of looking at Theories of Change in this context:&lt;/p&gt;&lt;p&gt;The&amp;nbsp;first perspective: A particular scenario (i.e. an elaborated view of the future) can contain within it a particular theory of change.  One view of the future may imply that technological change will be the main driver of what happens.  Another might emphasise the major long-term causal influence of demographic change.&lt;/p&gt;&lt;p&gt;The second perspective: Those organising a scenario planning exercise are also likely to have either explicitly or implicitly or mixture of both a Theory of Change of how their exercise is expected to influence&amp;nbsp;on the participants, and the influence those participants will have on others.&lt;/p&gt;&lt;p&gt;The third perspective looks in the opposite direction and raises the possibility that in other settings a Theory of Change may contain a particular type of future scenario.  I&#39;m thinking here particularly of Theories of Change as used by organisations planning economic and/or social interventions in developed and developing economies.  This territory has been explored recently in a paper by&amp;nbsp;Derbyshire (2019),&amp;nbsp;titled &quot;&lt;span style=&quot;text-indent: -2em;&quot;&gt;&lt;a href=&quot;https://doi.org/10.1002/ffo2.1&quot; target=&quot;_blank&quot;&gt;Use of scenario planning as a theory-driven evaluation tool. &lt;/a&gt;&lt;/span&gt;&lt;a href=&quot;https://doi.org/10.1002/ffo2.1&quot; target=&quot;_blank&quot;&gt;&lt;i style=&quot;text-indent: -2em;&quot;&gt;FUTURES &amp;amp; FORESIGHT SCIENCE&lt;/i&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;, &lt;/span&gt;&lt;i style=&quot;text-indent: -2em;&quot;&gt;1&lt;/i&gt;&lt;/a&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;&lt;a href=&quot;https://doi.org/10.1002/ffo2.1&quot; target=&quot;_blank&quot;&gt;(1), 1–13&lt;/a&gt;.&amp;nbsp; In that paper he puts forward a good argument for the use of scenario planning methods as a way of developing improved Theories of Change.  Improved in a number of ways.&amp;nbsp; Firstly a much more detailed articulation of the causal processes involved.  Secondly, more adequate attention to risks and unintended consequences.  Thirdly, more adequate involvement of stakeholders in these two processes.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;Both the task force discussions and my revisiting of the paper by Derbyshire have prompted me to think about the potential use of &lt;a href=&quot;https://mscinnovations.wordpress.com/&quot; target=&quot;_blank&quot;&gt;a ParEvo exercise &lt;/a&gt;as a means of articulating the contents of a Theory of Change for a development intervention.  And to start to reach out to people who might be interested in testing such possibilities.  The following possibilities come to mind:&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;1.&amp;nbsp; A ParEvo exercise could be set up to explore what happens when X project is set up in Y circumstances with Z resources and expectations.&amp;nbsp; A description of this initial setting would form the seed paragraph(s)&amp;nbsp;of the ParEvo exercise.  The subsequent iterations would describe the various possible developments that took place over a series of calendar periods,&amp;nbsp;reflecting the expected lifespan of the intervention, and perhaps a limited period thereafter.  The participants would be, or act in the role of, different stakeholders in the intervention.  Commentators of the emerging storylines could be independent parties with different forms of expertise relevant to the intervention and its context.&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style=&quot;text-indent: -2em;&quot;&gt;2.&amp;nbsp; As with all previous ParEvo exercises to date, after the final iteration there would be an evaluation stage, completed by at least the participants and the commentators, but possibly also by others&amp;nbsp;in observer roles.&amp;nbsp; You can see a copy of a&lt;a href=&quot;https://mscinnovations.files.wordpress.com/2021/05/surveymonkey_304997640-evaluation-vs2.pdf&quot; target=&quot;_blank&quot;&gt; recent evaluation survey form here,&lt;/a&gt; to see&amp;nbsp;the types of evaluative judgements that would be sought from those involved and observing.&lt;/span&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;3.&amp;nbsp; .3.&amp;nbsp; There seemed to be at least two possible ways of using the storylines that have been generated, to inform the design of a Theory of Change.  One is to take&amp;nbsp;whole storylines as units of analysis.  For example, a storyline evaluated as both most likely and most desirable, by more participants than any other storyline, would seem an immediately&amp;nbsp;useful source of detailed information about a causal pathway that should go into a Theory of Change.  Other storylines identified as most likely but least desirable would warrant attention as &lt;i&gt;risks&lt;/i&gt; that also need to be built into a Theory Of Change, along with any potential means of preventing and/or mitigating those risks.  Other storylines identified as least likely but most desirable would warrant attention as &lt;i&gt;opportunities&lt;/i&gt;, also to be built into a Theory Of Change, along with means of enabling and exploiting those opportunities.&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;4. 34.&amp;nbsp; The second possible approach would&amp;nbsp;give less respect to the existing branch structure, and focus more on the contents of individual contributions i.e. paragraphs in the storylines.&amp;nbsp; Individual contributions could be sorted into categories familiar to those developing Theories of Change: activities, outputs, outputs, and impacts.&amp;nbsp; These could then be&amp;nbsp;recombined into one or more causal pathways that the participants thought&amp;nbsp;was both possible and desirable.&amp;nbsp; In effect, a kind of linear jigsaw puzzle.  If the four categories of event types were seen as being too rigid a schema&amp;nbsp;(a reasonable complaint!),&amp;nbsp;&amp;nbsp;but still an unfortunate necessity, they could be introduced after the recombination process, rather than before.  Either way, it probably would be useful to include another evaluation stage, making a comparative evaluation of the different&amp;nbsp;combinations of contributions that had been created.&amp;nbsp; Using the same metrics as are already being used with&amp;nbsp;existing ParEvo exercise.&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;More ideas will follow..&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;h3 style=&quot;text-align: left; text-indent: -32px;&quot;&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;The beginnings of a bibliography...&lt;/h3&gt;&lt;div&gt;&lt;div class=&quot;csl-bib-body&quot; style=&quot;line-height: 2; margin-left: 2em; text-indent: -2em;&quot;&gt;
  &lt;div class=&quot;csl-entry&quot;&gt;&lt;div class=&quot;csl-bib-body&quot; style=&quot;line-height: 2; margin-left: 2em; text-indent: -2em;&quot;&gt;
  &lt;div class=&quot;csl-entry&quot;&gt;Derbyshire, J. (2019). Use of scenario planning as a theory-driven evaluation tool. &lt;i&gt;FUTURES &amp;amp; FORESIGHT SCIENCE&lt;/i&gt;, &lt;i&gt;1&lt;/i&gt;(1), 1–13. &lt;a href=&quot;https://doi.org/10.1002/ffo2.1&quot;&gt;https://doi.org/10.1002/ffo2.1&lt;/a&gt;&lt;/div&gt;
  &lt;span class=&quot;Z3988&quot; title=&quot;url_ver=Z39.88-2004&amp;amp;ctx_ver=Z39.88-2004&amp;amp;rfr_id=info%3Asid%2Fzotero.org%3A2&amp;amp;rft_id=info%3Adoi%2F10.1002%2Fffo2.1&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.genre=article&amp;amp;rft.atitle=Use%20of%20scenario%20planning%20as%20a%20theory-driven%20evaluation%20tool&amp;amp;rft.jtitle=FUTURES%20%26%20FORESIGHT%20SCIENCE&amp;amp;rft.volume=1&amp;amp;rft.issue=1&amp;amp;rft.aufirst=James&amp;amp;rft.aulast=Derbyshire&amp;amp;rft.au=James%20Derbyshire&amp;amp;rft.date=2019&amp;amp;rft.pages=1-13&amp;amp;rft.spage=1&amp;amp;rft.epage=13&amp;amp;rft.issn=2573-5152&amp;amp;rft.language=en&quot;&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;csl-entry&quot;&gt;&lt;div class=&quot;csl-bib-body&quot; style=&quot;line-height: 2; margin-left: 2em; text-indent: -2em;&quot;&gt;
  &lt;div class=&quot;csl-entry&quot;&gt;Ganguli, S. (2017). Using Scenario Planning to Surface Invisible Risks (SSIR). &lt;i&gt;Stanford Social Innovation Review&lt;/i&gt;. &lt;a href=&quot;https://ssir.org/articles/entry/using_scenario_planning_to_surface_invisible_risks&quot;&gt;https://ssir.org/articles/entry/using_scenario_planning_to_surface_invisible_risks&lt;/a&gt;&lt;/div&gt;
  &lt;span class=&quot;Z3988&quot; title=&quot;url_ver=Z39.88-2004&amp;amp;ctx_ver=Z39.88-2004&amp;amp;rfr_id=info%3Asid%2Fzotero.org%3A2&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.genre=article&amp;amp;rft.atitle=Using%20Scenario%20Planning%20to%20Surface%20Invisible%20Risks%20(SSIR)&amp;amp;rft.jtitle=Stanford%20Social%20Innovation%20Review&amp;amp;rft.aufirst=Sampriti&amp;amp;rft.aulast=Ganguli&amp;amp;rft.au=Sampriti%20Ganguli&amp;amp;rft.date=2017&amp;amp;rft.language=en-us&quot;&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;
  &lt;span class=&quot;Z3988&quot; title=&quot;url_ver=Z39.88-2004&amp;amp;ctx_ver=Z39.88-2004&amp;amp;rfr_id=info%3Asid%2Fzotero.org%3A2&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;amp;rft.type=webpage&amp;amp;rft.title=Using%20Scenario%20Planning%20to%20Surface%20Invisible%20Risks%20(SSIR)&amp;amp;rft.description=How%20to%20prepare%20your%20organization%20for%20the%20unexpected.&amp;amp;rft.identifier=https%3A%2F%2Fssir.org%2Farticles%2Fentry%2Fusing_scenario_planning_to_surface_invisible_risks&amp;amp;rft.language=en-us&quot;&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;&lt;br /&gt;&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;3&lt;/p&gt;&lt;p style=&quot;text-indent: -32px;&quot;&gt;3&lt;/p&gt;&lt;div class=&quot;blogger-post-footer&quot;&gt;from &quot;Rick on the Road&quot; at http://mandenews.blogspot.com/&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://mandenews.blogspot.com/feeds/1178250385762675371/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://mandenews.blogspot.com/2021/05/the-potential-use-of-scenario-planning.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1178250385762675371'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6719829/posts/default/1178250385762675371'/><link rel='alternate' type='text/html' href='http://mandenews.blogspot.com/2021/05/the-potential-use-of-scenario-planning.html' title='The potential use of Scenario Planning methods to help articulate a Theory of Change'/><author><name>Rick Davies</name><uri>http://www.blogger.com/profile/07028422984421301184</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry></feed>