<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0"><channel><title>Rick On the Road</title><link>http://mandenews.blogspot.com/</link><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/RickOnTheRoad" /><description>Reflections on the monitoring and evaluation of development aid projects, programmes and policies, and development of organisation's capacity to do the same. This blog also functions as the &lt;b&gt;Editorial section of the MandE NEWS website&lt;/b&gt; - see the Links section below on the left.</description><language>en</language><managingEditor>noreply@blogger.com (Rick Davies)</managingEditor><lastBuildDate>Tue, 29 May 2012 00:50:05 PDT</lastBuildDate><generator>Blogger http://www.blogger.com</generator><openSearch:totalResults xmlns:openSearch="http://a9.com/-/spec/opensearchrss/1.0/">57</openSearch:totalResults><openSearch:startIndex xmlns:openSearch="http://a9.com/-/spec/opensearchrss/1.0/">1</openSearch:startIndex><openSearch:itemsPerPage xmlns:openSearch="http://a9.com/-/spec/opensearchrss/1.0/">25</openSearch:itemsPerPage><feedburner:info uri="rickontheroad" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:browserFriendly>This is an XML content feed. It is intended to be viewed in a newsreader or syndicated to another site.</feedburner:browserFriendly><item><title>A perspective on "Value for Money" relationships</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/9k53SbqfSo8/perspective-on-value-for-money.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 28 May 2012 06:43:02 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-5169228477832957058</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;Matrices can be a useful means of showing the results of different combinations of things In this matrix I show how three different performance attributes can be seen as the results of different combinations of change in unit costs and effectiveness.  &lt;br /&gt;&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-sqJxf7Ik7BY/T74bCEs3Y7I/AAAAAAAAGFk/vVzE5lL_qiM/s1600/VFM.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/-sqJxf7Ik7BY/T74bCEs3Y7I/AAAAAAAAGFk/vVzE5lL_qiM/s1600/VFM.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;Source: Department of Crude Measures&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-5169228477832957058?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-sqJxf7Ik7BY/T74bCEs3Y7I/AAAAAAAAGFk/vVzE5lL_qiM/s72-c/VFM.bmp" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">2</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/05/perspective-on-value-for-money.html</feedburner:origLink></item><item><title>Data mining algorithms as evaluation tools</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/5AbMTnSqdWw/data-mining-tools-and-evaluation.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Tue, 29 May 2012 00:50:05 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-8968326040027170850</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;div&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;For years now I have been in favour of theory-led evaluation approaches. Many of the previous postings on this website are evidence of this. But this post is about something quite different, about a particular form of &lt;a href="http://en.wikipedia.org/wiki/Data_mining"&gt;data mining&lt;/a&gt;, how to do it and how it might be useful. Some have argued that data mining is radically different from hypothesis-led research (or evaluation, for that matter). Others have argued that there are some important continuities and complimentarities (&lt;a href="http://theoryandscience.icaap.org/content/vol9.2/Chong.html" target="_blank"&gt;Yu, 2007&lt;/a&gt;)&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Recently I have started reading about different data mining &lt;a href="http://en.wikipedia.org/wiki/Algorithm"&gt;algorithms&lt;/a&gt;, especially the use of what are called classification trees and genetic algorithms (GAs). The latter was the subject of my &lt;a href="http://www.mandenews.blogspot.co.uk/2012/03/can-we-evolve-explanations-of-observed.html"&gt;recent post&lt;/a&gt;, about whether we could evolve models of development projects as well as design them. Genetic algorithms are software embodiments of the evolutionary algorithm (i.e. iterated variation, selection, retention) at work in the biological world. They are good for exploring large possibility spaces and for coming up with new solutions that may not be nearby to current practice.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;I had wondered if this idea could be connected to the use of &lt;a href="http://www.compasss.org/"&gt;Qualitative Comparative Analysis (QCA),&lt;/a&gt;a method of identifying configurations of attributes (e.g. of development projects) associated with a particular type of outcomes (e.g. reduced household poverty). QCA is a theory-led approach, which uses very basic forms of data about attributes (i.e. categorical), then describes configurations of these attributes using Boolean logic expressions, and analyses these with the help of software that can compare and manipulate these statements. The aim is to come up with a minimal number of simple “IF…THEN” type statements describing what sorts of conditions are associated with particular outcomes. This is potentially very useful for development aid managers who are often asking about “what works where in what circumstances”. (But before then there is the challenge of getting on top of the technical language required to be able to do QCA).&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;My initial thought&amp;nbsp; was whether genetic algorithms could be used evolve and test statements describing different configurations, as distinct from constructing them one by one on the basis of a current theory. This might lead to quicker resolution, and perhaps discoveries that had not been suggested by current theory.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;As described in my previous post, there is already a simple GA built into Excel, known as &lt;a href="http://blogs.office.com/b/microsoft-excel/archive/2009/09/21/new-and-improved-solver.aspx"&gt;Solver&lt;/a&gt;. What I could not work out was how to represent logic elements like AND, NOT, OR in such a way that Solver could vary them to create different statements representing different configurations of existing attributes. &amp;nbsp;In the process of trying to sort out this problem I discovered that there is a&amp;nbsp; whole literature on &lt;a href="https://www.google.co.uk/search?q=genetic+algorithm+rule+discovery&amp;amp;ie=utf-8&amp;amp;oe=utf-8&amp;amp;aq=t&amp;amp;rls=org.mozilla:en-GB:official&amp;amp;client=firefox-a#hl=en&amp;amp;client=firefox-a&amp;amp;rls=org.mozilla:en-GB%3Aofficial&amp;amp;sclient=psy-ab&amp;amp;q=genetic+algorithm+rule+discovery&amp;amp;oq=genetic+algorithm+rule+discovery&amp;amp;aq=f&amp;amp;aqi=&amp;amp;aql=&amp;amp;gs_l=serp.12...0l0l0l9129734l0l0l0l0l0l0l0l0ll0l0.frgbld.&amp;amp;pbx=1&amp;amp;fp=1&amp;amp;biw=1920&amp;amp;bih=930&amp;amp;bav=on.2,or." target="_blank"&gt;&lt;span style="color: blue;"&gt;GAs and rule discovery&lt;/span&gt;&lt;/a&gt;(rules as in IF-THEN statements). Around the same time, a technical adviser from &lt;a href="http://www.solver.com/"&gt;FrontlineSolver&lt;/a&gt; suggested I try a different approach to the automated search for association rules. This involved the use of &lt;a href="http://www-users.cs.umn.edu/%7Ekumar/dmbook/ch4.pdf"&gt;Classification Trees&lt;/a&gt;, a tool which has the merit of producing results which are readable by ordinary mortals, unlike the results of some other data mining methods.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;An example!&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/04/epm-data-only.xlsx"&gt;This Excel file&lt;/a&gt; contains a small data set, which has previously been used for QCA analysis. It contains 36 cases, each with 4 attributes and 1 outcome of interest. The cases relate to different ethnic minorities in countries across Europe and the extent to which there has been ethnic political mobilisation in their countries (being the outcome of interest). Both the attributes and outcomes are coded as either 0 or 1 meaning absent or present.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;With each case having up to four different attributes there could be 16 different combinations of attributes. A classification algorithm in &lt;a href="http://www.solver.com/xlminer/"&gt;XLMiner&lt;/a&gt; &amp;nbsp;software (and others like it) is able to automatically sort through these possibilities to find the simplest classification tree that can correctly point to where the different outcomes take place. &lt;a href="http://www.solver.com/xlminer/"&gt;XLMiner&lt;/a&gt; produced the following classification tree, which I have annotated and will walk through below.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/04/epm-tree-2nd1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="560" src="http://mande.co.uk/blog/wp-content/uploads/2012/04/epm-tree-2nd1.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/04/epm-tree.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;We start at the top with the attribute “large” referring to the size of the linguistic subnation within their own country. Those that are large have then been divided according to whether their subnational region is “growing” or not. Those that are not have then been divided into those who are relatively “wealthy” group within their nation and those who are not. The smaller linguistic subnations &amp;nbsp;have also been divided into those who are relatively wealthy group within their nation and those who are not, and those who are relatively wealthy are then divided into those whose subnational region speak and write in their own language or not. The square nodes at the end of each “branch” indicate the outcome associated with these combinations of conditions - where there has been ethnic political mobilisation (1) or not (0). Under each square node are the ethnic groups placed in that category. These results fit with the original data in Excel (right column).&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;This is my summary of the rules described by the classification tree:&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;IF &lt;/u&gt;a linguistic subnation’s economy is large &lt;u&gt;AND&lt;/u&gt; growing &lt;u&gt;THEN&lt;/u&gt; ethnic political mobilisation will be present [14 of 19 positive cases]&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;IF &lt;/u&gt;a linguistic subnation’s economy is large, &lt;u&gt;NOT&lt;/u&gt; growing &lt;u&gt;AND&lt;/u&gt; is relatively wealthy &lt;u&gt;THEN&lt;/u&gt; ethnic political mobilisation will be present [2 of 19 positive cases]&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;IF&lt;/u&gt; a linguistic subnation’s economy is &lt;u&gt;NOT &lt;/u&gt;large &lt;u&gt;AND&lt;/u&gt; is relatively wealthy &lt;u&gt;AND&lt;/u&gt; speaks and writes in its own language &lt;u&gt;THEN&lt;/u&gt; ethnic political mobilisation will be present [3 of 19 positive cases]&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;blockquote class="tr_bq"&gt;&lt;div&gt;Both QCA and classification trees have procedures for simplifying the association rules that are found. With classification  trees there is an automated “pruning” option that removes redundant parts. My  impression is that there are no redundant parts in the above tree, but I  may be wrong.&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;These rules are, in realist evaluation terminology, describing three different configurations of &lt;i&gt;possible&lt;/i&gt; causal processes. I say "possible" because what we have above are associations. Like correlation co-effecients, they dont necessarily mean causation. However, they are at least candidate configurations of causal processes at work.&lt;/div&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;The origins of this data set and its coding are described in pages 137-149 of The Comparative Method: Moving Beyond Qualitative and Quantitative Strategies by Charles C. Ragin, viewable on &lt;a href="http://books.google.co.uk/books?id=mZi17vherScC&amp;amp;pg=PA136&amp;amp;lpg=PA136&amp;amp;dq=ethnic+political+mobilization+data+set+from+The+Comparative+Method&amp;amp;source=bl&amp;amp;ots=3rcB0uydnX&amp;amp;sig=FYVPIhALBciXkoBuTcyc1b_YmS8&amp;amp;hl=en&amp;amp;sa=X&amp;amp;ei=ZeCPT9-_N4SR0QXqm6WCAg&amp;amp;ved=0CCoQ6AEwAA#v=onepage&amp;amp;q=ethnic%20political%20mobilization%20data%20set%20from%20The%20Comparative%20Method&amp;amp;f=false"&gt;Google Books&lt;/a&gt;. Also discussed there is the QCA analysis of this data set and its implications for different theories of ethnic political mobilisation. My thanks to Charles Ragin for making the data set available.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;I think this type of analysis, by both QCA and classification tree algorithms, has considerable potential use in the evaluation of development programs. Because it uses nominal data the range of data sources that can be used is much wider than statistical methods that need ratio or interval scale data. Nominal data can either be derived from pre-existing more sophisticated data (by using cut-off points to create categories) or be collected as primary data, including by participatory processes such as card/pile sorting and ranking exercises. The results in the form of IF…THEN rules should be of practical use, if only in the first instance as a source of hypotheses needing further testing by more detailed inquiries.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;There are some fields of development work where large amounts of potentially useful, but rarely used, data is generated on a continuing basis such a microfinance services and to a less extent healthy and education services. Much of the demand for data mining capacity has come from industries that are finding themselves flooded with data, but lack the means to exploit it. This may well be the case with more development agencies in the future, as they make more use of interactive websites and mobile phone data collection methods and the like.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;For those who are interested, there is a range of software worth exploring in addition to the package I have mentioned above. See these lists: &lt;a href="http://www.kdnuggets.com/software/classification-decision-tree.html"&gt;A&lt;/a&gt;and &lt;a href="http://www.phiresearchlab.org/downloads/OpenSourceDataMining.pdf"&gt;B&lt;/a&gt;&amp;nbsp; I have a particular interest in &lt;a href="http://www.gatree.com/wordpress/"&gt;GATree&lt;/a&gt;, which uses genetic algorithm to evolve the best fitting classification tree, and to avoid the problem of being stuck in a “&lt;a href="http://en.wikipedia.org/wiki/Local_optimum"&gt;local optimum&lt;/a&gt;”. There is also another type of algorithm with the delightfull name of &lt;a href="http://en.wikipedia.org/wiki/Random_forest"&gt;Random Forests&lt;/a&gt;, which uses the “wisdom of crowds” principle to find the best fitting classification tree. But note the caveat: “Unlike decision trees, the classifications made by Random Forests are difficult for humans to interpret”. These and other algorithms are in use by participants in the &lt;a href="https://www.kaggle.com/competitions"&gt;Kaggle&lt;/a&gt; competitions online, which themselves could be considered as a kind of semi-automated meta-algorithm (i.e. an algorithm for finding useful algorithms). Lots to explore!&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;b&gt;PS&lt;/b&gt;: I have just found and tested another package, called &lt;a href="http://www.xlstat.com/en/" target="_blank"&gt;XLSTAT&lt;/a&gt;, that also generates classification trees. Here is a graphic showing the same result as found above, but in more detail. (Click on the image to enlarge it)&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-SlIotpmFscY/T5BKwRvpr9I/AAAAAAAAF-8/NY65phAy-V0/s1600/emp+tree+XLSTAT.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="288" src="http://1.bp.blogspot.com/-SlIotpmFscY/T5BKwRvpr9I/AAAAAAAAF-8/NY65phAy-V0/s640/emp+tree+XLSTAT.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b&gt;PS 29 April 2012&lt;/b&gt;: In QCA distinctions are made between a condition being "&lt;a href="http://en.wikipedia.org/wiki/Necessity_and_sufficiency" target="_blank"&gt;necessary" and or "sufficient" &lt;/a&gt;for the outcome to occur.&amp;nbsp; In the simplest setting a single condition can be a necessary and sufficient cause. In more complex settings a single condition may be a necessary part of a configuration of conditions which itself is sufficient but not necessary. For example a "growing" economy in the right branch of the first tree above. In classification trees the presence/absence of the necessary/sufficient conditions can easily be observed. If a condition appears in all "yes" branches of the tree (= different configurations) then it is "necessary". If a condition appears along with another in a given "yes" branch of&amp;nbsp; of a tree then it is not "sufficient". "Wealthy" is a condition that appears necessary but not sufficient&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 4 May 2012&lt;/b&gt;: I have just discovered there is what looks like a very good open source datamining package called &lt;a href="http://sourceforge.net/projects/rapidminer/" target="_blank"&gt;RapidMiner,&lt;/a&gt; which comes with a &lt;a href="http://rapidminerresources.com/" target="_blank"&gt;whole stack of training videos&lt;/a&gt;, and a big support and development community&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 29 May 2012:&lt;/b&gt;&lt;a href="http://dilbert.com/dyn/str_strip/000000000/00000000/0000000/000000/00000/6000/600/6646/6646.strip.gif"&gt; Pertinent comment from Dilbert &lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-8968326040027170850?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-SlIotpmFscY/T5BKwRvpr9I/AAAAAAAAF-8/NY65phAy-V0/s72-c/emp+tree+XLSTAT.png" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/04/data-mining-tools-and-evaluation.html</feedburner:origLink></item><item><title>Criteria for assessing the evaluability of Theories of Change</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/VrJifJdvoi4/criteria-for-assessing-evaluablity-of.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 30 Apr 2012 03:44:41 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-4836947793652117567</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Our team has recently begun work on an evaluability assessment of an agency's work in a particular policy area, covering many programs in many countries. Part of our brief is to examine the evaluability of the programs' Theory of Change (ToC).&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;In order to do this we clearly need to identify some criteria for assessing the evaluability of ToC. I initially identified five which I thought might be appropriate, and then put these out to the members of the &lt;a href="http://groups.yahoo.com/group/MandENEWS/"&gt;MandE NEWS email list&lt;/a&gt; for comment. Many comments were quickly forthcoming. In all, a total of 20 people responded in the space of two days (Thanks to Bali, Dwiagus, Denis, Bob, Helene, Mustapha, Justine, Claude, Alex, Alatunji, Isabel, Sven, Irene, Francis, Erik, Dinesh, Rebecca, John, Rajan and Nick).&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Caveats and clarifications&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;What I have presented below is my current perspective on the issue of evaluability criteria, as informed by these responses. It is not intended to be an objective and representative description of the responses (&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/04/Evaluablity-of-TOC-criteria-email-list-members-comments-Vs2.pdf" target="_blank"&gt;Lookhere for a copy&lt;/a&gt; of all the comments received) (&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/04/Evaluablity-of-TOC-criteria3.pdf" target="_blank"&gt;You can also download this posting as a pdf&lt;/a&gt;)&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The word "evaluable" needs some clarification. In the literature on evaluability assessments it has two meanings. The main one is that it is &lt;i style="mso-bidi-font-style: normal;"&gt;possible&lt;/i&gt; to evaluate something. For example, if the theory is clear and the data is available. The second meaning is more &lt;i style="mso-bidi-font-style: normal;"&gt;practically oriented&lt;/i&gt;. The theory may be clear and the data available, but the theory may be so implausible that it is simply not worth expending resources on its evaluation. Or there may be a perfectly good ToC, but if no one owns it apart from a consultant who visited the project six months ago, so it might be questionable whether expensive resources should be invested in its evaluation.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;We also need to distinguish between an evaluable ToC and a “good” ToC.&amp;nbsp; A ToC may be evaluable because the theory is clear and plausible, and relevant data is available. But as the program is implemented, or following its evaluation, it might be discovered that the ToC was wrong, that people or institutions don’t work the way the theory expected them to do so. It was a “bad” ToC. Alternately it is also possible that a ToC may turn out to be good, but the poor way it was initially expressed made it un-evaluable, until remedial changes were made.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&amp;nbsp;This brings us to a third clarification. &amp;nbsp;My minimalist definition of a ToC is quite simple: “&lt;i style="mso-bidi-font-style: normal;"&gt;the description of a sequence of events that is expected to lead to a particular desired outcome&lt;/i&gt;” Such a description could be in text, tables, diagrams or a combination of these. Falling within the scope of this definition we could of course find ToC that are evaluable and those that are not so evaluable.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;A possible list of criteria for assessing the evaluability of a Theory of Change (Version 2)&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpFirst" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Understandable&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Do the individual readers of the ToC find it easy to understand?&amp;nbsp; Is the text understandable? If used, is the diagram clear?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Do different people interpret the ToC in the same way?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Do different documents give consistent representations of the same ToC?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Verifiable&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Are the events described in a way that could be verified? This is the same territory as that of Objectively Verifiable Indicators (OVIs) and Means of Verification (MoVs) found in LogFrames&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Testable&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Are there identifiable causal links between the events? Often there are not&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Are the linked events parts of an identifiable causal pathway?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Explained&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Are there explanations of how the connections are expected to work? Connections are common, explanations of the causal process involved are much less so.&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Have the underlying assumptions been made explicit? (also duplicated below)&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Complete&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Does what might be a long chain of events make a connection between the intervening agent with the intended beneficiaries (/target of their actions)? In a recent ToC that I have seen the ToC is quite detailed at the beneficiary end, but surprisingly vague and unspecific towards the agent’s end, even though that is where accountability might be more immediately expected.&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l3 level1 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Inclusive &lt;/u&gt;(a better a term is needed here)&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l3 level2 lfo2; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Does the ToC encompass the diversity of contexts it is meant to cover? In ToC covering whole portfolios of projects there could be a substantial diversity of contexts and interventions. Does the ToC provide room for these with sacrificing too much in terms of verifiability and testability” See &lt;a href="http://www.mandenews.blogspot.co.uk/2012/03/modular-theories-of-change-means-of.html"&gt;Modular Theories of Change: A means of coping with diversity and change?&lt;/a&gt; for some views how to respond to this challenge.&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l0 level1 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Justifiable&lt;/u&gt;(new)&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Is there evidence supporting the sequence of events in the ToC? Either from past studies, previous projects, and/or from a situation analysis/baseline study or the like which is part of the design/inception stage of the current project&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l0 level1 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Plausible &lt;/u&gt;(new)&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Where there is no prior evidence is the sequence of events plausible, given what is known about the intervention and the context? &lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Have the underlying assumptions been made explicit?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Have contextual factors been recognised as important mediating variables?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l0 level1 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Owned&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Can those responsible for contents of the ToC be identified?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;How widely owned is the ToC?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Do their views have any consequences? &lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 18.0pt; mso-add-space: auto; mso-list: l0 level1 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;u&gt;Embedded&lt;/u&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 54pt; text-indent: -18pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Are the contents of the ToC are also referred to in other documents that will help ensure that it is operationalized?&lt;/div&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Weighting&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;It was sensibly suggested that some criteria were more important than others. One argued that if you can establish that the causal links in a ToC are evidence based then ‘ownership will and shall follow’”.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;In individual evaluability assessments a simple sense of their relative priority may be sufficient. When comparisons need to be made of the evaluability of multiple programs, it may be necessary to think about weighted scoring mechanisms/checklists.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Purpose&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;It was suggested that the criteria used would depend on the purpose for which the ToC was created. An understanding of the Purpose could therefore inform the weighting given to the different criteria. &lt;/div&gt;&lt;div class="MsoNormal"&gt;Prior to consulting the email list members I had drafted a list of three possible purposes that could generate different kinds of evaluation questions, which an evaluability assessment would need to consider. They were:&lt;/div&gt;&lt;div class="MsoListParagraphCxSpFirst" style="margin-left: 38.5pt; mso-add-space: auto; mso-list: l2 level1 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;If the purpose of the ToC was to &lt;b style="mso-bidi-font-weight: normal;"&gt;set direction&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 74.5pt; mso-add-space: auto; mso-list: l2 level2 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Then we need to ask were programs designed accordingly?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 38.5pt; mso-add-space: auto; mso-list: l2 level1 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;If the purpose of the ToC was to &lt;b style="mso-bidi-font-weight: normal;"&gt;make a prediction&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 74.5pt; mso-add-space: auto; mso-list: l2 level2 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Then we need to ask if the programs subsequently turn out this way&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 38.5pt; mso-add-space: auto; mso-list: l2 level1 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&amp;nbsp;If the purpose of the ToC was to &lt;b style="mso-bidi-font-weight: normal;"&gt;provide a summation&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 74.5pt; mso-add-space: auto; mso-list: l2 level2 lfo3; text-indent: -18.0pt;"&gt;&lt;span style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;o&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Then we need to ask if this is an accurate picture of what actually happened?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 74.5pt; text-indent: -18pt;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;One criticism of the inclusion of prediction was that most ToC are nothing like scientific models and because of this they are typically insufficient in their contents to generate any attributable predictions.&amp;nbsp; This may be true in the sense that scientific predictions aim to be generalisable, albeit subject to specific conditions e.g. that gravity behaves the same way in different parts of the universe. But most program ToC have much more location-specific predictions in mind, e.g. about the effects of a particular intervention in a particular place. There are interesting exceptions however, such as a ToC about a whole portfolio of programs, or a ToC about a whole policy area that might be operationalised through investment portfolios managed in a range of countries. There the criticism of incapacity may be more relevant.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The same critic proposed an alternate purpose to prediction, one where simplicity might be more of a virtue than a liability. ToC may aim to communicate or generate &lt;b style="mso-bidi-font-weight: normal;"&gt;insight&lt;/b&gt;, by focusing on the core of an idea that is driving or inspiring a program. If so, then evaluation question could focus on how the ToC has changed the users’ &lt;i style="mso-bidi-font-style: normal;"&gt;understanding&lt;/i&gt; of the issues involved. This question about effects could be extended to include the effects of participation in the process whereby the ToC was developed.&lt;br /&gt;&lt;blockquote class="tr_bq"&gt;&lt;div class="MsoNormal"&gt;PS: A similar point was made by another contributor, in a parallel related discussion on the KBF email list, who distinguished between two purposes:&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;i&gt;to model a situation to better understand it and programme around it&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;to simplify a complex situation to help explain it to others and persuade them of the logic of your proposed intervention (e.g. for funding)&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;...noting that “&lt;i&gt;in practice there is often a tradeoff between the explanatory and persuasive aspects of the underlying logic&lt;/i&gt;” &lt;/blockquote&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Issues arising about criteria&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The following issues were raised.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 17.85pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -17.85pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Process and Product:&lt;/b&gt; The list above is largely about the ToC product, not the process whereby it was created. Some argued there needed to be a participatory process of development to ensure the ToC was “&lt;i style="mso-bidi-font-style: normal;"&gt;aligned with the needs of beneficiaries and the national objectives&lt;/i&gt;”. However, others argued that that “&lt;i style="mso-bidi-font-style: normal;"&gt;ToC are not “development projects” that must be aligned with the Paris Declaration, but rather tools that must be rigorous, applied without ‘complaisance’&lt;/i&gt; “. The hoped for reality might lie in between, ToC typically are associated with specifically project interventions and the extent of their ownership is relevant to answering the practical aspects of evaluability. On the other hand, the rigour of their use as tools will affect their usefulness and whether they can be evaluated. The product-oriented criteria given above do include two criteria that may reflect the effects of a good development process. i.e. ownership and embeddedness.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 17.85pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -17.85pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Ownership&lt;/b&gt;: It was argued that ownership was not a criterion of good ToC, often the consensus in science has been proved wrong. But in the above list the criterion of ownership is relevant to whether the ToC is worth evaluating, it is not a criterion of value of the belief or understanding represented by the ToC. It could be argued that widely owned views of how a project is working are eminently worth evaluating, because of the risk that they are wrong.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;This approach might lead to the view that on the other hand ToC with few owners should not be evaluated. This view was in effect questioned by an example cited of an evaluator coming up with their alternative ToC, which was based on prior evaluations studies and research, in contrast to the politically motivated views of the official in charge of a program. &amp;nbsp;This brings us back to the criteria listed above, and the idea of weighting them according to context (ownership versus justifiability).&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Relevance&lt;/b&gt;: This proposed criterion begs the question of relevant to whom? Ownership of the ToC (voluntary or mandated) would seem to signify a degree of relevance.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Falsifiability&lt;/b&gt;: It was argued that this is the pre-eminent criteria of a good scientific theory, and one which needed more attention by development agencies when thinking about the ToC behind their interventions. The criteria in the list above address this to some extent by inquiring about the existence of clear causal links, along with good explanations for how they are expected to work. Perhaps “good” needs to be replaced by falsifiable, though I worry about setting the bar too high when most ToC I see barely manage to crawl. Many decent ToC do include multiple causal links. The more there are, the more vulnerable they are to disproof, because only one link needs to fail for ToC not to work. This could be seen as a crude measure of falsifiability. &lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Flexibility&lt;/b&gt;: Although it was suggested that ToC be flexible and adaptable this view is contentious, in that it seems to contradict the need for specificity (by being verifiable, testable, and explained) and thus its falsifiablity. However, there is no in principle reason why a ToC can’t be changed. If it is, it becomes a different ToC, subject to a separate evaluation. It is not the same one as before. The only point to note here is that the findings of the adapted version would not validate the content of the earlier version.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Lack of adaptability may also be a problem. It was suggested &amp;nbsp;evaluators should ask 'When has the ToC been reviewed and how has it been adapted in the light of implementation experience, M&amp;amp;E data, dialogue and consultation with stakeholders?” If the answer is not for a long time, then there may be doubts about its current relevance, which could be reflected in limited ownership.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Clarity of logic as well as evidence&lt;/b&gt;: One commentator suggested that it might be made clear whether a given cause is both “necessary and sufficient”, presumably as distinct from alternative combinations of these terms. &amp;nbsp;Necessity &lt;i style="mso-bidi-font-style: normal;"&gt;and &lt;/i&gt;sufficiency is a demanding criterion, and arguable whether which many programs would satisfy, or perhaps even should satisfy.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Simplicity&lt;/b&gt;: This suggested requirement (captured by Occam’s razor) is not as simple a requirement as it might sound.&amp;nbsp; It will always be in tension with its opposite (captured by Ashby’s Law of Requisite Variety), which is that a theory must also have sufficient internal complexity in order to describe the complexity of the events it is seeking to describe. Along the same lines some commentators asked whether there was enough detail provided, the lack of which can affect verifiability and testability. &amp;nbsp;Simplicity may win out as the more important criteria where a ToC is primarily intended as a communication tool.&lt;/div&gt;&lt;div class="MsoListParagraph" style="margin-bottom: 6.0pt; margin-left: 18.0pt; margin-right: 0cm; margin-top: 6.0pt; mso-list: l4 level1 lfo5; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Justifiability &lt;/b&gt;was highlighted as important.&lt;b style="mso-bidi-font-weight: normal;"&gt; Plausiblity&lt;/b&gt;was questioned “&lt;i style="mso-bidi-font-style: normal;"&gt;What that does really mean? If based on common sense then it is incompatible with being evidence based! If humanity had to rely on common sense, the earth would still be flat!!” &lt;/i&gt;Plausibility is clearly not a good evaluation finding. But it is a useful finding for an evaluability assessment. If a ToC is not plausible then it makes no sense to go any further with the design of an evaluation. Justifiablity is evidence of a good ToC, and is a judgement that might follow an evaluation. However, it might also be obvious before an evaluation, through an evaluability assessment, and lead to a decision that a further evaluation would not be useful.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;Informed sources mentioned by contributors&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Connell, J.P. &amp;amp; Kubisch, A.C. (1998) &lt;i style="mso-bidi-font-style: normal;"&gt;Applying a theory of change approach to the evaluation of comprehensive community initiatives: progress, prospects and problems&lt;/i&gt;, in: K. Fulbright-Anderson, A.C. Kubisch &amp;amp; J.P. Connell (Eds) New Approaches to Evaluating Community Initiatives. Volume 2: Theory, measurement and analysis (Queenstown, The Aspen Institute).&amp;nbsp; [courtesy of &amp;nbsp;&lt;a href="http://books.google.co.uk/books?hl=en&amp;amp;lr=&amp;amp;id=7bIu-i4mJOQC&amp;amp;oi=fnd&amp;amp;pg=PR7&amp;amp;dq=Evaluating+the+Complex.+R.+Schwartz,+K.+Forss+and+M.+Marra,&amp;amp;ots=VDM5i0BpBw&amp;amp;sig=bKon3WzEjehR2lGJ_EXPr5UQpgs#v=onepage&amp;amp;q&amp;amp;f=false"&gt;John Mayne&lt;/a&gt;]&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 18.0pt;"&gt;Connell and Kubisch suggest a number of attributes of a good theory of change. &lt;/div&gt;&lt;div class="MsoListParagraphCxSpFirst" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l5 level1 lfo4; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;It should be &lt;b style="mso-bidi-font-weight: normal;"&gt;plausible&lt;/b&gt;.&amp;nbsp; Does common sense or prior evidence suggest that the activities, if implemented, will lead to desired results?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l5 level1 lfo4; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;It should be &lt;b style="mso-bidi-font-weight: normal;"&gt;agreed&lt;/b&gt;.&amp;nbsp; Is there reasonable agreement with the theory of change as postulated?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l5 level1 lfo4; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;It should be &lt;b style="mso-bidi-font-weight: normal;"&gt;embedded&lt;/b&gt;.&amp;nbsp; Is the theory of change embedded in a broader social and economic context, where other factors and risks likely to influence the desired results are identified?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 54.0pt; mso-add-space: auto; mso-list: l5 level1 lfo4; text-indent: -18.0pt;"&gt;&lt;span style="font-family: Symbol;"&gt;·&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;It should be&lt;b style="mso-bidi-font-weight: normal;"&gt;testable&lt;/b&gt;.&amp;nbsp; Is the theory of change specific enough to measure its assumptions in credible and useful ways?&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 54pt; text-indent: -18pt;"&gt;&lt;br /&gt;&lt;/div&gt;Other sources that may be of interest&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;a href="http://www.eval.org/summerinstitute/06SIHandouts/SI06.Leviton.PL3.Final.pdf" target="_blank"&gt;Evaluability Assessments - Achieving Better Evaluations&lt;/a&gt; Nicola Dawkins, 2005 PowerPoint&lt;/li&gt;&lt;li&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;PS 30 April 2012: See also HIVOS posting on "&lt;a href="http://www.hivos.nl/eng/Hivos-Knowledge-Programme/Themes/Theory-of-Change/Resources/8.-How-can-I-recognise-good-quality-ToC" target="_blank"&gt;How can I recognise a good quality Theory of Change?&lt;/a&gt;"&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-4836947793652117567?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/04/criteria-for-assessing-evaluablity-of.html</feedburner:origLink></item><item><title>Can we evolve explanations of observed outcomes?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/dc4F9nuQOHs/can-we-evolve-explanations-of-observed.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Wed, 11 Apr 2012 14:57:41 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-6906376128510397291</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;In mathematics and computer science, an optimization problem is the problem of finding the best solution from all feasible solutions. There are various techniques for doing so.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Science as a whole can be seen as an optimisation process, involving a search for explanations that have the best fit with observed reality.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;In evaluation we often have a similar task, of identifying what aspects of one or more project interventions best explains the observed outcomes of interest. For example, the effects of various kinds of improvements in health systems on rates of infant mortality.&amp;nbsp; This can done in two ways. One is by looking internally at the design of a project, at its expected workings and then trying to find evidence of whether it did so in practice. This is the territory of theory led-evaluation.&amp;nbsp; The other way is to look externally, at alternative explanations involving other influences, and to seek to test those. This is ostensibly good practice but not very common in reality, because it can be time consuming and to some extent inconclusive, in that there may always be other explanations not yet identified and thus untested. This is where randomised control trials (RCTs) come in. Randomised allocation of subjects between control and intervention groups nullifies the possible influence of other external causes.&amp;nbsp; Qualitative Comparative Analysis (QCA) takes a slightly different approach, searching for &lt;i style="mso-bidi-font-style: normal;"&gt;multiple&lt;/i&gt; possible configurations of conditions which are both necessary and sufficient to explain all observed outcomes (both positive and negative instances). &lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The value of theory led approaches, including QCA, is that the evaluator’s theories help the search for relevant data, amongst the myriad of possibly relevant design characteristics, and combinations thereof. The absence of a clear theory of change is often one reason why baseline surveys are so expansive in contents, but yet rarely used. Without a half way decent theory we can easily get lost. It is true that "There is nothing as practical as a good theory" (Kurt Lewin)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;The alternative to theory led approaches&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;There is however an alternative search process which does not require a prior theory, known as the &lt;a href="http://en.wikipedia.org/wiki/Evolutionary_algorithm" target="_blank"&gt;evolutionary algorithm&lt;/a&gt;, the kernel of the process of evolution. The evolutionary processes of variation, selection and retention, iterated many times over, have been able to solve many complex optimisation problems such as the design of &lt;a href="http://www.stuff.co.nz/science/6549057/Deep-study-of-the-gannet-collision-zone" target="_blank"&gt;a bird&lt;/a&gt; that can both fly long distances and dive deep in the sea for fish to eat. &lt;a href="http://en.wikipedia.org/wiki/Genetic_algorithm"&gt;Genetic algorithms&lt;/a&gt;(GA) are embodiments of the same kinds of process in software programs, in order to solve problems of interest to scientists and businesses. These are useful in two respects. One is the ability to search vary large combinatorial spaces very quickly. The other is that they can come up with solutions involving particular combinations of attributes that might not have been so obvious to a human observer.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Development projects have attributes that vary. These include both the context in which they operate and the mechanisms by which they seek to work. There are many possible combinations of these attributes, but only some of these are likely to be associated with achieving a positive impact on peoples’ live. If they were relatively common then implementing development aid projects would not be so difficult. The challenge is how to find the right combination of attributes. Trial and error by varying project designs and their implementaion on the ground is a good idea in principle, but in practice it is slow. There is also a huge amount of systemic memory loss, for various reasons including poor or non-existent communications between various iterations of a project design taking place in different locations.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Can we instead develop &lt;i style="mso-bidi-font-style: normal;"&gt;models&lt;/i&gt;of projects, which combine real data about the distribution of project attributes with variable views of their relative importance in order to generate an aggregate predicted result? This expected result can then be compared to an observed result (ideally from independent sources). &amp;nbsp;By varying the influence of the different attributes a range of predicted results can be generated, some of which may be more accurate than others. The best way to search this large space of possibilities is by using a GA. Fortunately Excel now includes a simple GA add-in, known as &lt;a href="http://www.easyexceltutorial.com/excel-solver.html" target="_blank"&gt;Solver&lt;/a&gt;.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The following spreadsheet shows a very basic example of what such a model could look like, using a totally fictitious data set. The projects and their observed scores on four attributes (A-D) are shown on the left. Below them is a set of weights, reflecting the possible importance of each attribute for the aggregate performance of the projects. The Expected Outcome score for each project is the sum of the score on each attribute x the weight for that score.&amp;nbsp; In other words the more a project has an important attribute (or combination of these) the higher will be its Expected Outcome score. That score is important only as a relative measure, relative to that of the other projects in the model.&lt;br /&gt;&lt;br /&gt;The Expected Outcome score for each project is then compared to an Observed Outcome measure (ideally converted to a comparable scale), and the difference is shown as the Prediction Error. On the bottom left an is aggregate measures of prediction error, the Standard Deviation. The original data can be found in &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2012/03/MCA+GA-vs1.xlsx" target="_blank"&gt;this Excel file&lt;/a&gt;.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&amp;nbsp;&lt;img alt="" height="262" src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAloAAAD3CAIAAABKJG2kAAAgAElEQVR4nO19S0/jWLuu/8Tee7L3+QEMkACJI33SHh7pCJ0RowyYZLLFnGEYRGLCcIsx1IQpAwZHn0Q1qf7U3XzdXae51YVuiENCSJxwSUhcFAVVBTkD35bt5cR27GWv5efRo1IqF79rvevyrGWv90UaAAAAAEDmISVdAAAAAABIHpBDAAAAAIAcAgAAAADkEACAlOPu/qnVfai172VFBePmxfV9++7Lp4evSTd7AoAcAgCQUjx9e27cfpYVtaz039fu/jjrvD29AePjfrnzvnan6aLSefj+/JJ0F2AKyOEIPH17Vj9/7aiPoB/ef/n27ftz0o0GiIDvzy+V1qey0v/nyfXOfhNkybenN7Ki1tr3mVJEyKEnvjx9v7jC/ZkwzOC6EogcjduHstL/8biVuDZkkz99uJIVtX33JemOwA6QQzp690+yop42+r+eXP/jXTvxrskLf/pw9cdZp6z0K61PD0/fk25GgFd8evgqK+rvp9gXJkltj/j58VvS3YERIIcUPH17llvq+9rdDwdK4j2SR745bp02+rWr+xdsEYFQuO0/yoqKAZgsS0eKrKgd9Snp7sAIkEMK2ndfykr/zRGGYnj+/KEtK2pHfUy6MQEu0bj9/Ge9l3g3Bk8bfaXzkHR3YATIIQWV9qd31bvEOyLvzNRAAqJFrX1/dN6l9avNOUnH3Jrx5lpxcdv+Nfc7tk8XrN9q15zfHPqFoPR9wXENDeWwi7tK6MH3tbtmZkYx5NCJ788vsqK+Pb2Jq49mhsfn3Vr7Pun2BLhErX1/VOm4O9WrpQVD5/YW54uv1ooTk4Y0bhcnJEmSpImlPV0J9E9nFrebO2sLkiRJk8VX+6RObM5JkjQ5MzG/SXxhb3FSkiRpYnLG+sm+/Wr6dxZW9h2vXRec39wxL7i0Z7ye0WuhfUeXdvI6xqWkhRXyJ44yzBdf7Td3totzS5vWb7XvzBcNo5uUy2ol9DGKIYeZxrfvz7KiUs52Gx13YmnPs/cMX+uFWAluFyfIVfD4HFIGz4/0wexcko/iUaXDlxx+fvx2239sdh7AELztP36J7vCUlxyuzM/Y5VCTvYW5tearpeKKqZfE+7adkLZrNLq6/uXt4oTHF6x39m1XM1R5c3Fpj/J6uzgxv/lqaYa4iN2iqWT6+3uL88UV4jqk8Nt+Yq+R9tGrpYW5eeK3ZuF1t9AuS1YZcmgAcujEEDkkOiKxjqOsOslV2571BXvvpF/H1kd1HZqb15Z7mzvUdeLSgiXS5GJ5jfqrBdc6VFuEzkyYC1hKSaz5RZKIta1zPcurHL68DK57X7Qokfe1OzAoP17osdtRHbvwsTvcnJt0yOGM1W9dcqh/qg8NXTBW5o0hQPuC7R37DLAyr42XhZX9pv21dcGd/ebKvL081hcocrhIXIf4pv0nDoHX9oVLtvI45HCFflncLKUAcujEqN3hgrUCNRZcXqtOYug2d9aKi0tOObSvK4m1pElDUClLS+1NfZVHGV2vvH9FW9uShaeVhLbcpviBTznsqI+yoh5VOjjKGJo/HChHlU5U56e85JB4djjj2AUa6uV8f4gcmrd8JMcXtosT5s1Smhza7hXRXkvzmyvzkiXP5m0ex00mr+vsE0vPJdt37PvdvcVJ12XN2mlG5+kWnetdD0IOM40Ru8Pt4tzSHrngog8zY9W2Q35h3imHroWbq4/a5ZCyTtS+MFQO3b+irWdtN4goJaHJoaP8DvIih1pcDQ5PRcJ31btKSx0/CYO3HHrRuqU/7HEGGJCQw0xj1M1SXVTMgUeXQ3PV5pJD84G/ZF9j2taSawv6kLbLIWUN6JJDyiLR/SZlTTqzuKYvYK2SmMXYp8vh8OepvMih+vmrrKg/fbhKfOoRgFoqE/XzuAmgg8shGAshh5mGpxwypLVjC8oRp3U256I9mDOUvMjhDSK+o+MPB0ok90shhykh5DDTSIMcikFe5FA7RJO4u4Qh5FAkQg4zDchhVBRNDp2nan0zULRM0GicMJEzYekOQqAxXjmMNW7dp59jLcPw3sK8+pDDTANyGBUFk0PynLAtWoYa8WIFtxjRMsRDZUqIjhlz7RDdSCJnTFvzm8OCux2RP8ZNe/vZYyKinBqQE68c7tlDy4mlic2ZzWHVJFvK5l4yYspwCBEl5Toi4IoyMq7mvI5m3coSsOkKT3L3lmHBWs4eOCJeayxCDjMNyGFU5FoOzQNQxCmhvcVJfZqzyyEl4sUW3KIfd9IuSw/RcYW7WIeVxo+c2SGCZ4YGd7uLtzk3OTNnFZU8TuURkMNkd+gMZrc+LdprNyo2yREovO+xAzaipLx8bvMP5TpapIeeJcAWMr/vKqTRW4YEa+3Ye+DoeK0xCDnMNFjIITWhohWZNDTd4n7TFULrHVE78lL2iF2Pr9mPsK8teEVWOMi1HOrHa90LbXcuBdoRX1twCyGHXiE6rnCXYXIYInJmxwieGRrc7Y6c2ZwjFwR2OdyhBuQwkUNnMLsj3ml4DDvZUrakMJQj4s5j4R4+t/cN93WIwEdXeJWzkGZv8Q7W2rHL4eh4rTEIOcw0RoThu24o0W80UW7LuJLXUFIgWkOIuE/lkRHR9V/aHTzJaYjM67jvuvFFycdoX59qQ9ffAwyu5dBNQyAXVvad0TIjIl4IOdzxDtHR/b/kHcoSNnLGKrwV20MN7nZGzmgZ0ay8aObuyhGQw1IOqaHlLjkcVk27HHo1h1sOh/ncvxw6iuTqFURv8Q7WcvVA8+dejQI59APIoRNecui63WTc66DeaKKmjNm330Kh5FsiL6jfp/LKiGhZdPyX3LJQcnMUXU+Dmjaj9nsvO9YDpBlJkiaWinPa8ycf400wOUyGaYqc8UmeTpayO5nCa5YAyGGmMeRmqXG7yX6vg3qjybUOpdyKGSmH+nJvSEZE23/9yeGMbfFIk0NHBa2nEdvFOSNtlZ/7pZDDbJInOQRHEXKYaXjJoet2k3Vmz0/KmKByaN2nWnLfB7OS19j+676D55m50bj9RaRStArjvAtE/kWLGepzKSohh9kk5FAkQg4zDaFPljK9YwM5zCYhhyIRcphpCC2HTCmyHFL+/DotTHv0IWEBCTkUiZDDTANyGBV5lsO9xUkiAt286e16Nkz++XXjLrTtvO7Qk73Jt1EchByKRMhhpgE5jIo8y6EV4jJhHOh1nxym/Pl18hzTvo+TvSIScigSIYeZBuQwKnIthzvbxbml4uLS5uJ8cXFJi6x3nhym/Pn10XJoP9krIiGHIhFymGlocvgr5HBs8i2H+3uLkzOL281XSzPUP1w+R/3z65ocOs7rDjnZKyIhhyIRcphpvLwM5JaKoTg+T+q9y5vPSbfnaIQ9WRrimC6vsdiBCDkUiZDDrOPy5vOf9V7iHZFrlo4UWVFv+uNOiwyAQItoCTkUiZDDrKN3/yQr6u+nuF8ankfnXVlRH79+T7oxR+Om/ygr6usDJXGnCcDXB0okcnhxff++dpd4dcA/6z3IYdbRuH2QFXW/3CkdYZYMxjdHyrvqnayoHfUp6Wb0BW3184937cRdJwD/8a4tK2rvftymb3UfzprYsidMbXHDxT2eSAA5pOP5+UW7hyYr6ull/33tjiMen3ejVfHf/rr2afqsqcqKWmlFsD9ghqdvz7KCp8XR8KjSkRX16dvzmI3S/fSINUri/PlDNIsbXgA5HIbHr9876mOr+9Ds8MRKSz0670Y1JH48bsmKennz2Y/p9t2Xu/unb9/HnQ0ZQ7tf+sdZB7dMQ/P1gfLHWSeqzcT355dK69NJvYcWSbBBTy/75+1Pz88v4zcoF4AcCoiO+iQr6o/HrUhGxbtq97z96UXoEfHyMlA6D9rNAP+78HfVu58/RLl9eX2gHFQ6id9dCMGPF3ea99p3X6LqKp8evsqKenTe/QGKyJylI+V97U5W1M+P36JpTh7ASA5VuQwyY1+WK43eu2oEG0Rta9iuNROvFANeV+tKvV1vdHyy2riL8MjV6wP9meuF7wKkikq9fVu9iLZFWheK9rTi7enNzx/aP3248s+oloND+OZICVQkLvjLx6s/zjpnzb6s9H0OfDYiwgCQQzHZrjVlRf3l49WYA/5dtVtp9vqynHiNUsi+LDcub7QdzPj39LSnbq0LJfF6pYq31Qtt2RGCMSXT+PG49fb05qTeC1cqLlhrdDvnVZ9txEZEGAByKCb7slxtdE/Gi5785eNVdraGodmqNWVFPan33oxxfEl76ta8vEq8Oulkr3J+U623LhT/lBX17elNhCr484f2QaVzeqmr4EWjE6g8vPC2etGrVAK1DhsRYQDR5PBweUpP+5Fb9/rC7PIu/ecbOfqvzPftX9Au5XlBr6ux4k21Livq3hgbxJN6D1tD367un172wp2E/O2va1lR65e3iVdEJEYihz8cKHsfr47Ou2fNvqyostK/vLy5rjaCCobYZCMiDCCgHM4u76ryel7KbWn/nZ6Scrm8JEmSlN8gvyBJ0tRqqWy+np2WJEmytG0jJxGYXV5fnTZeT09JuXVTDo0vGNJYKsxO5/LW1Qhb2jVZyWSt0T297Ie7j6dtDW+q9cQHGxfsVqrVxl1Z6Qe9Qbf38UrbamDZESF7lco4cvjmSPn15Fp7lCsraqXZa15e3VTraCMq2YgIAwgoh+Tu8HB5Ssqtb+Wk/EZZ08jV5anZ5V39nY0c8Sl9P2cIqrU71K6pOnaHpcKscXG1VJidLhwaV6PbYsLOeVVW1N/+CvME5c96r9boJj7SOGJfluuNjqyoB75DGH/+0C4r/Vqji3k2WvYq5yHk8M1x6/fTa/OhYK3RVertznkt8eqknGxEhAEElEPy1uUhKX52OTQl00sOt3K2vaD5BdOE/WapdXG3HJI7wq2cJE0XDlk55PLy5rTRCxqVj61haCr1tqyoHy7uRoYH/HjcKiv9auOuVzlPvNiCUZNDn4d+f/pwtV+2PRRs15rdit+DJCAbEWGATMihedvTUi/iHeP11OpGYZa4WWrJYS6nv18qzJq/IuTQ2o+a91enC4cl42qELf2aDOWwc16Vlf5+OUDKldcHCraG4/C62pCV/ullf8hZ/zdHylmzX1H6mHbjoCaHQ/5q6WvtoWClg4eC45ONiDCAaHI4iut58ulgNqjU22Wl73+DqJ3swNZwHHbOa5Vm76zZp8a6lI6U08u+rPRxIy4meslh6Uj59eT6+LyrPxRU+s3Lq2s8FByPbESEAbImh1lkr1KRlb7PtG2vD5TTRu+i0Um82LyzV6nUGl33E6wfDhTt6VTkQeugyc55jZRDx0PBauMujqQBmSUbEWEAyGEmqIVh+cnToW0N/UfggkPojtN/faC8r97JinpdbSRePIF5W72QFXW/3LE/FOy28FAwBrIREQaAHGaCWtq299URf0BO2xpeXt4kXmCR2Cbi9LXbdC1kNoiZmhxqvLy8aeOhYJxkIyIMADnMCv2kbcPWMCbeVOsVRTuyoSr1duLlEZ59WW5dKHgoyIZsRIQBsi2HGzktNj9WKyMT5URELdhfkqTcFu0LI9O2lY6Us2bfa2to1cLj+sKQzKsQ4IdDkhCVCrPTheNKtdboKl5p2JLOYTQGzY5HnNwe/7JatFJon/NDv/NDqTA7Xdhyu9ce/ZUI2YgIA2RZDndXp6fyuam4R9QhGYwYp5WRFdHStnmdPt8vd4ZsDcloSwHmoNHtZQSkmuE0+VyOkmaIktKI+FSLzJmeIpp+3ZkgyZbDqJDXFxzW11QzF4Qk5ZcLs/q8SZYhFe4yM1E4Q5jspTXyWhRWp408TY7qOJ1G9VjhUN5d9XYazWPl9DiN5kBqIi2nQ7aWyaRauS3DCfa+Z/MAIZPuVlin5+2aDjbG2YgIA2RYDrVBVSrMxrzdsVZ/ccrhVs7XbsYrbVvpSCkr/SEppK1ZT4gluY/20uNQtVbTFwGlwqyUyw9NaURJQrSRM5venRGCTNpgXo382hYZPjtdOJTX8/YyJOsuItHSel6ayuesJE15R3VoSZ20j4Y4zdNjsrMJKN8kPLZlL0bi3czV32iJtFwO2bLlFbE6nle+EXIV7tUK7rxdQf3DRkQYILtySNz9i/d+qSEku/pyOL7q+OjEXmnb9ssdWekPOW6Qwd2hsSrX5dBcwueHpjSiJDxyT0nh5FC/jrMMaXKXUSlCgaTpwmrONt1b35/O5aeNr3k4bSw5JDy25coPlRJ6JdKiOmRMOXS3gucPg1SBjYgwQGbl0BKnuOd3NrvDkc8OTV5e3pw1bVH52tZw+BEPVrVInvZ10u7qtH4HyVrCD09pRPmUcJrXp1oOo2ljY7pMfI0mh1sbti8kSuvZocNRVg4md1oouaxqN/rM9IdeThvykbfTKB5zXCrpbkb2t+GJtMhab9k+JW6luvoe5RkNrRU883YFqQIbEWGAzMphdtmtONO2jdwagskeVQDBNJONiDAA5DCLJNO2/Xjcwul/EARDk42IMIBQcigB/vAv//pvJ7Wb4/Puzn7zUL49qd38y7/+W0y2Eh+rAnSwxCvOi+sSr2Y23cJGRBhANDkUxkrchlq1phZ0EWuSFB5nqLRVRwwfMqgFj44Swy1sRIQBIIcptRK3ob4sV5o97S99x5e5g8cZKm3VEcOHYsz7PJYZcugfTOVwZP6FYQcWyGihEhlaO0bDl2x/4DCu7hXKSlBDVrSc71gOLW1bO9DWsFSYDWKCxxkqmuo4Gt04uaefjw/iRjF8GGrUBAtM4tFRw8tMGdRWR9IP9I6cWCCH/sFaDm1/OF7PfeDKOjEi60dZLe0e0mJpwzR8qZCPWw5DWQlpaCMXKIbyf/7tP4Ncf3c1V9hazkEOfZFo9K2cFv2yns+tB3WjGD4MWIvAPY1TR/kqMzGozY40K0np6T9sRIQBEtsdmtF+7ghQSkYG2ZVLAnJI4e7qdLAMO8FMbOTyG+VDyKFPEo3uCFsM5EYxfBh3T+PUUT7KbBvUhhzuknliE3cLGxFhgER2h7bXVDk0b6hCDv1zKxc4600QE1Z2RP+R+DzOUJFVx93opUJ+eT2oG8XwYdw9jVNHjSyzY1C7dofa/YaE3cJGRBggeTmkJEQYnvWD+IlDEcM91fOzwmJvJaihcH+HIcRQwe7QF81Gn9YzgGv91VzmY3c4ktgdHtrzoZNzo56a3EeCScihf6TqZOl6frz8SWzGA7NRx8BQ3CZ4nKHSVh0xfChAZ+a0zJBD/0iVHHLQ8MyssDEEOUx/dcTwoQCdmdMyQw79A3KYUitsDEEO018dMXwoQGfmtMyQQ/8QTQ6BtIFN04vdwRKvOC+uS7ya2XQLGxFhANHkUBgrbAzFbYLHGSpt1RHDhwJ0Zk7LDDn0D8hhSq2wMQQ5TH91xPChAJ2Z0zJDDv2DyyRt5nUch4zDNDybXFDBrQQ1FCJJGwMTPM5QkVTH6SsiZ9vIUSCkD9PWmVNCX2V2JPwL2Jcgh/7BZ5I2uazK5a2cMyojeMOzyQUVxkq4fny4PBUoSVuYoRIkDxyPM1SE1bE1hz0qH3GHoxkw4yCPjvJbZkdKhyB9CXLoH9wmaaPttwI3PJtcUKGsBDekrRviTNIml4PmgeNxhoqoOq7msE1hu6u5bGX2ibunceqoIWXeytnuLnjL4Yi+BDn0j+Sz0oRK0raep91ICdjwbHJBhbQSsh8HzAYX1ETQPHA8zlBRVodsDvJ19nbYcfc0Th0Vwe5wVF+CHPpH8nIYIknblpGjbdwkbXpJWOSCint3aPokvvtLIfLA8ThDRVIdqzlyudnlXXuivizusOPuaZw6yv+zQ8lI+Be0L0EO/SNVJ0uRpI21obhN8DhDpa06YvhQgM7MaZkhh/6RKjnkoOGZWWFjCHKY/uqI4UMBOjOnZYYc+odocgikDWyaXuwOlnjFeXFd4tXMplvYiAgDiCaHwlhhYyhuEzzOUGmrjhg+FKAzc1pmyKF/QA5TaoWNIchh+qsjhg8F6Myclhly6B+ZzkrDJvlFOCtBDbGoiyM7Rgy+Sjn9V8fZHOZfbd2wv86MDwPXgk26qKQ5vMyUQW12npzfXgQ59I+MZ6XRS8Ui+UXAFBshDcWdyCPmuMaUM3B1jObYymmn4dfzuXXydXZ8GLAWbNJFJU9fZSYGtdl5JEny2Ysgh/7BZ1YaPfJm7Kw0jDK5lEOk2AhlKP5EHpDDUM1hzGK7q7nCKvHaT1oGMXwYrBZs0kWlgD7KbBvUZkea1eVwdC+CHPpH8mH4obLS6FeIJAyfwRQfIsVGCEMsEnlADkM1B3aHQWrBJl1UKjiyzI5Bjd1hrEheDsfJSjPms0MGmVzUsCk2ghpikcjDlg4jFl+lnMGfHRqZRPDsMHgtsDu0DeqNnK0j4dlhDEjVyVJkpWFtKG4TPM5QaauOGD4UoDNzWmbIoX+kSg45aHhmVtgYghymvzpi+FCAzsxpmSGH/iGaHAJpA5umF7uDJV5xXlyXeDWz6RY2IsIAosmhMFbYGIrbBI8zVNqqI4YPBejMnJYZcugfkMOUWmFjCHKY/uqI4UMBOjOnZYYc+geXWWn0/459sjREmpUQVthkpTGqE2MijxAV4XGGiqQ6FF+ZrWMelvbnRjF8GHdP49RRI8rsPoRsdp5lv70IcugfnGalWc9PF1ZzY8uhHDiQLnT3Cpr7JrghVok8sveX3MNXx/IV0TobBeFn+UhqwShdVIrdQglRNTuP714EOfQPLrPSaONkixs5DJP7JrAhRok8sviX3MNWh/CVrXX0MHPEHXqTWbqoVLtli5LAyOw8fnsR5NA/kg/DD56VxjNpRVrlkIEhRok8gia+4XGGiqo6hK9orVMqzCIrzXBmIAVS4N2h4Rm98/joRZBD/0heDkNkpTH6SjTPDqWY/0pDuNw34fpxrLvDEIlveJyhIqmOMysN0TpGf8CzQ0+ySReVBo4osz2ZkbVVMB4e4dlhtEjVyVJkpWFtKG4TPM5QaauOGD4UoDNzWmbIoX+kSg45aHhmVtgYghymvzpi+FCAzsxpmSGH/iGaHAJpA5umF7uDJV5xXlyXeDWz6RY2IsIAosmhMFbYGIrbBI8zVNqqI4YPBejMnJYZcugfkMOUWmFjCHKY/uqI4UMBOjOnZYYc+geXWWnM64x5spRN8gtGWWmYZNgJmviGxxkqkuq4Gl3PNaE3UBA3iuHDQLVAVhqdrqw0hGfsPSpRt7AREQbgMiuNl2qGa3hGyS+CJHMJaSj2GMrAiW94nKGirI7R6PYAmGBuFMOHIWqBrDRecYeHy1Oz0ynqP2xEhAF4zUoj0VZGwRueWfKLYMlcQhqKWw6DJ77hcYaKrjpWozv+rHkgN4rhw4C1QFaaskrPSqN7ZtV3EDDk0D+SD8MPnpXGoOudkA0ff/KLoMlcQhpKX+IbHmeoqKpDNrohfuv5XCGoG8XwYZhaICvNkKw05s0zZKWJDsnLYYisNF5JK4I2PJvkFyGSuYQxxCTDjhow8Q2PM1Qk1XFmpTFah3gIhN2hJ5GVRqdnVhopv+zsUQm6hY2IMECqTpYiKw1rQ3Gb4HGGSlt1xPChAJ2Z0zJDDv0jVXLIQcMzs8LGEOQw/dURw4cCdGZOyww59A/R5BBIG9g0vfAdLPG6c+G6xKuZTbewEREGEE0OhbHCxlDcJnicodJWHTF8KEBn5rTMkEP/gBym1AobQ5DD9FdHDB8K0Jk5LTPk0D8ghym1wsYQ5DD91RHDhwJ0Zk7LDDn0Dy6TtDlTXoVteKGStLnyOUVuIkRFeJyhoqmOI2ceNXYIgRYjHJj1JG2UEWf2K9+9CHLoH7wmaaNO+iEbPmD6NDZWghryjNiNui6BUmfxOENFVh0iilx3mvbORkH4WX7sWgROB8ipo3yV2TF1BOxFkEP/4DJJmxmLGkVWmsDp09hYCWqIls8p8roETp3F4wwVWXVsSVW0nD5GG00H2MeL4cOgtzqCpgPk1FE+yuyaOvR+5bcXQQ79I/msNCGStOlvlgqz9jRXIRo+RPo0NlaCGmK2OwyUOovHGSqy6hCOMjoA0Tqlwqy/lhLDh0FqESYdIKeOGllmytRBDkAfvQhy6B/Jy2GIJG22L4zR8OHSp7GxEthQ/M8OQ6TO4nGGiqY6Zs686SmtY5t9wHAjnh2OIHaHjszvZLY/A3h2GCVSdbIUSdpYG4rbBI8zVNqqI4YPBejMnJYZcugfqZJDDhqemRU2hiCH6a+OGD4UoDNzWmbIoX+IJodA2sCm6cXuYIlXnBfXJV7NbLqFjYgwgGhyKIwVNobiNsHjDJW26ojhQwE6M6dlhhz6B+QwpVbYGIIcpr86YvhQgM7MaZkhh/7BZVaaKOMOGSS/cCQoiccQiww7wQ+v8jhDRVMdZ6Pb8ygF6XVi+DB1nTkdHFFm14gjPEPPzJWIW9iICANwmZVGoxWSGL7hWSW/CBKrN5ahODPshAht5HGGiqw6tqw0ZB8L1uvE8GHaOnNKOLzMXiPucHlqdjpF/YeNiDAAl1lpVFlbXzvTowRueGbJLxjJYbwZdkIkvuFxhoqsOvYkbZK5lg/Y68TwYdo6c0roTw7JEadnhlr1Hc0MOfSP5MPwQ2Sl0d5394OADc8w+QUTOYw7ww52h2PvDtfzuULQXieGD9PWmVPCcLtD4mb76MEIOfSP5OUwTFYaeT1Pe64QruFj3x2aCUrizErDIsMOnh2GzUpj/pd4CITdoSfZpItKAwM8O9zIESmNpPyys0cl6BY2IsIAqTpZiqw0rA3FbYLHGSpt1RHDhwJ0Zk7LDDn0j1TJIQcNz8wKG0OQw/RXRwwfCtCZOS0z5NA/RJNDIG1g0/Rid7DEK86L6xKvZjbdwkZEGEA0ORTGChtDcZvgcYZKW3XE8KEAnZnTMkMO/QNymFIrbAxBDtNfHTF8KEBn5rTMkEP/4DIrjdfBs6ANzy75RfDcN0ENsahL8PQ6PM5QkVTH2RzmAenS6NcW+sQAABNESURBVFEgpA9T15nTQV9ldpzoDtiXIIf+wWVWGiseY/wkbXL5cHkq5uQXYXLfhKtO7HUJGEDJ4wwVZXXMvCobBUfrI9AigPfiM5E0/ZTZGX0YsC9BDv2D06w0WgS9c/EYvOH1FA/xJr8IlfsmuCEmdYEcBvg+mVdFz/lgzO+Zy+wTZhGZ7aw0Wzl9inPlpgnWlyCH/pF8GH6IrDT6m6XCrD2vR8iGj3eKD5n7JmR14pYryKHvL1PyqpQKs/oCP8C+RwwfBq0FstIQrqDlpvHdlyCH/pG8HIbIShPVs0MrxUP892Ti3h2yqEvw9Do8zlCRVMfqotPECk+/nxFs3yOGD0M9OxT/QbWvMpvPDnO5EH0JcugfqTpZiqw0rA3FbYLHGSpt1RHDhwJ0Zk7LDDn0j1TJIQcNz8wKG0OQw/RXRwwfCtCZOS0z5NA/RJNDIG1g0/Rid7DEK86L6xKvZjbdwkZEGEAoOQRBEAQZk42IMADkEARBEAxPNiLCAJBDEARBMDzZiAgDQA5BEATB8GQjIgwAOQRBEATDk42IMADkEARBEAxPNiLCAJBDEARBMDzZiAgDQA5BEATB8GQjIgwAOQRBEATDk42IMAAjOQQAAACANANyCAAAAACQQwAAAACAHAIAAADAAHIIAAAAAAPIIQAAAAAMIIcAAAAAMIAcAgAAAMAAcggAAAAAA8ghAAAAAAwghwAAAAAwgBwCAAAAwAByCAAAAAADyCEAAAAADCCHAAAAADCAHAIAAADAAHIIAAAAAAPIIQAAAAAMIIcAAAAAMIAcAgAAAMAAcggAAAAAA2ZyqMplEARBUDyyEREGgByCIAiC4clGRBgAcghGwK3lwmGpMDtdOGRr93C5sJV03UEw42QjIgwgnBxu5CRJkiQpvxH4h94/Wc/n1lW5rG4UVkshSxW4PG6WCrP+6zXcovap4zta7Yb80POj9dXlXU0R88u7qn9ddPuTNGG+HuJ2zeKYjgVBcAyyEREGEFAOzfl0K5fbksuHy7nVUlndKMxOmzK5uzotSVJuSy6r8npekiRpSvt0dnmd+Mj4dHpqVpdDTUW0S02tLuckSZpd3rVf3Lz+FGnX+GZOkiRpunBoXqREGJJyW7aykZfSXpBySHzTvKx5HX1ZMGUJifUd/VJEqYjyE+sJ3XX6akD/lctFulsOl6ckScpv7K7mSC3cXZ0mWqFUyJM/1/xZKsw6/EPUV3vT5vYS+ZN1uzkQBFmTjYgwgIBySMjAen56St896DK5u5orbBmz/OryrjHjm3O69ZFKTOJ2OTQmcU0kcoVD4uKH5q/Ij9z7sGXtHX3faRaDXgDtUvadkOOb2mXzOVt1rH0tabrk2h2S/6U5Kk9U323X8qH2esO2OzxczmnaNru8e7icM0q4vmronHbD01i1EMWmFk9ez+fWiZ/sQg5BMFmyEREGEFAOCeFZz2vbC9k2y6/mtL1gbsvYQZpf2CI+Uq1PbXpgymF+eZcqh8avKHKobaEkScrnbHJlFsOjABQ5JL9pXtbatLnk0DLtVL5gcugooUrKYamQJ++ayuabhdXl9dVcYXXZ5nxyWymRHh4th+ZPIIcgmDDZiAgDCCiH5j3ArdzUaqms/Uu+T3s9tbpRmJWkfI74iLiaNEoOqb+ybvQZj/285NAqxrL3pRzPyYiPyMua1xkmh1p5lo2HkXY5JC3OktUv0VwkE0I1rW2OpyTnTd2p1VL5cHnK7XzCJ1PO3WGJVjxSDqWp1ZJr+wuCIFuyEREGEE4OvRjJYZZELs4BdSEM9VvrQeas30MxxE+mp7LteRBMnmxEhAEyI4cgCIJgDGQjIgwglBxKQPqQ+FgFQTBWshERBhBNDoWxwsZQ3CYghyAoPNmICANADlNqhY0hyCEIgmOSjYgwQEbl0DqauOwKV4/DSok4Oxq1oXGsBDYR0F2Qw9EsFWbtB54dyRCSLyGYQprdxnhn2CAlpwXztDz1+Fug1FcG2YgIA2RUDs1eks8FzrsWxspGmARvwQyFshLYREB3QQ59UYvXJIJcnQGvYLbZOa9R3qdmKKQNUjJRBhnOa6ipV+orMu9VYXZaknLr1gsi99bg7//9t79J0n/9nY2axIcsy6GWP8yd/CwOK2bfislQSCvBTcR3/QzTJodapggi/ULixQMT4m31Qrm8qjR6sqJeVxvOL1Dk0GuQ7q5OG9tBUw6Nn3ulvrLln3LkxLD/qvz3/+JfCgeDLMuhHp5vdKxZ38vw8FZiMxTaSkgTMVw/08TuEDTYl+Xrar15eSUrfVlRy0q/3OzLSr9XqTi/7JLDEYOUmovKO/XVliuBlNevIIfBwKYnBX+qZyLWZ4daipy4DI1jJaiJoO6CHI6mmY582pZFD88OM8W+LF9XG43LG00Fz5r9o0rnl49XPx63ZEVt1dvOn9gT35OpqdyD1JgWiHT/kpFqSnYmosoT2aYkwoRDDslPB5DDQGDTpdhMvsym+MgN9SqV2+pFX5aZ1QVyCIJD2KtU2tVG/fJWVlRZUU8v+weVzk8frnb2mxrfVbuVZo8csykkGxFhAMhhSq1Ea6gvy61aU1t41hsdc3RBDkGQPXuV83atWWt0NBX8s97bL3d+PG6ZKqjxl49XsqK2a83ECzycbESEAUSTQ8CN//W//8/P/+9PWVHf1+7+OOvIivp/d//57//xP9hYT3ysgmBK2K1UWxdKtdHVVPCk3vv99PqNSwVN/lnv1RrdlG8NVchhUCTeYNlk57x60ehoN2F++ajfgfnnyXVZ6Veavc55NfESgqDw7JzXlHr7vNHTVPB97e63v67fHCleKmiOU1lRb6r1xMs/kmxEhAEgh2KyV6kol1faM/nfT69fH9jG3k8frs6a/YrSv61eJF5UEBSSZJiErKjH591/nlyXRqmgxtcHymmjd9HoJF4LP2QjIgwAORSN5GPCg0rHa/i9OW6dXvZlRW2745lAEAxFd5jE0Xl37+PVDwe+VNDkb39dy4rKy/0bNiLCAJBDoXhTrVcbd9oNGfeTeQdLR8pJvUc/xg2CoG96hUkEkkByYJaV/uXlTeL18kk2IsIAkENBaD4m/It4TOjnnszxeVdW1MblTfqf2INgqjgyTCIc98sdWen3KueJV9An2YgIA0AOuefwx4R+eFDpOAIwQBD0os8wiXB8c9wqK32Fqxs2bESEASCHHJN8TLhf9nxM6P9ZRbXR5WhNCoIsGTRMIhzfVbsVakq2FJONiDAA5JBXmo8J31XvIhmTex+vykq/ovR5eYAPggwYLkwiHLWUbOmPu3eQjYgwAOSQP5qPCf+s937+0I5wNP7jXfus2ZeVPhfRTiAYH8cJkwjNk3ov/SnZ3GQjIgwgoBxeVxv1Rke7hSgqz5r93/66jmNAvjlStACMEKw27i4vb+h/mw0E46SWj7d1oYzP8cMkwlFLyVZvdCKpBUt21MeY2Lt/+vL0nY1IDZjJIRs8P780bx+005XH5923pzdC8vfT61jH5w8HSohS/XHWeV+7Kyt9WVG7nx6T7gtAhqB+/lppfYpuramOEyYRmloCRdDNxu3np2/PDDqSUHLY7DzIivr7aSzbJtAPS0fKu+qdrKifHr4m3R2ATOC2/ygr6seLu18+XsX0SA9Mij99uHp7elNW+pXWJwaKKI4cqp+/yor6x1kn8SbMOF8fKCf13nn708tL0n0CEB0PT99lRT067ybe7cH4+Oa4ddZUG7ef4+5O4sjhTf9RVtQQUXdg5Pz15FpWVJY3/YFsotV9OGuqbJ7tgQlSiwR7iHlKEUcOL28+n9R7iTcbuGOcF1c/434pEC/OW5/eVe8S7/Bg3NSmlLv7p1i7kzhyWGvfv69hYKSCb44UyCHAAHg+kh3KitpR4z2jJ7wcbs7pf4Z2YWVtYW4tioYhrxP0mtvFCUmam1+YW/P+rfn+WnFxe+Q1N+fmN/VvahdfI16EqJHtfT8FoBByCLCBrKhvT2/s3W9vcdL429OTxVdRjXrHYDEva441bbBEbi58OV2D15pYfBfS/wxAzjlrQ+e3sIQcBgBVDl8tzZDSJUmSJM3oDaz9d7L4aq04MWm8v12ckCRJkiaW9oxxtbBCftm6jkQ0OfFNoidNTErSfJH4SB+ohBySP9SUe2Zi0iiA3qu8irc5J0nS5MzE/CZxKcn1Qiu/qzDWZfXv61WeL77ab+5sF+fMAujFnpmYXFjZb75aWtBKMre059VxIYcAG9Dk0Dn1E6PeMU71jr24be/2+66BY4qrc643xxo5nKWJyQVrzDqMauWZ37RekINxacHX5DO/qQ/D/c05bcaY33Qb0kplWCGG+TCfOAs5t9ZcmZfsF6E50PSDaWhyZoQVyGF8oMrhyvyMtbrRe/DmnNYFzeXPkvX+q6WiOeObHW7RnPdtC8C9xXl97Un5pmHO+REpM2sLE5PWp8Y3XYu4taHF2y4ackhI1HZxzrrg5qJh0f2Osbm01nG2i9t/NWf0+wmytJBDIDn4kUNi+NgGo/HfvcX54grR7WnDxDX8id2hPujI94cZ1RfojpW6flNHkxyjPO7Jh/i5OSolxzuLRHkoVob6xLmLsP9kiANfkevjUU6AHMYOP7tDUg5fLc3om7x5stlmzCXbyry2UdO2RMaX1yhySH7TMSCdH9nl0NgLLqzsN1fm7cs6eq8ii2fsKT3k0GbaXRgPOdzZLs4tbS4ubVKqsF2cWyouLm0uzhcXl4qvvDsu5BBgg0By6BiMxggyZ3O927uHCWX4+5ZD9+SwMm8OYXN3SI7fvcX54qLX5GP8yi2HTkNGCZ1Whvpkx1FI+5w5yoGecuhlBXIYF3w/O/Qhh9oG37gpOrG05xgP5vu2OxLGOxP24WF95JLDuXnbD3XTa+bTCA85tBdP8pDDkeW0vGF71ri3OOl8BmD8anNxckZbok4MXeVBDgE2CCSHzsFo+6+t2zuGSQA5JA8H0IyaNx6JO5AUOXzlMflYvzKem0gSrdjWWpawsm2fWKg+MbVQct0snbSVymmRlMNRToAcxo4oTpZaD+FDN5u1yYueERSPDSGHABt4yiEoHCGHAYBAi/QQcgiwAeQwO4QcBgDkMD2EHAJsADnMDiGHAQA5TA8hhwAbCCaHK0NPqCVI7Ux7smWAHAbAaDl0BKcHiEUNHr465CKBPvJJnwH4RiBjuOB6/4QcAmxAl0Myrs7X4LIHX434dOiXvYLf/aXUCBCHMDLUfUgYPvVSQ76wPyLOmA0hhwHgIYfUQFG97a0xY/uaI9h801f4qhGuawXOW1Gr5LAcEXdPHjbTT3OZF7dF9JOXcgfgDzU0OTO35h13HAUhhwAbUOWQCIq1h4STo0kfxbZcFq4Rt2B9um//smMAmqNpcsY6jD1ZfEUUgDy5bcwJZElcgfOU2HziypRsAAsr1szjDsO3DoLa5wczPJ+Y6NyJCOaLeh0hh1zAI+7QI1DU7C62KFEzANYebO4nfNUVjOi07opjdcfd26IkDZV1h8PbY2A3F8lTzsaLYYaIq3kmHIAcAjzA42bp3uKkPYmMazTZRrEe/+4x4oxP6f8lhtirpQUjp4fLnPecQIR+LZCB9u7YfEuNzOQhjplh0hlmTQu8doX5u3aHtEQEe5BDnuCRlcYjUNSuba6v2YPN/YSv0uTQdllXIDwl7p6ImXWHOrliYIn/uuRwmCEryIkWdww5BPjBsGeHtsWfO40GOfDJoF5yxBGZLvbd/3UMwM3Fpc2RcYruOcEhhzuOEHgyGNEdLe2YGWza7xWG7wzzd8shLV8H5JAr0G+WegWK2rXNFZBrDzb3E75KiXAfHbCv33o14u7JmFnqiPK06JLDYYbmrRss9ABeyCHACahyqI8jQ6gmrPyZnqNYosmhLdPFvud/bVH8bjm0xaR7p8LYN7XNMOFbDq1runaHjmB/15ygzx7OkwdDCgk55AIJnywd8cTeyqWUBUIOATYQ62Tp5txkSk+WRnDcb2xCDgMAgRbpIeQQYAOx5BAcRshhAEAO00PIIcAGkMPsEHIYAJDD9BByCLAB5DA7hBwGwMX1/ccLyGEq+Oa4BTkEGEBW1D/OOol3eJABIYcB0L77Ulb6ibcZuLPf3Pt4JSvq58dvSXcKQHBUWp+Oz7uJd3gwbmo3nLqfIIf+cHf/JCvqryfXibcc+L56V2mpz88vSXcKQHAonYfTRv/1gZJ4nwdjpbbCvv8S7wpbHDl8eRnUr+/LSv/nD+3EGy/LfHt6w2AdBwCDwUD9/BWPD4Xn6wPl9LJfu7p/iXmBLY4cDgaDp2/P1fYnWVHf1+7ent6AjLlf7vx12ZMVtdV9SLovAFmB0nmQFfWo0nlz3Ep84gaj5esD5ZePV9qs8vD0Pe6+JJQcDgaD5+eXm/7jefuTrKgge9av73GCBmCJ5+eX696XxHs+GB+r7U9sDiKIJocmvn1/fnj6DrJk3LcyAMALj1+/d9THVveh2QHF4W3/Uf38ldkpBGHlEAAAAAD8A3IIAAAAAJBDAAAAAIAcAgAAAMAAcggAAAAAA8ghAAAAAAwghwAAAAAwgBwCAAAAwAByCAAAAAADyCEAAAAADCCHAAAAADAYDP4/tGyuEeXM/nkAAAAASUVORK5CYII=" width="640" /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The initial weights were set at 25 for each attribute, in effect reflecting the absence of any view about which might be more important. With those weights, the SD of the Prediction Errors was 1.25 After 60,000+ iterations in the space of 1 minute the SD had been reduce down to 0.97. This was achieved with this new combination of weights: Attribute A:19, Attribute B: 0, Attribute C: 19, Attribute D: 61.The substantial error that remains can be considered as due to causal factors outside of the model (i.e. as is described by the list of attributes)&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftn1" name="_ftnref1" style="mso-footnote-id: ftn1;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 11pt; line-height: 115%;"&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;It seems that it is also possible to find least appropriate solutions, i.e, those which make the least accurate Outcome Predictions. Using the GA set to find the maximum error, it was found that in the above example a 100% weighting given to Attribute A generated a SD of 1.87. This is the nearest that such an evolutionary approach comes to disproving a theory.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;GA deliver functional rather than logical proofs that certain explanations are better than others. Unlike logical proofs, they are not immortal. With more projects included in the model it is possible that there may be a fitter solution, which applies to this wider set. However, the original solution to the smaller set would still stand.&lt;br /&gt;&lt;br /&gt;Models of complex processes can sometimes be sensitive to starting conditions. Different results can be generated from initial settings that are very similar. This was not the case in this exercise, with widely different initial weighting’s evolving and converging on almost identical sets of final weightings e.g. 19, 0, 19, &amp;nbsp;62 versus 61) producing the same final error rate. This robustness is probably due to the absence of feedback loops in the model, which could be created where the weighted score of one attribute affected those of another.&amp;nbsp; That would a much more complex model, possibly worth exploring at another time. &lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Small changes in Attribute scores made a more noticable difference to the Prediction Error. In the above model varying Project 8’s score on attribute A from 3 to 4 increases the average error by 0.02. Changes in other cells varied in direction of their effects. In more realistic models with more kinds of attributes and more project cases the results are likely to be less sensitive to such small differences in attribute scores. &lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The heading of this post asks “Can we evolve explanations of observed outcomes?” My argument above suggests that in principle it should be possible. However there is a caveat. A set of weighted attributes that are associated with success might better be described as the &lt;i style="mso-bidi-font-style: normal;"&gt;ingredients&lt;/i&gt; of an explanation. Further investigative work would be needed to find out how those attributes actually interact together in real life.&amp;nbsp; Before then, it would be interesting to do some testing of this use of GAs on real project datasets.&lt;br /&gt;&lt;br /&gt;Your comments please... &lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="mso-element: footnote-list;"&gt;&lt;b&gt;&lt;span style="color: red;"&gt;PS 6 April 2012&lt;/span&gt;&lt;/b&gt;: I have just come across &lt;a href="https://www.kaggle.com/competitions" target="_blank"&gt;the Kaggle website&lt;/a&gt;. This site hosts competitions to solve various kinds of prediction problems (re both past and future events) using a data set available to all entrants, and gives prizes to the winner - who must provide not only their prediction but the algorithm that generated the prediction. Have a look. Perhaps we should outsource the prediction and testing of results of development projects via this website? :-) Though..., even to do this the project managers would still have a major task on hand: to gather and provide reliable data about implementation characteristics, as well as measures of observed outcomes... Though...this might be easier with some projects that generate lots of data, say micro-finance or education system projects.&lt;br /&gt;&lt;br /&gt;&amp;nbsp;&lt;a href="http://www.abc.net.au/catalyst/stories/3296837.htm" target="_blank"&gt;View this Australian TV video&lt;/a&gt;, explaining how the site works and some of its achievements so far. And the&lt;a href="http://www.fastcompany.com/anthony-goldbloom-kaggle-whos-next" target="_blank"&gt; Fast Company interview&lt;/a&gt; of the CEO&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span style="color: red;"&gt;PS 9 April 2012&lt;/span&gt;&lt;/b&gt;:&amp;nbsp; I have just discovered that there is a whole literature on the use of &lt;a href="https://www.google.co.uk/search?q=genetic+algorithm+rule+discovery&amp;amp;ie=utf-8&amp;amp;oe=utf-8&amp;amp;aq=t&amp;amp;rls=org.mozilla:en-GB:official&amp;amp;client=firefox-a#hl=en&amp;amp;client=firefox-a&amp;amp;rls=org.mozilla:en-GB%3Aofficial&amp;amp;sclient=psy-ab&amp;amp;q=genetic+algorithm+rule+discovery&amp;amp;oq=genetic+algorithm+rule+discovery&amp;amp;aq=f&amp;amp;aqi=&amp;amp;aql=&amp;amp;gs_l=serp.12...0l0l0l9129734l0l0l0l0l0l0l0l0ll0l0.frgbld.&amp;amp;pbx=1&amp;amp;bav=on.2,or.r_gc.r_pw.r_cp.r_qf.,cf.osb&amp;amp;fp=759c2bde6e044c68&amp;amp;biw=1920&amp;amp;bih=930" target="_blank"&gt;genetic algorithms for rule discovery&lt;/a&gt; "In a nutshell, the motivation for applying evolutionary algorithms to data mining is that evolutionary algorithms are robust search methods which perform a global search in the space of candidate solutions (rules or another form of knowledge representation)" &lt;a href="http://books.google.co.uk/books/about/Data_mining_and_knowledge_discovery_with.html?id=KkdZlfQJvbYC&amp;amp;redir_esc=y" target="_blank"&gt;(Freitas, 2002)&lt;/a&gt; The rules referred to are typcially "IF...THEN..."type statements&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;hr align="left" size="1" width="33%" /&gt;&lt;div id="ftn1" style="mso-element: footnote;"&gt;&lt;div class="MsoFootnoteText"&gt;&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftnref1" name="_ftn1" style="mso-footnote-id: ftn1;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style="font-size: 9pt;"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt; line-height: 115%;"&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size: 9pt;"&gt; Bear in mind that this example set of attribute scores and observed outcome measures is totally fictitious, so the inability to find a really good set of fitting attributes should not be surprising. In reality some sets of attributes will not be found co-existing because of their incompatibility e.g. corrupt project management plus highly committed staff&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-6906376128510397291?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/03/can-we-evolve-explanations-of-observed.html</feedburner:origLink></item><item><title>Modular Theories of Change: A means of coping with diversity and change?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/dhAAJUCYvfA/modular-theories-of-change-means-of.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 02 Apr 2012 00:39:02 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-5372395231733067674</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;Two weeks ago I attended a DFID workshop at which Price Waterhouse Coopers (PwC) consultants presented the results of their work, commissioned by DFID, on “Monitoring Results from Low Carbon Development”. LCD is one of three areas of investment by&lt;a href="http://www.decc.gov.uk/en/content/cms/tackling/international/icf/icf.aspx" target="_blank"&gt; International Climate Fund (ICF)&lt;/a&gt;. The ICF is “a £2.9bn financial contribution … provided by the UK Government to support action on climate change and development. Having started to disperse funds, a comprehensive results framework is now required to measure the impact of this investment, to enable learning to inform future programming, and to show value for money on every pound”&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The PwC consultants’ tasks included (a) consultation with HMG staff on the required functions of the LCD results framework; (b) a detailed analysis of potentially useful indicators through extensive consultations and research into the available data; and (c) exploration of opportunities to harmonise results and/or share methodologies and data collection with others. Their report documents the large amount of work that has been done, but also acknowledges that more work is still needed.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Following the workshop I sent in some comments on the PwC report, some of which I will focus on here because I think they might be of wider interest. There were three aspects of the PwC proposals that particularly interested me. One was the fact that they had managed to focus down on 28 indicators, and were proposing that set be limited even further, down to 20. Secondly, they had organised the indicators into a LogFrame type structure, but one which is covering two levels of performance in parallel (within countries and across countries), rather than in a sequence. Thirdly, they had advocated the use of &lt;a href="http://eprints.lse.ac.uk/12761/1/Multi-criteria_Analysis.pdf"&gt;Multi-Criteria Analysis (MCA)&lt;/a&gt; for the measurement of some of the more complex forms of change referred to in the Logframe.&amp;nbsp; MCA is similar in structure to the design of weighted checklists, which I have previously discussed &lt;a href="http://mandenews.blogspot.com/2007/09/checklists-as-mini-theories-of-change.html"&gt;here&lt;/a&gt;and &lt;a href="http://mande.co.uk/special-issues/weighted-checklists/"&gt;elsewhere&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Monitorable versus evaluable frameworks&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;As it stands the current LCD LogFrame is a potential means of &lt;i style="mso-bidi-font-style: normal;"&gt;monitoring&lt;/i&gt; important changes relating to low carbon development. But it is not yet sufficiently developed to enable an &lt;i style="mso-bidi-font-style: normal;"&gt;evaluation&lt;/i&gt; of the impact of efforts aimed at promoting low carbon development. This is because there is not yet sufficient clarity about the expected causal linkages between the various events described in the Logframe. It is the case that, as is required by DFID Logframes, weightings have been given to each of the four Outputs describing their expected impact on Outcome level changes. But the differences in weightings are modest (+/- 10%) and each of the Outputs describes a bundle of up to 5 more indicator-specific changes.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;Clarity about the expected causal linkages is an essential “evaluability” requirement. Impact evaluations in their current form seek to establish not only what changes occurred, but also their causes. Accounts of causation in turn need to include not only &lt;i style="mso-bidi-font-style: normal;"&gt;attribution&lt;/i&gt;(whether A can be said to have caused B) but also &lt;i style="mso-bidi-font-style: normal;"&gt;explanation &lt;/i&gt;(how A caused B). In order for the LCD results framework to be evaluable, someone needs “connect the dots” in some detail. That is, identify plausible explanations for how particular indicator-specific changes are expected influence each other. Once that is done, the LCD program could be said to have not just a set of indicators of change, but a Theory of Change about how the changes interact and function as a whole.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Indicator level changes as shared building blocks&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;There are two subsequent challenges here to developing an evaluable Theory of Change for LCD. One is the multiplicity of possible causal linkages. The second is the diversity of perspectives on which of these possible causal linkages really matter. With 28 different indicator-specific changes there are, at least hypothetically, many thousands of different possible combinations that could make up a given Theory of Change (i&lt;sup&gt;i&lt;/sup&gt;, where i = number of indicator specific changes). But, it can be well argued that “this is a feature, not a bug”. As the title of this blog suggests, the 28 indicators can be considered as equivalent to Lego building blocks. The same set (or parts thereof) can be combined in a multiplicity of ways, to construct very different ToC. The positive side to this picture is the flexibility and low cost. Different ToC can be constructed for different countries, but each one does not involve a whole new set of data collection requirements. In fact it is reasonable to expect that in each country the causal linkages between different changes may be quite different, because of the differences in the physical, demographic, cultural and economic context.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Documenting expecting causal linkages (how the blocks are put together)&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;There are other more practical challenges, relating to how to exploit this flexibility. How do you seek stakeholder views of the expected causal connections, without getting lost in a sea of possibilities? One approach I have used in Indonesia and in Vietnam involves the use of simple network matrices, in workshops involving donor agencies and/or the national partners associated with a given project. Two examples are shown below. These don’t need to be understood in detail (one is still in Vietnamese), it is their overall structure that matters.&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;A network matrix simply shows the entities that could be connected in the left column and top row. The convention is that each cell in the matrix provides data on whether that row entity is connected to that column entity (and it may also describe the nature of the connection)&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The Indonesian example shown below shows expected relationships between 16 Output indicators (left column) and 11 Purpose level indicators (top row) in a maternal health project. Workshop participants were asked to consider one Purpose level indicator at a time, and allocate 100 percentage points across the 16 Output indicators, with more percentage points = an Output having more expected impact on the Purpose indicator, relative to other Output indicators. Debate was encouraged between participants as figures were proposed for each cell down a column. Looking within the matrix we can see that for Purpose 3 it was agreed that Output indicator 1.1 would have the most impact. For some other Purpose level changes, impact was expected from a wider range of Outputs. The column on the right side sums up the relative expected impact of each Output, providing useful guidance on where monitoring attention might be most usefully focused.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-Z_z30jS4-Bs/T19ij6ICEZI/AAAAAAAAF24/rU8mCkInvQw/s1600/Indonesia+matrix.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="352" src="http://4.bp.blogspot.com/-Z_z30jS4-Bs/T19ij6ICEZI/AAAAAAAAF24/rU8mCkInvQw/s640/Indonesia+matrix.jpg" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;This exercise was completed in little over an hour. The matrix of results show one set of expected relationships amongst many thousands of other possible sets that could exist within the same list of indicators. The same kind of data can be collected on a larger scale via online surveys, where the options down each column are represented within a single multiple choice question. Matrices like these, obtained either from different individuals or different stakeholder groups, can be compared with each other to identify relationships (i.e. specific cells) where there is the most/least agreement, as well as which relationships are seen as most important, when all satkeholder views are added up. This information should then inform the focus of evaluations, allowing scarce attention and resources to be directed to the most critical relationships.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;The second example of a network matrix used to explicate a tacit ToC comes from Vietnam, and is shown below. In this example, a Ministry’s programmes are shown (unconventionally) across the top row and the country’s 5 year plan objectives are shown down the left column. Cell entries, discussed and proposed by workshop participants, show the relative expected causal contribution of each programme to each 5 year plan objective. Summary row on the bottom shows the aggregate expected contribution of each programme and the summary column on the right show the aggregate extent to which each 5 year plan objective was expected to be affected.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span style="color: black;"&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-SeSzGEi2cAo/T19ij8BTl1I/AAAAAAAAF24/JoZSf-o-ess/s1600/Vietnam+matrix.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="500" src="http://4.bp.blogspot.com/-SeSzGEi2cAo/T19ij8BTl1I/AAAAAAAAF24/JoZSf-o-ess/s640/Vietnam+matrix.jpg" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b&gt;&amp;nbsp;Modularity&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The modules referred to in the title of this blog can be seen as referring to two types of entities that can be used to construct many different kinds of ToC. One is the indicator-specific changes in the LCD Logframe, for example. By treating them as a standard set available for use by different stakeholders in different settings, we may gain flexibility at a low cost. The other is the grouping of indicator specific changes into categories (e.g. Outputs 1-2-3-4) and larger sets of categories (Outputs, Outcomes, Purpose). The existence of one or more nested types of entities is sometimes described as &lt;a href="http://en.wikipedia.org/wiki/Modularity_%28biology%29"&gt;modularity&lt;/a&gt;. In evolutionary theory it has been argued that modularity in design improves evolvablity. This can happen: (a) by allowing specific features to undergo changes without substantially altering the functionality of the entire system, (b) by allowing larger more structural changes to occur by recombining existing functional units.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;In the conceptual world of Logframes, and the like, this suggests that we may need to think of ToC being constructed at multiple levels of detail, by different sized modules. In the LCD Logframe impact weightings had already been assigned to each Output, indicating its relative expected contribution to the Outcomes as a whole. But the flexibility of ToC design at this level was seriously constrained by the structure of the representational device being used. In a Logframe Outputs are expected to influence Outcomes, but not the other way. Nor are they expected to influence each other, contra other more graphic based logic models. Similarly, both of the above network matrix exercises made use of existing modules and accepted the kinds of relationship that was expected between them (Outputs should influence Purpose level changes; Ministry Programmes should influence 5Year Plan objectives achievements).&amp;nbsp;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;b&gt;The value of multiple causal pathways with a ToC&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;More recently I have seen the ToC for a major area of DFID policy that will be coming under review. This is represented in diagramatic form, showing various kinds of events (including some nested categories of events), and also shows the expected causal relationships between these events. It was quite a complex diagram, perhaps too much so for those who like traffic-light level simplicities. However, what interested me the most is that subsequent versions have been used to show how two specific in-country programs fit within this generic ToC. This has been done by highlighting the particular events that make up one of the number of causal chains that can be found within the generic ToC. In doing so it appears to be successfully addressing a common problem with generic ToC - the inability to reflect the diversity of the programs that make up the policy area described by a generic ToC.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Shared causal pathways justify more evaluation attention &lt;/b&gt;&lt;br /&gt;&lt;br /&gt;This innovation points to an alternate and additional use of the matrices above. The cell numbers could refer to the numbers of constituent programs in a policy area (and/or which are funded by a single funding mechanism) that involve this particular causal link (i.e. between the row event and the column event). The higher this number, the more important it would be for evaluations to focus on that casual link - because the findings would have relevance across a number of programs in the policy area.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-5372395231733067674?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-Z_z30jS4-Bs/T19ij6ICEZI/AAAAAAAAF24/rU8mCkInvQw/s72-c/Indonesia+matrix.jpg" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/03/modular-theories-of-change-means-of.html</feedburner:origLink></item><item><title>Evaluation questions: Managing agency, bias and scale</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/4dPDPYZYC4k/evaluation-questions-managing-agency.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 16 Feb 2012 03:25:39 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-3803591654789748578</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;span lang="EN-GB"&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;It is common to see in the Terms of Reference (ToRs) of an evaluation a list of evaluation questions. Or, at least a requirement that the evaluator develops such a list of questions as part of the evaluation plan. Such questions are typically fairly open-ended “how” and “whether” type questions. On the surface this approach makes sense. It gives some focus but leaves room for the unexpected and unknown.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;But perhaps there is an argument for a much more focused and pre-specified approach.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;span lang="EN-GB"&gt;Agency&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;There are two grounds on which such an argument could be made. One is that aid organisations implementing development programs have “&lt;a href="http://en.wikipedia.org/wiki/Agency_%28philosophy%29"&gt;agency&lt;/a&gt;”, i.e. they are expected to be able to assess the situation they are in and act on the basis of informed judgements. They are not just mechanical instruments for implementing a program, like a computer. Given this fact, one could argue that evaluations should not simply focus on the behaviour of an organisation and its consequences, but &lt;i style="mso-bidi-font-style: normal;"&gt;on the organisation’s knowledge&lt;/i&gt; of its behaviour and its consequences. If that knowledge is misinformed then the sustainability of any achievements may be seriously in doubt. Likewise, it may be less likely that unintended negative consequences of a program will be identified and responded to appropriately.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;One way to assess an organisation’s knowledge is to solicit their judgements about program outcomes in a form that can be tested by independent observation. For example, an organisation’s view on the percentage of households who have been lifted above the poverty line as a result of a livelihood intervention. An external evaluation could then gather independent data to test this judgement, or more realistically, audit the quality of the data and analysis that the organisation used to come to their judgement. In this latter case the role of the external evaluator is undertake a &lt;i style="mso-bidi-font-style: normal;"&gt;meta-evaluation&lt;/i&gt;, evaluating an organisation’s capacity by examining their judgements relating to key areas of expected program performance. This would require focused evaluation questions rather than open ended evaluation questions.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;span lang="EN-GB"&gt;Bias&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;The second argument is arises from a body of research and argument about the prevalence of what appears to be endemic bias in many fields of research: the under-reporting of negative findings (i.e. non-relationships) and the related tendency of positive findings to disappear over time. The evidence here makes salutary reading, especially the evidence from the field of medical research where research protocols are perhaps the most demanding of all (for good reason, given the stakes involved). Lehrer’s 2010 article in the New York Times “&lt;a href="http://www.newyorker.com/reporting/2010/12/13/101213fa_fact_lehrer"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;The Truth Wears Off: Is there something wrong with the scientific method&lt;/i&gt;?&lt;/a&gt;’&lt;/span&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;is a good introduction, and &lt;a href="http://www.theatlantic.com/magazine/print/2010/11/lies-damned-lies-and-medical-science/8269/"&gt;Ioannidis’ work&lt;/a&gt; (cited by Lehrer) provides the more in-depth analysis and evidence.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;One solution that has been proposed to the problem of under-reporting of negative findings is the establishment of trial registries, whereby plans for experiments would be lodged in advance, before their results are known. This is now established practice in some fields of research and has recently been proposed for the use of randomised control trials by development agencies&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftn1" name="_ftnref1" style="mso-footnote-id: ftn1;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Arial&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 10pt;"&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;Trial registries can provide two lines of defence against bias. The first is to make visible all trials, regardless of whether they are deemed “successful” and get published, or not. The other defence is against inappropriate “data mining”&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftn2" name="_ftnref2" style="mso-footnote-id: ftn2;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Arial&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 10pt;"&gt;[2]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;within individual trials. The risk is that researchers can examine so many possible correlations between independent and dependent variables that some positive correlations will appear by chance alone. This risk is greater where a study looks at more than one outcome measure and at several different sub-groups. Multiple outcome measures are likely to be used when examining the impact on complex phenomenon such a poverty levels or governance, for example. When there are many relationships being examined there is also the well known risk of publication bias, of the evaluator only reporting the significant results.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;These risks can be managed partly by the researchers themselves. &lt;a href="http://mande.co.uk/2011/uncategorized/walking-the-talk-the-need-for-a-trial-registry-for-development-interventions/"&gt;Rasmussen et al&lt;/a&gt; suggests that if the outcomes are assumed to be fully independent, statistical significance values should be divided by the number of tests. Other approaches involve constructing mean standardised outcomes across a family of outcome measures. However these do not deal with the problem of selective reporting of results. Rasmussen et al argue that this risk would be best dealt with through the use of trial registries, &lt;i style="mso-bidi-font-style: normal;"&gt;where relationships to be examined are recorded in advance&lt;/i&gt;. In other words, researchers would spell out the hypothesis or claim to be tested, rather than simply state an open ended question. Open ended questions invite cherry picking of results according to the researcher’s interests, especially when there are lot of them.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;As I have noted &lt;a href="http://mande.co.uk/2012/uncategorized/3ie-and-the-funding-of-impact-evaluations/"&gt;elsewhere&lt;/a&gt;, there are risks with this approach. One concern is that it might prevent evaluators from looking at the data and identifying new hypothesis that genuinely emerges as being of interest and worth testing.&amp;nbsp; However, registering hypotheses to be tested would not preclude this possibility. It should, however, make it evident when this is happening, and therefore encourage the evaluator to provide an explicit rationale for why additional hypotheses are being tested.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;span lang="EN-GB"&gt;Same again, on a larger scale&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;The problems of biased reporting re-appear when individual studies are aggregated. Ben Goldacre explains:&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36.0pt;"&gt;&lt;span lang="EN-GB"&gt;“&lt;i style="mso-bidi-font-style: normal;"&gt;But individual experiments are not the end of the story. There is a second, crucial process in science, which is synthesising that evidence together to create a coherent picture.&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36.0pt;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-GB"&gt;In the very recent past, this was done badly. In the 1980s, researchers such as &lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-GB"&gt;&lt;a href="http://www.bmj.com/content/309/6954/597.full"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;Celia Mulrow&lt;/i&gt;&lt;/a&gt;&lt;i style="mso-bidi-font-style: normal;"&gt; produced damning research showing that review articles in academic journals and textbooks, which everyone had trusted, actually presented a distorted and unrepresentative view, when compared with a systematic search of the academic literature. After struggling to exclude bias from every individual study, doctors and academics would then synthesise that evidence together with frightening arbitrariness.&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36pt;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36.0pt;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-GB"&gt;The science of "systematic reviews" that grew from this research is exactly that: a science. It's a series of reproducible methods for searching information, to ensure that your evidence synthesis is as free from bias as your individual experiments. You describe not just what you found, but how you looked, which research databases you used, what search terms you typed, and so on. This apparently obvious manoeuvre has revolutionised the science of medicine”&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36pt;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Reviews face the same risks as individual experiments and evaluations. They may be selectively published, and their individual methodologies may not adequately deal with the problem of selective reporting of the more interesting results – sometimes described as &lt;a href="http://www.guardian.co.uk/commentisfree/2011/sep/23/bad-science-ben-goldacre"&gt;cherry picking&lt;/a&gt;.&amp;nbsp; The development of review protocols and the registering of those prior to a review are an important means of reducing biased reporting, as they are with individual experiments. Systematic reviews are already a well established practice in the health sphere under the &lt;a href="http://www.thecochranelibrary.com/view/0/index.html"&gt;Cochrane Collaboration&lt;/a&gt; and in social policy under the &lt;a href="http://www.campbellcollaboration.org/"&gt;Campbell Collaboration&lt;/a&gt;. Recently a new health sector journal, &lt;a href="http://www.systematicreviewsjournal.com/"&gt;Systematic Reviews&lt;/a&gt;, has been established with the aim of ensuring that the results of all well-conducted systematic reviews are published, regardless of their outcome. The journal also aims to promote discussion of review methodologies, with the current issue including a paper on “&lt;a href="http://www.systematicreviewsjournal.com/content/1/1/10"&gt;Evidence summaries”, a rapid review approach&lt;/a&gt;. &lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;It is common place for large aid organisations to request synthesis studies of achievements across a range of programs, defined by geography (e.g. a country program) or subject matter (e.g. livelihood interventions). A synthesis study requires some meta-evaluation, of what evidence is of sufficient quality and what is not. These judgements inform both the sampling of sources and the weighing of evidence found within the selected sources.&amp;nbsp; Despite the prevalence of synthesis studies, I am not aware of much literature existing on appropriate methodologies for such reviews, at least within the sphere of development evaluation. [I would welcome corrections to this view]&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;However, there are signs that experiences elsewhere with systematic reviews are being attended to. In the development field &lt;a href="http://www.campbellcollaboration.org/international_development/index.php"&gt;The International Development Coordinating Group&lt;/a&gt; has been established, under the auspices of the Campbell Collaboration, with the aim of encouraging registration of review plans and protocols and then disseminating “&lt;i style="mso-bidi-font-style: normal;"&gt;systematic reviews of high policy-relevance with a dedicated focus on social and economic development interventions in low and middle income countries&lt;/i&gt;”. DFID and AusAID have funded &lt;a href="http://www.3ieimpact.org/systematicreviews/"&gt;3ie&lt;/a&gt; to commission a body of systematic reviews of what it identifies as rigorous impact evaluations, in a range of development fields. More recently an &lt;a href="http://www.odi.org.uk/resources/docs/7546.pdf"&gt;ODI Discussion Paper&lt;/a&gt;has reviewed some experiences with the implementation of systematic reviews. Associated with the publication of this paper was a useful &lt;a href="http://www.odi.org.uk/opinion/details.asp?id=6283&amp;amp;title=systematic-reviews-international-development-slrc"&gt;online discussion&lt;/a&gt;.&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Three problems that were identified are of interest here. One is the difficulty of accessing source materials, especially evaluation reports many of which are not in the public domain, but should be. This problem is faced by all review methods, systematic and otherwise. This problem is now being addressed on multiple fronts, by individual organisation initiatives (e.g. 3ie and IDS evaluation databases) and by collective efforts such as the International Aid Transparency Initiative. The authors of the ODI paper note that “&lt;span class="product"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;there are no guarantees that systematic reviews, or rather the individuals conducting them, will successfully identify every relevant study, meaning that subsequent conclusions may only partially reflect the true evidence base.”&lt;/i&gt; While this is (for any type of review process) it is the transparency of the sample selection - via protocols, and the visibility of the review itself – via registries, which help make this problem manageable.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="product"&gt;&lt;span lang="EN-GB"&gt;The second problem, as seen by the authors, is that “Systematic reviews tend to privilege one kind of method over another, with full-blown randomised controlled trials (RCTs) often representing the ‘gold standard’ of methodology and in-depth qualitative evidence not really given the credit it deserves.” This does not have to be the case.&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;A systematic review &lt;a href="http://www.bmj.com/content/315/7109/672.full"&gt;has been usefully defined&lt;/a&gt; as&lt;i style="mso-bidi-font-style: normal;"&gt; “an overview of primary studies which contains an explicit statement of objectives, materials, and methods and has been conducted according to explicit and reproducible methodology&lt;/i&gt;” Replicability is key and this requires &lt;i style="mso-bidi-font-style: normal;"&gt;systematic and transparent process&lt;/i&gt;relating to sampling and analysis. This should be evident in protocols.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;A third problem was identified by 3ie, in &lt;a href="http://www.3ieimpact.org/userfiles/doc/SR_blog.pdf?utm_source=IDCG&amp;amp;utm_medium=Website&amp;amp;utm_campaign=1%2F12"&gt;their commentary&lt;/a&gt; on the Discussion Paper. This relates directly to the initial focus of this blog, the argument for more focused evaluation questions. They comment that:&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-left: 36.0pt;"&gt;&lt;span lang="EN-GB"&gt;“&lt;i style="mso-bidi-font-style: normal;"&gt;Even with plenty of data available, making systematic reviews work for international development requires applying the methodology to clearly defined research questions on issues where a review seems sensible. This is one of the key lessons to emerge from recent applications of the methodology. A review in medicine will often ask a narrow question such as the Cochrane Collaboration’s recent review on the inefficacy of oseltamivir (tamiflu) for preventing and treating influenza. Many of the review questions development researchers have attempted to answer in recent systematic reviews seem too broad, which inevitably leads to challenges. There is a trade-off between depth and breath, but if our goal is to build a sustainable community of practice around credible, high quality reviews we should be favouring depth of analysis where a trade-off needs to be made&lt;/i&gt;.”&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="mso-element: footnote-list;"&gt;&lt;br /&gt;&lt;hr align="left" size="1" width="33%" /&gt;&lt;div id="ftn1" style="mso-element: footnote;"&gt;&lt;div class="MsoFootnoteText"&gt;&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftnref1" name="_ftn1" style="mso-footnote-id: ftn1;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt; By the head of DFID EvD in 2011 and by Rasmussen et al, see below.&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div id="ftn2" style="mso-element: footnote;"&gt;&lt;div class="MsoFootnoteText"&gt;&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftnref2" name="_ftn2" style="mso-footnote-id: ftn2;" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;[2]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt; See Ole Dahl Rasmussen, Nikolaj Malchow-Møller, Thomas Barnebeck Andersen,&lt;i style="mso-bidi-font-style: normal;"&gt; Walking the talk: the need for a trial registry for development interventions, &lt;/i&gt;&amp;nbsp;available via &lt;/span&gt;&lt;span lang="EN-GB"&gt;&lt;a href="http://mande.co.uk/2011/uncategorized/walking-the-talk-the-need-for-a-trial-registry-for-development-interventions/"&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;http://mande.co.uk/2011/uncategorized/walking-the-talk-the-need-for-a-trial-registry-for-development-interventions/&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9pt;"&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-3803591654789748578?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2012/02/evaluation-questions-managing-agency.html</feedburner:origLink></item><item><title>Evaluation quality standards: Theories in need of testing?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/Nk-1lSU7Sw0/evaluation-quality-standards-theories.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Tue, 25 Oct 2011 14:26:25 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-9044554625934379561</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Since the beginning of this year I have been part of a DFID funded exercise which has the aim of “Developing a broader range of rigorous designs and methods for impact evaluations” Part of the brief has been to develop draft quality standards, to help identify “the difference between appropriate, high quality use of the approach and inappropriate/ poor quality use”&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;A quick search of what already exists suggests that there is no shortage of quality standards. Those relevant to development projects have been listed &lt;a href="http://mande.co.uk/2011/lists/evaluation-quality-standards/on-evaluation-quality-standards-a-list/"&gt;online here&lt;/a&gt;. They include:&lt;/span&gt;&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Standards agreed by multiple organisations, e.g. OECD-DAC and various national evaluation societies. The former are of interest to aid organisations where as the latter are of more interest to evaluators.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Standards developed for use within individual organisations, e.g. DFID and EuropeAID&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Methodology specific standards, e.g. those relating to randomised and other kinds of experimental methods, and qualitative research &lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;In addition there is a much larger body of academic literature on the use and mis-use of various more specific methods.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;A scan of the criteria I have listed shows that a variety of types of evaluation criteria are used, including:&lt;/span&gt;&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Process criteria, where the focus is on how evaluations are done. e.g. relevance, timeliness, accessibility, inclusiveness&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Normative criteria, where the focus is on principles of behaviour e.g. independence, impartiality, ethicality&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB" style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Technical criteria, where the focus is on attributes of the methods used e.g. reliability and validity&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Somewhat surprisingly, technical criteria like reliability and validity are in the minority, being two of at least 20 OECD-DAC criteria. The more encompassing topic of Evaluation Design is only one of the 17 main topics in the DFID Quality Assurance template for revising draft evaluations. There are three possible reasons why this is so: (a) Process attributes may be more important, in terms of their effects on what happens to an evaluation, during and after its production, (b) It is hard to identify generic quality criteria for a diversity of evaluation methodologies, (c) Lists have no size limits. For example, the DFID QA template has 85 subsidiary questions under 17 main topics.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Given these circumstances what is the best way forward, of addressing the need for quality standards for “a broader range of rigorous designs and methods for impact evaluations”? The first step might be to develop specific guidance which can be packed in separate notes on particular evaluation designs and methods. The primary problem may be simple lack of knowledge about the methods available; knowing how to choose between them may be in fact “a problem we would like to have”, which needs to be addressed &lt;i&gt;after&lt;/i&gt; people at least know something about the alternative methods. The Asian Development Bank has addressed this issue through its “Knowledge Solutions” series of publications.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;The second step that could be taken would be to develop more generic guidance that can be incorporated into the existing quality standards. Our initial proposal focused on developing some additional design focused quality standards that could be used with some reliability across different users. But perhaps this is a side issue. Finding out what quality criteria really matter, may be more important. However, there seems to be very little evidence on what quality attributes matter. In 2008 Forss et al carried out a study: “&lt;a href="http://www.oecd.org/dataoecd/23/36/41390724.pdf"&gt;Are Sida Evaluations Good Enough? An Assessment of 34 Evaluation Reports”&lt;/a&gt; The authors gathered and analysed empirical data on 40 different quality attributes of evaluation reports published between 2003 and 2005. Despite suggestions made, the report was not required to examine the relationship between these attributes and the subsequent use of the evaluations. Yet, the insufficient use of evaluations has been a long standing concern to evaluators and to those funding evaluations.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;There are at least 4 different hypotheses that would be worth testing in future versions of the SIDA study that did look at evaluation quality and usage:&lt;/span&gt;&lt;/div&gt;&lt;ol style="text-align: left;"&gt;&lt;li&gt;&lt;span lang="EN-GB"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Quality is largely irrelevant, what matters is how the evaluation results are communicated. &lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Quality matters, especially the use of a rigorous methodology, which is able to address attribution issues&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Quality matters, especially the use of participatory processes that engage stakeholders&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span lang="EN-GB"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Quality matters, but it is a multi-dimensional issue. The more dimensions are addressed, the more likely that the evaluation results will be used.&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;The first is in effect the null hypothesis, and one which needs to be taken seriously. The second hypothesis seems to be the position taken by 3ie and other advocates of RCTs and their next-best substitutes. It could be described as the killer assumption being made by RCT advocates that is yet to be tested. The third could be the position of some of the supporters of the “&lt;a href="http://www.ids.ac.uk/go/news/development-professionals-launch-big-push-back-to-counter-audit-culture"&gt;Big Push Back&lt;/a&gt;” against inappropriate demands for performance reporting. The fourth is the view present in the OECD-DAC evaluation standards, which can be read as a narrative theory of change about how a complex of evaluation quality features will lead to evaluation use, strengthened accountability, contribute to learning and improved development outcomes. I have taken the liberty of identifying the various possible causal connections in that theory of change in this network diagram below. As noted above, one interesting feature is that the attributes of reliability and validity are only one part of a much bigger picture.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-SVplRkWYalY/TqU-tK9GqxI/AAAAAAAAEgU/Eh5cC7Vqdl8/s1600/OECD+DAC+Quality+standards+as+a+ToC2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="280" src="http://4.bp.blogspot.com/-SVplRkWYalY/TqU-tK9GqxI/AAAAAAAAEgU/Eh5cC7Vqdl8/s320/OECD+DAC+Quality+standards+as+a+ToC2.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="text-align: center;"&gt;&lt;span lang="EN-GB"&gt;[Click on image to view a larger version of the diagram]&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div align="center" class="MsoNormal" style="text-align: center;"&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;span lang="EN-GB"&gt;While we wait for the evidence…&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;We should consider transparency as a pre-eminent quality criterion, which would be applicable across all types of evaluation designs. It is a meta-quality, enabling judgments about other qualities. It also addresses the issue of robustness, which was of concern to DFID. The more explicit and articulated an evaluation design is, the more vulnerable it will be to criticism and identification of error. Robust designs will be those that&amp;nbsp; can survive this process. This view connects to wider ideas in the philosophy of science about the importance of &lt;a href="http://en.wikipedia.org/wiki/Falsifiability"&gt;falsifiablity&lt;/a&gt; as a quality of scientific theories (promoted by Popper and others).&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span lang="EN-GB"&gt;Transparency might be expected at both a macro and micro level. At the macro level, we might ask these types of quality assurance questions:&lt;/span&gt;&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;Before the evaluation&lt;/span&gt;: Has an evaluation plan been lodged, which includes the hypotheses to be tested? Doing so will help reduce selective reporting and opportunistic data mining &lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;After the evaluation: &lt;/span&gt;Is the evaluation report available? Is the raw data available for re-analysis using the same or different methods?&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;Substantial progress is now being made with the availability of evaluation reports. Some bilateral agencies are considering the use of evaluation/trial registries, which are increasingly commonplace in some field of research. However, availability of raw data seems likely to remain the most challenging requirement for many evaluators.&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;At the micro-level, more transparency could be expected in the particular contents of evaluation plans and reports. The DFID Quality Assurance templates seem to be most operationalised set of evaluation quality standards available at present. The following types of questions could be considered for inclusion in those templates:&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Is it clear how specific features of the project/program influenced the evaluation design?&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Have rejected evaluation design choices been explained?&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Have terms like impact been clearly defined? &lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;What kinds of impact were examined?&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Where attribution is claimed is there also a plausible explanations of the causal processes at work?&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Have distinctions been made between causes which are necessary, sufficient or neither (but still contributory)?&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;Are there assessments of what would have happened without the intervention?&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;This approach seems to have some support in other spheres of evaluation work, not associated with development aid: “&lt;i style="mso-bidi-font-style: normal;"&gt;The transparency, or clarity, in the reporting of individual studies is key” &lt;/i&gt;&lt;a href="http://www.cdc.gov/trendstatement/"&gt;TREND statement, 2004&lt;/a&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;In summary, three main recommendations have been made above:&lt;/div&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Develop technical guidance notes, separate from additional quality criteria&lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Identify specific areas where transparency of evaluation designs and methods is essential, for possible inclusion in DFID QA templates, and the like &lt;/li&gt;&lt;li&gt;&lt;span style="font-family: Symbol;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt;&lt;/span&gt;&lt;/span&gt;Seek and use opportunities to test out the relevance of different evaluation criteria, in terms of&amp;nbsp; their effects on evaluation use&lt;/li&gt;&lt;/ul&gt;&lt;div class="MsoNormal"&gt;PS: This text was the basis of one of the presentations to DFID staff (and others)&amp;nbsp;in a workshop  on 7&lt;sup&gt;th&lt;/sup&gt;October 2011 on the subject of &lt;span lang="EN-GB"&gt;“Developing a broader range of rigorous designs and methods for impact evaluations” The views expressed above are my own and should not be taken to reflect the views of either DFID or others involved in the exercise.&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-9044554625934379561?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-SVplRkWYalY/TqU-tK9GqxI/AAAAAAAAEgU/Eh5cC7Vqdl8/s72-c/OECD+DAC+Quality+standards+as+a+ToC2.jpg" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2011/10/evaluation-quality-standards-theories.html</feedburner:origLink></item><item><title>Relative rather than absolute counterfactuals: A more useful alternative?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/b9HinQdN8sY/relative-rather-than-absolute.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 08 Sep 2011 02:17:37 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-1809521461596104153</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The&lt;i&gt; basic&lt;/i&gt; design of a randomised control trial (RCT) involves comparisons of two groups: an intervention (or “treatment”) group and a control group, at two points of time, before an intervention begins and after the intervention ends. The expectation (hypothesis) is that there will be a bigger change on an agreed impact measure in the intervention group than in the control group. This hypothesis can be tested by comparing the average change in the impact status of members of the two groups, and applying a statistical test to establish that this difference was unlikely to be a chance finding (e.g. less than 5% probability of being a chance difference). The two groups are made comparable by randomly assigning participants to both groups. The types of comparisons involved are shown in this fictional example below.&lt;br /&gt;&lt;br /&gt;&lt;table border="1" cellpadding="0" cellspacing="0"&gt;	&lt;tbody&gt;&lt;tr&gt;			&lt;td style="width: 180px;"&gt;&lt;br /&gt;&lt;/td&gt;			&lt;td style="width: 192px;"&gt;A.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Intervention group&lt;/td&gt;			&lt;td style="width: 190px;"&gt;B.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Control group&lt;/td&gt;		&lt;/tr&gt;&lt;tr&gt;			&lt;td style="width: 180px;"&gt;Before intervention&lt;/td&gt;			&lt;td style="width: 192px;"&gt;Average income per household = $1000 year.&lt;br /&gt;N = 500&lt;/td&gt;			&lt;td style="width: 190px;"&gt;Average income per household = $1000 year N=500&lt;/td&gt;		&lt;/tr&gt;&lt;tr&gt;			&lt;td style="width: 180px;"&gt;After intervention&lt;/td&gt;			&lt;td style="width: 192px;"&gt;Average income per household = $1500 year.&lt;br /&gt;N = 500&lt;/td&gt;			&lt;td style="width: 190px;"&gt;Average income per household = $1200 year N=500&lt;/td&gt;		&lt;/tr&gt;&lt;tr&gt;			&lt;td rowspan="2" style="width: 180px;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: x-small;"&gt;PS: See Comment 3 below re this table]&lt;/span&gt;&lt;/td&gt;			&lt;td style="width: 192px;"&gt;Difference over time = $500&lt;/td&gt;			&lt;td style="width: 190px;"&gt;Difference over time = $200&lt;/td&gt;		&lt;/tr&gt;&lt;tr&gt;			&lt;td colspan="2" style="width: 382px;"&gt;&lt;div align="center"&gt;Difference between changes in A and B = £300&lt;/div&gt;&lt;/td&gt;		&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;This method allows a comparison with what could be called an &lt;i&gt;absolute&lt;/i&gt; counterfactual: what would have happened if there was &lt;i&gt;no&lt;/i&gt; intervention. &lt;br /&gt;&lt;br /&gt;Note that only the impact indicator is measured, there is no measurement of the intervention. This is because the intervention is assumed to be the same across all participants in the intervention group. This assumption is reasonable with some development interventions, such as those involving financial or medical activities (e.g. cash transfers or de-worming). Some information based interventions, using radio programs or the distribution of booklets, can also be assumed to be available to all participants in a standardised form. Where delivery is standardised it makes sense to measure the average impacts on the intervention and control group, because significant variations in impact are not expected to arise from the intervention.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Alternate views&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;There are however many development interventions where delivery is not expected to be standardised and where the opposite is the case, that delivery is expected to be customised. Here the agent delivering the intervention is expected to have &lt;i&gt;some &lt;/i&gt;autonomy and to use that autonomy to the benefit of the participants. Examples of such agents would include community development workers, agricultural extension workers, teachers, nurses, midwives, nurses, doctors, plus all their supervisors. On a more collective level would be providers of training to such groups working in different locations. Also included would be almost all forms of technical assistance provided by development agencies.&lt;br /&gt;&lt;br /&gt;In these settings measurement of the intervention, as well as the actual impact, will be essential before any conclusions can be drawn about attribution – the extent to which the intervention caused the observed impacts. Let us temporarily assume that it will be possible to come up with a measurement of the degree to which an intervention has been successfully implemented, a quality measure of some kind. It might be very crude, such as number of days an extension worker has spent in villages they are responsible for, or it might be a more sophisticated index combining multiple attributes of quality (e.g. &lt;a href="http://mande.co.uk/special-issues/weighted-checklists/"&gt;weighted checklists&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Data on implementation quality and observed impact (i.e. an After minus a Before measure) can now be brought together in a two dimensional scatter plot. In this exercise there is no longer a control group, just an intervention group where implementation has been variable but measured. This provides an opportunity to explore the &lt;i&gt;relative &lt;/i&gt;counterfactual, what would have happened if implementation was less successful, and less successful still, etc. In this situation we could hypothesise that if the intervention did cause the observed impacts then there would be a statistically significant &lt;i&gt;correlation&lt;/i&gt; between the quality of implementation and observed impact. In place of an absolute counterfactual obtained via the use of control group, where there was no intervention we have relative counterfactuals, in the form of participants exposed to interventions of different qualities. In place of an average, we have a correlation.&lt;br /&gt;&lt;br /&gt;There are a number of advantages to this approach. Firstly, with the same amount of evaluation funds available, the number of intervention cases that can be measured can be doubled, because a control group is no longer being used. In addition to obtaining (or not) a statistically significant &lt;i&gt;correlation, &lt;/i&gt;we can also identify the &lt;i&gt;strength &lt;/i&gt;of the relationship between the intervention and the impact. This will be visible in the slope of the regression line. A steep slope&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftn1" name="_ftnref1" title=""&gt;[1]&lt;/a&gt; would imply that small improvements in implementation can make big improvements in observed impacts and vice versa. If a non-lineal relationship is found then the shape of a best fitting regression line might also be informative, about where improvements will generate more versus less improvement.&lt;br /&gt;&lt;br /&gt;Another typical feature of scatter plots is outliers. There may be some participants (individuals or groups of) who have received a high quality intervention, but where the impact has been modest, i.e. a negative outlier. Conversely, there may be some participants who have received a poor quality intervention, but where the impact has been impressive, i.e. a positive outlier. These are both important learning opportunities, which could be explored via the use of in-depth cases studies . But ideally these case studies would be informed by some theory, directing us where to look.&lt;br /&gt;&lt;br /&gt;Evaluators sometimes talk about implementation failure versus theory failure. In her &lt;a href="http://genuineevaluation.com/intention-to-treat-and-checking-for-implementation-failure-and-differential-effects/"&gt;Genuine Evaluation blog&lt;/a&gt;Patricia Rogers gives an interesting example from Ghana, involving the distribution of Vitamin A tablets to women in order to reduce pregnancy related mortality rates. Contrary to previous findings, there was no significant impact. But as Patricia noted, the researchers appeared to have failed to measure compliance i.e. whether all the women actually took the tables given to them! This appears to be a serious case of implementation failure, in that the implementers could have designed a delivery mechanism that ensured compliance. Theory failure would be where our understanding of how Vitamin A affects women’s health appears to be faulty, because expected impacts do not materialise, &lt;i&gt;after &lt;/i&gt;women have taken the prescribed medication.&lt;br /&gt;&lt;br /&gt;In the argument developed so far, we have already proposed measuring quality of implementation, rather than making any assumptions about how it is happening. However, it is still possible that we might face “implementation &lt;i&gt;measurement&lt;/i&gt; failure”. In other words, there may be some aspect of the implementation process that was not captured by the measure used, and which was causally connected to the conspicuous impact, or lack thereof. &amp;nbsp;A case study, looking at the implementation process in the outlier cases might help us identify the missing dimension. Re-measurement of implementation success incorporating this dimension might produce a higher correlation result. If it did not, then we might &lt;i&gt;by default&lt;/i&gt; then have a good reason to believe we are now dealing with theory failure, i.e. a lack of understanding of how an intervention has its impact. Again, case studies of the outliers could help generate hypotheses about these. Testing these out is likely to be more expensive than testing alternate views on implementation processes because data will be less readily at hand. For reasons of economy and practicality implementation failure should be our first suspect.&lt;br /&gt;&lt;br /&gt;In addition to having informative outliers to explore, the use of a scatter plot enables us to identify another potential outcome not readily available via the use of control groups, where the focus is on averages. In some programmes poor implementation may not simply lead to no impact (i.e. no difference between the average impact of control and intervention groups). Poor implementation may lead to &lt;i&gt;negative impacts&lt;/i&gt;. For example, a poorly managed savings and credit programme may lead to increased indebtedness in some communities. In a standard comparison between intervention and control groups this type of failure would usually need to be present in a large of cases before it became visible in a net negative average impact. In a scatter plot any negative cases would be immediately visible, including their relationship to implementation quality.&lt;br /&gt;&lt;br /&gt;To summarise so far, the assumption about standardised delivery of an intervention does not fit the reality of many development programmes. Replacing assumptions by measurement will provide a much richer picture of the relationship between an intervention and the expected impacts. Overall impact can still be measured, by using a correlation coefficient. In addition we can see the potential for greater impact present in existing implementation practice (the slope of the regression line). We can also find outliers that can help improve our understanding of implementation and impact process. We can also quickly identify negative impacts, as well as the absence of any impact.&lt;br /&gt;&lt;br /&gt;Perhaps more important still, the exploration of internal differences in implementation means that the autonomy of development agents can be valued and encouraged. Local experimentation might then generate more useful outliers, and not be seen simply as statistical noise. This is experimentation with a small e, of the kind advocated by Chris Blattman in &lt;a href="http://www.chrisblattman.com/documents/policy/2011.ImpactEvaluation3.DFID_talk.pdf"&gt;his presentation&lt;/a&gt;to DFID on 1&lt;sup&gt;st&lt;/sup&gt; September 2011, and of a kind long advocated by most competent NGOs.&lt;br /&gt;&lt;br /&gt;Given this discussion is about counterfactuals, it might be worth considering what would happen if this implementation measurement based approach was not used, where an intervention is being delivered in a non-standard way. One example is a quasi-experimental evaluation of an agricultural project in Tanzania, described in Oxfam GB‘s paper on its Global Performance Framework&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftn2" name="_ftnref2" title=""&gt;[2]&lt;/a&gt; . “&lt;i&gt;Oxfam is working with local partners in four districts of Shinyanga Region, Tanzania, to support over 4,000 smallholder&amp;nbsp; farmers (54% of whom are women) to enhance their production and marketing of local chicken and rice. To promote group cohesion and solidarity, the producers are encouraged to form themselves into savings and internal lending communities. They are also provided with specialised training and marketing supporting, including forming linkages with buyers through the establishment of collection centres&lt;/i&gt;.” This is a classic case where the staff of the partner organisations would need to exercise considerable judgement about how to best help each community. It is unlikely that each community was given a standard package of assistance, without any deliberate customisations nor any unintentional quality variations along the way. Nevertheless, the evaluation chose to measure the impact of the partner’s activities on changes in household incomes and women’s decision making power, by comparing the intervention group with a control group. Results of the two groups were described in terms of &lt;i&gt;“% of targeted households living on more than £1.00 per day per capita&lt;/i&gt;”, and &lt;i&gt;% of supported women are meaningfully involved in household decision making”&lt;/i&gt;. In using these measures to make comparisons Oxfam GB has effectively treated quality differences in the extension work as noise to be ignored, rather than as valuable information to be analysed. In the process they have unintentionally devalued the work of their partners.&lt;br /&gt;&lt;br /&gt;A similar problem can be found elsewhere in the same document where Oxfam GB describes their new set of global outcome indicators. The Livelihood Support indicator is:&lt;i&gt; % of targeted households living on more than £1.00 per day per capita &lt;/i&gt;(as used in the Tanzania example). In four of the six global indicators the unit of analysis are people, the ultimate intended beneficiaries of Oxfam GB’s work. However, the problem is that in most cases Oxfam GB does not work directly with such people. Instead Oxfam GB typically works with local NGOs who in turn work with such groups. In claiming to have increased the % of targeted households living on more than £1.00 per day per capita Oxfam GB is again obscuring through simplification the fact that it is those partners who are responsible for these achievements. Instead, I would argue that the unit of analysis many of Oxfam GB’s global outcome indicators should be the &lt;i&gt;behaviour and performance of its partners&lt;/i&gt;. Its global indicator for Livelihood Support should read something like this: “&lt;i&gt;x % of Oxfam GB partners working on rural livelihoods have managed to double the proportion of targeted households living on more than £1.00 per day per capita&lt;/i&gt;” Credit should be given to where credit is due.&amp;nbsp; However, these kinds of claims will only be possible if and where Oxfam GB encourages partners to measure their implementation performance as well as changes taking place in the communities they are working with, and then to analyse the relationship between both measures.&lt;br /&gt;&lt;br /&gt;Ironically, the suggestion to measure implementation sounds rather unfashionable and regressive, because we are often reading how in the past aid organisations used to focus too much on outputs and that now they need to focus more on impacts. But in practice it is not an either/or question. What we need is both, and both done well. Not something quickly produced by the Department of Rough Measures.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 4th September 2011:&lt;/b&gt; I forgot to discuss the issue of whether any form of randomisation would be useful where relative counterfactuals are being explored. In an absolute counterfactual experiment the recipients’ membership of control versus intervention groups is randomised. In a relative counterfactual “experiment” all participants will receive an intervention so there is no need to randomly assign participants to control versus intervention groups. But randomisation could be used to decide which staff worked with which participants (/vice versa). For example, where a single extension worker is assigned to a given community. But this would be less easily where a whole group of staff e.g. in a local health centre or local school, are responsible for the surrounding community.&lt;br /&gt;&lt;br /&gt;Even where randomisation of staff was possible this would not prevent the impact of external factors influencing the impact of the intervention. It could be argued that the groups experiencing least impact and the poorest quality implementation were doing so, because of the influence of an independent cause (e.g. geographical isolation) that is not present amongst the groups experiencing bigger impacts and better quality implementation. Geographical isolation is a common exterbal influence in many rural development projects, one which is likely to make implementation of a livelihood initiative more difficult as well as making it more difficult for the participants to realise any benefits e.g. through sales of new produce at a regional market. Other external influences may affect the impact but not the intervention e.g. subsequent changes in market prices for produce. However, identifying the significance of external influences should be relatively easy, by making statistical tests of the difference in their prevalence in the high and low impact groups. This does of course require being able to identify potential external influences whereas as with randomised control trials (RCTs) no knowledge of other possible causes is needed (their influence is assumed to be equally distributed between control and intervention groups). However, this requirement could be considered as a "feature" rather than a "bug", because exploration of the role of other causal factors could inform and help improve implementation. On the other hand, the randomisation of control and intervention groups could encourage management's neglect of the role of other causal factors. There are clearly trade-offs here between competing evaluation quality criteria of rigour and utility.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;hr align="left" size="1" width="33%" /&gt;&lt;div id="ftn1"&gt;&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftnref1" name="_ftn1" title=""&gt;[1]&lt;/a&gt;i.e. with observed impact on the y axis and intervention quality on the x axis&lt;/div&gt;&lt;div id="ftn2"&gt;&lt;a href="http://www.blogger.com/blogger.g?blogID=6719829#_ftnref2" name="_ftn2" title=""&gt;[2]&lt;/a&gt;&lt;i&gt;&lt;a href="http://www.3ieimpact.org/admin/pdfs_papers/WP%2013_Final.pdf"&gt;Can we obtain the required rigour without randomisation? Oxfam GB’s non-experimental&lt;/a&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;a href="http://www.3ieimpact.org/admin/pdfs_papers/WP%2013_Final.pdf"&gt;Global Performance Framework&lt;/a&gt;&lt;/i&gt;&lt;i&gt;. &lt;/i&gt;Karl Hughes, Claire Hutchings. August 2011&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;hr align="left" size="1" width="33%" /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-1809521461596104153?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">5</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2011/09/relative-rather-than-absolute.html</feedburner:origLink></item><item><title>Evaluation methods looking for projects or projects seeking appropriate evaluation methods?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/fAOkSGRrgmo/evaluation-methods-looking-for-projects.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 15 Dec 2011 15:20:34 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-1515982254776638892</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;A few months ago I carried out a brief desk review of &lt;a href="http://www.3ieimpact.org/"&gt;3ie&lt;/a&gt;'s approach to funding impact evaluations, for AusAID's &lt;a href="http://www.ode.ausaid.gov.au/"&gt;Office of Development Effectiveness&lt;/a&gt;. One question that I did not address was &lt;span lang="EN-AU" style="font-family: &amp;quot;Courier New&amp;quot;;"&gt;&lt;span style="font: 7pt &amp;quot;Times New Roman&amp;quot;;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-AU"&gt;"&lt;i&gt;Broadly, are there other organisations providing complementary approaches to 3ie for promoting quality evaluation to fill the evidence gap in international development?&lt;/i&gt;"&lt;br /&gt;&lt;br /&gt;While &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2011/12/Discussion-Paper-3ie-and-the-funding-of-impact-eval-FINAL.pdf" target="_blank"&gt;my report&lt;/a&gt; for 3ie examined the pros and cons of 3ie's use of experimental methods as a preferred evaluation design, it did not look at the question of appropriate institutional structures for supporting better evaluations. Yet, you could argue that choices made about institutional structures could have more consequences than those involving the specifics of particular evaluation methods. The question quoted above seems to contain a tacit assumption about institutional arrangements, i.e that improvements in  evaluation can best be promoted by funding externally located specialists centres of  expertise, like 3ie. This kind of assumption seems questionable, for two sets of reasons that I explain below. One is to do with the results they generate, the other concerns the neglected potential of an alternative.&lt;br /&gt;&lt;br /&gt;In the "3ie" (&lt;a href="http://www.3ieimpact.org/openwindow/"&gt;Open Window&lt;/a&gt;) model anyone can submit a proposal for an evaluation of a project implemented by any  organisation.  This approach is conducive to 'cherry picking' of evaluable (by experimental methods)  projects and the collection of evaluations representing a miscellany of types of projects - about which it will be hard to generate useful generalisations. Plus an unknown number of other projects possibly being left unevaluated, because they dont fit the prevailing method preferences.&lt;br /&gt;&lt;br /&gt;In the alternative  scenario the funding of evaluations would not be outsourced to any specialist centre(s). Instead, an agency like DFID would identify a &lt;i&gt;portfolio&lt;/i&gt; of projects needing evaluation. For example, those initiatives focusing on climate change adaptation. DFID would call for proposals for their evaluation and then screen those proposals, largely as it does now, but perhaps seeking a wider range of bidders.&lt;br /&gt;&lt;br /&gt;Unlike the present process, it would then offer funding to the bidders who had provided, say the best 50% of, the proposals to develop those proposals further in more detail. At present there is no financial incentive to do so, and any time and money already spent on developing proposals is unlikely to be recompensed, because only one bidder will get the contract.&lt;br /&gt;&lt;br /&gt;The expected result of this "proposal development" funding would be revised and  expanded proposals that outlined the bidder's proposed methodology in  considerable detail, in  something like an&amp;nbsp; inception report. All the bidders  involved at this  stage would need access to a given set of project documents and at least  one  collective meeting with the project holders.&lt;br /&gt;&lt;br /&gt;The revised proposals would  then be assessed by DFID, but with a much greater weighting towards the  technical content of the proposal than exists at present. These  second level assessment would benefit from the involvement of external  specialists, as in the 3ie model. DFID Evaluation Department already does this in the case of some evaluations through the use of a quality assurance panel.The best proposal would then be funded as normal, and the evaluation then carried out. &lt;br /&gt;&lt;br /&gt;Both the winning and losing technical proposals would  then be put in the public domain via the DFID website in order to  encourage cross fertilisation of ideas, external critiquing and public accountability. This is not the case at present. All bidders operate in isolation. There are no opportunities to learn from each other. The same appears to be the case with 3ie, the full text of technical proposals are not publicly available (even of those who were successful). Making the proposals public would mean that the proposal development funding had not been wasted, even where the proposals were not successful.&lt;br /&gt;&lt;br /&gt;In summary, with the "external centre of expertise" model there is a risk that methodological preferences are the driving force behind what gets evaluated.&amp;nbsp; The alternative is a portfolio-of-projects led approach, where interim funding support is used to generate a diversity of improved evaluation proposals, which are later made accessible by all and which can then inform future proposals.&lt;br /&gt;&lt;br /&gt;A meta-evaluation might be useful to test the efficacy of this project-led  approach. Other matched kinds of projects also needing evaluation   could continue to be funded by the pre-existing mechanisms (e.g. in-country   DFID offices). Pair comparisons could later be made of the quality of   the evaluations that were subsequently produced by the two different   mechanisms.Although it is likely there would be multiple points of difference, it should be possible for DFID, and any other stakeholders, to prioritise their relative importance, and come to an overall judgement of which has been most useful.&lt;br /&gt;&lt;br /&gt;PS: 3ie seems to be heading in this direction, to some extent.&amp;nbsp; 3ie now have a &lt;a href="http://www.3ieimpact.org/userfiles/doc/3ie_Policy_Window_Call_for_Proposals_Fiji.pdf"&gt;Policy Window&lt;/a&gt;  where they have recently sought applications for the evaluation of  projects belonging to  a specific portfolio ("Poverty Reduction  Interventions in Fiji" implemented by the Government of Fiji). Funding  is available to cover costs of the successful&amp;nbsp; bidder (only) to visit  Fiji "to develop a scope of work to be included in a subsequent Request  for Proposal (RFP) to conduct the impact evaluation".Subject to 3ie's  approval of the developed proposal 3ie will then fund the implementation  of the evaluation by the bidder.The success of this approach will be worth watching, especially its ability to ensure the evaluation of the whole portfolio of projects (which is likely to depend on 3ie having some flexiblity about the methodologies used). However, I am perhaps making a risky assumption here, that the&amp;nbsp; projects within the portfolio to be evaluated have not already been pre-selected on the grounds of their suitability to 3ie's preferred approach.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span lang="EN-AU"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span lang="EN-AU"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span lang="EN-AU"&gt;PS: I have been reading the [Malawi] CIVIL SOCIETY GOVERNANCE FUND - &lt;br /&gt;TECHNICAL SPECIFICATION REFERENCE DOCUMENT FOR POTENTIAL SERVICE PROVIDERS. In the section on the role of the Independent Evaluation Agent, it is stated that the agent will be responsible for "&lt;i&gt;The commissioning and coordination of randomised control trials for two large projects funded during the first or second year of granting.&lt;/i&gt;" This specification appears to have been made prior to the funding of any projects. So, will the fund managers feel obliged to find and fund two large projects that will be evaluable by RCTs? Fascinating, in a bizarre kind of way.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-1515982254776638892?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2011/08/evaluation-methods-looking-for-projects.html</feedburner:origLink></item><item><title>Models and reality: Dialogue through simulation</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/uXQ9og9rq4E/models-and-limitations-of-models.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Tue, 24 May 2011 00:32:11 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-8114296753797518048</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;br /&gt;I have been finalising preparations for a training workshop on network visualisation as an evaluation tool. In the process I came across this "&lt;a href="http://www.oecd.org/secure/pdfDocument/0,2834,en_21571361_34047972_38339123_1_1_1_1,00.pdf"&gt;Causality Map for the Enhanced Evaluation Framework&lt;/a&gt;", for Budget Support activities.&lt;br /&gt;&lt;br /&gt;&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-gtjAj0w8maw/TZ8DBxCTwiI/AAAAAAAACN4/kPRZ9Dk2KP4/s1600/budget+support+model.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="323" src="http://2.bp.blogspot.com/-gtjAj0w8maw/TZ8DBxCTwiI/AAAAAAAACN4/kPRZ9Dk2KP4/s640/budget+support+model.JPG" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;On the surface this diagram seems realistic, budget support is a complex process. However,&amp;nbsp; I will probably use this diagram to highlight what is often missing in models of development interventions. Like many others, it lacks any feedback loops, and as such it is a model that is a long way from the reality it is trying to represent in summary form. Using a distinction being used more widely these days (&lt;a href="http://mandenews.blogspot.com/2010/08/test3.html"&gt;and despite my reservations about it&lt;/a&gt;), I think this model qualifies as complicated but not complex. If you were to assign a numerical value to each node and to each connecting relationship, the value that would be generated at the end of the process (on the right) would always be the same.&lt;br /&gt;&lt;br /&gt;The picture changes radically as soon as you include feedback loops, which is much easier to do when you use network rather than chain models (and where you give up using one dimension in the above type of diagram to represent the passage of time). Here below is my &lt;i&gt;very&lt;/i&gt; simple example. This model represents five actors. They all start with a self-esteem rating of 1, but their subsequent self-esteem depends on the influence of the others they are connected to (represented by positive or negative link values, [randomly allocated]) and the self-esteem of those others. &lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://mande.co.uk/blog/wp-content/uploads/2011/05/self-esteem-network1.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="356" src="http://mande.co.uk/blog/wp-content/uploads/2011/05/self-esteem-network1.bmp" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;You can see what happens when self-esteem values are recalculated to take into account those each actor is connected to, in &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2011/05/Iteration-of-nodes-values-over-time-NEW.xlsx"&gt;this Excel file&lt;/a&gt; (best viewed with the &lt;a href="http://nodexl.codeplex.com/releases/view/65666"&gt;NodeXL plugin&lt;/a&gt;). After ten iterations, Actor 0 has the highest self-esteem, and Actor 2 has the lowest. After 20 iterations Actor 2 has the highest self-esteem and Actor 1 has the lowest. After 30 iterations Actor 1 has the highest self-esteem and Actor O has the lowest. With more and more iterations the self-esteem of the actors involved might stabilise at a particular set of values, or it might repeat past patterns already seen, or maybe not.&lt;br /&gt;&lt;br /&gt;There are two important points to be made here. The first is the dramatic affect of introducing feedback loops, in even the simplest of models. The aggregate results are not easily predictable, but they can be simulated. The second is that nature of the impact that is seen even in this very small complex system is a matter of the time period under examination. Impact seen at iteration 10 is different from iteration 20 and different again at iteration 30. In the words of the proponents of systems perspectives on evaluation, what is seen depends on the "perspective" that is chosen (&lt;a href="http://www.sup.org/book.cgi?id=18331"&gt;Williams and Hummelbrunner, 2009&lt;/a&gt;).&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;PS 1&lt;/b&gt;: &lt;a href="http://www.bwpi.manchester.ac.uk/resources/Working-Papers/bwpi-wp-7309.pdf"&gt;Michael Woolcock &lt;/a&gt;has written about the need to pay more attention to a related issue, captured by the term "impact trajectory". He argues that: "...in virtually all sectors, the development community has a weak (or at best implicit or assumed) understanding of the shape of the impact trajectories associated with its projects, and even less understanding of how these trajectories vary for different kinds of project operating in different contexts, at different scales and with varying degrees of implementation effectiveness; more forcefully, I argue that the weakness of this knowledge greatly compromises our capacity to make accurate statements about project impacts, irrespective of whether they are inspired by ‘demand’ or ‘supply’ side imperatives, and even if they have been subject to the most deftly implemented randomised trial"&lt;/blockquote&gt;&lt;blockquote style="font-family: inherit;"&gt;&lt;blockquote&gt;&lt;span lang="EN-GB" style="font-size: small;"&gt;PS1.1: Some examples:I  recall that it has been argued that there is a big impact on households when  they first join savings and credit groups, but the continuing impact  drops down to a much more modest level thereafter. On the other hand,  the impact of girls completing primary school may be the greatest when it reaches through to the  next generation, to the number of their children and their survival rates.&amp;nbsp;&lt;/span&gt; &lt;/blockquote&gt;&lt;/blockquote&gt;There is one downside to my actors' self-esteem model, which is its almost excessive sensitivity. Small changes to any of the node or link values can&amp;nbsp; significantly change the longer term impacts. This is because this simple model of a social system has no buffers or "&lt;a href="http://www.management-aims.com/PapersMgmt/82Donada.pdf"&gt;slack&lt;/a&gt;". Buffers could be in the form of accumulated attributes of the actors (like an individual's self-confidence arising from their lifetime experience or a firm's accumulated inventory) and also provided via the wider context (like individuals having access to a wider network of friends or firms having alternate sources of suppliers) . This model could clearly be improved upon.&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;PS 2&lt;/b&gt;: I came across this quote by Duncan Watts, in a commentary on his latest book "&lt;a href="http://everythingisobvious.com/essay/can-a-butterfly-flapping-its-wings-on-facebook-stir-a-revolution-in-the-middle-east/"&gt;Everything is obvious&lt;/a&gt;" - "&lt;i&gt;when people base their decisions in part on what other people are deciding, collective outcomes become highly unpredictable&lt;/i&gt;" That is exactly what is happening in the self-esteem model above.Duncan Watts has &lt;a href="http://everythingisobvious.com/papers-publications/"&gt;written extensively on network&lt;/a&gt;s.&lt;/blockquote&gt;Here below is another unidirectional causal model, available on the &lt;a href="http://www.enterprise-development.org/page/the-evidence-framework"&gt;Donor Committee for Enterprise Development website&lt;/a&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-KpS_62aijVE/TZ8O2RKsFoI/AAAAAAAACOA/b6hjsNPHthI/s1600/enterprise+development+model.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="475" src="http://4.bp.blogspot.com/-KpS_62aijVE/TZ8O2RKsFoI/AAAAAAAACOA/b6hjsNPHthI/s640/enterprise+development+model.JPG" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;What I like about this example is that visitors to the website can click on the links (but not in the copy I have made above) and be taken to other pages where they will be given a detailed account of the nature of the causal processes represented by those links. This is exactly what the web was designed for. Visitors can also click on any of the boxes at the bottom and find out more about the activities that input into the whole process.&lt;br /&gt;&lt;br /&gt;The inclusion of a feedback loop in this diagram would not be too difficult to imagine. For example, from perhaps the top box back to one of the earlier boxes e.g New firms start / register. This positive feedback loop would quickly produce escalating results further up the diagram. Ideally, we would recognise that this type of outcome (simple continuous escalation) does not fit very well with our perception of what happens in reality. That awareness would then lead to further improvements to the model, which generated more realistic behaviors.&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;PS 24 May 2011&lt;/b&gt;:&amp;nbsp; In their feminist perspective on monitoring and evaluation &lt;a href="http://mande.co.uk/2011/uncategorized/capturing-change-in-women%E2%80%99s-realities-a-critical-overview-of-current-monitoring-evaluation-frameworks-and-approaches/"&gt;Batliwala and Pittman&lt;/a&gt; have suggested that we need "&lt;i&gt;&lt;b&gt;to develop a “theory of constraints” to accompany our “theory of change” in any given context...&lt;/b&gt;" &lt;/i&gt;They noted that "&lt;i&gt;…  most tools do not allow for tracking negative change, reversals,  backlash, unexpected change, and other processes that push back or shift  the direction of a positive change trajectory. How do we create tools  that can capture this “two steps forward, one step back” phenomenon that  many activists and organizations acknowledge as a reality and in which  large amounts of learning lay hidden? In women’s rights work, this is  vital because as soon as advances seriously challenge patriarchal or  other social power structures, there are often significant reactions and  setbacks. These are not, ironically, always indicative of failure or  lack of effectiveness, but exactly the opposite— this is evidence that  the process was working and was creating resistance from the status quo  as a result&lt;/i&gt; .”&lt;/blockquote&gt;&lt;div align="left" class="MsoNormal" style="margin: 0cm 0cm 0.0001pt; text-align: left; text-indent: 0cm;"&gt;But it is early days. Many development programs do not yet even have a decent unidirectional causal model of what they are trying to do. In this context, the inclusion of any sort of feedback loop would be a major improvement. &lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;As shown above, the next step that can be taken is to run &lt;a href="http://www.google.com/search?client=gmail&amp;amp;rls=gm&amp;amp;q=define%3A%20simulation"&gt;&lt;i&gt;simulations&lt;/i&gt;&lt;/a&gt; of those improved models by inserting numerical values in the links and functions/equations in nodes of those models. In the examples above we can see how simulations can help improve models by showing how their results &lt;i&gt;do not fit&lt;/i&gt; with our own observations of reality. Perhaps in the future they will be seen as a useful form of pre-test, worth carrying out at the earliest stages of an evaluation. &lt;/span&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;b&gt;PS 3&lt;/b&gt;: This blog was prompted by comments to me by Elliot Stern on the relevance of modeling and simulation to evaluation, on which I hope he has more to say.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 4&lt;/b&gt; I am struggling through Manuel DeLanda's &lt;a href="http://www.amazon.com/Philosophy-Simulation-Emergence-Synthetic-Reason/dp/1441170286"&gt;Philosophy and Simulation: The emergence of synthetic reason&lt;/a&gt; (2011), which being about simulations, relates to the contents of this post.&lt;/blockquote&gt;&lt;blockquote&gt;&lt;b&gt;PS 5&lt;/b&gt;: I have just scanned Funnell and Roger's very useful new book, &lt;a href="http://www.amazon.co.uk/Purposeful-Program-Theory-Effective-Theories/dp/0470478578/ref=sr_1_1?ie=UTF8&amp;amp;s=books&amp;amp;qid=1303303408&amp;amp;sr=8-1"&gt;Purposeful Program Theory&lt;/a&gt;" and found 67 unidirectional models but only 15 models that have one or more feedback loops (that is, 23%). This is quite dissapointing. So is the advice on the use of feedback loops: "&lt;i&gt;We advise against using so many feedback loops that the logic becomes meaningless. When feedback loops are incorporated, a balance needs to be struck between including all of them and (because everything is related to everything else) and capturing some important ones. Showing that everything leads to everything else can make an outcome chain very difficult to understand - what we call a spagetti junction model. Neverthless some feedback loops can be critical to the success of a program and should be included ...&lt;/i&gt;"p187&lt;br /&gt;&lt;blockquote&gt;Given the scarcity of models with feedback links even in this book, the risk of having too many feedback loops sounds like "a&amp;nbsp; problem we would like to have" And I am not sure why an excess of feedback links should be any more of a probability than an excess of forward links. The concern about the understandability of models with feedback loops is however more reasonable,&amp;nbsp; for reasons I have outlined above. When you introduce feedback loops what were either simple and complicated models start to exhibit complex behavior. Welcome to something that is a bit closer to the real world.&lt;/blockquote&gt;&lt;/blockquote&gt;&lt;blockquote&gt;&lt;b&gt;PS6&lt;/b&gt;:&amp;nbsp; "As change makers, we should not try to design a better world. We should make better feedback loops", the text of the last slide in Owen Barder's presentation "&lt;a href="http://media.owen.org/Evolution/player.html"&gt;Development Complexity and Evolution&lt;/a&gt;"&lt;/blockquote&gt;&lt;blockquote&gt;&lt;b&gt;PS7&lt;/b&gt;: In Funnell and Roger's book (page 486) they describe how the Intergovernmental Panel on Climate Change (IPCC) " recognised that controlled experimentation with the climate system in which the hypothesised agents of change are systematically varied in order to determine the climate's sensitivity to these agents...[is] clearly not possible"&amp;nbsp; Different models of climate change were developed with different assumptions about possible contributing factors. "The observed patterns of warming, including greater warming over land than over the ocean, and their changes over time, are &lt;i&gt;simulated &lt;/i&gt;only by the models that include anthropogenic forcing. No coupled global climate change model that has used natural forcing only has reproduced the continental warming trends in individual continents (except Antarctica) over the second half of the 20th centrury" (IPCC, 2001, p39)&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-8114296753797518048?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-gtjAj0w8maw/TZ8DBxCTwiI/AAAAAAAACN4/kPRZ9Dk2KP4/s72-c/budget+support+model.JPG" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2011/04/models-and-limitations-of-models.html</feedburner:origLink></item><item><title>A submission to the UK Independent Commission for Aid Impact (ICAI)</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/iZHPbByKSAI/submission-to-uk-independent-commission.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 24 Mar 2011 15:49:16 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-2838389082562731452</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;Background: &lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;blockquote style="color: black; font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;"&lt;b&gt;&lt;a href="http://icai.independent.gov.uk/consultation/"&gt;ICAI&lt;/a&gt;&lt;/b&gt; is holding a public consultation to understand which areas of UK  overseas aid stakeholder groups and the public believe the Commission  should report on in its first three years. &lt;b style="font-weight: normal;"&gt;The consultation will run for 12 weeks from the 14th January until the 7th April 2011. &lt;/b&gt;&lt;/span&gt; &lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&lt;a href="http://www.surveymonkey.com/s/LLZK2NN"&gt;Click here to respond to the  consultation&lt;/a&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;If you would like to read further information on ICAI, the consultation and the aid budget, please click on &lt;b&gt;‘&lt;a href="http://icai.independent.gov.uk/consultation/consultation-document/"&gt;Consultation document&lt;/a&gt;’&lt;/b&gt;.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;If you would like to consider the questions in detail before responding, please click on &lt;b&gt;‘&lt;a href="http://icai.independent.gov.uk/consultation/downloadable-questions/"&gt;Downloadable questions&lt;/a&gt;’&lt;/b&gt;. You will need to access the online response form to respond".&lt;/span&gt;&lt;/blockquote&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;Two initial comments&lt;/b&gt;:&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;1. An online survey is a fairly narrow approach to a public consultation. There are many other options, even if the ICAI is limited to those that can take place online, rather than face to face&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;2. The focus of the consultation is also narrow, i.e. "&lt;i&gt;which areas of UK  overseas aid stakeholder groups and the public believe the Commission  should report on in its first three years&lt;/i&gt;". Equally important is &lt;i&gt;how&lt;/i&gt; those areas of aid should be reported on.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;On widening the process of consultation&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;1. All pages on the ICAI website should have associated Comment facilities, which visitors can make use of. More and more websites are being constructed with blog-type features such as these, because website managers these days expect to be &lt;i&gt;interacting&lt;/i&gt; with their audience, not simply &lt;i&gt;broadcasting&lt;/i&gt;. Built into such a facilitwould be an assumption that there will be an &lt;i&gt;ongoing&lt;/i&gt; process of consultation, not a once off event.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;2. The raw data results of the current online survey should be made publicly available, not just summaries. This is already possible, with minimal extra work required, because the survey provider (SurveyMonkey.com) is able to provide a public link to the survey results, with multiple options regarding reading, filtering and downloading the data. The more people who can access the data, the more value that might be obtained from it.&amp;nbsp; However, the most important reason for doing so is that the ICAI should be seen to be &lt;i&gt;maximally transparent&lt;/i&gt; in its operations. Transparency will help build trust and confidence in the work and judgements of the commission.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;3. Although late in the day, the ICAI should edit the Consultation page to include an invitation to people to submit their own submissions using their own words and structures.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;4. The ICAI website should include an option for visitors to sign up for email notification of any changes to the website, including the main content pages and any comments made on those pages by visitors.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;5. The ICAI should be open about what it will be open about. It should develop a policy on transparency and place that policy on its website. Disclosure policies are now commonplace for many large international organisations like the World Bank and IMF, and transparency in regard to international aid is high on the agenda of many governments, including the UK. Having such a policy does mean everything about the workings of the ICAI must be made public, but it &lt;/span&gt;&lt;span style="font-size: small;"&gt;would &lt;/span&gt;&lt;span style="font-size: small;"&gt;typically require a default assumption of openness along with specified procedures and conditions relating to when and where information will not be disclosed.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;On widening the content of consultation&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&amp;nbsp;1. The ICAI should be aware, if not already, that there continues to be intense debate about the best ways of assessing the value of international aid. This debate exists because of the&amp;nbsp; multiplicity of purposes behind aid programs, many and varied types of aid, the enormous diversity of contexts where it is provided, the wide range of people and organisations involved in its delivery, as well as some genuinely difficult issues of measurement and analysis. There are no simple and universally applicable solutions.&amp;nbsp;Value for Money provides only a partial view of aid impact, and is only partially measurable. Randomise Control Trials (RCTs) can be useful for simple replicable interventions in comparable conditions, but many aid interventions are complex. The best immediate response in these circumstances is for the ICAI to be &lt;i&gt;maximally transparent&lt;/i&gt; about the methods being used to assess aid interventions, and to be open to the wider debate.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;A good starting point would be for the ICAI to make public: (a) the Terms of Reference for the "Contracted Out Service Provider" who will do the assessment work for the ICAI, and (b) the tendered proposal put forward by the winning bidder. Both of these documents refer to ways and means of doing the required work. It is also expected that there will be periodic reviews of the work of the winning bidder. The ToRs and reports of those reviews should also be publicly disclosed on the ICAI website. Finally, all the&amp;nbsp; "evaluations, reviews and investigations" to be carried out by the winning bidder on behalf of the ICAI should be publicly disclosed, as hopefully has already been agreed.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;2. It would be useful, if only to help ensure that the ICAI itself delivers Value for Money", if the ICAI could clarify not only how its role will differ from that of the DFID Evaluation Department and multi-agency initiatives like 3IE, but also how it will cooperate with them to exploit any complementarities and possible synergies in their work.Complete and utter independence could lead to wasteful duplication. In the worst case various wheels could be reinvented.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;For example, the OECD's Development Assistance Committee (DAC) has over the years developed a widely agree set of &lt;a href="http://www.oecd.org/document/22/0,2340,en_2649_34435_2086550_1_1_1_1,00.html"&gt;evaluation criteria&lt;/a&gt;, however these are nowhere to be seen in the ICAI's ToR for the "Contracted Out Service Provider". Instead, Value For Money receives repeated attention, and its definition is sourced to the National Audit Office (NAO). However, the NAO is a member of the &lt;a href="http://www.improvementnetwork.gov.uk/imp/core/page.do?pageId=1068398"&gt;Improvement Network&lt;/a&gt;, which has provided a wider perspective on Value for Money. Their website notes that effectiveness is part of Value for Money, as well as efficiency and economy. Commenting on effectiveness they note:&lt;/span&gt;&lt;/div&gt;&lt;blockquote style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;&lt;/b&gt;&lt;b&gt;"Effectiveness&lt;/b&gt; is a measure of the impact that has been  achieved, which can be either quantitative or qualitative....Outcomes should be equitable across  communities, so effectiveness measures should include aspects of equity,  as well as quality. Sustainability is also an increasingly important  aspect of effectiveness." &lt;/span&gt;&lt;/blockquote&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;Sustainability is a DAC evaluation criteria which has been around for a decade or more.&lt;/span&gt; &lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;------------------------------------------------------------------------------------------------&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;A link to this blog posting has been emailed to &lt;a href="mailto:c-robathan@icai.independent.gov.uk"&gt;c-robathan@icai.independent.gov.uk&lt;/a&gt;.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;Other readers of this blog might like to do the same, with their own views.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;PS: See Alex Jacob's March 22nd submission to the ICAI: &lt;/span&gt;&lt;a href="http://ngoperformance.org/2011/03/22/advice-for-the-new-aid-watchdog/" rel="bookmark" title="Advice for the new aid watchdog"&gt;Advice for the new aid&amp;nbsp;watchdog&lt;/a&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-2838389082562731452?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2011/03/submission-to-uk-independent-commission.html</feedburner:origLink></item><item><title>Counter-factuals and counter-theories</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/oDRiFUdbFzU/counter-factuals-and-counter-theories.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 06 Dec 2010 09:08:34 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-7346518758753165049</guid><description>Thinking about the counter-factual means thinking about something that did not happen. So consider a project involving the provision of savings and credit services, with the expectation of reducing levels of poverty amongst the participating households. The counter-factual is the situation where the savings and credit services were not provided. This can either be imagined, or monitored through the use of a control group, which is a group of similar households in a similar context.&lt;br /&gt;&lt;br /&gt;In the course of 20 years work on monitoring and evaluation of development aid projects I have only come across one good opportunity to analyse changes in household poverty levels through the comparison of participating and non-participating households (i.e. the so called double difference method: comparing participants and non-participants, before and after the intervention). This was in Can Loc District, Ha Tinh province, in Vietnam. In 1996 ActionAid Vietnam began a savings and credit program in Can Loc.  In 1997 I helped them design and implement a baseline survey of almost 600 households, being a 10% sample of the population in three communes of Can Loc District, covering participants and non-participants in the savings and credit services (which reached about 25% of all households). This was done using the &lt;a href="http://mande.co.uk/special-issues/the-basic-necessities-survey/"&gt;Basic Necessities Survey&lt;/a&gt; (BNS) , an instrument that I have described in detail elsewhere.&lt;br /&gt;&lt;br /&gt;A few years later the responsibility for the project was handed over to a Vietnamese NGO called the Pro-Poor Centre (PPC), which had been formed by ex-Action Aid staff who used to work in Ha Tinh. They continued to manage the savings and credit program over the following years. In 2006, nine years after the baseline survey, an ex-ActionAid staff member who was now working for a foundation in Hanoi, held discussions with the PPC about doing a follow up survey of the Ha Tinh households. I was brought in to assist the re-use of the same BNS instrument as in 1996. At this stage the main interest was simply to see how much households' situations had improved over the nine year period, a period of rapid economic growth throughout much of Vietnam.&lt;br /&gt;&lt;br /&gt;The survey went ahead, and was implemented with particular care and diligence by the PPC staff. A copy of the 2006 survey report &lt;a href="http://www.mande.co.uk/docs/The%202006%20Basic%20Necessities%20Survey%20Final%20Report%2020%20July%202007.doc"&gt;can be found here&lt;/a&gt; (See pages 23-25 especially). Fortunately the PPC had carefully kept hard copy records of the 1996 baseline survey (including the sample frame) and I had also kept digital copies of the data. This meant it was possible to make a number of comparisons:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Of households poverty status in 2006 compared      to 1997&lt;/li&gt;&lt;li&gt;Of changes in the poverty status of households      who were and were not participating in the saving and credit program      during these periods i.e&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Those who had never participated&lt;/li&gt;&lt;li&gt;Those were in (in 1997) but       dropped out (by 2006)&lt;/li&gt;&lt;li&gt;Those who were not in (in 1997)       but joined later (before 2007)&lt;/li&gt;&lt;li&gt;Those who were always in (in 1997       and 2006)&lt;/li&gt;&lt;/ol&gt;&lt;ul&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;Somewhat to my surprise, I found what seemed an ideal set of results. Poverty levels had dropped the most in the 4th group ("always members"), then almost as much in the 2nd group ("ex-members"), less in the 3rd group ("new members") and least in the 1st group ("never members"). The 3rd group might have been expected to have changed less because over the years the project had expanded its coverage to include the less poor, reaching 43% of all households by 2006.&lt;br /&gt;&lt;br /&gt;However, the project's focus on the poorest was also a problem. The members of the savings and credit program had not been randomly chosen, so the control group was not really a control group. They were not comparable. (and I had not heard of, nor still know, how to use the &lt;a href="http://en.wikipedia.org/wiki/Propensity_score_matching"&gt;propensity score matching &lt;/a&gt;method)&lt;br /&gt;&lt;br /&gt;The alternative to considering the effects of a counter-factual (i.e. a non-intervention) is, I guess, what could be called “counter-theoretical” That is, an &lt;i&gt;alternative theory of what has happened, with the existing intervention.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;My counter-theoretical centered on the idea of dependency ratios - poor families typically have high dependency ratios (i.e. many young children, relatively few adults). As families age this ratio will change, with dependent children growing up and becoming more able bodied and able to take on workloads and or generate income. Even without the access to a savings and credit program, this demographic fact alone might have explained why the participating families did better over the nine year period. It could also explain why the 2nd group did almost as well, if they were selected on the same basis of being the poorest, but had been participants for the shorter period of time.&lt;br /&gt;&lt;br /&gt;What I could have and should have done, was go back to the PPC and see what data they had on the family structure of the interviewed households. It is quite likely they would have the relevant data: ages of all family members, given their close involvement with the community. Unfortunately at that time there was not much interest in the impact assessment aspect of the survey, by either the foundation, the PPC or ActionAid, and their support was necessary for any further analysis. Perhaps I  gave up too quickly…&lt;br /&gt;&lt;br /&gt;Nevertheless, reflection on this experience makes me wonder how often it would be well worthwhile, in the absence of good control group data, giving more attention to identifying and testing “counter-theoreticals” &lt;i&gt;about the existing intervention&lt;/i&gt;, as part of a more rigorous process of coming to conclusions about impacts.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS1 3rd November 2010&lt;/b&gt;: I have since recalled that as part of the 2006 survey I met with the staff of ActionAid in Hanoi to explain the survey process and to solicit from them their views on the likely causes of any improvements. The &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2008/04/Cause-of-Improvement-in-Can-Loc.pdf"&gt;attached file&lt;/a&gt; shows two lists, one relating to ActionAid interventions in the district, and the other relating to interventions in the same district by other organisations, including government. Micro-finance was at the top of the list of the ActionAid interventions seen as likely causes of change, but there were 7 others, as well as 12 non-ActionAid interventions that were possible causes. This raises the spectre of 12 possible alternative hypotheses, let alone various combinations of these. One approach I subsequently toyed with for generating composite predictions in this kind of multiple-location/multiple-intervention situation was the "&lt;a href="http://mande.co.uk/special-issues/prediction-matrices/"&gt;Prediction Matrix&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS2 3rd November 2010&lt;/b&gt;: The current edition of Evaluation (16(4), 2010 has an article by Nicoletta Stame, titled " &lt;i&gt;&lt;a href="http://evi.sagepub.com/content/16/4/371.abstract"&gt;What doesn’t work? Three Failures, Many Answers&lt;/a&gt;"&lt;/i&gt; which includes a section on "&lt;i&gt;&lt;b&gt;Rival Explanations&lt;/b&gt;" &lt;/i&gt;which I have taken the liberty of copy and pasting below:&lt;br /&gt;&lt;blockquote&gt;"The link between complexity and causation has been at the centre of  evaluation theory ever since and has nurtured thinking about 'plausible  rival hypotheses' (Campbell, 1969). Although it was originally treated  as a methodological problem of validity, it has recently been revisited  from the substantive perspective of programme theory. Commenting on  Campbell's interest in 'reforms', that are by definition 'complex social  change', Yin contrasts two strategies of Campbell: that of the  experimental design and that of using rival explanations. He concludes  that the second - as Campbell himself came to admit in Campbell (199a) -  is better suited to complex interventions (that are changing and  multifaceted), as it is with the complex case studies that have been  Yin's turf for a long times (Yin, 2000: 242). The use of rival  explanations is common in other crafts journalism, detective work,  forensic science and astronomy), where 'the investigator defines the  most compelling explanations, tests them by fairly collecting data that  can support or refute them, and - given sufficiently consistent and  clear evidence - concludes that one explanation but not the others is  the most acceptable' (Yin, 2000: 243). These crafts are empirical: their  advantage is that while a 'whole host of societal changes may be  amenable to empirical investigation', especially those where stakes are  currently the highest, they are 'freed from having to impose an  experimental design' ('the broader and in fact more common use of rival  explanations covers real-life, not craft, rivals', Yin, 2000:248).  Nonetheless, rival explanations are by no means alien to evaluation, as  is shown by how Campbell himself has offered Pawson good arguments for  criticizing the way systematic reviews are conducted (Pawson, 2006)."&lt;/blockquote&gt;&lt;blockquote&gt;"The problem that remains is how to identify rival explanations. From a  methodological starting point, Yin says that 'evaluation literature  offers virtually no guidance on how to identify and define real-life  rivals'. He proposes a typology of real-life rivals, that can variously  relate to targeted interventions, to implementation, to theory to  external conditions; and proposes examples of how to deal with them  taken from such fields as decline in crime rates, support for industrial  development, technological innovations, etc. However, Yin appears to  overlook something that had indeed fascinated theory-based evaluation  since its first appearance: the possible existence of different theories  to explain the working of a programme, and the need to choose among  them in order to test them. And - as Patton (1989: 377) has advised - it  should be noted that in this way it would be possible to engage  stakeholders in conceptualizing their own programme’s theories of  action. Nevertheless, Yin’s contribution in its explicitness and  methodological “correctness” is an important step forward."&lt;/blockquote&gt;&lt;blockquote&gt;"Weiss responded to Win’s provocative stance. In an article entitled  “What to do until the random assigners come”, she locates Yin’s  contribution as the next step beyond Campbell’s ideas about plausible  rival hypotheses: “where Campbell focused primarily on rival  explanations stemming from methodological artifacts, Yin proposes to  identify substantive rival explanations” (Weiss, 2002: 217). She  describes the process whereby the evaluator “looks around and collects  whatever information and qualitative data are relevant to the issue at  hand” (2002: 219), in order to see “whether any [other factor, such as  other programs or policies, environmental conditions, social and  cultural conditions] could have brought about the kinds of outcomes that  the target program was trying to affect”, thus setting up systematic  inquires into the situation. Weiss concludes that alternative means to  random assignment in order to solve the causality dilemma can be a “a  combination of Theory-Based Evaluation and Ruling-Out” (the rival  explanation)."&lt;/blockquote&gt;I recommend the whole article...&lt;br /&gt;&lt;br /&gt;PS3 6th December. On re-reading this post, especially Nicoletta's quote, I wondered about the potential usefullness of the "&lt;a href="http://mande.co.uk/special-issues/evolving-storylines-a-participatory-design-process/"&gt;Evolving Storyline&lt;/a&gt;s" method I developed some years ago. It could be used as a means of developing a small range of alternative histories of a project, that could then each be subject to some testing (by focusing on the most vulnerable point in each story)&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-7346518758753165049?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2010/10/counter-factuals-and-counter-theories.html</feedburner:origLink></item><item><title>Do we need a Minimum Level of Failure (MLF)?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/NT6xE4MQiPI/do-we-need-minimal-level-of-failure-mlf.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Tue, 24 May 2011 11:32:51 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-1890236059624698450</guid><description>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;This week I am attending the 2010 European Evaluation Society conference in Prague. Today I have been involved in a number of interesting discussions, including how to commission better evaluations and the potential and perils of randomised control trials (RCTs). This has prompted me to resurrect an idea I have previously raised partly in jest, but which I now think deserves more serious consideration.&lt;br /&gt;&lt;br /&gt;Background first: RCTs have been promoted as an important means of improving the effectiveness of development aid projects. But there are also concerns that RCTs will become a dominating orthodoxy, driving out the use of other approaches to impact assessment, and in the worst case, discouraging investment in development projects which are not evaluable through the use of RCTs.&lt;br /&gt;&lt;br /&gt;In my &lt;a href="http://www.mande.co.uk/docs/thesis.htm"&gt;PhD thesis&lt;/a&gt; many years ago I looked at organisational learning through the lense of an evolutionary epistemology. That school of thought sees evolution (through the re-iteration of variation, selection and retention) as a kind of learning process, and human learning as a sub-set of that process. As I explain below, that view of the process of learning has some relevance to the current debate on how to improve aid effectiveness. It is also worth acknowledging the results of that process - evolution has been very effective in developing some extremely complex and sophisticated lifeforms, against which intentionally designed aid projects pale in comparison.&lt;br /&gt;&lt;br /&gt;The point to be made: A common misconception is evolution is about the “survival of the fittest”. In fact this phrase, coined by &lt;a href="http://en.wikipedia.org/wiki/Herbert_Spencer"&gt;Herbert Spencer&lt;/a&gt;, is significantly misleading. Biological evolution is NOT about the survival of the fittest, but the non-survival of the least fit. This process leaves room for some diversity amongst those that survive, and it is this diversity that enables further evolution. The lesson here is that the process of evolution is not about picking winners according to some global standard of fitness, but about culling of failures based on their lack of fitness to local circumstances.&lt;br /&gt;&lt;br /&gt;This leads me to my own “modest proposal” for another route to improved aid effectiveness, which is an alternative to the widespread use of RCTs and the replication of the kinds of projects found to be effective via that means. This would be to build a widening consensus about the need for a defined “Minimum Level of Failure” (MLF) within the portfolio of activities funded or implemented by aid agencies. A MLF could be something like a 10% of projects by value. Participating agencies would committ to publicly declaring this proportion of their projects as failed. Each of these agencies would also need to show: (a) how in their particular operating context they have defined these as failures, and (b) what steps they will take to avoid the replication of these failures in the future. There would be no need for a global consensus on evaluation methods, or a hegemony of methods arising through less democratic processes.&amp;nbsp; PS: Using the current M&amp;amp;E terminology, the consensus would need to be on the desired outcomes, not on the activities needed to achieve them. &lt;br /&gt;&lt;br /&gt;I can of course anticipate, if not already hear, some protests about how unrealistic this proposal is. Let us hear these protests, especially in public. Any agency that did so would probably be implying, if not explicitly arguing, that such a failure rate would be unacceptable, because public monies and poor people’s lives are at stake. However making such a de facto claim of a 90%+ rate of success would be a seriously high risk activity, becaus it would be very vulnerable to disproof, probably through journalistic inquiry alone. For anyone involved with development aid programmes, a brief moment’s reflection would suggest that the reality of aid effectiveness is very different, and that a 10% failure rate is probably way too optimistic and in real life failures are much more common.&lt;br /&gt;&lt;br /&gt;Perhaps the protesting agencies might be better advised to consider the upside of a achieving a minimum level of failure. If taken seriously establishing a norm of a minimal level of failure could help get the public at large, along with journalists and politicians, past the shock-horror of failure itself and into the more interesting territory of why some projects fail. It could also help raise the level of risk tolerance, and enable the exploration of more innovative approaches to the uses of aid. Both of these developments would be in addition to a progressive improvement on the average performance of development projects resulting from a periodic culling of the worst performers.&lt;br /&gt;&lt;br /&gt;It is possible that advocates of specific methods like RCTs (as the route to improved aid effectiveness) might also have some criticisms of the MLF proposal. They could argue that these methods will generate plenty of evidence of what does not work, and perhaps that evidence should be privileged. But the problem with this method-led solution is that there is already a body of evidence from a number of fields of scientific research that negative findings are widely under-reported. People like to publish positive findings. This may not be a big risk while RCTs are funded by one or two major actors, but it will become a systemic risk as the number of actors involved increases.&amp;nbsp; There needs to be an explicit and public focus on failure.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span style="color: red;"&gt;Actual data on failure rates &lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;PS: 15th October 2010: Four days ago I posted below some information on the success and failure rates of&amp;nbsp; DFID projects. I have re-stated and re-edited that information here with additional comments:&lt;br /&gt;&lt;br /&gt;There is some interesting data on failure within the DFID system, most  notably the most recent review of Project Completion Reports (PCRs), undertaken in 2005. See the “&lt;a href="http://www.dfid.gov.uk/Documents/publications1/evaluation/ev664.pdf" rel="nofollow"&gt;An Analysis of Projects and programmes in Prism 2000-2005&lt;/a&gt;”report available on the DFID website. The  percentage (68%) of projects “defined as ‘completely’ or ‘largely’  achieving their Goals (Rated 1 or 2)” was given at the beginning of the Executive Summary, but information about failures was less prominent. Under section   “8. Lessons from Project Failures” on page 61 it is stated “There are  only 23 projects [out of 453] within the sample that are rated as  failing to meet their objectives (i.e. 4 or 5) &lt;i&gt;and which have significant lessons&lt;/i&gt;”  (italics added). This is equivalent to about 5% of the sampled projects.&lt;br /&gt;&lt;br /&gt;More importantly are the 20% or so rated 3 = &lt;i&gt;Likely to be partly achieved&lt;/i&gt;  (see page 64). It could be argued that those with a rating of 3 should  also be included as failures, since their objectives are only likely to  be partly achieved, versus largely achieved in the case of rating 2. In  other words a successful project should be defined as one likely to  achieve more than 50% of its Output and Purpose objectives.  Others are  failures. This interpretation seems to be supported by a comment sent to me (whose author will remain anonymous): " "&lt;i&gt;My understanding is that projects  with scores of less than 2 are under real pressure and maybe quickly  closed down unless they improve rapidly. I have certainly "felt the pressure"  from&amp;nbsp;projects to score them&amp;nbsp;2 rather than 3. That said I have not  buckled to the pressure!&lt;/i&gt;"&lt;br /&gt;&lt;br /&gt;I think the fact that DFID at least has a performance scoring system (for all its faults), that it has done this analysis of the project scores, and that it has made the results public, probably puts it well ahead of many other aid agencies. I would like to hear about any other agencies who have done anything like this, along with comments on the strengths and weaknesses of what they have done. I would also like to see DFID repeat the 2005 exercise at the end of this year, this time with more discussion on the projects rated 3 = &lt;i&gt;Likely to be partly achieved&lt;/i&gt;, and what subsequently happened to these projects.&lt;br /&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;PS 2nd November 2010&lt;/b&gt;: See&lt;a href="http://www.developmenthorizons.com/2010/10/from-one-extreme-to-another.html"&gt; Lawrence Hadad's reference here&lt;/a&gt; to the same DFID set of statistics here, recently quoted/misused on the One Show&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b&gt;PS 3rd November 2010:&amp;nbsp; &lt;/b&gt;Thanks to &lt;span style="font-size: small;"&gt;Yu-Lan van Alphen&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;;"&gt;&lt;/span&gt;, Programmamanager, Stichting DOEN, Amsterdam, for this book reference:&lt;/span&gt; &lt;a href="http://www.nytimes.com/2010/06/11/books/11book.html?_r=1"&gt;Kathryn Schulz – "Being Wrong",&amp;nbsp; reviewed &lt;/a&gt;in the NYT. It sounds like a good read.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 14th February 2011&lt;/b&gt;: Computer programs are intolerant of programming errors. So, computer programmers tried to avoid them at all costs, not always successfully. Doing so becomes a much bigger challenge as software grows in size and complexity. Now some programmers are trying a different approach, that involves recognising that there will always be programming errors. For more, see "Let It Crash" Programming" by Craig Stunz at &lt;a href="http://blogs.teamb.com/craigstuntz/2008/05/19/37819/%20"&gt;http://blogs.teamb.com/craigstuntz/2008/05/19/37819/&amp;nbsp;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;PS 15th February 2011:&lt;/b&gt; &lt;span style="font-size: small;"&gt;"&lt;a href="http://www.healthnewsreview.org/blog/2011/02/why-negative-studies-are-good-for-health-journalism-and-where-to-find-them.html"&gt;Why negative studies are good for health journalism, and where to find them&lt;/a&gt;" "&lt;/span&gt;&lt;br /&gt;&lt;div style="font-family: inherit;"&gt;&lt;span style="font-size: small;"&gt;&lt;i&gt;This is a guest column by &lt;a href="http://twitter.com/ivanoransky" target="_blank"&gt;Ivan Oransky, MD&lt;/a&gt;, who is executive editor of Reuters Health and blogs at &lt;a href="http://embargowatch.wordpress.com/" target="_blank"&gt;Embargo Watch&lt;/a&gt; and &lt;a href="http://www.retractionwatch.wordpress.com/" target="_blank"&gt;Retraction Watch&lt;/a&gt;.&lt;/i&gt;One of the things that makes evaluating medical evidence difficult is  knowing whether what's being published actually reflects reality. Are  the studies we read a good representation of scientific truth, or are  they full of cherry-picked data that help sell drugs or skew policy  decisions?..."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;PS: 21 February 2011&lt;/b&gt;: See also the &lt;a href="http://www.admittingfailure.com/search-failures/"&gt;Admitting Failure website&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: small;"&gt;&lt;b&gt;PS: 23 April 2011&lt;/b&gt;. See today's Bad Science column in the Guardian by Ben Goldacre, titled "&lt;/span&gt;&lt;a href="http://www.badscience.net/2011/04/i-foresee-that-nobody-will-do-anything-about-this-problem/"&gt;I foresee that nobody will do anything about this problem&lt;/a&gt;", on the difficulty of getting negative findings published&lt;/div&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;div class="MsoListParagraph" style="font-family: inherit; margin-left: 0cm;"&gt;&lt;b&gt;PS: 23 May 2011 &lt;/b&gt;.The above analysis of DFID project ratings focuses on the recognition of failures that have already occurred. It is also possible, and important, to take steps to ensure that failures are possible to be recognised in the first place. A project that has no clear theory of change will be difficult to evaluate and thus difficult to classify as a success or failure. The most common means of describing a development project’s theory of change is probably via a LogFrame representation. Within a reasonably well constructed LogFrame representation there is a sequence of “if…and…then…” statements, spelling out what is expected to happen as the project is implemented and takes effect. While there may be positive developments in a project’s Goal level indicators, there also needs to be associated evidence that the expected chain of causation leading to that Goal has also taken place as expected. It is not uncommon, in my experience, to find that while the expected outcomes have occurred, the outputs that were meant to contribute to those outcomes were not successfully delivered. In this situation the project cannot claim to be successful. There is however a more generic point to be made here.&lt;span style="mso-spacerun: yes;"&gt;&amp;nbsp; &lt;/span&gt;The more detailed a project’s ToC is the more vulnerable it will be to disproof. Any one of the many expected causal links could be found to have not worked as expected. However, if these linkages have not been disproved, then the stronger the project’s claims will be to have contributed to any expected and observed changes. Willingness to allow failure to be identified strengthens the claim of any success that is observed. This seems an important observation in the case of projects where there is no possibility of making comparisons with a control group where there was no intervention. In those circumstances a ToC should be as detailed and articulate as possible.&lt;/div&gt;&lt;div class="MsoListParagraph" style="font-family: inherit; margin-left: 0cm;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoListParagraph" style="font-family: inherit; margin-left: 0cm;"&gt;&lt;b&gt;PS: 23 May 2011 &lt;/b&gt;:Articulating more disprovable theories of change may sound like a good idea, but it could be argued that this requirement risks locking aid agencies into a static view of the world they are working in, and one which is developed quite early in their intervention. In many settings, for example in humanitarian emergencies and highly politicised environments, aid agencies often have to revisit, revise and adapt their views of what is happening and how they should best respond. The best that might be expected in these circumstances is that those agencies are able to construct a detailed (and disprovable) &lt;i style="mso-bidi-font-style: normal;"&gt;history&lt;/i&gt; of what happened. &lt;span style="mso-spacerun: yes;"&gt;&amp;nbsp;&lt;/span&gt;This could actually produce better (i.e. more disprovable) results. There is some research evidence which shows that people find it easier to imagine events in some detail when they are situated in the past than to imagine the same kind of events taking place in the future&lt;a href="http://www.blogger.com/post-edit.g?blogID=6719829&amp;amp;postID=1890236059624698450#_edn1" name="_ednref1" style="mso-endnote-id: edn1;" title=""&gt;&lt;span class="MsoEndnoteReference"&gt;&lt;span style="mso-special-character: footnote;"&gt;&lt;span class="MsoEndnoteReference"&gt;&lt;span style="font-size: 10pt;"&gt;[i]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;.&lt;/div&gt;&lt;div class="MsoListParagraph" style="font-family: inherit; margin-left: 0cm;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="mso-element: endnote-list;"&gt;&lt;br clear="all" /&gt;  &lt;hr align="left" size="1" width="33%" /&gt;    &lt;div id="edn1" style="mso-element: endnote;"&gt;  &lt;div class="MsoEndnoteText"&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=6719829&amp;amp;postID=1890236059624698450#_ednref1" name="_edn1" style="mso-endnote-id: edn1;" title=""&gt;&lt;span class="MsoEndnoteReference"&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9.0pt; mso-ascii-theme-font: minor-latin; mso-hansi-theme-font: minor-latin;"&gt;&lt;span style="mso-special-character: footnote;"&gt;&lt;span class="MsoEndnoteReference"&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9.0pt; mso-ansi-language: EN-US; mso-ascii-theme-font: minor-latin; mso-bidi-font-family: &amp;quot;Times New Roman&amp;quot;; mso-bidi-language: AR-SA; mso-fareast-font-family: &amp;quot;Arial Unicode MS&amp;quot;; mso-fareast-language: JA; mso-hansi-theme-font: minor-latin;"&gt;[i]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9.0pt; mso-ascii-theme-font: minor-latin; mso-hansi-theme-font: minor-latin;"&gt; Bavelas, J.B. (1973) &lt;i style="mso-bidi-font-style: normal;"&gt;Effects of the temporal context of information&lt;/i&gt;, Psychological Reports, 35, 695-698, cited in &lt;i style="mso-bidi-font-style: normal;"&gt;Dance with Chance&lt;/i&gt;, by Makridakis, Hogarth and Gaba, 2009, page189.&lt;/span&gt;&lt;span lang="EN-GB" style="font-family: &amp;quot;Calibri&amp;quot;,&amp;quot;sans-serif&amp;quot;; font-size: 9.0pt; mso-ansi-language: EN-GB; mso-ascii-theme-font: minor-latin; mso-hansi-theme-font: minor-latin;"&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-1890236059624698450?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">7</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2010/10/do-we-need-minimal-level-of-failure-mlf.html</feedburner:origLink></item><item><title>Meta-narratives, evaluation and complexity</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/jBoImOQpKSY/meta-narratives-evaluation-and.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 25 Oct 2010 02:03:42 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-868941310771553282</guid><description>&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;A meta-narrative is &lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/Metanarrative"&gt;&lt;span lang="EN-GB"&gt;a story about stories&lt;/span&gt;&lt;/a&gt;&lt;span lang="EN-GB"&gt;. Some evaluations take this form, especially those using participatory approaches to obtain qualitative data from a diversity of sources. Even more conventional expert-led evaluations have an element of storytelling to them as they attempt to weave information obtained from various sources, often opportunistically, into a coherent and plausible overall picture of what happened, and what might happen in future.&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;Recently I have come across two examples of evaluations that were very much about creating a story about stories. They raised interesting questions about method: how can it be done well?&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;b&gt;&lt;span lang="EN-GB"&gt;Stories about Culture&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;b&gt;&lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;The first evaluation was of a multiplicity of small arts projects in developing countries, funded by &lt;/span&gt;&lt;a href="http://www.doen.nl/web/show/id=44928"&gt;&lt;span lang="EN-GB"&gt;DOEN&lt;/span&gt;&lt;/a&gt;&lt;span lang="EN-GB"&gt;, a Dutch funding agency. Claudia Fontes used the &lt;a href="http://mande.co.uk/special-issues/most-significant-change-msc/"&gt;Most Significant Change technique&lt;/a&gt; to elicit and analyse 95 stories from a sample of different kinds of participants in these projects. The aim was to identify what DOEN’s cultural intervention meant to the primary stakeholders. What particularly interested me was one part of the MSC process, which can be a useful step when faced with a large number of stories. This involved the participants categorising the stories into different groupings, according to their commonalities. It was from each of these groupings that the participants then went on to select, through intensive discussion, what they saw as the most significant changes of all. In one country five categories of stories were identified: Personal Development and Growth, Professional Development, Exposure, Change Of Perception And Attitude Towards Art And Artists, and Validation Of Self-Expression. Later on, at the report writing stage, Claudia looked at the contents of these groupings, especially the MSC stories within each, and produced an interpretation of how these groups of stories linked together. In other words, a meta-narrative.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt; &lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA" style="margin-left: 35.45pt;"&gt; &lt;span style="font-size: 10pt;"&gt;“For the primary stakeholders in XXXX these categories of change relate to each other in that the &lt;i&gt;personal&lt;b&gt; &lt;/b&gt;and professional development&lt;/i&gt; of artists and other professionals who support the artists’ work results in a &lt;i&gt;validation of the self-expression&lt;/i&gt; of direct (artists) and indirect (public in general) users. This process of affirmation and recovery of ownership of &lt;i&gt;self-expression&lt;/i&gt; contributes in turn to a change in &lt;i&gt;society’s perception of art and artists&lt;/i&gt; with the potential to make the whole cycle of change sustainable for the sector. Strategies of &lt;i&gt;exposure&lt;/i&gt; have a key role in contributing across these changes, and towards the profiling of the sector in general” (italics added)&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;span style="font-size: 11pt;"&gt;&lt;o:p&gt;&amp;nbsp;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; In commenting on the report I suggested that in future it might be possible and useful to take a participatory approach to the same task of producing a meta-narrative. Faced with the five groupings (and knowledge of their contents) each participant could be asked to identify expected causal connections between the different groupings, and give some explanation of these views. This can be done through a &lt;a href="http://mande.co.uk/special-issues/participatory-aggregation-of-qualitative-information-paqi/"&gt;simple card sorting exercise&lt;/a&gt;. The results from multiple participants can then be aggregated, and the result will take the form of a network of relationships between groupings, some being stronger than others (stronger in the sense of more participants’ highlighting a particular causal linkage). This emergent structure can then be visualized using &lt;a href="http://mande.co.uk/2008/media/software/social-network-analysis-software-a-list/"&gt;network software&lt;/a&gt;. Once visualised in this manner, the structure could be the subject of discussion, and perhaps some revisions.&amp;nbsp; One important virtue of this kind of process is that it will not necessarily produce a single dominant narrative. Minority and majority views will be discernable. And using network visualization software, the potential complexity would be manageable. Network views can be filtered on multiple variables, such as strength of the causal linkages.&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;b&gt;Stories about Conflict&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;b&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; The second evaluation was done by Lundy and McGovern, of &lt;a href="http://www.community-relations.org.uk/fs/doc/shared-space-issue-1-d-lundy-mcgovern.pdf"&gt;Community –based approaches to Post-Conflict “Truth telling” in Northern Ireland”&lt;/a&gt; I was sent this and other related papers by &lt;a href="http://www.incore.ulst.ac.uk/cgi-bin/staff/staffupdate.pl?task=INCORE"&gt;Ken Bush&lt;/a&gt;, who is exploring methods for evaluating story-telling as a peace building methodology. His draft conceptual framework notes that a “survey of the literature highlighted the lack of an agreed and effective evaluation tool for story-telling in peace-building despite the near universality of the practice and the huge monetary investment by the EU and others in story-telling projects.”&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; Lundy and McGovern’s paper is a good read, because it explores the many important complications of storytelling in a conflicted society. Not only important issues like appropriate sampling of story tellers, but how the story telling project intentions were framed, and the how the results were presented. The primary product of the project was a publication called “&lt;a href="http://www.amazon.co.uk/Ardoyne-Untold-Truth-Commemoration-Project/dp/1900960176"&gt;Ardoyne: The Untold Truth&lt;/a&gt;”, containing testimonies based on 300 interviews.&amp;nbsp; The purpose of Lundy and McGovern’s assessment of the project was “to assess the impacts and benefits of community based “truth telling”. This was done by interviewing 50 people from five different stakeholder groups. The results were then written up in their paper.&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; What we have here is daunting in its complexity: (a) There are the “original” stories, as compiled in the book, (b) then the stories of people’s reactions to these stories and how they were collected and disseminated, (c) and then the authors’ own story about how they collected these stories and&amp;nbsp; their interpretation of them as a whole. And of course behind all this we have the complex (as colloquially used) context of Northern Ireland!&lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;When reading what might be called Lundy and McGovern’s meta-meta-narrative (i.e. the interpreted results of the interviews) I looked for information on how sources were cited. These are the sorts of phrases I found: “according to respondents”, “many”, ”there was evidence”, “most”, “the vast majority”, “It was felt”, “respondents”, “in the main”, “for many”, “many people”, “there was a very strong opinion”, “it was felt”, “there was a consensus”. “for the majority of participants”, “without exception”, “many interviews”, “overwhelmingly”, “for others”, “some”, “for these respondents”, “one of the most frequently mentioned”, “it was further suggested”, “most respondents”, “the view”, “it was further suggested”, in general respondents were of the view that”, “the experience of those involved…would seem to suggest”, “some respondents”, “the overwhelming majority”, “responses from Union representatives were”, “for some”, “a representative of the community sector”, “that said, others were”, “by another interviewee”. “it was”, and “a significant section of mainly nationalist interviewees”.&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; I list these here with some hesitation, knowing how often during evaluations I have resorted to using the same vocabulary, when faced with making sense of many different comments by different sources, in a limited period of time. However there are important issues here, made even more important by how often we have to deal with situations like this. How people see things, like their reactions to the Ardoyn stories, matters. How many people see things in a given way matters, who those groups of people are matters, and how the views of different groups overlap also matters. In Lundy and McGovern’s paper we only get glimpses of this kind of underlying social structure. We sometimes get a sense of majority or minority and occasionally which particular group holds a view, and sometimes that a group sharing one view also thinks that…&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; How could it be done differently? The views of a set of respondents can be summarised in a "two-mode" matrix, with respondents listed in the rows and the descriptions of views listed in columns, and cell values indicating what is known about a person’s views on a listed issue. For example, agreement/disagreement, degree of agreement, or not known. By itself this data is not easy to analyse, other than through frequency counts (e.g. # of people supporting x view, or # views expressed by x person). But it is possible to convert this data into two different kinds of one-mode matrix, showing: (&lt;i&gt;a&lt;/i&gt;) how different people are connected to each other (by their shared views) and, (&lt;i&gt;b&lt;/i&gt;) how different views are connected to each other (by the same people holding those views). The networks structure of the data in these matrices can be seen and further manipulated using &lt;a href="http://mande.co.uk/2008/media/software/social-network-analysis-software-a-list/"&gt;network visualization software&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; As in many evaluations, Lundy and McGovern were constrained by a confidentiality commitment. Individuals can be anonymised by being categorized into types of people, but this may have its limits if the number of respondents is small and the identity of participants is known to others (if not their specific views). This means the potential to make use of the first kind of network visualization (i.e. &lt;i&gt;a&lt;/i&gt;) may be limited, even if the network visualization showed the relationships between &lt;i&gt;types&lt;/i&gt; of respondents. However, the second type (i.e. &lt;i&gt;b&lt;/i&gt;) should remain an option. To recap, this would show a network of opinions, some strongly linked to many others because they were often shared by the other respondents, others with weaker links to fewer others because they were shared with few, if any, other respondents. The next step would be the development of a narrative, commentary explaining the highlights of the network structure. This would usefully focus on the contents of the different clusters of opinions, and the nature of any bridges between them, especially of the clusters expressed contrasting views.&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; There are two significant hurdles in front of this approach. Typically not all respondents will express views on all topics, and the number who do will vary across topics. One option would be to filter out the views with the least number of respondents. The other, which I have never tried, would be interesting to explore. That would be to build in a supplementary question in interviews, along the lines of&amp;nbsp; “&lt;i&gt;…and how many people do you think would feel the same way as you on this issue?”. &lt;/i&gt;Their answers would be important in themselves, possibly affecting how the same people might act on their own views. But the same answers could also provide a weighting mechanism for views in an otherwise small sub-sample.&lt;br /&gt;&lt;br /&gt;The second hurdle is that the network description of the relationships between the participants views is a snapshot in time. But an evaluation usually requires comparison, with a prior state. This is a problem if the&amp;nbsp; questions asked by Lundy and McGovern were about current opinions. but if they were about &lt;i&gt;changes&lt;/i&gt; in people's views it would not be.&lt;br /&gt;&lt;br /&gt;Lets return to the layer below, the stories collected in the original “Ardoyne: The Untold Truth” publications. Stories beget stories. The telling of one can prompt the telling of another. If stories can be seen as linked in this way, then as the number number of stories recounted grows we could end up with a &lt;a href="http://www.google.com/search?client=gmail&amp;amp;rls=gm&amp;amp;q=%22network%20of%20stories%22"&gt;network of stories&lt;/a&gt;. Some stories&amp;nbsp; in that network may be told more often than others, because they are connected to many others, in the minds of the storytellers. These stories might be what complexity science people call "attractors" Although storytellers may start off telling various different stories, their is a likelihood many of them will end up telling this particular story, because of its connectedness, its position in the network.&amp;nbsp; If these stories are negative, in the sense of provoking antipathy towards others in the same community, then this type of structure may be of concern. Ideally the attractors, the highly connected stories in the network would be positive stories, encouraging peace and cooperation with others. This network structure of stories could be explored by an evaluator asking questions like "What other stories does this story most remind you off? or, "Which of these stories does that story most remind you of?" Or versions thereof. When comparing changes over time the evaluator's focus would then be on the changing contents of the strongly connected versus weakly connected stories.&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; In this discussion above I have outlined how a network approach could help us construct various types of aggregated (network) views of multiple stories. Because they are built up out of the views of individuals, it would be possible to see where there were varying degrees of agreement within those structures. They would not be biased towards a single (excluding others) narrative, a concern of many people using story-telling approaches including some of the originators of the term &lt;a href="http://en.wikipedia.org/wiki/Metanarrative"&gt;meta-narrative&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;b&gt;And complex histories&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;b&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; My final comments relate to another form of story-telling, that is grand narrative as done by historians. Yesterday I read with interest Niall Ferguson’s &lt;a href="http://www.informationclearinghouse.info/article24874.htm"&gt;Complexity and Collapse: Empires on the Edge of Chaos&lt;/a&gt;(originally in &lt;a href="http://www.foreignaffairs.com/articles/65987/niall-ferguson/complexity-and-collapse"&gt;Foreign Affairs&lt;/a&gt;). In this article Niall describes the ways some historians have sought to explain the rise and fall in empires, in terms of sequences of events taking place over long periods. In his view they suffer from what Nassim Taleb calls "the narrative fallacy": they construct psychologically satisfying stories on the principle of post hoc, ergo propter hoc ("after this, therefore because of this”). That is, the propensity to over-explain major historical events, to create a long and coherent story where in fact there was none. His alternate view is couched in terms of complexity theory ideas. Given the complexity of modern societies “In reality, the proximate triggers of a crisis are often sufficient to explain the sudden shift from a good equilibrium to a bad mess.” He then qualifies the notion of equilibrium: “a complex economy is characterized by the interaction of dispersed agents, a lack of central control, multiple levels of organization, continual adaptation, incessant creation of new market niches, and the absence of general equilibrium.” Within those systems small changes can have catastrophic (i.e. non-linear) effects, because of the nature of the connectivity involved. Ferguson then goes onto recount examples of the rapidity of decline in some major empires.&lt;br /&gt;&lt;br /&gt;One point which he does not make, but which&amp;nbsp; I think is implicit in his discription of how change can happen in complex systems is that more than one type of small change can trigger the same kind of large scale change. Consider the &lt;a href="http://www.blogger.com/goog_1589961627"&gt;assissination of &lt;/a&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://en.wikipedia.org/wiki/Assassination_of_Archduke_Franz_Ferdinand_of_Austria"&gt;Archduke Franz Ferdinand of Austria&lt;/a&gt; in Sarajevo in June 1914. Would World War 1 not have happened if that event took place? Not speaking as a historian..my guess is that there are quiet a few other events that could have triggered the start of a war thereafter.&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;br /&gt;&lt;/div&gt;&lt;div class="BodyAA"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;/div&gt;Niall Ferguson complexity based view is in a sense a technocrat’s objection to grand narratives, but perhaps also another kind of grand narrative in its own right. Nevertheless his view does seem to have practical relevance to the writing of evaluation stories: it highlights the need for caution about excessive internal coherence in any story of change and its causes. A network view of causal relationships between types of events, constructed by participants with differing views, might help mitigate against this risk, when it needs to be reduced to a text description.&lt;br /&gt;&lt;br /&gt;PS: "&lt;i&gt;In recent years, however,  advancements in cognitive neuroscience have suggested that memories unfold  across multiple areas of the cortex simultaneously, like a richly  interconnected  network of stories, rather than an archive of static files&lt;/i&gt;." in &lt;a href="http://www.wired.com/wired/archive/10.04/sacks_pr.html"&gt;The Fully Immersive Mind of Oliver Sacks&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;PS 25 October 2010. Please also see&amp;nbsp; &lt;a href="http://mande.co.uk/special-issues/participatory-aggregation-of-qualitative-information-paqi/#self-categorised"&gt;&lt;span style="font-size: small;"&gt;&lt;span style="color: red;"&gt;Networks of self-categorised stories&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-868941310771553282?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">3</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2010/08/meta-narratives-evaluation-and.html</feedburner:origLink></item><item><title>Cynefin Framework versus Stacey Matrix  versus network perspectives</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/yrl1aYnuDDI/test3.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Wed, 25 Aug 2010 00:52:34 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-2234597720524152199</guid><description>&lt;span style="font-size: large;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;br /&gt;&lt;span style="font-size: large;"&gt;Cynefin&lt;/span&gt;&lt;br /&gt;&lt;div align="center" style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_QeRS3Af8hns/TG6mj699jqI/AAAAAAAAB0A/4TQ5Nq2aFsM/s1600/cynefin.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_QeRS3Af8hns/TG6mj699jqI/AAAAAAAAB0A/4TQ5Nq2aFsM/s320/cynefin.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;Lots of people seem to like the &lt;a href="http://en.wikipedia.org/wiki/Cynefin"&gt;Cynefin Framework&lt;/a&gt;. &lt;a href="http://www.clearhorizon.com.au/discussion/rigorous-evaluation-practice-that-embraces-complexity/"&gt;Jess Dart and Patricia Rogers&lt;/a&gt; are some of my friends and colleagues of mine who have expressed a liking for it. It was one of the subjects of discussion in the recent &lt;a href="http://evaluationrevisited.wordpress.com/"&gt;Evaluation Revisited &lt;/a&gt;conference in Utrecht in May. Why don’t I like it? There are three reasons...&lt;br /&gt;&lt;br /&gt;Usually matrix classifications of possible states are based on the intersection of two dimensions. They can provide good value because combining two dimensions to generate four (or more) possible states is a compact and efficient way of describing things. Matrix classifications have &lt;i&gt;parsimony&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;But whenever I look at descriptions of the Cynefin Framework I can never see, or identify, what the two dimensions are which give the framework its 2 x 2 structure, and from which the four states are generated. If they were more evident I might be able to use them to identify which of the four states best described the particular conditions I was facing at a given time. But up to now I just have to make a best guess, based on the description of each state. PS: I have been told by someone recently that Dave Snowden says this is not a 2x2 matrix, but if so, why is presented like one?&lt;br /&gt;&lt;br /&gt;My second concern is the nature of the connection between this fourfold classification and other research on complexity, beyond the field of management studies and consultancy work. IMHO, I don’t think there is much in the way of a theoretical or empirical basis for it, especially when Dave’s fifth state of “disorder”, is placed in the centre. This may be the reason why the two axes of the matrix I mentioned above have not been specified, ...because they have not yet been found.&lt;br /&gt;&lt;br /&gt;My third concern is that I don’t think the fourfold classification has much discriminatory power. Most the situations I face, as an evaluator, could probably be described as complex. I don’t see many really chaotic ones, like gyrating stock markets or changeable weather patterns, nor do I see many that could be described as simple, or just complicated. Except perhaps when dealing with single person’s task, not involving interactions with others. Given the prevalence of complex situations, I would prefer to see a matrix that helped me discriminate&lt;i&gt; between different forms of complexity&lt;/i&gt;, and their possible consequences.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: large;"&gt;Stacey &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div align="center" style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_QeRS3Af8hns/TG6mtjHJ9BI/AAAAAAAAB0I/Yy3Nz4Rmyx0/s1600/stacey1.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_QeRS3Af8hns/TG6mtjHJ9BI/AAAAAAAAB0I/Yy3Nz4Rmyx0/s320/stacey1.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;This brings me to Stacey's matrix, which does have two identifiable dimensions shown above: certainty (i.e. the predictability of events) and the degree of agreement over those events. Years before I had heard of "Stacey's matrix"" I had found the same kind of 2 x 2 matrix a useful means of describing four different kinds of possible development outcomes which had different&amp;nbsp; implications for what sort of M&amp;amp;E tools would be most relevant. For example, by definition you cannot use predefined indicators to monitor unpredictable outcomes (regardless of whether we agree or disagree on their significance). However methods like MSC can be used to monitor these kinds of change. And a good case could be made for more attention to the use of historian's skills, especially to respond to unexpected events that are of dispute meaning. More recently I argued that &lt;a href="http://mande.co.uk/special-issues/weighted-checklists/"&gt;weighted checklists&lt;/a&gt; are probably the most suitable for tracking outcomes that are predictable but where there is not necessary any agreement about their significance. A &lt;a href="http://books.google.com/books?id=gd_RvUbSWnsC&amp;amp;lpg=PP1&amp;amp;ots=pP4zzQhXTg&amp;amp;dq=patton%20%22developmental%20evaluation%22&amp;amp;pg=PA85#v=snippet&amp;amp;q=utility&amp;amp;f=false"&gt;quote from Patton&lt;/a&gt; could be hijacked and used here "&lt;i&gt;These distinctions help with situation recognition&amp;nbsp; so that an evaluation approach can be selected that is appropriate to a particular situation and intervention, thereby increasing the likely utility -and actual use- of the evaluation&lt;/i&gt;" (page 85, Developmental Evaluation)&lt;br /&gt;&lt;br /&gt;From what I have read I think Ralph Stacey also produced the following more detailed version of his matrix:&lt;br /&gt;&lt;br /&gt;&lt;div align="center" style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_QeRS3Af8hns/TG6m5wcXrQI/AAAAAAAAB0Q/vjQD39kZu6M/s1600/stacey2.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_QeRS3Af8hns/TG6m5wcXrQI/AAAAAAAAB0Q/vjQD39kZu6M/s320/stacey2.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;This has then been simplified by &lt;a href="http://www.plexusinstitute.org/edgeware/archive/think/main_aides3.html"&gt;Brenda Zimmerman&lt;/a&gt;, as follows&lt;br /&gt;&lt;br /&gt;&lt;div align="center" style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_QeRS3Af8hns/TG6nBqyscKI/AAAAAAAAB0Y/QFr69XSd-tU/s1600/zimmerman.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_QeRS3Af8hns/TG6nBqyscKI/AAAAAAAAB0Y/QFr69XSd-tU/s320/zimmerman.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;In this version simple, complicated complex and anarchy (chaos) are in effect part of a continuum, involving different mixes of agreement and certainty. Interestingly, from my point of view, the category taking up the most space in the matrix is that of complexity, echoing my gut level feeling expressed above. This feeling was supported when I read Patton's three examples of simple, complicated and complex (page92, ibid), based on Zimmerman. The simple and complicated examples were both about making &lt;i&gt;materials&lt;/i&gt; do what you wanted (cake mix and rocket components), whereas the complex example was about child rearing i.e. getting &lt;i&gt;people&lt;/i&gt; to do what you wanted. More interesting still, the complex example was raising a couple of children in&amp;nbsp; family, in other words a &lt;i&gt;small&lt;/i&gt; group of people.So anything involving more people is probably going to be a whole lot more complex. PS: And interestingly along the same lines, the difference between simple and complicated was a physical task involving &lt;i&gt;one&lt;/i&gt; person (following a recipe) and one involving &lt;i&gt;large numbers&lt;/i&gt; of people (sending a rocket into space)&lt;br /&gt;&lt;br /&gt;Another take on this is given by &lt;a href="http://informalcoalitions.typepad.com/informal_coalitions/2009/08/staceys-certainty-agreement-matrix-and-levels-of-complexity.html"&gt;Chris Rodgers&lt;/a&gt; comments on Stacey’s views:&lt;br /&gt;&lt;blockquote&gt;&lt;div class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;“&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;Although the framework, which Stacey had developed in the mid-1990s, regularly crops up in blogs, on websites and during presentations, he no longer sees it as valid and useful.&amp;nbsp; His comment explains why this is the case, and the implications that this has for his current view of complexity and organizational dynamics.&amp;nbsp; In essence, he argues that&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;ul type="disc"&gt;&lt;li class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;life      is complex all the time, not just on those occasions which can be      characterized as being “far from certainty” and “far from agreement” …&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;this      is because change and stability are inextricably intertwined in the      everyday conversational life of the organization …&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;which      means that, even in the most ordinary of situations, something unexpected      might happen that generates far-reaching and unexpected outcomes …&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;and      so, from this perspective, there are no “levels of complexity” …&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li class="MsoNormal"&gt;&lt;i&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;nor      levels in human action that might usefully be thought of as a “system”.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;Well maybe,… but this is beginning to sound a bit too much like the utterances of a Zen master to me. Like Rodgers, I hope we can still make &lt;i&gt;some&lt;/i&gt; kind of useful distinctions re complexity.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size: large;"&gt;Back to Snowden&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Which brings me back to a more recent statement by Dave Snowden, which to me seems more useful than his earlier Cynefin Framework. In his presentation at the Gurteen Knowledge Cafe, in early 2009, &lt;a href="http://conradiator.wordpress.com/2009/02/26/dave-snowden-on-managing-complexity/"&gt;as reported by Conrad Taylor&lt;/a&gt;, "Dave presented &lt;i&gt;three&lt;/i&gt; system models: ordered, chaotic and complex. By ‘system’ he means networks that have coherence, though that need not imply sharp boundaries. ‘Agents’ are defined as anything which &lt;i&gt;acts&lt;/i&gt; within a system. An agent could be an individual person,or a grouping; an idea can also be an agent, for example the myth-structures which largely determine how we make decisions within the communities and societies within which we live."  &lt;br /&gt;&lt;ul style="margin-top: 0cm;" type="disc"&gt;&lt;li class="MsoNormal"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;"Ordered systems&lt;/span&gt;&lt;/b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt; are ones in which the      actions of agents are constrained by the system, making the behavior of      the agents predictable. Most management theory is predicated on this view      of the organisation."&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul style="margin-top: 0cm;" type="disc"&gt;&lt;li class="MsoNormal"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;Chaotic systems&lt;/span&gt;&lt;/b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt; are ones in which the      agents are unconstrained and independent of each other. This is the domain      of statistical analysis and probability. We have tended to assume that      markets are chaotic; but this has been a simplistic view."&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul style="margin-top: 0cm;" type="disc"&gt;&lt;li class="MsoNormal"&gt;&lt;b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt;"Complex systems&lt;/span&gt;&lt;/b&gt;&lt;span style="font-family: &amp;quot;Times New Roman&amp;quot;,&amp;quot;serif&amp;quot;; font-size: 12pt;"&gt; are ones in which the      agents are &lt;i&gt;lightly&lt;/i&gt; constrained by the system, and through their      mutual interactions with each other and with the system environment, the      agents also modify the system. As a result, the system and its agents      ‘co-evolve’. This, in fact, is a better model for understanding markets,      and organisations.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;This conceptualization is simpler (i.e. has more economy) and seems more connected with prior research on complexity. My favorite relevant quote here is Stuart &amp;nbsp;Kauffman’s book: &lt;a href="http://www.amazon.com/At-Home-Universe-Self-Organization-Complexity/dp/0195111303"&gt;At home in the Universe: The search for the laws of complexity&lt;/a&gt; (p86-92) where he describes the behavior of electronic models of networks of actors (with on/off behavior states for each actor) moving from simple to complex to chaotic patterns, &lt;i&gt;depending on the number of connections between them. &lt;/i&gt;As I read it, few connections generate ordered (stable) network behavior, many connections generate chaotic (apparently unrepeating) behavior, and medium numbers (where N actors = N connections) generate complex cyclical behavior. (&lt;a href="http://fias.uni-frankfurt.de/%7Ewilladsen/RBN/"&gt;See more on Boolean networks&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;This relates back to conversation that I had with Dave Snowden in 2009 about the value of a network perspective on complexity, in which he said (as I remember) that relationships within networks can be seen as constraints. So, as i see it, in order to differentiate forms of complexity we should be looking at the nature of the specific networks in which actors are involved: Their number, the structure of relationships, and perhaps the extent to which the actors have own individual autonomy i.e. responses which are not specific to particular relationships (an attribute not granted to “actors” in the electronic model described).&lt;br /&gt;&lt;br /&gt;My feeling is that with this approach it might even be possible to link this kind of analysis back to Stacey’s 2x2 matrix. Predictability might be primarily a function of connectedness, and therefore more problematic in larger networks where the number of possible connections is much higher. The possibility of agreement, Stacey’s second dimension, might be further dependent the extent to which actors’ have some individual autonomy within a given network structure.&lt;br /&gt;&lt;br /&gt;To be continued…&lt;br /&gt;&lt;br /&gt;PS1:Michael Quinn Patton's book on &lt;a href="http://books.google.com/books?id=s5okv_bZ8EQC&amp;amp;pg=PA351&amp;amp;lpg=PA351&amp;amp;dq=patton+%22developmental+evaluation%22+%22google+books%22&amp;amp;source=bl&amp;amp;ots=CKt-VSXQ0L&amp;amp;sig=bfMLYyRLITne8_syLNyDm-W0hNU&amp;amp;hl=en&amp;amp;ei=pE9yTKu7Hcb74AbC6ajwCg&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=4&amp;amp;ved=0CB4Q6AEwAw#v=onepage&amp;amp;q&amp;amp;f=false"&gt;Developmental Evaluation&lt;/a&gt; has a whole chapter on "Distinguishing Simple, Complicated, and Complex". However, I was surprised to find that despite the book's focus on complexity, there was not a single reference in the Index to "networks". There was one example of a network model (Exhibit 5.3) , contrasted with a Linear Program Logic Model..." (Exhibit 5.2) in the chapter on Systems Thinking and Complexity Concepts. [I will elaborate further &lt;a href="http://mande.co.uk/?p=2230"&gt;here&lt;/a&gt;]&lt;br /&gt;&lt;br /&gt;Regarding the simple, complicated and complex, on page 95 Michael describes these as "sensitising concepts, not operational measurements" This worried me a bit, but it is an idea with a history (&lt;a href="http://www.google.com/webhp?sourceid=gmail#sourceid=gmail&amp;amp;hl=en&amp;amp;source=hp&amp;amp;q=%22sensitising+concepts%22&amp;amp;aq=f&amp;amp;aqi=g-sx3&amp;amp;aql=&amp;amp;oq=&amp;amp;gs_rfai=&amp;amp;pbx=1&amp;amp;fp=93c3c78db929eee0"&gt;Look here&lt;/a&gt; for other views on this idea). But he then says "The purpose of making such distinctions is driven by the utility of situation recognition and responsiveness. For evaluation this means matching the evaluation to the nature of the situation" That makes sense to me, and is how have I tried to use the simple version of the Stacey Matrix (using dimensions only). However, Michael then goes on to provide, perhaps unintentionally, evidence of how useless these distinctions are in this respect, at least in their current form. He describes working with a group of 20 experienced teachers to design an evaluation of an innovative reading program. "They disagreed intensely about the state of knowledge concerning how children learn to read..Different preferences for evaluation flowed from different definitions of the situation. We ultimately agreed on a mixed methods design that incorporated aspects of both sets of preferences". Further on in the same chapter, Bob Williams is quoted reporting the same kind of result (i.e conflicting interpretations), in a discussion with health sector workers. PS 25/8/2010 - Perhaps I need to clarify here - in both cases participants could not agree on whether the situation under discussion was simple, complicated or complex, and thus these distinctions could not inform their choices of what to do. As I read it, in the first case the mixed method choice was a compromise, not an informed choice.&lt;br /&gt;&lt;br /&gt;PS2: I have also just pulled Melanie Mitchell's "&lt;a href="http://books.google.com/books?id=sSgzHayrDBsC&amp;amp;lpg=PP1&amp;amp;ots=eMc840Ig6j&amp;amp;dq=Melanie%20Mitchell%27s%20%22Complexity%3A%20A%20Guided%20Tour&amp;amp;pg=PR7#v=twopage&amp;amp;q&amp;amp;f=false"&gt;&lt;i&gt;Complexity: A Guided Tour&lt;/i&gt;&lt;/a&gt;" off the shelf, and re-scanned her Chapter 7 on "&lt;i&gt;Defining and&amp;nbsp; Measuring Complexity&lt;/i&gt;". She notes that about &lt;b&gt;40&lt;/b&gt; different measures of complexity have been proposed by different people. Her conclusion, 17 pages later, is that "The diversity of measures that have been proposed indicates that the notions of complexity that we're trying to get at have many different interacting dimensions and probably cant be captured by a single measurement scale" This is not a very helpfull conclusion. But I noticed that she does cite earlier what seem to be three &lt;i&gt;categories&lt;/i&gt; of measures that cover many of the 40 or so measures: These are: 1. how hard the  object or process is to describe?, 2. How had it is to create?, and 3.  What is its degree of organisation?&lt;br /&gt;&lt;br /&gt;PS3: I have followed up John Caddell's advice to read &lt;a href="http://www.storycoloredglasses.com/2010/06/confluence.html"&gt;a blog post by Cynthia Kurtz&lt;/a&gt; (a co-author of the IBM Systems Journal paper on Cynefin) recalling some of the early work around the framework. In that post was the following version of the Cynefin Framework included in the oft-mentioned "&lt;a href="http://alumni.media.mit.edu/%7Ebrooks/storybiz/kurtz.pdf"&gt;The new dynamics of strategy: Sense-making in a complex and complicated world&lt;/a&gt;" published in the IBM SYSTEMS JOURNAL, VOL 42, NO 3, 2003.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_QeRS3Af8hns/TG-_UV4sd9I/AAAAAAAAB0g/3mJSDKa2rXs/s1600/connection+strengths.bmp" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_QeRS3Af8hns/TG-_UV4sd9I/AAAAAAAAB0g/3mJSDKa2rXs/s320/connection+strengths.bmp" /&gt;&lt;/a&gt;&lt;/div&gt;In her explanation of the origins of this version she says it had two axes: "the degree of imposed order" and "the degree of  self-organization." This I found interesting because these dimension have the potential to be measurable. If they are measureable, then the actual behavior of four identified systems could be compared. And we could then ask "Does their behavior differ in ways that have consequences for managers or evaluators?" I have previously speculated that there might be &lt;a href="http://en.wikipedia.org/wiki/Social_network"&gt;network measures&lt;/a&gt; that could describe these two measures: network density and network centrality. Network centrality could be the x axis, being low on the left and high on the right, and network density could be the y axis, low on the bottom and high on the top. How well the differences in these four types of network structures might capture our day-to-day notion of complexity is not yet clear to me. As mentioned way above, density does seem to be linked to differences between simple, complex and chaotic behavior. Maybe differences in centrality moderate/magnify the consequences of different levels of network density?&lt;br /&gt;&lt;br /&gt;PLEASE NOTE: To make a Comment, or to read the Comments already made on this post, click on the &lt;i&gt;Leave a Comment&lt;/i&gt;&amp;nbsp; link below, or &lt;a href="http://mandenews.blogspot.com/2010/08/test3.html#comments"&gt;directly here&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-2234597720524152199?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_QeRS3Af8hns/TG6mj699jqI/AAAAAAAAB0A/4TQ5Nq2aFsM/s72-c/cynefin.bmp" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">13</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2010/08/test3.html</feedburner:origLink></item><item><title>Evaluating a composite Theory of Change (ToC)</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/APZzD9JWx2E/evaluating-composite-theory-of-change.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Sun, 25 Apr 2010 03:30:52 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-8488695106349302862</guid><description>&lt;p&gt;&lt;/P&gt;One NGO I have been working with has a project that is being implemented via partner organisations in four different countries. There is one over-arching LogFrame capturing the generic ToC, but the situation in each country is quite different. So country specific LogFrames were developed to recognise that fact. However, for convenience of reporting to the back donor the progress report format has been based on the contents of the generic LogFrame. When it comes to the Mid-Term Review more attention will need to be paid to the country specific LogFrames. But then how will the four MTR results be systematically aggregated into one synthesis report?&lt;br /&gt;&lt;br /&gt;Other colleagues of mine have had to review a funding mechanism involving 30 or more project partners. The diversity of activities on the ground there is even greater. Rather than focus on the original LogFrame that describes the purpose of the funding mechanism they got all the project staff and local partners together to construct a retrospective ToC that fitted all the funded activities. That is how they have dealt with the micro-macro compatibility issue.&lt;br /&gt;&lt;br /&gt;How then have they evaluated the 30 projects in terms of this composite ToC? Well, because the ToC was reconstructed retrospectively there was no set of readily available monitoring data, for example as based on indicators in a LogFrame. Instead, the main source of evidence has been the qualitative data gathered from interviews with a large and diverse number of stakeholders. This approach has not met with approval in some quarters, which then prompted me to think how you could solve this problem. I should start by saying I do like the idea of a retrospectively constructed ToC, so long as there is also some accountability for its transition from any prior form such as a LogFrame.&lt;br /&gt;&lt;br /&gt;My first thought was to treat the 30 projects as “units of evidence” and to try to classify them as having achieved, or not achieved, each stage in the ToC. The ToC was in a graphic form, with multiple events happening at different stages, and with lines representing causal links connecting the various events.  The problem would then be how to classify each project. One possible means would be to get stakeholders to “success rank” the projects in relation to a given outcome event in the ToC, and then identify a cut-off point in the success ranking representing an acceptable level of achievement. This cut-off point would need to be explained and justified. To start with the focus of these success ranking exercises should be on those events most central to the ToC, i.e. those with many incoming and outgoing causal links. If there was sufficient time we would end up with a percentage (of projects) measure for each outcome, that could if necessary be weighted by scale of expenditure.&lt;br /&gt;&lt;br /&gt;We could then turn our attention to examining what happened to the expected causal relationships between the events in the ToC. At this stage it might be possible to do simple 2 x 2 cross tabulations of the relationships between related pairs of events in the ToC, by counting numbers of projects that had achieved both of the events, achieved one but not the other, and achieved none at all. A Chi Square test could tell us how significant the observed relationship was.&lt;br /&gt;&lt;br /&gt;There is however a potential difficulty with the next step in the process, which is to look at the larger picture of how all of the events are causally linked together to make the whole ToC work as expected. Event A may be linked to Event B by project X (and others) achieving both, and Event B may be linked to Event C by project Y (and others) achieving both. But if project X and Y are in different locations then the connection between Event A and C would seem very questionable, because the two Event Bs probably also happened in two different locations. In the worst case we could end up with a ToC where many of the individual casual links were working as expected, but where there was no evidence of the whole set of causal links working together. At the least we would have to investigate the relationships between the projects that created the larger causal pathway. In the imagined example above, we would need to look at projects X and Y and how their achievements of Event B inter-related.&lt;br /&gt;&lt;br /&gt;This is, and sounds, complex. One means of simplification would be to identify and focus on the most important causal pathway in the ToC. In my colleagues’ ToC there were at least two major pathways and a number of minor variations. Identifying points of failure in the network of events would be one way forward. This could be done in two stages: 1. Success ranking might show that some outcome events in the ToC were not satisfactorily achieved by any projects. In that case the pathway it belonged to would be broken, 2. Chi square tests might show that although some pairs of events both took place, there was no significant association between them (in the form of the number of projects achieving both). Again the pathway they were part of would be broken.&lt;br /&gt;&lt;br /&gt;This reflection on causal pathways make me think that the challenge of having to sort out “the attribution problem” for the final event in the ToC might be what could be called “a problem we would like to have”. It would presume that we have already established a plausible pathway of influence of our own activities. The discussion above suggests it may not be so easy in some cases.&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-8488695106349302862?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">3</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2010/04/evaluating-composite-theory-of-change.html</feedburner:origLink></item><item><title>Reflections on Dave Snowden’s presentations on sense-making and complexity</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/ys_kq6uvfts/reflections-on-dave-snowdens.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 03 Dec 2009 06:55:14 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-4283570203328024090</guid><description>&lt;a style="font-family: trebuchet ms;" href="http://www.pap.wur.nl/UK/newsagenda/news/Innovation_Dialogue_how_to_be_strategic_in_the_face_of_complexity.htm"&gt;... at the Wageningen Innovation Dialogue, 30 November -1&lt;sup&gt;st&lt;/sup&gt; December 2009&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;From my point of view, one of the most interesting and important challenges is &lt;i style=""&gt;h&lt;/i&gt;&lt;i style=""&gt;&lt;span style="" lang="EN-GB"&gt;ow to create useful representations &lt;/span&gt;&lt;/i&gt; of large, complex, dynamic structures, especially as seen by participants in those structures. For example, multi stakeholder processes in operation at national and international levels. Behind this view is an assumption, that if we have better representations then this will provide us with more informed choices about how to respond to that complexity. Note that the key word here is respond to, not manage. The scale of ambition is more modest. Management of complexity only seems feasible when it is on a small scale, such as the children’s play group example cited by Dave Snowden (DS).&lt;br /&gt;&lt;br /&gt;I have had a long standing interest in one particular set of tools that can be used for producing representations of complex structures. These are social network analysis (SNA) methods and associated software. During the workshop &lt;a href="http://www.networkingaction.net/2.html"&gt;Steve Waddell&lt;/a&gt; provided a good introduction to SNA and related tools.&lt;br /&gt;&lt;br /&gt;DS’s presentations on the sense-making approach provided a useful complementary perspective. This was all about making use of large sets of qualitative data, of a kind that cannot be easily used by SNA tools. Many of this data was about people’s voices, values and concerns, all in the form of fairly unstructured&lt;span style=""&gt;  &lt;/span&gt;and impromptu responses to questions asked by their peers (who were trained to do so). These are called “micro-narratives” (MNs).&lt;br /&gt;&lt;br /&gt;DS’s &lt;a href="http://www.cognitive-edge.com/sensemaker_suite.php"&gt;sense-making process (and associated software)&lt;/a&gt; is innovative in at least three respects. Firstly, in terms of the huge scale. Up to 30,000 items of text collected and analysed in one application.In many cases this would be more like a census than a sample survey. I have never heard of qualitative data being collected on this scale before. Nor as promptly, including the time spent on analysis, in the case of the Pakistan example. Secondly, and related to this, is the sophistication and apparent user friendliness of the bespoke software and hardware that was used.&lt;br /&gt;&lt;br /&gt;More interesting, and more important, was the decision to ask respondents to “self-signify” the qualitative information they had provided. This was done by asking the respondents to describe their own MNs by using two different kinds of scales, to rate the presence of different attributes already identified by the researchers as being of concern. The consequence of respondents providing this meta-data was that all the MNs could be given a location in a three dimensional space. In fact a number of different kinds of three dimensional spaces, if many self-signifiers were used. Within that space it was then possible for the researcher to look for clusters of MNs. Of special interest were clusters of MNs that were outliers, i.e. those that were not part of the centre of the overall distribution of MNs.&lt;br /&gt;&lt;br /&gt;There are echoes here of the expectation that the collection and analysis of &lt;a href="http://mande.co.uk/special-issues/most-significant-change-msc/"&gt;Most Significant Change (MSC)&lt;/a&gt; stories will help organisations identify “the edges of experience”, which they wanted to see more examples of in future (if positive), or less (if negative). The difference is DS's use of quantitative data to make these outliers more identifiable, in a transparent manner.&lt;br /&gt;&lt;br /&gt;As far as I understand it, an additional purpose of using self-signifiers to identify clusters of MNs is to prevent premature completion of the process of interpretation by the researcher, and thus to strengthen the trustworthiness of the analysis that is made.&lt;br /&gt;&lt;br /&gt;On the first day of the workshop I had two reservations about the approach that had been described. The first was about the “fitness landscape” that was drawn within the three dimensional space. How was it constructed, and why it was needed, this was unclear to me. My understanding now is that this surface is a mathematical projection from the 30,000 data points in that 3-D space (in the Pakistan example). A bit like a regression line in a 2D graph. One advantage of this constructed landscape is that it enable observers to have a clearer understanding of how these numerous MNs relate to each other on the three dimensions. When they are simply dots hanging in space this is much more difficult to do so.&lt;br /&gt;&lt;br /&gt;I also wondered why “peak” locations were designated as peaks, and not troughs, and vice versa. This seems to be a matter of researcher choice. This seems okay, if the landscape has no more significance than a visual aid, as suggested above. But in some complexity studies peaks in landscapes are presented as unstable locations, and troughs as stable points, acting as “attractors”.&lt;span style=""&gt;  &lt;/span&gt;Is it likely that any pole of any of the self-signifying scales will show this type of behaviour? If not, might it be better not to talk about fitness landscapes, or at least be very careful about not giving them more apparent significance than they merit? A related claim seems to have been made when DS said “Fitness landscapes show people where change is possible”. But is this really the case? I can’t see how it can be, unless desirable/undesirable attributes are built into the self-signifying scales chosen to create the 3D space. There is a risk that the technical language that is being used imputes more independent analytic capacity than  the software has in reality.&lt;br /&gt;&lt;br /&gt;The other concern I had was about who chooses the scales used to self-signify? I should say that I do think it is okay to derive these from a relevant academic field, or from the concerns of the client for the research. But might it provide an even more independent structuring of the MN data, if these scales were somehow also derived from the respondents themselves? On reflection, there seems to be no way of doing this when the sense- maker approach is applied on a large scale.&lt;br /&gt;&lt;br /&gt;But on a much smaller scale I think there may be ways of doing this, by using a reiterated process of inquiry, rather than a once off process. I can provide an example by using data &lt;a href="http://bushfirecrc.com/search/downloads/BushfireCRC_Gerry%20Elsworth.pdf"&gt;borrowed&lt;/a&gt; from a stakeholder consultation process held in rural Australia a few years ago. In the first stage respondents generated the equivalent of MNs. In this case they were short statements about how they expected a new fire prevention programme to help them and their community. These statements were in effect informal “objectives”, written in ordinary day-to-day language, on small filing cards. In the next stage the same individual stakeholders were each asked to sort these statements into a number of groups (of their own choosing), each group describing a different &lt;i style=""&gt;kind&lt;/i&gt; of expectation. Each of these groups was then labelled by the respondent who created it. The data from these card-sorting exercises was then aggregated into a single cards x cards matrix, where each cell value described how often the row card had been placed in the same group as the column card.&lt;br /&gt;&lt;br /&gt;Here the card sorting exercise was in effect another means of self-signifying. It was generating meta-data, statements (group labels) about the statements (individual expectations). Unlike the tripolar and bipolar scales used in David’s sense-making approach, it did not enable a 3D space to be generated where all the 30 statements could be given a specific location. However, the cards x cards matrix was a data set that many SNA software tools can easily use to construct a network diagram, which is a 2D presentation of complex structures. The structure that was generated is shown below. Each node is a card, each link between two cards represents the fact that those two cards were placed in the same group one or more times (shown by line thickness). Clusters of cards all linked to each other were all placed in the same group one or more times.When using one software package (Visualyzer), a “mouseover” on any node can be used to show not only the original card contents (the expectation), but also the labels of the one or more groups that the card was later placed in.In this adapted use of self-signifiers the process of grouping cards helps add additional qualitative information and meaning to that already there in the card contents.&lt;br /&gt;&lt;br /&gt;As well as being able to identify respondent defined clusters of statements, we can also sometimes see links between these clusters. The links are like a more skeletal version of the landscape surface discussed above. The “peaks” of that landscape are the nodes connected by strong links (i.e. the two cards were placed the same groups multiple times). These can be made easier to identify by applying a filter to screen out the weaker links. This is the metaphorical equivalent of raising the sea level, and covering the lower levels of the landscape.&lt;br /&gt;&lt;br /&gt;The virtue of this network approach to analysing MNs is its very participative nature. Its limitation is its modest scalability. The literature on sorting methods suggests an upper limit of between 50 or so cards (I will investigate this further).While this is much less than 30,000, many structured stakeholder consultation processes  can involve a smaller numbers of participants than this.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_QeRS3Af8hns/SxZmwxEW3XI/AAAAAAAABcs/ZoCW1eRDkNk/s1600-h/stronglinks2.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 400px; height: 305px;" src="http://2.bp.blogspot.com/_QeRS3Af8hns/SxZmwxEW3XI/AAAAAAAABcs/ZoCW1eRDkNk/s400/stronglinks2.jpg" alt="" id="BLOGGER_PHOTO_ID_5410624990323596658" border="0" /&gt;&lt;/a&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold;"&gt;Key&lt;/span&gt;: Numbers represent the IDs of each card. Links indicate that the two cards were placed in the same group, one or more times.  Thicker links = placed in the same group more often. Yellow nodes = most conspicuous cliques of cards (all often co-occuring).This image shows the strongest links only(i.e. above the average number. The mouseover function is not available for this image copy.&lt;/blockquote&gt;&lt;br /&gt;My final set of comments are about some of the risks and possible limitations of DS’s sense-making approach. The first concern is about transparency of method. To newcomers, the complexity terminology that is used when introducing the method was challenging, to say the least. At worst I wonder whether it is an unnecessary obstruction, and whether a shorter route to understanding the method would exist, if less complexity sciences terminology was used. The proprietary nature of the associated software is also a related concern to me, though I have been told that there is an intention to make an open source version available. Open source means open to critique and open to improvement, through collective effort, which is what the progress of science is ideally all about. The extensive use of complexity science terms also seems to make the approach vulnerable to corruption and possible ridicule, as people decided to “pick and mix” the bits and pieces of complexity ideas they are interested in, without understanding the basics of the whole idea of complexity.&lt;br /&gt;&lt;br /&gt;Another issue is commensurate benefits. After seeing the scale of the data gathering involved, and the sophistication of the software used, both of which are impressive, I did also wonder whether the benefits obtained from the analysis were commensurate with the costs and efforts that had been invested, at least in the examples we were told about. Other concerns are not exclusive to the sense-making approach. What about the stories not told? Perhaps with almost census like coverage of some groups of concern this is less of a concern than with other large scale ethnographic inquiries. What about unexpected stories? Is the search for outliers leading to the discovery of views which are a surprise to clients of the research, and of possible consequence to their plans on how to relate to the respondents in the future? And are these surprises  enough in number, or are they dramatic enough, to counterbalance the resource invested to find them?&lt;br /&gt;&lt;div style="text-align: center;"&gt;------------------&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt; “&lt;em&gt;At the heart of all major discoveries in the physical sciences is the discovery of novel methods of representation&lt;/em&gt;” &lt;a href="http://en.wikipedia.org/wiki/Stephen_Toulmin#The_Evolutionary_Model"&gt;Steven Toulmin&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-4283570203328024090?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_QeRS3Af8hns/SxZmwxEW3XI/AAAAAAAABcs/ZoCW1eRDkNk/s72-c/stronglinks2.jpg" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">3</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2009/12/reflections-on-dave-snowdens.html</feedburner:origLink></item><item><title>On the poverty of baselines and targets...</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/x4ty_4B8yr0/on-poverty-of-baselines-and-targets.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Sat, 31 Oct 2009 08:18:18 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-7048192715872725831</guid><description>I have been surprised to see how demanding DFID has become on the subject of baseline data. On page 13 of the new &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2009/06/logical-framework.pdf"&gt;DFID Guidance&lt;/a&gt; (on using the new formatted Logical Framework)  it is stated that ” &lt;span style="font-style: italic;"&gt;All projects should have baseline data at all levels before they are approved. In exceptional circumstances, projects may be approved without baseline data at Output level..&lt;/span&gt;." Closer to the ground I have witnessed an UK NGO being pressed by DFID-appointed managers of a funding mechanism to deliver the required baseline data. This is despite the fact that the NGO's project will be implemented in a number of countries over a period of years, not all at once.&lt;br /&gt;&lt;br /&gt;Meanwhile, in Uganda and Indonesia, I am watching two projects coming to an end. Both had baseline data collected shortly after they started. Neither is showing any signs of intending to do a re-survey at the end of the project period. Is anyone bothered? Not that I can see. Including DFID, who is a donor supporting one of the projects. And in both cases  baseline surveys were expensive investments.To make matters worse, in one country the project performance targets were set &lt;span style="font-style: italic;"&gt;before&lt;/span&gt; the baseline study, and in the other they have never really been agreed on.&lt;br /&gt;&lt;br /&gt;I have just completed the final review of one project. We have diligently compared progress made on a set of indicators, against all the original targets. There are of course the usual problems of weak and missing data, and questionable causal links with project interventions. But what bothers me more is how outdated and ill-fitting some of these initial performance measures are. And how little justice this mode of assessment seems to be doing to what the project has been able to do since it started, especially the flexibility of its response in the face of the changing needs of the main partner organisation. Of even greater concern is the fact that this project is being implemented in a large number of districts, in a  country that has been going through a significant process of decentralisation. Each district's capacities and needs are different, and not surprisingly the project's activities and results have varied from district to district. There is fact no one &lt;span style="font-style: italic;"&gt;single&lt;/span&gt; project. Yet our review process, like many others, has in effect treated these district variations as "noise", obscuring what were expected to be region-wide trends over time.&lt;br /&gt;&lt;br /&gt;I am now working on some ideas of how to do things differently in my next project review, in the same country. This time the focus will be more on&lt;span style="font-style: italic;"&gt; internal comparisons&lt;/span&gt;: (a) between locations, (b) between time periods during the project period.&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-7048192715872725831?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">4</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2009/10/on-poverty-of-baselines-and-targets.html</feedburner:origLink></item><item><title>Why we should make economists work harder</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/ihg7lSHI5ys/why-we-should-make-economists-work.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Tue, 27 Oct 2009 03:51:31 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-7644225976090834387</guid><description>"&lt;a href="http://blogs.ft.com/undercover/2009/10/want-to-help-then-make-life-harder-for-the-aid-agencies/"&gt;Why we should make life harder for aid agencies&lt;/a&gt;" is the title of an article by Tim Harford ("The Undercover Economist") in last weekend's Financial Times magazine section.&lt;br /&gt;&lt;br /&gt;I agree with the sentiment, but not with the analysis.  I was expecting better, given what I have read of Tim in the past.&lt;br /&gt;&lt;br /&gt;Tim's article starts with the problem of how can we, as individual donors, be sure that our aid goes in the right direction and have the expected impact. The next problem, as seen by Tim, is that aid agencies are bureacracies. The solution is competition via a more open market. From within this perspective recent efforts at aid "harmonisation" are viewed by Tim with suspicion, and seen as almost the equivalent to establishing a cartel.&lt;br /&gt;&lt;br /&gt;He then asks could agencies be made to compete , not only with each other, but even with private companies, to get funding from donor  organisations. And could money (or rather vouchers) be given directly to aid recipients to spend, redeemable for services provided by a range of charities and aid agencies. These ideas he seeas as "radical" and possibly "far fetched" More immediately, he suggests we could "start by asking simple questions about where aid comes from, where it goes, how effective it is and how much is lost to administration – or worse."&lt;br /&gt;&lt;br /&gt;I hope Tim will be pleasantly surprised to find his ideas are not seen as radical or far fetched, and in fact have been in play now for quite come time. What  Tim really needs to do (apart from more homework before writing articles like this) is to start questioning the assumptions behind his analysis of the nature and benefits of competition amongst aid agencies.&lt;br /&gt;&lt;br /&gt;1. In most ordinary markets the &lt;span style="font-style: italic;"&gt;purchaser&lt;/span&gt; and &lt;span style="font-style: italic;"&gt;user&lt;/span&gt; are one and the same person. The purchaser and user of aid agency services are different parties, seperated by continents and cultures.&lt;br /&gt;&lt;br /&gt;2. In between them is not a single supplier, but a large and complex international aid supply network. See &lt;a href="http://evaluatingkatine.wordpress.com/2009/09/21/the-worst-question-to-ask-about-charity/"&gt;my map&lt;/a&gt; of one of the simpler aid supply networks, in a Guardian funded development project in Uganda (map is at the end of the article)&lt;br /&gt;&lt;br /&gt;3. The quality of the product/service being provided is much more difficult to assess than that found in many goods and services markets in the UK. Measurement of poverty reduction is a field of its own, &lt;span style="font-style: italic;"&gt;improved governance&lt;/span&gt; is another order of magnitude more difficult to assess, but nevertheless a common development objective. There are some more measuable oucomes, such as those captured via the Millenium Development Goals e.g. reduced maternal mortality. But these usually require changes in the performance of institutions e.g. national health services. These sorts of change are not simple to measure, let alone achieve. Aid agencies can avoid this challenge by directly supplying health services to poor communities, but they will then fail on another performance metric: sustainability&lt;br /&gt;&lt;br /&gt;Tim's idea of vouchers (above) could best be described as quaint. It is now common place for aid agencies in humanitarian emergencies to give cash handouts to families in need, not just vouchers. So they can buy what they need from anyone, not simply "a range of charities and aid agencies" Cash transfers are also being tested for their usefullness in development programmes, where there is no emergency present.&lt;br /&gt;&lt;br /&gt;Competition between aid agencies is happening all over the place. DFID has, for years, invited tenders from a wide range of organisations to implement its aid programmes. See their &lt;a href="http://www.dfid.gov.uk/Working-with-DFID/Procurement/Current-contract-opportunities1/"&gt;Current Contract Opportunities page &lt;/a&gt;But what difference is this making, that is the question. By contracting out work to others DFID moves its own "overhead costs" off its own books, onto others. But the overheads are still there. In fact they are multiplied, because in order to win contracts multiple organsiations invest substantial amounts of time and effort into producing complex  documents, but only one wins. Those loosing bids are not products that can easily be sold to other possible buyers, like unused factory stock. Instead the costs of their non-use is figured in to the subsequent bids, including those that win.&lt;br /&gt;&lt;br /&gt;So, costs will have gone up, but what about effectiveness? If that has improved, then the increased costs would be justifiable. The problem is, as touched upon above, it is very diffifuclt to measure the effectiveness of  many contracted-out projects, because of the scale and complexity of the changes they are trying to achieve.&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-7644225976090834387?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2009/10/why-we-should-make-economists-work.html</feedburner:origLink></item><item><title>Constructing longer term perspectives</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/hQ7nXYNJofo/constructing-long-term-perspectives.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Wed, 23 Sep 2009 01:07:53 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-3134777387899530926</guid><description>A few weeks ago a friend asked me for help with ideas for a presentation that needed to be made on "&lt;span style="font-style: italic;"&gt;challenges for the international development sector...&lt;/span&gt;"&lt;br /&gt;&lt;br /&gt;Not an easy task, where do you start? But I knuckled down and did some reflection. Work I am doing on DFID and AusAID funded projects in Indonesia ended up as the source of some ideas that may be useful. I have been working on these since late 2005 and the work continues until early 2010.&lt;br /&gt;&lt;br /&gt;My short reply to my friend was as follows:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;How to ensure that development interventions are designed and implemented within a long term perspective, that extends way beyond the typical 3-5 year planning cycles&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There is a massive contradiction between the short term nature of project designs and what most people know about how long development can take (both technological and social).&lt;br /&gt;&lt;br /&gt;If project planning cycles cannot be lengthened (e.g. because of goverment budget cycles and election cycles) then how can we make sure that these planning cycles are better linked up, into a more coherent longer term intervention? This is n easy task when there is constant staff turnover both within government and aid agencies. Strategy papers by themselves are not much use, because they have their own continuity problems, no new boss wants to simply say, yes, we will do more of same. Everyone wants to re-write the strategy in their own image&lt;br /&gt;&lt;br /&gt;In Jakarta in October we will be holding an end-of-project review workshop for the DFID funded, GTZ implemented, AusAID monitored, government owned SISKES project (maternal and neonatal health). One of the two workshop objectives is&lt;br /&gt;&lt;span style="font-style: italic;"&gt;&lt;br /&gt;To engage participants in a longer term perspective on MNH development, that exceeds the typical 3-5 year project lifespan&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-style: italic;"&gt;By looking back, on developments since the beginning of the decade&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-style: italic;"&gt;By looking forward to up to four years in the future&lt;/span&gt;. &lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; We will be including some people associated with a new AusAID MNH project in one of the 2 districts that GTZ are pulling out of. Plus a caste of thousands (well, 52 other participants so far).&lt;br /&gt;&lt;br /&gt;One of the workshop exercises will be to engage participants in predicting trends in key service provision indicators over the next four years, based on their knowledge aquired through the SISKES project, and other sources. And then analysing the implications of these expected trends for the incoming projects, including the new AusAID MNH project&lt;br /&gt;&lt;br /&gt;There is also a need for more connecting events at the design stage as well, where various stakeholders from prior and parallel related developments are brought in to inform planning decisions, or at least the choices to be considered. Often the consultants on design missions are about the only bridges to the past. &lt;/blockquote&gt;&lt;br /&gt;For other people's efforts to promote really long term thinking, see the &lt;a href="http://www.longnow.org/"&gt;Long Now Foundation&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-3134777387899530926?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2009/09/constructing-long-term-perspectives.html</feedburner:origLink></item><item><title>Bibliographic Timelines</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/GU-75z0wmuM/bibliographic-timelines.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Fri, 14 Aug 2009 06:25:58 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-986803334321101375</guid><description>It is a simple idea, but one that looks useful&lt;br /&gt;&lt;br /&gt;During a recent mid-term review of AMREF's &lt;a href="http://www.guardian.co.uk/katine"&gt;Katine Community Partnerships Project&lt;/a&gt;, I started to create a bibliography of project related documents, with a difference. Normally documents listed in a bibliography are structured in alphabetical order, by the authors' name. This helps you find the document if you know the authors name, but not much more.&lt;br /&gt;&lt;br /&gt;This time I listed all the project documents in time order, by the year and the month when they were produced, starting with the oldest. In the text of most reports referenced documents are usually referred to by their author and date, so it is still easy to find cited documents in this chronologically ordered list. The added advantage of this "bibliographic timeline" is that it also gives you (the reader and/or writer) a quick sense of the history of the project. Most document titles make some reference to the event they are describing (e.g. baseline studies, needs assessments, workplans, annual reports, etc), so by scanning down the bibliography you can quickly get a rough sense of the sequence of activities that have taken place. Even though there may be a time lag between an event and when it is documented (say in the next month).&lt;br /&gt;&lt;br /&gt;I have attached below a graphic image of the "bibliographic timeline" that was produced this way. Click on the image to get more detail.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_QeRS3Af8hns/SoAllTmY8BI/AAAAAAAABZc/NvqZYxRxAJE/s1600-h/bibliographic+timetline.JPG"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 200px; height: 400px;" src="http://4.bp.blogspot.com/_QeRS3Af8hns/SoAllTmY8BI/AAAAAAAABZc/NvqZYxRxAJE/s400/bibliographic+timetline.JPG" alt="" id="BLOGGER_PHOTO_ID_5368332078672310290" border="0"&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-986803334321101375?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/_QeRS3Af8hns/SoAllTmY8BI/AAAAAAAABZc/NvqZYxRxAJE/s72-c/bibliographic+timetline.JPG" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2009/08/bibliographic-timelines.html</feedburner:origLink></item><item><title>Comments on the draft DFID evaluation policy</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/rbT-dnwiAfM/comments-on-draft-dfid-evaluation.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Mon, 12 Jan 2009 05:59:10 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-5298983772598918663</guid><description>&lt;p&gt;&lt;br /&gt;DFID and &lt;a href="http://iacdi.independent.gov.uk/" onclick="javascript:pageTracker._trackPageview ('/outbound/iacdi.independent.gov.uk');"&gt;Independent Advisory Committee on Development Impact&lt;/a&gt;  have sought public comments on two documents: the &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2008/12/dfid-evaluation-policy-consultation-draft-2.pdf"&gt;Draft Evaluation Policy&lt;/a&gt; and the &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2008/12/dfid-topic-list-consultation-doc-draft.pdf"&gt;Evaluation Topic List&lt;/a&gt;. More information on the public consultation process can be found at &lt;a href="http://mande.co.uk/2008/topic-bibliographies/bilateral-agencies/new-dfid-policy-on-evaluation/"&gt;MandE NEWS&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-weight: normal;font-size:100%;" &gt;Comments can be emailed to&lt;/span&gt; &lt;/strong&gt;&lt;a href="mailto:evaluationfeedback@dfid.gov.uk"&gt;evaluationfeedback@dfid.gov.uk&lt;/a&gt; Here below are two sets of comments that I have sent in:&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;1. The need for a meta-evaluation of the results of the decentralised evaluation policy&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;In the List of Potential Evaluation Topics, readers are invited to comment on “any topics you consider very important that we have not listed here”.&lt;/p&gt; &lt;p&gt;One gap which I noted was the lack of any reference to meta-evaluation of the many evaluation activities carried out within the country programmes.&lt;br /&gt;&lt;/p&gt; &lt;p&gt;However, the draft Evaluation Policy mentioned above makes eleven references to the role of “decentralised evaluation”. DFID’s decentralised evaluations “are those commissioned by our staff responsible for managing DFID’s programmes, policies and partnerships, normally in collaboration with their development partners”&lt;/p&gt; &lt;p&gt;The references to decentralised evaluations covered the following areas:&lt;br /&gt;- increased use of decentralised evaluation as one of the 4 major priorities for developing the evaluation function in DFID. p.11&lt;br /&gt;-  sustaining a strong culture of decentralised evaluation across the Department.   p.16&lt;br /&gt;- strengthening its advisory and quality support role for decentralised evaluations p.17&lt;br /&gt;- quality assurance of decentralised evaluations. p.4, p.16&lt;br /&gt;- helping to set standards, providing support and advice, and reporting on quality. p4&lt;/p&gt; &lt;p&gt;But there are no references to a systematic or periodic meta-evaluation of decentralised evaluations. This seems like a major omission. Authority for evaluation has been decentralised, and advisory support and guidance will be provided, but there is no evident complementary mechanism for assessing the results.&lt;span style="font-style: italic;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-weight: bold;"&gt;PS&lt;/span&gt;: &lt;span style="font-weight: bold;"&gt;meta-evaluations are not the same as synthesis studies&lt;/span&gt;&lt;/span&gt;. A synthesis study looks at the findings across a number of evaluations, a meta-evaluation looks at the evaluation methods used by a number of evaluations. Most organisations, including DFID, already do quite a few sythesis studies.&lt;br /&gt;&lt;span style="font-style: italic;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;blockquote&gt;&lt;/blockquote&gt;&lt;/span&gt;&lt;/span&gt;&lt;p&gt;&lt;strong&gt;2. The need for consultation on evaluation criteria, not just what should be evaluated&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;There needs to be some debate not just about what is to be evaluated, but on what criteria? &lt;/p&gt; &lt;p&gt;So far, during the present consultation, the question of what to evaluate has been subject of a separate DFID paper (the &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2008/12/dfid-topic-list-consultation-doc-draft.pdf"&gt;Evaluation Topic List&lt;/a&gt;) but the question of what criteria has only warranted a short section in an annex to the &lt;a href="http://mande.co.uk/blog/wp-content/uploads/2008/12/dfid-evaluation-policy-consultation-draft-2.pdf"&gt;draft policy paper&lt;/a&gt;. In that annex DFID list “the internationally-agreed evaluation criteria …[that] will be applied to DFID evaluations. They appropriately note that while “It will not be appropriate to investigate every criterion in depth in every evaluation. DFID evaluators will be requested to provide an explanation of the criteria they have chosen (or not) to cover”. The listed criteria are 1. Relevance, 2. Effectiveness, 3. Effeciency, 4. Impact, 5. Sustainability, 6. Coverage, and 7. Coherence.&lt;/p&gt; &lt;p&gt;Elsewhere on this blog I have argued for the inclusion of two additional criteria to the traditional DAC 5 (1-5 above).These are &lt;em&gt;equity&lt;/em&gt; and &lt;em&gt;transparency&lt;/em&gt;&lt;/p&gt; &lt;p&gt;It could be argued that criteria 6 (coverage) already covers equity. However the choice of words can be important. Coverage is an apparently technical term, but equity is explicitly about a value: fairness, of process and outcome. DFID’s desire to eliminate of poverty is a statement about values. Values should be clearly stated, not hidden or assumed.&lt;/p&gt; &lt;p&gt;Transparency is not covered at all. Yet transparency is basic to the whole process of evaluation, especially when viewed in a wider context. Without access to information the ability of stakeholders in development programmes to evaluate performance on any of these criteria will be extremely limited. The importance of access to information was emphasised by the United Nations General Assembly in its first session in 1946, which states: “&lt;em&gt;Freedom of information is a fundamental human right and … the touchstone of all the freedoms to which the UN is consecrated.&lt;/em&gt;” (Resolution 59)&lt;/p&gt; &lt;p&gt;More recently DFID was one of the founding signatories to the &lt;a href="http://mande.co.uk/2008/topic-bibliographies/transparency/international-aid-transparency-initiative/" rel="nofollow"&gt;International Aid Transparency Initiative&lt;/a&gt;, publicised at the August 2008 High Level Forum on Aid Effectiveness in Accra, Ghana. &lt;/p&gt; &lt;p&gt;Given this recent statement of position by DFID transparency should clearly be included as an evaluation criteria on the DFID list. If this proposal raises concerns about the list becoming too lengthy, one could argue that it should certainly have higher priority than the newly proposed criteria 7 (coherence). In fact, perhaps it should be criteria number 1, ahead of relevance and all other criteria.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-5298983772598918663?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2008/12/comments-on-draft-dfid-evaluation.html</feedburner:origLink></item><item><title>An aid bubble? - Interpreting aid trends</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/CqOCxPui4So/aid-bubble.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Sat, 27 Jun 2009 02:03:54 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-2723330482730713076</guid><description>&lt;div style="text-align: center;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_QeRS3Af8hns/SIiDJ0q6Y7I/AAAAAAAABAA/7qXZ_Puze34/s1600-h/dfid+spends.bmp"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://4.bp.blogspot.com/_QeRS3Af8hns/SIiDJ0q6Y7I/AAAAAAAABAA/7qXZ_Puze34/s400/dfid+spends.bmp" alt="" id="BLOGGER_PHOTO_ID_5226571572344153010" border="0" /&gt;&lt;/a&gt;(unattributed source)&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;img src="file:///C:/DOCUME%7E1/Rick/LOCALS%7E1/Temp/moz-screenshot-7.jpg" alt="" /&gt;This graph (&lt;a href="http://www.slideshare.net/rickjdavies/strengthening-independent-evaluation-in-international-development-the-uks-approach"&gt;from a DFID presentation&lt;/a&gt;) shows what many people have already heard, that the volume of aid given by DFID will continue to increase, but that the amount of money being spent on administering that aid will plateau. Does this divergence mean:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;DFID has discovered a new means of effectively giving development aid that requires less and less administrative overhead each year?&lt;/li&gt;&lt;li&gt;There is a huge amount of slack capacity within DFID that can safely be pared away for years without hindering its effectiveness?&lt;/li&gt;&lt;li&gt;This graph is prima facie evidence of an impending aid bubble that is highly likely to burst in the next few years, as one or more mal-administered or corrupted aid programs are publicly exposed, to the discredit of both the good and the bad?&lt;/li&gt;&lt;li&gt;Yes, it does mean there will be more mal-administered and corrupted aid programs in the future, but not many more people will be worried about this than have been in the past?&lt;/li&gt;&lt;li&gt;This is a good example of where there is a pressing need for an ex-ante impact assessment of a budgeting strategy ( if ever there was one)?&lt;/li&gt;&lt;li&gt;&lt;!--[if !supportLists]--&gt;&lt;span style="font-family:Wingdings;"&gt;&lt;span style=";font-family:&amp;quot;;font-size:7;"  &gt;&lt;/span&gt;&lt;/span&gt;The category “Admin budget” is meaningless and in fact the real costs of administering aid have not been adequately disaggregated in this graph. &lt;/li&gt;&lt;li&gt;Or...?&lt;br /&gt;&lt;/li&gt;&lt;style&gt; &lt;!--  /* Font Definitions */ @font-face  {font-family:Wingdings;  panose-1:5 0 0 0 0 0 0 0 0 0;  mso-font-charset:2;  mso-generic-font-family:auto;  mso-font-pitch:variable;  mso-font-signature:0 268435456 0 0 -2147483648 0;}  /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal  {mso-style-parent:"";  margin:0cm;  margin-bottom:.0001pt;  mso-pagination:widow-orphan;  font-size:11.0pt;  mso-bidi-font-size:12.0pt;  font-family:Arial;  mso-fareast-font-family:"Times New Roman";  mso-bidi-font-family:"Times New Roman";  layout-grid-mode:line;} @page Section1  {size:612.0pt 792.0pt;  margin:72.0pt 90.0pt 72.0pt 90.0pt;  mso-header-margin:36.0pt;  mso-footer-margin:36.0pt;  mso-paper-source:0;} div.Section1  {page:Section1;}  /* List Definitions */ @list l0  {mso-list-id:1997957412;  mso-list-type:hybrid;  mso-list-template-ids:-1378443264 67698693 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;} @list l0:level1  {mso-level-number-format:bullet;  mso-level-text:;  mso-level-tab-stop:18.0pt;  mso-level-number-position:left;  margin-left:18.0pt;  text-indent:-18.0pt;  font-family:Wingdings;} ol  {margin-bottom:0cm;} ul  {margin-bottom:0cm;} --&gt; &lt;/style&gt;&lt;span style=";font-family:Arial;font-size:11;"  &gt;&lt;/span&gt;&lt;/ul&gt;&lt;br /&gt;You can record your opinion, by posting a Comment below, or registering your vote on this anonymous opinion poll.&lt;br /&gt;&lt;!-- Start Bravenet.com Service Code --&gt;&lt;br /&gt;&lt;!-- The following line of code must be on one line, it cannot wrap // --&gt;&lt;br /&gt;&lt;div align="center"&gt;&lt;script type="text/javascript" src="http://pub27.bravenet.com/minipoll/show.php?usernum=2241665034&amp;amp;cpv=2"&gt;&lt;br /&gt;&lt;/script&gt;&lt;/div&gt;&lt;br /&gt;&lt;!-- End Bravenet.com Service Code --&gt;&lt;br /&gt;My initial preference would be for the fifth option, even though it is probably unlikely that the results would have much impact on decisions that have already been made.&lt;br /&gt;&lt;br /&gt;But on reflection option one may not be so impossible as it seems. DFID may well give more and more of its aid through third parties (multilaterals, and international programmes of different kinds). When it does this those organisations' administration costs will not appear on the DFID books as administration costs, but as aid given. And those organisations can in turn use the same device to manage the apparent levels of their own administration costs, by funding other parties, such as national NGOs.&lt;br /&gt;&lt;br /&gt;The cumulative outcome of this re-iterated strategy may well be very perverse, adding up to a bigger proportion of aid being spent on administration than would be the case if the orginal donor had been more directly engaged and been willing to show higher admin costs in its own budget. All this is speculative though. What it does suggest however, is the possible relevance of a "whole supply chain" approach to the evaluation of the costs of different forms of aid. Unlike private sector supply chains, the total cost of delivered aid is not evident in what the beneficiary pays for the final product.&lt;br /&gt;&lt;br /&gt;Perhaps these issues could be pursued by the new &lt;a href="http://www.dfid.gov.uk/aboutdfid/evaluation-iacdi-info.asp"&gt;Independent Advisory Committee on Development Impact&lt;/a&gt;?&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-2723330482730713076?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/_QeRS3Af8hns/SIiDJ0q6Y7I/AAAAAAAABAA/7qXZ_Puze34/s72-c/dfid+spends.bmp" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">5</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2007/07/aid-bubble.html</feedburner:origLink></item><item><title>Aid organisations as self-interested businesses?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/r_3NryAEuRw/aid-organisations-as-self-interested.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Thu, 27 Mar 2008 12:07:01 PDT</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-4798756627296287926</guid><description>&lt;p style="font-family: arial;" face="arial" class="MsoNormal"&gt;This posting has been prompted by a letter I received recently. A client I am working with (evaluating their project and that of another donor) wanted me to sign a confidentiality agreement. While it did not seem excessively restrictive, in terms of general intent it was the very opposite of what I have been trying to encourage this and other donors to do with information about their projects. Increasingly over the past few years I have been pushing for more transparency, not less. The rationale being that the whole aid process would benefit by being more accountable to the public at large, not just to donors or the project manager’s immediate partners and intended beneficiaries. Some of my clients have taken this approach seriously and used their websites to make a whole range of project documents publicly available (See &lt;a href="http://www.g-rap.org/"&gt;G-rap&lt;/a&gt; and PETRRA). Others have agreed in principle but seem to have made little progress in practice.&lt;/p&gt;  &lt;p class="MsoNormal"  style="font-family:arial;"&gt;&lt;!--[if !supportEmptyParas]--&gt; &lt;!--[endif]--&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal" face="arial"&gt;Parallel to this effort I have been trying to persuade donors and project managers that achieving specific development objectives is not enough For example, increased levels of health service usage, or increased farmers’ incomes.&lt;span style=""&gt;  &lt;/span&gt;It is also essential that knowledge be accumulated, and made available, about how these objectives were achieved, and what factors made the difference between higher and lower levels of achievement. Without this knowledge the existing achievements are less likely to be sustainable, and they are certainly unlikely to be replicable. Given the scale of most development problems, sustainability and replicability of achievements is absolutely essential. Measuring sustainability and replicability is not easy. But identifying the availability of relevant knowledge should be possible.&lt;/p&gt;  &lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;!--[if !supportEmptyParas]--&gt; &lt;!--[endif]--&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p style="font-family: arial;" class="MsoNormal"&gt;If information was made publicly available on how specific developments were achieved then a project can be considered to have created a &lt;i&gt;public good&lt;/i&gt;, that others can use. The more usable that knowledge is, the more value that public good is. Businesses do not often do this, though putting usable knowledge in the public domain is becoming more common in the world of software and internet services&lt;a style="" href="http://www.blogger.com/post-create.g?blogID=6719829#_ftn1" name="_ftnref1" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style=""&gt;&lt;!--[if !supportFootnotes]--&gt;[1]&lt;!--[endif]--&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;. Businesses usually have a commercial self interest in keeping secret the key parts of their business processes that would enable others to compete with them in providing the same goods or services. The production of public goods could therefore be seen as a way of differentiating the degree to which aid organisations (of varying kinds) are operating as self-interested businesses versus more public interested organisations. Whether they make and distribute a profit could be considered a secondary matter.&lt;/p&gt;  &lt;p style="font-family: arial;" class="MsoNormal"&gt;&lt;!--[if !supportEmptyParas]--&gt; &lt;!--[endif]--&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;span style=";font-family:arial;font-size:11;"  &gt;If the production of public goods is accepted as an important defining feature of good aid organisations then more attention to the quality of those goods, and how it could be improved, would be justified. Some might argue that a lot of information products produced by aid organisations are often more like advertising and public relations materials, and better described as “vapourware”&lt;a style="" href="http://www.blogger.com/post-create.g?blogID=6719829#_ftn2" name="_ftnref2" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style=""&gt;&lt;!--[if !supportFootnotes]--&gt;[2]&lt;!--[endif]--&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;. One means of improving the quality of potential public goods would be increased transparency. So we can see not only the final information product (e.g. a book, web page, video, etc), but the drafts and the debate that surrounded their development, and the background data. Not simply as a final package, but during the process. The public could then become engaged, though comment and feedback, in the process of producing the public good(s). This type of semi-open production process is increasingly common in some areas of business (see &lt;a href="http://web.mit.edu/evhippel/www/books.htm"&gt;"Democratizing Innovation", 2005&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/User_innovation"&gt;Wikipedia entry&lt;/a&gt;).&lt;span style=""&gt;  &lt;/span&gt;In aid organisations this approach could be realised in fairly simple forms through the use of websites to host draft documents, and the use of online open forums and email lists to promote awareness and discussion of those documents. This is not rocket science. But nor is it yet common practice on the scale it should be.&lt;br /&gt;&lt;/span&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;In my argument above transparency has two rationales. One is &lt;span style="font-style: italic;"&gt;pragmatic&lt;/span&gt;, tranparency could help improve the knowledge that is available about how best to have an impact. On the other hand, when visibly put into practice, transparency may also function as an important &lt;i style="font-style: italic;"&gt;signal&lt;/i&gt;&lt;span style="font-style: italic;"&gt; of intentions&lt;/span&gt;, helping us differentiate &lt;/span&gt;organisations that are more public interested from those that are more self-interested.&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; &lt;br /&gt; &lt;div face="arial"&gt;&lt;!--[if !supportFootnotes]--&gt;&lt;br /&gt;&lt;hr align="left" size="1" width="33%"&gt;  &lt;!--[endif]--&gt;  &lt;div style="font-family: arial;" id="ftn1"&gt;  &lt;p class="MsoFootnoteText"&gt;&lt;a style="" href="http://www.blogger.com/post-create.g?blogID=6719829#_ftnref1" name="_ftn1" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style=""&gt;&lt;!--[if !supportFootnotes]--&gt;[1]&lt;!--[endif]--&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; For example, in the form of open source products, free internet services and services that inter-operable with those provided by others.&lt;/p&gt;  &lt;/div&gt;  &lt;div style="font-family: arial;" id="ftn2"&gt;  &lt;p class="MsoFootnoteText"&gt;&lt;a style="" href="http://www.blogger.com/post-create.g?blogID=6719829#_ftnref2" name="_ftn2" title=""&gt;&lt;span class="MsoFootnoteReference"&gt;&lt;span style=""&gt;&lt;!--[if !supportFootnotes]--&gt;[2]&lt;!--[endif]--&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; Software products that have a name, and promotional materials, but not much in the way of contents that will actually make them work and deliver what they promise&lt;/p&gt;  &lt;/div&gt;  &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-4798756627296287926?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">11</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2008/03/aid-organisations-as-self-interested.html</feedburner:origLink></item><item><title>Social Frameworks: An improvement on the Logical Framework?</title><link>http://feedproxy.google.com/~r/RickOnTheRoad/~3/sncyEklLBXs/social-frameworks-improvement-on.html</link><author>noreply@blogger.com (Rick Davies)</author><pubDate>Wed, 10 Dec 2008 11:49:12 PST</pubDate><guid isPermaLink="false">tag:blogger.com,1999:blog-6719829.post-3057797796303428604</guid><description>&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;span style="" lang="EN-GB"&gt;Over the last week there have been quite a few email exchanges on the MandE NEWS email list about how to distinguish results from outcomes, results from impact, inputs from outputs, outcomes from impacts, etc. These are the various terms used to describe different levels of a Logical Framework description of a development intervention (in some of the variations of the Logical Framework used by some development agencies). This debate is not new; it comes and goes, and has appeared within most development organisations at some stage or another. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;There are two causes of this confusion of nomenclature, in my view.&lt;span style=""&gt;  &lt;/span&gt;One is that the Logical Framework describes a sequence of causally linked events happening over time. Time flows, it has no natural punctuation marks that can be used to distinguish &lt;/span&gt;&lt;span style="" lang="EN-GB"&gt;and categorise &lt;/span&gt;&lt;span style="" lang="EN-GB"&gt;stages of a process. It is not possible to “carve nature at the joints” when dealing with time. So any introduction of stage categories like inputs, activities, outputs, outcomes, impacts etc.,  is artificial, and requires a consensus amongst the users of these terms, if they are to be useful. Within organisations that can be achieved, across organisations it is usually much more difficult.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;The second cause is a widespread confusion between two types of hierarchy. The Logical Framework is supposed to represent a temporal hierarchy, of events taking place through time. Here A is supposed to lead to B, which is supposed to lead to C, etc. However some organisations mix in a different kind of hierarchy, when they introduce terms like “components”, and “sub-objectives”. This is a hierarchy of inclusion, where A, B, and C are part of X, and X, Y, and Z are in turn part of some larger entity. So the upper levels of this hierarchy are not the outcomes of lower level activities, but simply wider generalisations or descriptions of types of things described at the lower levels. I have seen this sort of terminology in some UNICEF Logical Frameworks in &lt;st1:country-region st="on"&gt;&lt;st1:place st="on"&gt;Indonesia&lt;/st1:place&gt;&lt;/st1:country-region&gt;, mixed in with Purpose and Goal statements that are part of a temporal hierarchy.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="font-size:130%;"&gt;A Social Framework?&lt;/span&gt;&lt;br /&gt;&lt;span style="" lang="EN-GB"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;I have been experimenting with an alternative, which does not “throw the baby out with the bathwater”. It could be called a Social Framework, rather than a Logical Framework, because it emphasises people and their relationships, rather than more abstract events and processes.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Let us start with the same tabular structure as the Logical Framework, but then introduce some significant changes. Each row of the narrative column (found on the left side of the Logical Framework) can be used to describe different types of actors (usually organisations or groups, rather than individuals). Actors in adjacent rows will be linked to each other by relationships that already  exist, or which will be developed. The overall result is that the table describes a pathway of expected influence, from the actor in the bottom row up to the actor in the top row. &lt;span style="font-style: italic;"&gt;The causal mechanisms are the relationships that link the actors. &lt;/span&gt;&lt;span&gt;However, as in real life, this process of influence is unlikely to be one-directional.  Both parties linked by a relationship may affect each other. For example a UK donor NGO may earn lessons from its southern partner, as well as being an important conduit of funding for that southern partner.&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;In the &lt;a href="http://www.guardian.co.uk/katine/about"&gt;Katine project in &lt;st1:country-region st="on"&gt;&lt;st1:place st="on"&gt;Uganda&lt;/st1:place&gt;&lt;/st1:country-region&gt;&lt;/a&gt; this pathway consists of UK &lt;i style=""&gt;&lt;u&gt;donors&lt;/u&gt;&lt;/i&gt; who fund &lt;i style=""&gt;&lt;u&gt;AMREF&lt;/u&gt;&lt;/i&gt; who help develop the capacity of &lt;i style=""&gt;&lt;u&gt;local organisations&lt;/u&gt;&lt;/i&gt;, who provide services to&lt;u&gt; &lt;i style=""&gt;local households&lt;/i&gt;&lt;/u&gt;. You can see this "pathway to the poor" in the table below. However, you will see &lt;span style=""&gt; &lt;/span&gt;I have introduced an extra row, so I can differentiate between the internal workings of AMREF Katine as an organisational &lt;span style="font-style: italic;"&gt;actor&lt;/span&gt;, and AMREF’s &lt;span style="font-style: italic;"&gt;relationships&lt;/span&gt; with local organisations. As shown in the second table that follows the first, I could do the same with the other actors in the pathway. But one may not always want this degree of comprehensive detail. Nevertheless, note this basic point: actors and their relationships with each other are the basis of the Social Framework. &lt;span style=""&gt; &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Simple version of the pathway&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;      &lt;table class="MsoTableGrid" style="border: medium none ; margin-left: 5.4pt; border-collapse: collapse;" border="1" cellpadding="0" cellspacing="0"&gt;  &lt;tbody&gt;&lt;tr style=""&gt;   &lt;td style="border: 1pt solid windowtext; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: solid solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Local   households&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Local   organisations&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Relationship&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;AMREF’s   &lt;span style="font-style: italic;"&gt;relationship&lt;/span&gt; with local organisations&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;AMREF’s   internal functioning&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Donors&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt; &lt;/tbody&gt;&lt;/table&gt;    &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;More detailed version of the pathway&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;  &lt;/p&gt;  &lt;table class="MsoTableGrid" style="border: medium none ; margin-left: 5.4pt; border-collapse: collapse;" border="1" cellpadding="0" cellspacing="0"&gt;  &lt;tbody&gt;&lt;tr style=""&gt;   &lt;td style="border: 1pt solid windowtext; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: solid solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Local   households&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Relationship&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Local   organisations’ &lt;span style="font-style: italic;"&gt;relationship&lt;/span&gt; with local households&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Local   organisations’ internal functioning&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Relationship&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;AMREF’s   &lt;span style="font-style: italic;"&gt;relationship&lt;/span&gt; with local organisations&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;AMREF’s   internal functioning&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Relationship&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Donors’   &lt;span style="font-style: italic;"&gt;relationships&lt;/span&gt; with AMREF &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 83.2pt;" valign="top" width="111"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Actor&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 312.8pt;" valign="top" width="417"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Donors’   internal functioning&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt; &lt;/tbody&gt;&lt;/table&gt;    &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;It is not difficult to see some correspondence between these levels (especially in the first table) and the Logical Framework categories of Inputs, Activities, Outputs, Purpose and Goal. But talking in terms of specific categories of actors is much more tangible and communicable, especially across cultures. So, lets say goodbye to inputs, activities, outputs, etc, for the time being.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Moving on to the next column in the traditional Logical Framework, the Objectively Verifiable Indicators (OVIs), there is no reason why they cannot be used in this more Social Framework. Indicators could be identified for expected internal changes in each actor and for expected changes in their relationships with other adjacent actors in the Social Framework. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Moving on to the next column, the Means of Verificationn (MoV), this column function can also be retained, describing where and how information will be available about the expected changes. In addition, I suggest taking a more social view of this question. The MoV could describe &lt;i style=""&gt;who is expected to know&lt;/i&gt; about the changes described by the OVIs in the same row, because of their interests or responsibilities in this area. &lt;/span&gt;&lt;span style="" lang="EN-GB"&gt;For example, the household row may have an indicator about households increased access to clean drinking water. In the OVI column in the same row, reference could be made to the Village Water Committee as a body who should know about changes of this type. &lt;/span&gt;&lt;span style="" lang="EN-GB"&gt;Their knowledge and then their responses have implications for the sustainability of any improvements in water supply. This actor-oriented view implies the need for participatory approach, built around what people can and should be able to do in the way of monitoring.  What is not needed is lists of disembodied items of information that might be found in a report or database somewhere.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Moving on to the next column, in the traditional Logical Framework we normally find Assumptions that refer to other factors that can influence the causal connection between events  happening in adjacent rows. In the Social Framework I would suggest that this column describe assumptions about &lt;span style="font-style: italic;"&gt;other actors&lt;/span&gt;, and the kind of influence that they are expected to have on the actor(s) described in the narrative row of this column (and vice versa).&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;span style="" lang="EN-GB"&gt;The work of other NGOs in the same location may involve relationships with some of the actors in the pathways described in the Katine Social Framework. For example, the same government body, or the same community group. This could be flagged by a commentary in the Assumptions column This design flexibility contrasts with the rigidity of nested Logical Frameworks, where it is only possible to represent convergence of plans (Leading to pyramid like structures, with lots of things happening at the base, all converging on a few things at the top).&lt;/span&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;/p&gt;&lt;span style="font-weight: bold;"&gt;The Social Framework&lt;/span&gt;&lt;br /&gt;&lt;p class="MsoNormal"&gt;  &lt;/p&gt;This table below is a rough draft of what a Social Framework might look like for part of the Katine project in Uganda. This project is described in detail on the Guardian website.&lt;br /&gt;&lt;br /&gt;&lt;table class="MsoTableGrid" style="border: medium none ; margin-left: 5.4pt; border-collapse: collapse;" border="1" cellpadding="0" cellspacing="0"&gt;  &lt;tbody&gt;&lt;tr style=""&gt;   &lt;td style="border: 1pt solid windowtext; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;Narrative description&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;- of   expected changes in a pathway of influence&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: solid solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;Objectively Verifiable Indicators (OVIs),&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;- evidence of expected change&lt;br /&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: solid solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;Means of Verification (MoV),&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;- who   should know about the OVIs&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: solid solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;Assumptions &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;b style=""&gt;&lt;span style="" lang="EN-GB"&gt;- &lt;/span&gt;&lt;/b&gt;&lt;span style="" lang="EN-GB"&gt;about these and other actors&lt;b style=""&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes  in&lt;/span&gt;&lt;span style="" lang="EN-GB"&gt; &lt;span style="font-weight: bold;"&gt;local   households&lt;/span&gt;&lt;/span&gt;&lt;span style="" lang="EN-GB"&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. indicators of access to safe drinking water, children's primary school participation,  food sufficiency&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. village water committee, school management committee, village administration&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g. the insurgents will not return again, force relocations and destroy property&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;local organisations’   relationship with local households&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. speed of repair of broken standpipes&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. village water committee, village administration&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. that local organisations will provide services equitably&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;local   organisations’ internal functioning&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g scores of weighted checklists re health clinic functioning&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g. Health Unit Management Committee (HUMC)&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt; e.g. that District Health Service will support implementation of HUMC recommendations&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;AMREF’s   relationship with local organisations&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g. AMREF will identify other NGOs who are also working with local organisations, cordinate plans with them and learn lessons re those groups&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;AMREF’s   internal functioning&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g. AMREF HQ will devolve right make public statements re the project&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;donors’   relationships with AMREF&lt;/span&gt; &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; background: rgb(204, 255, 204) none repeat scroll 0% 50%; width: 109.35pt; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g Existing donors will not prevent AMREF from seeking additional funding from other donors&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt;  &lt;tr style=""&gt;   &lt;td style="border-style: none solid solid; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Expected changes in &lt;span style="font-weight: bold;"&gt;donors’   internal functioning&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;   &lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;/td&gt;   &lt;td style="border-style: none solid solid none; padding: 0cm 5.4pt; width: 109.35pt;" valign="top" width="146"&gt;   &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;e.g. Donors will be able to agree on desired outcomes of their relationship with AMREF&lt;br /&gt;&lt;/td&gt;  &lt;/tr&gt; &lt;/tbody&gt;&lt;/table&gt;  &lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;span style="" lang="EN-GB"&gt;In complex development programmes people have tried to develop hierarchically nested Logical Frameworks, to show how different parts of a complex program connect to each other. But examples of these are not easy to find, despite the fact that there are many complex programmes in existence. In my experience, creating nested Logical Frameworks is not easy, and this may be the explanation for their scarcity. &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;Connecting up Social Frameworks to describe a more complex picture should be easier, because they have a modular structure. Each row describing an actor is in effect like a building block. These building blocks can be combined in different sequences. So, in addition to the pathway in the table above,  a parallel Katine project pathway of influence could be &lt;span style="font-style: italic;"&gt;Donors &lt;-&gt; AMREF&lt;/span&gt; &lt;-&gt; Ministry of Health &lt;-&gt; District Health Services &lt;-&gt; &lt;i style=""&gt;Local organisations &lt;-&gt; Local Households&lt;/i&gt; (actors in italics being part of other influence pathways already documented). This pathway could address the need for a parallel process of policy influencing at the national level, based on AMREF’s experience with local organisations in Katine.  This pathway branches off then re-converges with the original pathway (See simple diagram version below)&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_QeRS3Af8hns/R-RGpQoSp1I/AAAAAAAAAzY/U4yIcykMPwY/s1600-h/socia+frameworks.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://2.bp.blogspot.com/_QeRS3Af8hns/R-RGpQoSp1I/AAAAAAAAAzY/U4yIcykMPwY/s320/socia+frameworks.jpg" alt="" id="BLOGGER_PHOTO_ID_5180343146035455826" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt; As noted briefly above, each of the relationships connecting the actors in any part of the pathway is likely to involve &lt;span style="font-style: italic;"&gt;two way communications and influence&lt;/span&gt;, unlike the one way causality in the Logical Framework. So messages can come back from households, via local organisations to AMREF, then go off to the Ministry of Health. Useful indicators in the AMREF row, could therefore include such developments as improved knowledge about the impact of central government policies on Katine households&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="font-size:100%;"&gt;PS: I have now updated the ideas describe above in a posting on MandE NEWS called &lt;a href="http://mande.co.uk/2008/topic-bibliographies/networksanalysisandevaluation/the-social-framework-as-an-alternative-to-the-logical-framework/" rel="bookmark" title="Permanent Link: The Social Framework as an alternative to the Logical Framework"&gt;The Social Framework as an alternative to the Logical Framework &lt;/a&gt;This is where all future developments of the idea can be found. So, please visit.&lt;a href="http://mande.co.uk/2008/topic-bibliographies/networksanalysisandevaluation/the-social-framework-as-an-alternative-to-the-logical-framework/" rel="bookmark" title="Permanent Link: The Social Framework as an alternative to the Logical Framework"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;from "Rick on the Road" at http://mandenews.blogspot.com/&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6719829-3057797796303428604?l=mandenews.blogspot.com' alt='' /&gt;&lt;/div&gt;</description><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_QeRS3Af8hns/R-RGpQoSp1I/AAAAAAAAAzY/U4yIcykMPwY/s72-c/socia+frameworks.jpg" height="72" width="72" /><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">6</thr:total><feedburner:origLink>http://mandenews.blogspot.com/2008/02/social-frameworks-improvement-on.html</feedburner:origLink></item></channel></rss>

