{"id":349,"date":"2011-09-20T19:56:38","date_gmt":"2011-09-20T17:56:38","guid":{"rendered":"http:\/\/blog.asym.dk\/?p=349"},"modified":"2011-09-20T19:56:38","modified_gmt":"2011-09-20T17:56:38","slug":"the-many-are-smarter-than-the-few-how-crowds-can-forecast-quality","status":"publish","type":"post","link":"http:\/\/www.asym.dk\/index.php\/2011\/09\/20\/the-many-are-smarter-than-the-few-how-crowds-can-forecast-quality\/","title":{"rendered":"The many are smarter than the few: How crowds can forecast quality"},"content":{"rendered":"<p>This is a blog post which I&#8217;ve had underway since early May. It is about a new way of assessing quality. Let&#8217;s start with how we normally work:<br \/>\nTesters usually work alone or in small teams checking and exploring functionality, finding bugs, issues, and other artifacts. These artifacts do not by themselves say anything about the quality of the whole product, instead they document things which are relevant to quality.<br \/>\nIn this blog, I&#8217;ll propose a different kind of testing, one which is organised in a way which is radically different from traditional testing &#8211; and which can produce a quite different type of result.<br \/>\nMy inspiration is the 2004 book by <a title=\"Link to James Surowiecki's blog on 'The New Yorker'\" href=\"http:\/\/www.newyorker.com\/online\/blogs\/jamessurowiecki\" target=\"_blank\" rel=\"noopener noreferrer\">James Surowiecki<\/a>: &#8216;<a title=\"Page on the book at Wikipedia\" href=\"http:\/\/en.wikipedia.org\/wiki\/The_Wisdom_of_Crowds\" target=\"_blank\" rel=\"noopener noreferrer\">The Wisdom of Crowds<\/a>&#8216; with the subtitle &#8216;Why the many are smarter than the few&#8217;. In the book, Surowiecki presents a thought provoking fact: That while some individuals are very good problem solvers or excellent forecasters, a diverse crowd of ordinary people can always do better than any single individual.<br \/>\nSurowiecki explains this in a very convincing manner and the book is an enlighting read. I find Surowiecki&#8217;s thoughts a welcome diversion from what most seems to be concerned about these days: The performance of the individual. Too often, we forget that most good solutions are not invented or implemented by any single person, but by groups of people. And that the performance of teams often depend more on the composition of the team than on the individuals in it.<br \/>\nAs a tester, I enjoy working alone as well as in teams, but reading Surowiecki&#8217;s book made me think of ways to apply his thoughts to make quality assessments of a different kind than those traditional testing can make.<br \/>\n<a href=\"http:\/\/asymaps.files.wordpress.com\/2011\/09\/dsc_4330.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-353\" title=\"James Surowiecki: The Wisdom of Crowds\" src=\"http:\/\/asymaps.files.wordpress.com\/2011\/09\/dsc_4330.jpg\" alt=\"James Surowiecki: The Wisdom of Crowds\" width=\"500\" height=\"617\" srcset=\"http:\/\/www.asym.dk\/wp-content\/uploads\/2011\/09\/dsc_4330.jpg 800w, http:\/\/www.asym.dk\/wp-content\/uploads\/2011\/09\/dsc_4330-243x300.jpg 243w, http:\/\/www.asym.dk\/wp-content\/uploads\/2011\/09\/dsc_4330-768x948.jpg 768w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/a><br \/>\nLet me start the description with an example of a question which traditional testing cannot easily answer, but which I think a new kind of assessment can:<br \/>\nA client approaches us with a product which is still under development and therefore not yet on the market. The client tells us that he needs a holistic quality assessment of the product and he asks us to provide the answer to a very simple question: <em>Will it be a good product? <\/em><br \/>\nThough I can produce a wealth of information about a product I&#8217;m testing, answering this question is not possible by ordinary testing alone. I may be able to make up an opinion about the product based on my testing, and I can communicate this to my client, but it will always be a fundamentally <em>subjective <\/em>view.<br \/>\nAnd there is no practival way of assessing whether my view of the product matches that of the <em>collective intelligence <\/em>of the population of users of the future product. An expert in the field of forecasting product successes may do better than me, but in principle he may be just as wrong as I am \u2013 and the worst thing is that we will not know whether he&#8217;s right or wrong.<br \/>\nHumans are actually very good at coming up with answers to open ended questions: Quality is something that everyone tends to have an opinion about! But while a single human <em>can <\/em>(and according to Surowiecki <em>will<\/em>) make interpretation errors, Surowiecki points out that in a crowd, the errors will be evened out. Aggregated opinions can be a very reliable prediction of the quality of the finished product.<br \/>\nThe crowd does not have to be a team of experts. Surowiecki points out that rather than focusing on maximizing the individual members&#8217; domain knowledge and level of experience, the crowd should be put together to be as diverse as possible.<br \/>\nObviously we have to supply some information about the product to the group \u2013 they can&#8217;t make up their minds about quality without knowing <em>something<\/em> about the product. Collecting information has to be done by someone and provided to group members. This is an important task which a &#8216;moderator&#8217; has to do.<br \/>\nIn the ideal situation, we will provide all available information the group: Prototypes, design documents, concept descriptions, ideas, diagrams \u2013 even code! The idea is to allow each individual member of the crowd use his own heuristic making up his mind about the question.<br \/>\nBut that won&#8217;t work in practice. Asking all group members to read everything is just not effective. Besides, the documentation could lead them in wrong directions: They will focus on the most easily accessible parts and will avoid information for which they have to work a little to get to it.<br \/>\nSo the moderator will have to make a &#8216;flat&#8217; (as opposed to hiearachical) binder of different information from the product. What should it contain?<br \/>\nWhen I was learning sketching and drawing, I was introduced to the problem of drawing leaves on a tree or hair on an animal. I was taught a trick, which is to draw every 100<sup>th<\/sup> leave or every 10,000<sup>th<\/sup> hair accurately. It will then look correct to the viewer.<br \/>\nI suggest making the &#8216;information collection&#8217; in the same way: Pick <em>some <\/em>documents, <em>some <\/em>diagrams, <em>some <\/em>code, <em>some <\/em>tests. Or even, pick <em>some pages <\/em>from <em>some<\/em> documents.<br \/>\nThe idea is that the crowd members actually doesn&#8217;t need to see everytning \u2013 they only need enough to formulate an opinion. And then they should see different things, so we&#8217;re most certain that they will form different opinions about the system.<br \/>\nHow about questions \u2013 what questions should we ask? We will have to ask them in a way so answers can be aggregated into a combined result. We may want ask them to give a score, which can then be averaged or in other ways analysed.<br \/>\nSurowiecki points out some important pitfalls that should be avoided. I&#8217;ll focus on what is often referred to as <em>collective thinking<\/em>. This is what happens when a group of people turns out to be &#8216;dumber&#8217; than the individual members. A bullet proof way to get people to think in collectives is to let some members of the crowd influence other members: E.g. by putting a very charismatic person in the role of chairman or manager for the group. Surowiecki refers to several examples of how group thinking has lead to wrong decisions, and it is obvious that if we want to make an assessment which can be trusted, we have to avoid it. By all means.<br \/>\nSo &#8216;voting&#8217; should be secret, and we should generally prevent members from communicating with each other. If we do allow them to communicate, we should moderate the communication to ensure that individual members are not allowed to influence the opinions of other members.<br \/>\nIs crowd involvement in testing a new thing? I think so. I don&#8217;t think the concept has been described before.<br \/>\nOn the other hand, many beta test programs have traits of it.<br \/>\nBut where the crowd based quality assessments (or forecasts) can take place at any point in the development process, beta testing by definition takes place on an almost finished version of the product. And beta test programs produce the same types of results as ordinary testing: Bugs and other artifacts.<br \/>\nHolistic crowd testing is not an efficient bug factory. Its power is its ability to answer holistic questions about a product under development.<br \/>\nI&#8217;d like to set up a workshop in a forthcoming software testing conference where the idea can undergo further development. Let me know if you&#8217;re interested in participating, and I&#8217;ll let you know when and where.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This is a blog post which I&#8217;ve had underway since early May. It is about a new way of assessing quality. Let&#8217;s start with how we normally work: Testers usually work alone or in small teams checking and exploring functionality, finding bugs, issues, and other artifacts. These artifacts do not by themselves say anything about [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[29,67,74],"_links":{"self":[{"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/posts\/349"}],"collection":[{"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/comments?post=349"}],"version-history":[{"count":0,"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/posts\/349\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/media?parent=349"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/categories?post=349"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.asym.dk\/index.php\/wp-json\/wp\/v2\/tags?post=349"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}