08/31/2017 16:31:54
Paul McKeigue now applies the method described in Part 1 of his guest blog to the events in Ghouta (2013) and Khan Sheikhoun (2017). Based on extensive research, a false flag hypothesis for each event is spelled out in some detail. Photographic evidence referred to is not included in this blog but is available in the public domain. Readers are advised that these hypotheses involve harrowing and disturbing considerations.
In the last post I outlined the development of the mathematical and philosophical basis for using probability calculus to evaluate evidence.
The framework of probability calculus implies that:-
That's as far as we can go without using numbers. As the objective of these posts is to show you how to evaluate evidence for yourself using simple back-of-the envelope calculations, I'll recapitulate how to do this.
If you have learned to think of probabilities as objective properties of physical systems, the modern subjectivist interpretation of probability as quantifying degree of belief may be hard to accept. Classical probability theory was based on situations like coin-tossing and throwing dice, where probabilities are imposed by physical symmetries. However the rules for updating subjective probabilities in the light of evidence apply even when there are no such symmetries. One way to elicit your subjective probability of an event is as the price you would offer or accept for a ticket that will pay out £1 if the event occurs, and nothing if the event does not occur. If the prices you specify over various combinations of events are not consistent with probability theory, someone else can construct a "Dutch book" against you – a set of bets that guarantees that they will gain and you will lose.
Although probabilities are subjective, they are not plucked out of nowhere: they have to be consistent with what you already know, and with what you do not know. Usually there is some information that can be used to set conditional probabilities. For instance, in the examples discussed later, where some victims were injured after they were supposedly rescued, information on the frequencies of accidental injuries of these types in different settings can help us to specify the conditional probability of this observation given a hypothesis under which such injuries could only be explained as accidental. Where people differ in their assessment of the strength of evidence contributed by the same observations, eliciting these conditional probabilities will establish where their judgements differ.
We don't need to get too hung up on the subjectivity of assessing how well a hypothesis would have predicted the observations. In the examples discussed in these posts, serious errors have arisen not because people's assessments of these conditional probabilities were inconsistent with the available information, but because:-
In the context of the Syrian conflict, it is difficult for independent-minded journalists and academics to propose explanations that differ from the official line without being heavily criticized for "speculating" or "conspiracy theorizing". However you cannot evaluate evidence for a hypothesis without specifying alternative hypotheses and computing the likelihoods of these hypotheses given the observations: this inevitably requires you to "speculate".
In the discussion below I have linked to the sources of the observations used, but I have not embedded any images as the horrifying nature of some of these images would distract from the formalism of the argument. I am not appealing to your emotions but to your ability to use logic to evaluate evidence for yourself.
At the end of the last post I listed four alternative hypotheses about the Ghouta event:-
The problem is to compute the likelihoods of these four hypotheses given the "dog did not bark" observation that no images of search and rescue operations were released:-
Under H1, H2 or H3, it is unlikely that no such images would have been made available. But how unlikely? To assess this, we have to envisage the scenario under H1. Procedures for urban search and rescue are well established. After each home has been searched, it is marked to record how many live victims were rescued and how many dead victims were found. If in eastern Ghouta the area affected covered only one square kilometre of housing, with 50 homes per hectare, there would have been 5000 homes to search. With at least 400 fatalities, we expect at least as many living but incapacitated individuals to have needed rescuing. The immediate priority would have been to rescue the living, leaving bodies to be removed later. Even if the operation began in the middle of the night soon after the alleged attack, we would expect it to have continued after daybreak.
Of at least 150 videos uploaded , badged as coming from 18 different media outlets, not one shows this search and rescue operation. Most show victims at morgues, hospitals or improvised medical stations: dead and living victims appear to have arrived at these medical stations in the middle of the night. Most people will agree that the probability of this observation given H1 is low: I'll assign a value of 0.05. It is possible to calculate a number for this conditional probability based on some assumptions about the probability that the output of a single media outlet will include a search and rescue image, but I don't claim that this is more than a (rather conservative) subjective judgement.
Under H4, the conditional probability of this observation is high – it would have been difficult to stage such operations without the cooperation of large numbers of civilians. We can set a value of 0.8 for the probability that no search and rescue operations would be uploaded. This gives a likelihood ratio of about 20: a weight of evidence of 4.3 bits favouring H4 over H1. There are other related observations that should be taken into account: for instance:
Each of these observations would add one or two bits to the weight of evidence favouring H4 over H1, giving a total weight of evidence of about 7 bits given the related observations of no images of search and rescue, that the only such images uploaded were fraudulent, that bodies of victims apparently arrived without delay, and that these victims were fully clothed. Even if your prior odds favouring H1 over H4 were 1000 to 1, this weight of evidence would reduce the odds to about 3 to 1. There are other "dog did not bark" observations of non-occurrence of expected events related to the Ghouta incident that support H4 over H1: for instance under H1 we would expect to see interviews with bereaved survivors who would be able to document, with family photos, that they were relatives of victims seen dead in morgues.
We could proceed to evaluate the weights of evidence for other independent observations that have been made on the Ghouta event, and add them up. However there is one single observation for which I assess the weight of evidence favouring H4 over H1 to be so large as to overwhelm anything else.
Some of the most harrowing images from the Ghouta incident were from a building identified as an old tuberculosis hospital in the suburb of Kafr Batna, in which living and dead victims were shown in a basement room, and and at least 80 dead victims were laid out in a sunlit ground floor room (the "Sun Morgue"). A detailed study of the videos and still images from this site has been released online. This includes a detailed reconstruction of the fate of one victim in the Sun Morgue (pages 184-201). A short video summarizing this reconstruction has been released. The sequence of the videos and still images can be reconstructed from sun angles and from the order in which bodies are laid out and removed. The reconstruction shows that a heavily built male (given the code M-015 in this study) was brought into the morgue and laid on the floor apparently dead with no sign of bleeding. In later images M-015 had clenched his fists to grip his shirt, was bleeding from the neck, and a folded blanket had been been placed under his head. In subsequent images the flow of bright red blood had continued, eventually saturating the blanket and spilling on the floor. At the end, when most of the bodies had been removed, the blood-soaked blanket remained. These images show that M-015 was not dead when brought into the morgue (dead people do not clench fists or bleed profusely). The only plausible interpretation of this image sequence is that M-015's throat was cut when the morgue workers realized he was still alive.
I'll now try to compute the likelihoods of H4 and H1 given this observation. Under H1 it is possible that a victim would be mistakenly declared dead and begin stirring in the morgue, but it's almost impossible to explain why subsequent videos showed the victim bleeding bright red blood from the neck, or why the reaction of the emergency workers to someone who was obviously alive and bleeding profusely was to place a blanket under his neck and leave him to die.
The least implausible explanation I can come up with under H1 is that M-015 began stirring in the morgue, that somehow this led to an accident in which he was stabbed in the neck, and that the morgue staff, having no idea how to deal with this and afraid to report the accident, simply placed a blanket under his neck and left him to die. The probability of a patient being accidentally stabbed in the neck in a hospital setting, even in a chaotic response to a major incident, is extremely low. On the basis that I found no reports of such accidents in a brief search, I'll put the risk of at least one such accident in Ghouta at less than 10-5. A botched medical procedure, such as an attempted insertion of a central venous catheter via the neck, is not a plausible explanation for the bleeding as there would have been no indication for such a procedure as the first response to someone apparently recovering consciousness, and no space around the patient was cleared to facilitate medical intervention. Based on a probability of 10-5 that a victim waking up in the morgue would be accidentally stabbed in the neck and a probability of 0.01 (given that under hypothesis H1 the morgue staff are genuine emergency workers) that the reaction of the staff to this accident would be to leave him to die, I compute the likelihood of H1 given this observation as 10-7. Maybe readers can come up with an better explanation of how this sequence of images could have occurred given hypothesis H1.
Under H4, which postulates that the Ghouta victims were massacred captives most likely killed in gas chambers and that the morgue staff were playing an active part in this operation, such an observation is not unexpected. The probability that in a massacre of more than 400 people at least one victim would survive the gas chamber and that those removing the bodies would fail to detect this is high (0.5). It is to be expected that they would kill such an individual as soon as he began stirring, and it is probable that the method chosen would be throat cutting rather than shooting or strangling (0.5). It's also probable (0.4) that such an incriminating sequence of images would not be detected by those responsible for editing and uploading the videos and stills. Multiplying these conditional probabilities together gives a likelihood of 0.1. As we'll see it won't make much difference to the weight of evidence if these numbers vary by a factor of 2 or so. The weight of evidence is dominated by the very low likelihood of H1.
On this basis I evaluate the likelihood ratio favouring H4 over H1,given the observation of what appears to be a murder in the Kafr Batna morgue, as a million to one: a weight of evidence of 20 bits.
We now turn to evaluating the evidence for alternative explanations of the alleged chemical attack on 4 April 2017 in Khan Sheikhoun. With Ghouta as a precedent, we can begin by defining just two alternative hypotheses:
As before, your prior beliefs about which of these two alternative hypotheses is correct need not prevent you from evaluating the weight of evidence favouring one hypothesis over the other. You may believe that H1 is highly implausible on the basis that the Syrian government had no motive to carry out such an attack, or you may believe that H2 is highly implausible on the basis that such an elaborate deception operation is beyond the capability of the Syrian opposition and their foreign allies. To evaluate the evidence favouring H1 over H2, you have to assess, for each hypothesis in turn, what you would expect to observe if that hypothesis were true. This means that you have to put yourself first in the shoes of a Syrian general planning a chemical attack on the town, and then in the shoes of an opposition commander planning a massacre that would implicate the Syrian government.
For hypothesis H2 we have to envisage how a clever and ruthless al-Qaeda commander, perhaps working with foreign help, would plan such an operation. Although it is disturbing to have to work through this, I'll now state, as neutrally as I can, how I would expect such an operation to be planned.
In this framework, we can begin by evaluating the "mountain" of evidence – eyewitness reports, footage, crater, positive tests for sarin – that Monbiot invoked. Most of these observations were similar to that from Ghouta: purported eyewitnesses of the attack were made available for interview, images showing victims in morgues or improvised treatment facilities were uploaded, and samples tested positive for sarin. The likelihood of H1 given these observations is close to 1. Under H2, which specifies that Khan Sheikhoun was a repeat of Ghouta, we expect such observations so the likelihood of H2 also is close to 1. The weight of evidence favouring H1 over H2 given these observations is therefore close to zero. You may have a strong prior belief that H2 is implausible, but that does not influence the likelihood ratio favouring H1 over H2.
How does hypothesis H2 account for this mountain of evidence so easily? A key requirement for a successful deception operation is to create what look like many independent sources of evidence, even though they are all in fact generated by the operation. This principle was brilliantly applied by the legendary Naval Intelligence Division in the deception operations that led German commanders to expect Allied landings in 1943 in Greece rather than Sicily, and in 1944 in Calais rather than Normandy. Thus under H2, if the planners are competent, we expect to see videos badged with the logos of different opposition media agencies and uploaded separately, even though they may all originate from a central video editing operation.
At this point you may reasonably ask: if H2 can so easily account for this mountain of evidence, what possible observations could give a likelihood ratio strongly favouring H1 over H2? Such observations are those that would be expected under H1, but very difficult to generate under H2. For instance if H1 were true, any of the following observations might be expected to contribute evidence favouring H1 over H2:-
It's also useful to list, before going further, what possible observations might be expected to contribute evidence favouring H2 over H1 if H2 were true:-
With this in mind, we can evaluate the weight of evidence contributed by five observations that have been summarized here
Observation Prob (obs given H1) Prob (obs given H2) Likelihood ratio H2 / H1 Weight of evidence (bits) favouring H2 over H1 An individual claiming to be a bereaved survivor was made available for interview, with photos showing him with two children later seen as victims. The lack of photos of his wife was attributed to loss of the family photo album in an airstrike on the family home. 0.002 0.04 20 4.3 There are no videos of victims being rescued in their homes, or bodies being recovered 0.05 0.8 16 4 The flight track of the Syrian jet shown by the Pentagon (single east-west pass just south of the town) is incompatible with the track of the three explosions (north-south axis over the northern part of town) and the alleged impact site of the chemical munition 0.01 0.8 80 6.3 The alleged impact site of the chemical munition is upwind of where the casualties were reported (by the rebels) to have occurred. 0.02 0.5 25 4.6 In the images released by the rebels, several of the children who are seen dead have head and neck injuries. Reconstruction of sequences and matching of identities shows that in two of these children the head injuries were received after they had been supposedly rescued by the White Helmets 0.01 0.2 20 4.3 Total 23.5
From this evaluation, I assess the total weight of evidence favouring H2 over H1 as about 23 bits, giving a likelihood ratio of about 8 million to 1. This might be described as a mountain of evidence.
Although these assignments of the conditional probablities of the observations given H1 and H2 entail subjective judgements on my part, it should be possible for people with different prior odds to reach consensus on these conditional probabilities, and thus on the likelihood ratios. You may be able to improve on and correct my judgements of the conditional probabilities, using additional information. For instance:-
From this evaluation of the likelihoods of alternative explanations of the alleged chemical attacks in Ghouta in 2013 and Khan Sheikhoun in 2017 given some key observations, I assess that the evidence favouring the hypothesis of a managed massacre of captives over the hypothesis of a regime chemical attack is overwhelming (at least 20 bits) both for the Ghouta attack and for Khan Sheikhoun. The calculations and subjective judgements on which these assessments are based are set out above. The evaluation of weights of evidence does not depend on prior beliefs about which hypotheses are plausible. To modify this conclusion about the weight of evidence, you have either to identify additional observations which would have been predicted better by the regime chemical attack hypothesis than by the managed massacre hypothesis, or to criticize and revise my assessments of the conditional probabilities of the observations listed above given each of these two hypotheses. I've suggested above some ways in which additional information could be used to revise these conditional probabilities. If you believe that either the managed massacre hypothesis or the regime attack hypothesis is implausible, I am not disagreeing with you: priors are subjective. However for your beliefs to be logically consistent, your priors must be updated by the weight of evidence according to Bayes' theorem.
The strength of the evidence favouring the managed massacre hypothesis over the regime chemical attack hypothesis has quite radical implications for the credibility of western media, western governments and international agencies such as OPCW; you may reasonably ask "how could they have got it so wrong?".