<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.0.2" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>mrry</title>
	<link>http://www.mrry.co.uk/blog</link>
	<description>Derek Murray's weblog</description>
	<pubDate>Mon, 21 Jul 2008 00:18:04 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.2</generator>
	<language>en</language>
			<item>
		<title>Conspiracy Theories</title>
		<link>http://www.mrry.co.uk/blog/2008/07/21/conspiracy-theories/</link>
		<comments>http://www.mrry.co.uk/blog/2008/07/21/conspiracy-theories/#comments</comments>
		<pubDate>Mon, 21 Jul 2008 00:18:04 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
	<category>Meta</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/07/21/conspiracy-theories/</guid>
		<description><![CDATA[My last post concerned the conspiracy theories that surround the collapse of 7 World Trade Center on the 11th of September, 2001. In that post, I tried to provide an objective rationale for why the controlled demolition hypothesis should not be believed, owing to its unfalsifiability. The truth is that this and other 9/11 conspiracy [...]]]></description>
			<content:encoded><![CDATA[<p>My <a href="http://www.mrry.co.uk/blog/2008/07/21/the-911-delusion/">last post</a> concerned the <a href="http://en.wikipedia.org/wiki/Controlled_demolition_hypothesis_for_the_collapse_of_the_World_Trade_Center#World_Trade_Center_Seven">conspiracy theories that surround the collapse of 7 World Trade Center</a> on the 11th of September, 2001. In that post, I tried to provide an objective rationale for why the controlled demolition hypothesis should not be believed, owing to its unfalsifiability. The truth is that this and other <a href="http://en.wikipedia.org/wiki/9/11_conspiracy_theories">9/11 conspiracy theories</a> provoke an almost visceral response in me. I am pretty certain that I&#8217;m not the only person who feels this way.</p>
<p>Right now, it&#8217;s pretty obvious that I don&#8217;t believe in the conspiracy theories (at least, the ones in which the US government or one of its agencies &#8220;made it happen on purpose&#8221;). However, I am no great supporter of the present US administration, and my political leanings (if transposed to America) would be somewhere to the left of the Democratic Party. Why then am I inclined to give Bush and his aides the benefit of the doubt? Of course it&#8217;s their sheer ineptitude: one need only look at the prosecution of the Iraq war for a rich seam of evidence.</p>
<p>But that doesn&#8217;t explain why I am so viscerally affected by the conspiracy theories: after all, I might be an atheist on the balance of probabilities, but I have no problem with people who have religious faith.</p>
<p>I think part of it is cognitive dissonance: we are raised to trust the government, and the idea that a government could be responsible for an atrocity like 9/11 is utterly incompatible with that preconception. I&#8217;ve already rationalised away the conspiracy theories, but perhaps not everyone would do the same.</p>
<p>Let&#8217;s assume that Democrats are more likely to believe and perpetuate the 9/11 conspiracy theories; and that Republicans and independents are more likely to recoil from them. There are photos of <a href="http://hotair.com/archives/2007/04/30/photo-of-the-day-the-democrats-truther-problem/">conspiracy theorist banners at Obama rallies</a>. This is perfect ammunition for the Republicans, who can associate the Democrats with their &#8220;lunatic fringe&#8221; and exploit the cognitive dissonance in their base and the independents.</p>
<p>Here&#8217;s a conspiracy theory for you: Karl Rove sowed the seeds for the &#8220;9/11 Truth&#8221; movement in a deliberate attempt to discredit the Democrats and make them unelectable in the near future. Or maybe just to distract everyone from the true scandals of the Bush administration: tens of thousands of dead civilians in Iraq, thousands of dead soldiers, domestic spying on US citizens, the erosion of habeas corpus, inaction over global warming and the near collapse of the economy.</p>
<p>There are plenty of things that we still don&#8217;t know about 9/11, and we should as a matter of course seek the truth. We should discover the real reasons that the buildings fell in order to apply the lessons learned to future construction. But in finding the truth, we must retain an open mind, and not resort to intellectual dishonesty or partisanship.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/07/21/conspiracy-theories/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>The 9/11 Delusion</title>
		<link>http://www.mrry.co.uk/blog/2008/07/21/the-911-delusion/</link>
		<comments>http://www.mrry.co.uk/blog/2008/07/21/the-911-delusion/#comments</comments>
		<pubDate>Sun, 20 Jul 2008 23:27:47 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/07/21/the-911-delusion/</guid>
		<description><![CDATA[When I saw in last Monday&#8217;s Guardian that Charlie Brooker was taking aim at 9/11 conspiracy theories, I hoped that he&#8217;d use his wide audience to present a logically watertight argument, in an entertainingly acerbic register. And buried within his piece was the quite probable suggestion that the paperwork alone would be impossible to conceal. [...]]]></description>
			<content:encoded><![CDATA[<p>When I saw in last Monday&#8217;s Guardian that Charlie Brooker was <a href="http://www.guardian.co.uk/commentisfree/2008/jul/14/september11.usa">taking aim at 9/11 conspiracy theories</a>, I hoped that he&#8217;d use his wide audience to present a logically watertight argument, in an entertainingly acerbic register. And buried within his piece was the quite probable suggestion that the paperwork alone would be impossible to conceal. Unfortunately, because he&#8217;s evidently paid by the ad hominem, he also said that every conspiracy theorist might as well believe that he is the Emperor of Pluto, and unleashed a firestorm in the <a href="http://www.guardian.co.uk/commentisfree/2008/jul/14/september11.usa?commentpage=1">online comments</a>. By opening up too many fronts in this debate, he left himself open to attacks, even <a href="http://www.guardian.co.uk/commentisfree/2008/jul/17/september11">from other Guardian commentators</a>.</p>
<p><a id="more-35"></a>Let us consider a single event from 9/11: the collapse of 7 World Trade Center, a 47-storey skyscraper which stood across Vesey Street from the main World Trade Center site. It collapsed at 5:21pm that day, but, unlike 1 and 2 World Trade Center (the north and south towers, respectively), it was not struck by an aeroplane.</p>
<p>As yet, the official report on the collapse has not been published: the <a href="http://wtc.nist.gov/progress_report_june04/appendixl.pdf">working hypothesis</a> is that fire and/or debris caused a critical supporting column to fail, ultimately leading to the collapse of the entire building.</p>
<p>A widely held conspiracy theory is that the building was <a href="http://en.wikipedia.org/wiki/Controlled_demolition_hypothesis_for_the_collapse_of_the_World_Trade_Center#World_Trade_Center_Seven">destroyed by controlled demolition</a>.</p>
<p>At present, there is no definitive evidence that proves either hypothesis: why then should we prefer one over the other? The conspiracy hypothesis can be proven, but not disproven. Any &#8220;proof&#8221; of the official hypothesis can be probabilistic at best, as it will be based on simulated physics with limited precision and an inherent (albeit possibly small) uncertainty. And that uncertainty leaves the door open for the possibility that <a href="http://www.prisonplanet.com/011904wtc7.html">Larry Silverstein ordered the demolition</a>, or that <a href="http://www.journalof911studies.com/articles/Why%20Indeed%20Did%20the%20WTC%20Buildings%20Completely%20Collapse%20Jones%20Thermite%20World%20Trade%20Center%20J24.pdf">thermite was used to hide the typical hallmarks of a controlled demolition</a>, amongst other possibilities.</p>
<p>On the other hand, you can disprove the official hypothesis in a simple manner. Simply find one person who was involved in the conspiracy to admit his involvement. This could be one of the cabal who planned the atrocity, one of the secret services who executed it, one of the demolition experts who planted the explosives in the building, one of the building workers who saw explosives being installed, one of the emergency services who ordered the building&#8217;s evacuation or one of the news media who apparently reported the collapse before it happened.</p>
<p>Conspiracies do happen, but they can only succeed when the number of participants is limited: otherwise human fallibility makes it vanishingly improbable that the whole endeavour can remain a secret. And that is why, unless somebody comes forward and claims responsibility or provides legally credible witness evidence, I cannot (and we should not) believe that 7 World Trade Center was destroyed in a controlled explosion.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/07/21/the-911-delusion/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Glastonbury 2008</title>
		<link>http://www.mrry.co.uk/blog/2008/07/01/glastonbury-2008/</link>
		<comments>http://www.mrry.co.uk/blog/2008/07/01/glastonbury-2008/#comments</comments>
		<pubDate>Tue, 01 Jul 2008 10:34:32 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Travel</category>
	<category>Personal</category>
	<category>Music</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/07/01/glastonbury-2008/</guid>
		<description><![CDATA[Cut Copy. The Levellers. The Subways. Get Cape. Wear Cape. Fly. Vampire Weekend. Ben Folds. The Duke Spirit. Edwyn Collins. John Cale. Franz Ferdinand. Kings Of Leon. Fanfarlo. Cruel Folk. Mik Artistik&#8217;s Ego Trip. British Sea Power. The Raconteurs. The Last Shadow Puppets. Amy Winehouse. Buddy Guy. The Proclaimers. Mary Bourke. Attila The Stockbroker. Dynamo&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>Cut Copy. The Levellers. The Subways. Get Cape. Wear Cape. Fly. Vampire Weekend. Ben Folds. The Duke Spirit. Edwyn Collins. John Cale. Franz Ferdinand. Kings Of Leon. Fanfarlo. Cruel Folk. Mik Artistik&#8217;s Ego Trip. British Sea Power. The Raconteurs. The Last Shadow Puppets. Amy Winehouse. Buddy Guy. The Proclaimers. Mary Bourke. Attila The Stockbroker. Dynamo&#8217;s Rhythm Aces. John Mayer. Scouting For Girls. Mark Ronson. Goldfrapp. Leonard Cohen. The National.</p>
<p>D x.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/07/01/glastonbury-2008/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>At least he&#8217;s got a bag for life</title>
		<link>http://www.mrry.co.uk/blog/2008/05/25/at-least-hes-got-a-bag-for-life/</link>
		<comments>http://www.mrry.co.uk/blog/2008/05/25/at-least-hes-got-a-bag-for-life/#comments</comments>
		<pubDate>Sun, 25 May 2008 14:36:19 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/05/25/at-least-hes-got-a-bag-for-life/</guid>
		<description><![CDATA[The law of unintended consequences often comes shopping with me.
I do try to be environmentally-conscious—a task which is somewhat undermined by my air miles balance—and using fewer carrier bags seems like a good start. So I was cheered to see that Marks &#038; Spencer&#8217;s have started charging for bags (although the emotional blackmail handed down [...]]]></description>
			<content:encoded><![CDATA[<p>The law of unintended consequences often comes shopping with me.</p>
<p><a id="more-33"></a>I do try to be environmentally-conscious—a task which is somewhat undermined by my air miles balance—and using fewer carrier bags seems like a good start. So I was cheered to see that Marks &#038; Spencer&#8217;s have started charging for bags (although the emotional blackmail handed down by the checkout assistants may prove even more effective).</p>
<p>However, I&#8217;m rarely so organised that I have a bag with me when I nip into M&#038;S for a Chicken Tikka ready meal. &#8220;Do you need a bag?&#8221; Well, yes, I&#8217;m afraid so.</p>
<p>The choice that remains is between the 5p traditional poly bag, and the mighty 10p bag for life. Now I hope it will not be too bourgeois of me to say that the difference between having 5p, 10p or nothing at all is negligible. So I usually plump for the bag for life, with the added satisfaction that &#8220;this bag is for life, it&#8217;ll last forever, and I&#8217;m saving the environment.&#8221;</p>
<p>But as I survey the pile of unreused bags littering my bedroom, I can&#8217;t help but wonder at the real cost. These bags are bigger and sturdier than their cheaper counterparts: how much more oil do they require to manufacture, and how much longer do they take to biodegrade?</p>
<p>Is it perhaps to time to increase greatly the charge for bags? If a bag cost £1, I&#8217;d make damn sure that I remembered to bring one with me&#8230;.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/05/25/at-least-hes-got-a-bag-for-life/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>My name is not Bob</title>
		<link>http://www.mrry.co.uk/blog/2008/05/18/my-name-is-not-bob/</link>
		<comments>http://www.mrry.co.uk/blog/2008/05/18/my-name-is-not-bob/#comments</comments>
		<pubDate>Sun, 18 May 2008 00:26:14 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/05/18/my-name-is-not-bob/</guid>
		<description><![CDATA[This weekend finds me back in Glasgow to visit my parents, and I&#8217;ve spent much of the afternoon clearing out a desk that I&#8217;d used since 1996. Filled with memories, old tickets and trinkets, my first football match, my first gig, my first trip to London on my own, my trip to Cambridge for interview. [...]]]></description>
			<content:encoded><![CDATA[<p>This weekend finds me back in Glasgow to visit my parents, and I&#8217;ve spent much of the afternoon clearing out a desk that I&#8217;d used since 1996. Filled with memories, old tickets and trinkets, my first football match, my first gig, my first trip to London on my own, my trip to Cambridge for interview. Filled with old school work and school reports (Computing – &#8220;I am pleased with my grades, and I like computers.&#8221; R.E. – &#8220;I am pleased with my progress in the short course, and look forward to its completion.&#8221;), and the surprising insight that I apparently had a &#8220;particular ability at Volleyball.&#8221; Filled with greetings cards from old friends, people I barely remember, and people I&#8217;d rather forget.</p>
<p><a id="more-32"></a>I also found some notes I wrote for a couple of websites that, sadly, never saw the light of day. One was an elaborate satire on my high school, let down somewhat by the sophomoric title &#8220;<a title="Williamwood High School" href="http://www.williamwood.e-renfrew.sch.uk/">Willywood</a>&#8220;. The other was for a pseudonymous blog intended to have been published during <a href="http://www.mrry.co.uk/articles/1802/">my time in Guildford</a>. I had developed &#8220;Bob&#8221; as somewhat affected and prolix character, who is aghast to find himself living in squalor in the south-east of England&#8217;s dullest dormitory town. I feel compelled to quote from the prologue:</p>
<blockquote><p>Like two-thirds of a well-written American sitcome, two storylines intertwine to explain why I am here, writing this.</p>
<p>I&#8217;ll dispense with the Big Band-to-present montage that sat so awkwardly in the otherwise excellent Charlie Kaufman-penned <em>Adaptation</em>, and simply say that I got a job in Guildford.</p>
<p>I figured that I would stay at the University of Surrey. They clearly don&#8217;t value ten weeks&#8217; rent, and wouldn&#8217;t accommodate me beyond late August.</p>
<p>I figured that I would ask my employer – with its established placement scheme – for some help. They sent an out-of-date brochure for B&#038;Bs.</p>
<p>I figured that I would visit Guildford and look round at rooms for rent. In all, I saw three. I plumped for the least bad.</p>
<hr />Unbeknownst to me, Mick and Kylie had let all three rooms in their 1970&#8217;s mid-terrace house. To celebrate the deal, they took the new tenants for a night out in Guildford. Words were spoken, things turned source, Kylie got in a fight with one of them, Mick weighed in to defend her honour, and an advert was placed the next day – Three rooms to let.</p>
<p>This I learned on my first night in the house, as they took me for a night out in Guildford.</p></blockquote>
<p>I&#8217;d kind-of like to see where I might have taken this: there were plenty of horrifying anecdotes about that summer. How far away it seems now&#8230;.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/05/18/my-name-is-not-bob/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Strangers on a plane</title>
		<link>http://www.mrry.co.uk/blog/2008/05/16/strangers-on-a-plane/</link>
		<comments>http://www.mrry.co.uk/blog/2008/05/16/strangers-on-a-plane/#comments</comments>
		<pubDate>Fri, 16 May 2008 14:27:10 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/05/16/strangers-on-a-plane/</guid>
		<description><![CDATA[The other day, I was getting off a plane from Istanbul back to Stansted, and retrieving my Duty-Free carry-on, when a fellow passenger accosted me:
&#8220;I think that&#8217;s my bag,&#8221; he said.
&#8220;I&#8217;m fairly sure it&#8217;s not.&#8221;
&#8220;Does it have Turkish Delight in it?&#8221;
&#8220;Well, yes&#8230;.&#8221;

]]></description>
			<content:encoded><![CDATA[<p>The other day, I was getting off a plane from Istanbul back to Stansted, and retrieving my Duty-Free carry-on, when a fellow passenger accosted me:</p>
<p>&#8220;I think that&#8217;s my bag,&#8221; he said.</p>
<p>&#8220;I&#8217;m fairly sure it&#8217;s not.&#8221;</p>
<p>&#8220;Does it have Turkish Delight in it?&#8221;</p>
<p>&#8220;Well, yes&#8230;.&#8221;
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/05/16/strangers-on-a-plane/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Civic Pride</title>
		<link>http://www.mrry.co.uk/blog/2008/05/05/civic-pride/</link>
		<comments>http://www.mrry.co.uk/blog/2008/05/05/civic-pride/#comments</comments>
		<pubDate>Mon, 05 May 2008 10:30:58 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uncategorized</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/05/05/civic-pride/</guid>
		<description><![CDATA[Growing up in Glasgow, I was exposed to more than my fair share of internecine rivalries: when I was more serious about blogging, I planned a grand series of posts cataloguing every single one of them. Easy, I thought, there&#8217;s the other football team, the other side of the river, the suburbs, the other city, [...]]]></description>
			<content:encoded><![CDATA[<p>Growing up in Glasgow, I was exposed to more than my fair share of internecine rivalries: when I was more serious about blogging, I planned a grand series of posts cataloguing every single one of them. Easy, I thought, there&#8217;s the <a href="http://en.wikipedia.org/wiki/Old_Firm">other football team</a>, the other side of the river, the suburbs, the <a href="http://en.wikipedia.org/wiki/Edinburgh">other city</a>, and don&#8217;t even get me started on the English.</p>
<p><a id="more-30"></a>Though these rivalries have existed for years, the internet has given them a new lease of life, in the hands of pure, unabashed saddos. Usenet has been the site of a <a href="http://groups.google.com/group/alt.airports.uk.glasgow/browse_thread/thread/11ad497e76334c9a#">proxy war</a>, pitting <a href="http://groups.google.com/group/alt.airports.uk.glasgow/topics">Glasgow <em>Airport</em></a> against its <a href="http://groups.google.com/group/alt.airports.uk.edinburgh/topics?lnk=rgh">rival in Edinburgh</a>. The <a href="http://en.wikipedia.org/w/index.php?title=Glasgow&#038;action=history">revision history</a> for the <a href="http://en.wikipedia.org/w/index.php?title=Glasgow&#038;action=history">Glasgow article on Wikipedia</a> shows frequent changes in the population column, as well as <a href="http://en.wikipedia.org/w/index.php?title=Glasgow&#038;diff=209774696&#038;oldid=209774130">the recent deletion</a> of the comment, &#8220;<span class="diffchange diffchange-inline">Glasgow is the largest city in Scotland and all the more mysteriously is not the capital. Edinburh the second largest city in Scotland is the capital</span>.&#8221; I can even remember reading one message board post where the author had counted the number of A-roads in each city and concluded that, having more, Glasgow was the more important.<br />
In the interests of full disclosure, I must admit that I did once subscribe to both alt.airports.uk.glasgow and the edit feed for the city&#8217;s Wikipedia article. But having lived first in Edinburgh, now in England, I&#8217;ve become more laid back in my civic pride. Until today, that is.</p>
<p>A <a href="http://news.bbc.co.uk/1/hi/uk/7382750.stm">BBC news article</a> reported on fears during the Cold War that a nuclear strike would have a grave effect on the nation&#8217;s tea supply. It mentioned that, for the purposes of planning:</p>
<blockquote><p>the Ministry of Food listed London, Birmingham, Merseyside, Manchester and Clydeside [Glasgow] as H-bomb targets.</p>
<p>Tyneside, Teesside, Leeds, Sheffield, Hull, Derby, Purfleet in Essex, Southampton, Portsmouth, Bristol, Plymouth, Cardiff, Coventry and Belfast were named as A-bomb targets.</p></blockquote>
<p>And all I could think was, &#8216;Ha, take that, Edinburgh&#8230;.&#8217;
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/05/05/civic-pride/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>Trade</title>
		<link>http://www.mrry.co.uk/blog/2008/03/04/trade/</link>
		<comments>http://www.mrry.co.uk/blog/2008/03/04/trade/#comments</comments>
		<pubDate>Tue, 04 Mar 2008 16:35:29 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Politics</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2008/03/04/trade/</guid>
		<description><![CDATA[I can&#8217;t sleep, and I&#8217;ve got NAFTA on my mind. I won&#8217;t claim that the two are correlated, but when did that ever stop someone writing a blog post?
I am a little worried, however. Like just about everyone to whom I&#8217;ve talked, I&#8217;m hoping for a Democrat in the White House next year, and I&#8217;ve [...]]]></description>
			<content:encoded><![CDATA[<p>I can&#8217;t sleep, and I&#8217;ve got NAFTA on my mind. I won&#8217;t claim that the two are correlated, but when did that ever stop someone writing a blog post?</p>
<p><a id="more-29"></a>I am a little worried, however. Like just about everyone to whom I&#8217;ve talked, I&#8217;m hoping for a Democrat in the White House next year, and I&#8217;ve been following the primaries intently. But some of the recent pronouncements on trade have troubled me a little. So I&#8217;m hoping that, by writing something so naïve and ill-informed as this, someone will come out of the woodwork and tell me where I&#8217;m going wrong. Okay, here goes.</p>
<p>Let&#8217;s start by assuming that we&#8217;re all liberals. We probably want Barack Obama to be the next president, but maybe we&#8217;d rather it was Hilary Clinton. Either way, for the purpose of this discussion, it doesn&#8217;t matter.</p>
<p>We want things to be better for people in developing countries. To a first approximation, this could mean raising the standard of living in these countries by alleviating poverty. One way of doing this would be to spend more money in these countries. (Is this not why we&#8217;re all drinking fair-trade coffee, and eating fair-trade chocolate? Actually, scratch that, as I&#8217;m not convinced that the fair-trade movement doesn&#8217;t suffer from the exact same problems as what I&#8217;m about to describe. And sorry for the double-negative there.)</p>
<p>We are more likely to spend money in a country if we have free trade with that country, than if prices for goods from that country are inflated by duties or tariffs. We ensure free (or, perhaps more accurately, &#8220;freer&#8221;) trade by making free trade agreements with other countries.</p>
<p>So it&#8217;s heartening that, sometimes at least, both Obama [<a href="http://www.nytimes.com/2008/03/04/us/politics/04nafta.html?ex=1362373200&#038;en=95e274b55a8e3fe3&#038;ei=5124&#038;partner=permalink&#038;exprod=permalink">1</a>] and Clinton [<a href="http://www.ndol.org/ndol_ci.cfm?kaid=106&#038;subid=122&#038;contentid=250750">2</a>] have supported NAFTA, which enables free trade between the US, Canada and Mexico.</p>
<p>Except, there&#8217;s a problem: free trade can cost jobs, especially manufacturing jobs in countries where the standard of living is so high that it is uneconomical to pay workers a living wage, when the same output could be obtained at a fraction of the price from one of our trading partners.</p>
<p>Especially jobs in traditionally blue-collar states like Ohio, where voters go to the polls today. And, therefore, both Clinton [<a href="http://facts.hillaryhub.com/archive/?id=6019">3</a>] and Obama [<a href="http://www.barackobama.com/2008/02/24/remarks_for_senator_barack_oba_1.php">4</a>] have attacked the other for supporting NAFTA, at the expense of American jobs.</p>
<p>So I have three questions:</p>
<ul>
<li>Which of the above assumptions is incorrect?</li>
<li>I&#8217;d like to believe in Obama&#8217;s message of hope and change, but does this change stop at US border?</li>
<li>If, as is now being reported, at least one of the candidates is engaging in disingenuous political posturing when making these statements [<a href="http://www.nytimes.com/2008/03/04/us/politics/04nafta.html?ex=1362373200&#038;en=95e274b55a8e3fe3&#038;ei=5124&#038;partner=permalink&#038;exprod=permalink">1</a>] (and I&#8217;d be surprised if his opponent <em>wasn&#8217;t</em> also doing the same), then why should we believe in the rest of the rhetoric that goes along with it?</li>
</ul>
<p>Now, I&#8217;m not an idealist, but I don&#8217;t particularly enjoy being cynical. Indeed, I hope I&#8217;ve made a mistake somewhere above, and someone will come along and correct me. However, it seems that, in an election that will probably be won on the strength of aspirational oratory, we should be especially critical of what is said.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2008/03/04/trade/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>SOSP 2007: Day 3</title>
		<link>http://www.mrry.co.uk/blog/2007/10/17/sosp-2007-day-3/</link>
		<comments>http://www.mrry.co.uk/blog/2007/10/17/sosp-2007-day-3/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 15:58:51 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uni</category>
	<category>Technology</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2007/10/17/sosp-2007-day-3/</guid>
		<description><![CDATA[Indulge me for a moment. For some reason, the powers that be at SIGOPS like to hold SOSP at remote locations, which in recent years have included Bretton Woods, Banff, and Brighton. So I&#8217;ve not been the only one to point out that this year&#8217;s location, Skamania Lodge in southern Washington state, bears something of [...]]]></description>
			<content:encoded><![CDATA[<p>Indulge me for a moment. For some reason, the powers that be at SIGOPS like to hold SOSP at remote locations, which in recent years have included <a href="http://www.sosp.org/1983">Bretton Woods</a>, <a href="http://www.sosp.org/2001">Banff</a>, and <a href="http://www.sosp.org/2005">Brighton</a>. So I&#8217;ve not been the only one to point out that this year&#8217;s location, <a href="http://www.skamania.com/">Skamania Lodge</a> in southern Washington state, bears something of a resemblance to the infamous <a href="http://en.wikipedia.org/wiki/Overlook_Hotel">Overlook Hotel</a> in Stephen King&#8217;s <em>The Shining</em>. A little Wikipedia surfing last night led me to discover that the basis for the Overlook is Timberline Lodge, <a href="http://maps.google.com/maps?f=d&#038;hl=en&#038;geocode=&#038;time=&#038;date=&#038;ttype=&#038;saddr=Skamania+Lodge+Dr,+Stevenson,+WA+98648&#038;daddr=Timberline+Hwy,+Government+Camp,+Clackamas,+Oregon+97028,+United+States&#038;sll=45.305803,-121.731262&#038;sspn=0.063023,0.138702&#038;ie=UTF8&#038;z=10&#038;om=1">just across the river in Oregon</a>. Imagine my surprise as I watched the local news in our own hotel (which owes more than a little to <a href="http://www.mrry.co.uk/blog/en.wikipedia.org/wiki/Bates_Motel">Hitchcock</a>), and the weather forecaster cut to a shot of the Timberline Lodge, where apparently the snow has just started falling for the season. Ah well, at least they&#8217;ve got a security camera up there these days.</p>
<p><a id="more-28"></a></p>
<h2>Storage</h2>
<h3>DejaView: A Personal Virtual Computer Recorder</h3>
<ul>
<li>The MEMEX Vision: We lack tools to store all of our books, records and communications quickly and with great flexibility. (Vannevar Bush.) We need to archive, search, view and manipulate what we have seen. Web/desktop search is inadequate: no record of what we have seen but not saved.</li>
<li>DejaView: PVCR that provides complete, transparent, fast recording of the desktop computing experience. Tivo for the desktop. Records display like a video (playback, browse, ff, rw); and text and context to use as an index.</li>
<li>Display recording: transparent, efficient and full-fidelity. Uses a virtual display driver that can redirect the display anywhere: to the user (on the screen) and to persistent storage. Has a standard device interface, intercepting updates at the low level and logs all display updates.</li>
<li>Text/context recording: naïve approach would be do to OCR on the framebuffer, which is too slow and inaccurate. Instead leverage accessibility infrastructure (screen reader technology), which is integrated in most standard GUI toolkits. Also provides useful contextual information: name/type of application, window in focus, special properties (e.g. menu text).</li>
<li>Execution recording: be able to revive execution state at any time, including the entire desktop session, not just a single process, and it needs to be fast enough to save frequently without degrading user experience. Use a virtual execution environment that decouples the desktop from the underlying OS by interposing on the syscall API. Only saves the desktop state and not the entire OS, which cuts down on the size of the checkpoint. Requires some downtime for the application during saving and snapshotting the filesystem, which must be optimised for interactivity. Checkpoint rate must be limited to make the overhead manageable. Also avoid taking pointless checkpoints (like when the desktop clock changes). A checkpoint per second is sufficient granularity.</li>
<li>On revival, use UnionFS to put a CoW R/W layer on top of the R/O checkpointed file system. Can therefore fork history.</li>
<li>Evaluation: based on real implementation - overhead, impact on interactivity and storage requirements; and access to data (latency etc.). Ran a bunch of benchmarks, based on normal workloads, like web browsing, video playback, catting a large file to screen, doing a kernel build, or something in Matlab. Also evaluated based on real desktop usage by grad students.</li>
<li>Little overhead for display or execution recording; 100% overhead for text recording; 125% for full recording when web browsing (due to a bug in Firefox&#8217;s accessibility). All other use cases have much less overhead. Checkpoint latency: 5ms to 22ms downtime; total checkpoint time from 80 to 200ms. Storage growth is very low for the real usage scenario (lower than a PVR with equivalent display resolution). Browse and search latency no more than 200ms, which is good enough for interactive use. Playback speedup about 200x for real usage. Session revive takes 1.5 to just over 4s.</li>
</ul>
<h3>Improving File System Reliability with I/O Shepherding</h3>
<ul>
<li>Storage systems are becoming complex, which can lead to complex failures. Want to manage disk and individual block failures. Addressed by checksumming, parity, mirroring, versioning, etc. However, these techniques are insufficient, and poorly understood (unlike performance and consistency). There is no good strategy in commodity FSs, only coarse-grained policies.</li>
<li>Reliability treatments are diffuse and inflexible. Different apps require different policies (e.g. desktop workload versus web servers).</li>
<li>I/O shepherd: localised, flexible policies for different types of storage. e.g. Mirroring for archival data, checksumming for scientific data, or different levels of protection based on the quality of the drive. These policies can be composed.</li>
<li>I/O shepherding layer interposed between the FS and the disk subsystem. Includes >=1 policy tables, linking to policy code, policy primitives and policy metadata.</li>
<li>Policy table: for specifying policies. Different kinds of block types and volumes, and levels of reliability and importance. So we have a write policy and read policy for each block type (like a function pointer); and a policy table for each volume (or mount point?).</li>
<li>Policy metadata: need remapping, mirroring and sanity checking. Must be integrated with the FS. Also need I/O Shepherd Maps, to identify e.g. mirror locations.</li>
<li>Primitives and Code: want to make reliability management simple. Primitives like &#8220;Checksum&#8221;, &#8220;Parity&#8221;, &#8220;Sanity Check&#8221;, &#8220;Allocate Near&#8221;, &#8220;Allocate Far&#8221;. These are then composed into a full policy, in the policy code.</li>
<li>Challenge is to keep the new data and metadata consistent in the presence of crashes. Need consistency management.</li>
<li>CrookFS: ext3 with shepherding capabilities. 900LOC changes to the OS, and 3500LOC for the shepherding infrastructure. Small overhead. Integrates reliability policy with journalling: execute policies during checkpoint.</li>
<li>However, we can&#8217;t run e.g. remapping during checkpointing, because the checkpoint might fail, leaving us in an inconsistent state. If we keep the remap table in memory, the remap succeeds, but the program crashes before we can sync the table, we have the disk in an inconsistent state. At present, there is no consistency for any checkpoint recovery that changes state. Consistent reliability with the current journalling system is impossible!</li>
<li>Thus we need &#8220;chained transactions&#8221;, which says that only after a chained transaction (updating the remap table) can the previous transaction (doing the remapping) be released. This fixes the flaw in journalling. It is repeatable across crashes, as long as we have idempotent policies.</li>
<li>Evaluation: [see the paper].</li>
</ul>
<h3>Generalized File System Dependencies</h3>
<ul>
<li>A new architecture for constructing file systems, using the &#8220;generalised dependency abstraction&#8221;.</li>
<li>e.g. for consistency, we must keep FS consistent after every write, by, e.g. journalling. However must tradeoff durability features against performance. A file system must pick one tradeoff, but why can it not be extensible? This is difficult because it&#8217;s difficult to implement, complicated due to caches, and because correctness must be maintained.</li>
<li>What is a simple, general mechanism for implementing any consistency model? The &#8220;patch&#8221; abstraction in their implementation, Featherstitch.</li>
<li>Featherstitch contributions: the patch and patchgroup abstractions (explicit write-before abstractions, and FS-agnostic). Replaces Linux&#8217;s FS and buffer cache layer with implementations of ext2 and UFS. Journalling, WAFL and soft updates implemented just by using patch arrangements.</li>
<li>Patches: a disk data change and any dependencies it has on other disk data changes. Includes some undo data. The dependency says which data must be written by which others. Benefits of this are to separate write-before specification and enforcement, and makes these relationships explicit. Inspired by soft updates, but don&#8217;t enforce a particular FS and consistency model.</li>
<li>Example of rename: involves adding a directory entry and removing the old one. So we could write the source with the removed entry, then write the new entry. But what if we crash in between?</li>
<li>Soft updates: inc ref count for the file inode, then add the new dirent, then remove the old dirent, then dec the ref count. However, this has a cyclic dependency for block updates. But, in patches, this is not a cycle. So we can do the increment patch, then do the other patches, with no cycle.</li>
<li>Also showed how this would be done with journalling and WAFL: somewhat more complicated.</li>
<li>Patchgroups: arise from application-defined consistency requirements. Commonly just syscall to the buffer cache (fsync, sync), or depend on the underlying FS. Instead, the patch interface is extended to user space, using patchgroups, which enables the specification of write-before requirements between syscalls. Can make things more efficient by not doing a bunch of syncs.</li>
<li>Patch optimisations: massive dependency graph between patches even when allocating a few blocks for a new file, and so patches must have a lightning-fast implementation. Main primary overhead is unused undo data (150% overhead on some benchmarks). How do we detect where this is unused? Theorem is that undo data is only necessary when there are block-level cycles. So we should specify all dependencies when creating a patch.</li>
<li>Next: the patch data structures take a large amount of memory. Therefore try to merge them, i.e. if they are on the same block (hard patch merging) and when they overlap. There are several other optimisations in the paper!</li>
<li>Evaluation: measure optimisation effectiveness, compare with ext2 and ext3, check consistency, and run a benchmark (UW IMAP).</li>
<li>The optimisations are successful in reducing the number of patches, system time and undo data overhead.</li>
<li>Comparison with Linux: Featherstitch is between 19 and 26% slower at the PostMark benchmark. It&#8217;s faster on other benchmarks, where differences in the block allocation strategy outweigh the overhead.</li>
<li>Consistency correctness: under random crashes, is the FS consistent. Works on Soft updates, Journalling as expected.</li>
<li>Benchmark: moving 1000 messages from one IMAP folder to another. Patchgroups are much faster than using fsync.</li>
</ul>
<h2>Operating System Security</h2>
<h3>Information Flow Control For Standard OS Abstractions</h3>
<ul>
<li>Problem: vulnerabilities in websites can lead to exploits. Solution: decentralised information flow control (DIFC). Example: admin wants to secure web app that handles secret and non-secret data. Use a &#8220;declassifier&#8221; which decides who gets to see what data. User authenticates to the declassifier then passes all requests through it. Even a bug in the webapp cannot leak data, because the declassifier makes the final decision on who can see what. So why is nobody using this, if it&#8217;s so good?</li>
<li>Too hard to write the declassifier, requires smart people to develop all aspects of the system. Label systems are complex. Programming with DIFC can lead to unexpected behaviour. And it prevents the reuse of existing code, such as commodity operating systems.</li>
<li>Unexpected behaviour: unreliable communication between two processes of different security types; mysterious failures when a secret process elevates the security of another process, and therefore prevents it from continuing with a task such as writing to an unsecure file (so that it can&#8217;t leak data from the secret process).</li>
<li>Solution: Flume to solve DIFC problems (user-level DIFC on Linux with a simple label system, and glue between Unix API and labels). Then develop an application and evaluate it. Goal: want to be able to install this on existing machines using a apt-get.</li>
<li>Uses system call delegation which forwards system calls to a user-level Flume reference monitor.</li>
<li>Three process classes: confined (all syscalls go through refmon), Flume-oblivious (with no access to secret data), and unconfined/mediators (a mix of the two).</li>
<li>Simple label system: each process gets a secrecy label which summarises the categories of data that a process is assumed to have seen. (e.g. Financial Documents, HR Documents.) Single category is a tag; set of tags is a label. Any process can add any tag to its label, but it cannot remove it without special privileges. A process can create a new tag, and is given the ability to declassify it (and hence remove it from its secrecy label).</li>
<li>Communication rule: process p can send to q if p&#8217;s label is a subset of q&#8217;s label.</li>
<li>Endpoints: intermediaries between processes for communication (per file-descriptor?), which can be labelled independently from the processes, and which declassify data. This is only possible if the owning process is able to declassify a tag in the secrecy label of the endpoint (better specified in set notation). A process can have many endpoints. This addresses some of the mysterious failures by eagerly revealing errors (which would violate the endpoint invariant).</li>
<li>Example application is the Python-based MoinMoin wiki (100KLOC). 43 instances of the same check to see if the current user is able to access a bit of data, which have led to bugs, and plugins further complicate this. Instead add a 1KLOC declassifier. TCB is server + declassifier. Untrusted code is the Wiki plus any plugins.</li>
<li>Apache is Flume-oblivious. Declassifier is a mediator. MoinMoin is confined, i.e. must syscall through the refmon.</li>
<li>Results: Flume allows the adoption of existing Unix software: just 1% of MoinMoin had to be changed (no change in Python interpreter or Apache). By using Flume, two ACL bypass bugs were solved automatically. Performance is within a factor of 2.</li>
<li>Limitations: bigger TCB than HiStar or Asbestos (Linux stack + 22KLOC refmon). Disk quotas are a covert channel.</li>
<li>Q: On the example showing how endpoints are made, and secrecy labels may be changed, what if a malicious plugin attempts to perform declassification? This isn&#8217;t a problem, because, if the secrecy were critical to other applications, it wouldn&#8217;t be able to declassify the tag (because the tag would have been created by another application). Untrusted software would not have the elevated privilege to declassify tags.</li>
<li>Q: Can the described policies be implemented inside SELinux? For a single application, probably yes, but the policies would become complex if it were possible to add applications to the change. Fluke simplifies system administration and transfers these decisions to the content provider.</li>
</ul>
<h3>SecVisor: A Tiny Hypervisor to Provide Lifetime Kernel Code Integrity for Commodity OSes</h3>
<ul>
<li>Kernel rootkits: malware inserted in OS kernels to avoid detection by user-level scanners. Becoming increasingly common in the wild. DMA-based attacks are becoming common. Current (user-space) security tools are insufficient because they assume kernel integrity, and cannot find all attacks (as they are detection-based).</li>
<li>Aim: a hypervisor that prevents injected code from executing at kernel privilege. Permit only user-approved code to execute a kernel privilege, based on a user-specified approval policy. Goals: security and ease of porting (not performance).</li>
<li>~1100LOC hypervisor, that operates at a ring below the kernel, but doesn&#8217;t attempt to do anything with hardware. Enforces approved code execution in kernel mode, which holds over system lifetime.</li>
<li>Assumes: attacker can perform all attacks except HW attacks against CPU and memory. Can modify memory contents, perform DMA writes and modify system firmware. Can also have knowledge of 0-day kernel exploits. Single CPU with SVM or VT. Executed in 32-bit mode without the use of self-modifying code. No vulnerabilities in SecVisor (going to formally verify that).</li>
<li>Require: constrained instruction pointer, which should be within approved code regions whenever CPU is in kernel mode. Approved code regions must be immutable, and so cannot be modified by an attacker.</li>
<li>Constrained IP: kernel mode entry should set IP to being within approved regions. IP must remain within approved regions as long as we stay in kernel mode. Each kernel mode exit should set the CPU privilege level to user mode.</li>
<li>Kernel Entry: exception/interrupt happens, looks up handler in the interrupt vector, and jumps to that IP in kernel mode. So all entries in the vector must point to approved code.</li>
<li>During execution: place write XOR execute protection over kernel memory. But the problem is that the kernel and application share the same address space, so what if the attacker attempts to jump to user-space code, maybe by performing a buffer overrun. Solution: mark all application memory as non-executable (NX) on kernel entry.</li>
<li>Intercepting user-to-kernel switch: all CPU entry pointers point to approved code; mark approved code regions as NX during user mode execution; all user-to-kernel switches raise exceptions.</li>
<li>Kernel exit: intercept all kernel-to-user switches by using the exception that is raised by trying to execute NX code in user space after the switch back to user space.</li>
<li>Immutable approved code: memory may be written by software executing on CPU or by DMA writes by peripherals. Approved code must be marked read-only. IOMMU protects from DMA writes.</li>
<li>Implementation of memory protection: set protection independent of OS, using shadow (or nested) page tables. DMA exclusion vector to protect approved code.</li>
<li>[I don&#8217;t think I need to include the explanation of shadow page tables.]</li>
<li>Shadow PT is used to set memory protections. Approved code regions are set read-only in the SPT, and DEV prevents DMA writes. We also need to prevent aliasing of pages that are in the approved regions to non-approved VAs.</li>
<li>Check entry pointers to ensure they contain VAs of approved code. GDT, LDT and IDT must be protected by shadowing these (to protect them from DMA writes).</li>
<li>Overhead from intercepting all switches, and the shadow PT synchronisation and kernel PT checks. I/O-intensive workloads will perform poorly.</li>
<li>Specint benchmarks show that SecVisor is about 2 to 5% slower than Xen (except for the gcc test, where it is much slower). Some optimisation has been done in the latest version.</li>
<li>Q: Many past kernel exploits change the uid of user processes to overwrite kernel memory; does SecVisor address this? It doesn&#8217;t protect the integrity of kernel data structures. This is future work.</li>
<li>Q: Performance effect of nested page tables? Shadow overhead goes away, but you still have to check and protect the kernel page tables.</li>
<li>Q: Knowing everything you know about the system, how would you attack it as an intruder? Try to attack the TCB, but it is very small, so we don&#8217;t expect that this will be a problem.</li>
<li>Q: Deal with loadable modules? Paper discusses this: you could have a hash-based approval policy, and SecVisor performs code relocation on behalf of the kernel.</li>
<li>Q: How to protect the stable storage off which SecVisor is loaded? Use skinit, along with trusted computing techniques.</li>
</ul>
<h3>Secure Virtual Architecture: A Safe Execution Environment for Commodity Operating Systems</h3>
<ul>
<li>Safe execution environment: such as that provided by Java or C#. Array indexing within bounds, no use of uninitialised variables, type safe operations, no dangling pointers, control flow follows semantics, etc.</li>
<li>Commodity OSs do not use these kind of languages.</li>
<li>Using as secure exec env would provide security, novel design opportunities and the ability to develop new solutions to higher-level security challenges (such as information flow policies, for example).</li>
<li>Secure virtual architecture: interpose a compiler-based VM between a commodity OS and hardware. The VM provides a virtual instruction set architecture.</li>
<li>Contributions: a compiler-based VM that can host a commodity OS. First to provide security guarantees for a complete commodity OS.</li>
<li>Safety-checking compiler: similar to a C compiler which translates C into the virtual ISA (application level). Generates bytecode rather than native code. Bytecode has a type system which enables alias analysis in the VM.</li>
<li>VM contains a safety verifier (which ensures that the compiler has done the right thing; may insert runtime checks), native code generator, and OS and memory safety runtime libraries. The compiler is outside the TCB; only the verifier and code generator are inside.</li>
<li>Virtual ISA based on LLVM. Added OS-neutral operations (SVA-OS) that remove difficult to analyze assembly code and encapsulates privileged operations. Porting to SVA-OS is like porting to a new arch.</li>
<li>Goals: maximise safety guarantees, minimise reliance on OS correctness, minimise changes to the kernel, and retain original kernel memory allocators.</li>
<li>Guarantees: just like Java or C#, but type safety only for a subset of objects (due to the use of C code), and dangling pointers are harmless (rather than never used). These limitations do no compromise the other guarantees.</li>
<li>Checks: load/store, bounds, illegal free and indirect call. Transforms: stack to heap promotion to deal with dangling pointers, and initialising memory. Need to track object bounds: use object lookups. Naïvely, the security verifier would add code to register all allocations in the VM metadata. All uses of these objects would have to follow a lookup to see if the access is within bounds. Can improve this by alias analysis, which groups objects into logical partitions, reducing overhead from between 400 and 1100% to between 10 and 30%.</li>
<li>Implementation: ported Linux to the SVA ISA, compiled using LLVM. SVA-OS is a runtime library linked into the kernel. Guarantees safety of everything apart from the kernel memory management code and [blink and you&#8217;ll miss it, see the paper]. Around 5KLOC had to be changed in Linux.</li>
<li>Memory safety overhead is less than 59% (for a web server benchmark). Latency is bad with many small transfers. 80% of exploits caught (the non-caught one was in a library that isn&#8217;t currently ported to the VM).</li>
<li>Several performance improvements are possible, by changing the code, using smarter checks and smarter static analysis.</li>
<li>Q: To what extent does the analysis equate to type safety? Do not guarantee that pointers point to the correct object, but that it is a valid object.</li>
<li>Q: Must the number of memory pools be statically allocated? No. What if memory pools are allocated at run-time? Even then, its use is statically identifiable.</li>
<li>Q: Is the overhead really small enough to make this approach suitable for use today? Improving the overheads is future work.</li>
</ul>
<p>And, with that, I&#8217;m off to Portland for an afternoon of shopping in a city where £1 buys $2.03, and there&#8217;s no sales tax. See you when I get back!
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2007/10/17/sosp-2007-day-3/feed/</wfw:commentRSS>
		</item>
		<item>
		<title>SOSP 2007: Day 2</title>
		<link>http://www.mrry.co.uk/blog/2007/10/16/sosp-2007-day-2/</link>
		<comments>http://www.mrry.co.uk/blog/2007/10/16/sosp-2007-day-2/#comments</comments>
		<pubDate>Tue, 16 Oct 2007 16:24:00 +0000</pubDate>
		<dc:creator>Derek Murray</dc:creator>
		
	<category>Uni</category>
	<category>Technology</category>
		<guid isPermaLink="false">http://www.mrry.co.uk/blog/2007/10/16/sosp-2007-day-2/</guid>
		<description><![CDATA[I&#8217;ll skip doing a review of the banquet last night, so it&#8217;s on with the papers!

Software Robustness
Bouncer: Securing Software by Blocking Bad Input

Filters block bad input before it is processed. Low overhead and no false positives. Programs can keep running even when under attack.
Performs symbolic execution of programs to perform analysis and generate input filters. [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ll skip doing a review of the banquet last night, so it&#8217;s on with the papers!</p>
<p><a id="more-27"></a></p>
<h2>Software Robustness</h2>
<h3>Bouncer: Securing Software by Blocking Bad Input</h3>
<ul>
<li>Filters block bad input before it is processed. Low overhead and no false positives. Programs can keep running even when under attack.</li>
<li>Performs symbolic execution of programs to perform analysis and generate input filters. Works on machine code.</li>
<li>Pre-condition slicing: compute a subsequence of trace instructions, executing which is sufficient to create a vulnerability. Find instruction with vulnerability (e.g. call to sprintf), then move backwards through the trace, adding the instructions on which the vulnerable instruction depends.</li>
<li>Deployment in a distributed scenario. When an exploit is found, run Bouncer, calculate an improved filter, and deploy this. Or run centrally on a cluster to find exploits in parallel.</li>
<li>Nirvana generates traces. Phoenix implements slicing.</li>
<li>Evaluated against four real vulnerabilities (including the Slammer worm). Ran experiment for 24 hours.</li>
<li>Filters have no false positives. Found perfect filters for two vulnerabilities. Some false negatives for two others.</li>
<li>Throughput is much closer to 100% when using filters, compared to having to restart on each attack. Can still maintain 80% throughput with 18000 attack probes per second.</li>
</ul>
<h3>Triage: Diagnosing Production Run Failures at the User&#8217;s Site</h3>
<ul>
<li>What to do with a production run failure (send error report)? At present, we just send a core dump, which takes a lot of manual effort to diagnose. At the same time, these failures cause damage to end users.</li>
<li>Reproduction is hard, because user environment is unique. Furthermore, there are privacy concerns.</li>
<li>Core dumps tell you what the failure is. Bug detection tells you about some errors. Diagnosis is a means of tracing back to the underlying fault, but existing tools are offline (e.g. requiring programmer input).</li>
<li>To diagnose: (i) need information about the failure (fault, error, propagation tree), but off-site techniques don&#8217;t work on-site; (ii) need guidance about what to do next, the kind of analysis to perform, the interesting variables to investigate (but there is no programmer to do this, so decisions must be taken automatically); (iii) need to be able to test &#8220;what-ifs&#8221;, such as changing input and seeing what happens, but most techniques focus on replaying the same thing.</li>
<li>Triage enables on-site diagnosis, using systems techniques to make offline analysis tools useful on-site, addressing the three challenges. It introduces a new technique called delta analysis. Tested in a human study with real programmers and real bugs.</li>
<li>Uses checkpointing and re-execution to capture the bug. Allows replaying the failure repeatedly, using different analysis tools. Little overhead in the normal-run case. Checkpoints every 200ms, keeping 20 previous checkpoints, in memory to lower overhead.</li>
<li>Uses a human-like &#8220;diagnosis protocol&#8221;. Repeated replay enables an incremental diagnosis. e.g. If the bug doesn&#8217;t always repeat, it may be a race condition.</li>
<li>But the replay isn&#8217;t identical: it can either be deterministic (plain), loose (tolerating some variance, such as the introduction of analysis tools which change the syscalls) or wild (including potentially large variations, maybe changing input or the code itself). Delta analysis is similar to diffing a pair of (successful and failing) runs. Triage doesn&#8217;t need to be safe when it replays execution: it only cares about why the bug happens.</li>
<li>Failure analysis variety of methods (bounds checking, taint analysis, symbolic execution, etc.) and delta generation (rearrange allocations, drop inputs, mutate inputs, drop code(!)).</li>
<li>Delta analysis: compute the basic block (of code) vector (1 if executed, 0 if not), and subtract them. The two closest runs are diffed (finding the edit distance and shortest edit script). So a basic block that differs could for example be to return NULL, and it is then dereferenced.</li>
<li>Human study: 15 programmers from faculty, researchers and grad students. Measured times to repair bugs with and without Triage. (People got core dumps, sample inputs, instructions for how to replicate, and access to debugging tools.) The evaluation didn&#8217;t expect the participants to work out how to replicate the bug, and set a maximum time.</li>
<li>47% time saving on diagnosing real bugs, significant with probability of null hypothesis < 0.01%.</li>
</ul>
<h3>/* iComment: Bugs or Bad Comments? */</h3>
<ul>
<li>Many bugs are due to mismatches between the code and the programmer&#8217;s assumptions. Such as assuming that a lock is held when a piece of code is executed. Comments express these assumptions. 20% of Linux is comments (excluding blank lines and copyrights), but they are not utilised by compilers or bug detection tools.</li>
<li>However, comments are imprecise, compared to code, and cannot be tested. Therefore they tend to become less reliable as software evolves. Nevertheless, they are easier to understand for programmers. So wrong comments are likely to mislead programmers. Difficult to infer assumptions from source code (but see the MUVI paper from yesterday).</li>
<li>Mismatches indicate either bugs or bad comments. Challenge is to understand the natural language of comments. NLP techniques are not enough: POS tagging (97% accurate), chunking (90%) and semantic role labelling (70%). But they assume correct grammar, which comments don&#8217;t always have.</li>
<li>Therefore, also use machine learning, staistics and program analysis (with NLP). 1832 rules extracted, detecting 60 new bugs or bad comments (19 of these confirmed). Looking at call and locking bugs only.</li>
<li>Focus on the comments that contain rules (rather than explaining code), as these are more likely to be inconsistent with the code.</li>
<li>Example rule:  must be held before entering .</li>
<li>Comment classifier is trained using a decision tree building algorithm. Some manual training is required (at least once per topic).</li>
<li>Validated using Linux, Mozilla, Wine and Apache, looking at lock- and call-related topics. Overall, 60 mismatches, 33 (12 confirmed) bugs, 27 (7) bad comments and 38 false positives, from 1832 rules. Training accuracy is between 90 and 100% (for lock-related comments). Cross-software accuracy is between 78 and 90% (saves the amount of training).</li>
</ul>
<h2>Distributed Systems</h2>
<h3>Sinfonia: A New Paradigm for Building Scalable Distributed Systems</h3>
<ul>
<li>The protocols in current distributed algorithms are complicated, error-prone, and often difficult to understand. We want to avoid such protocols. Focus is on data centre systems, with small and predictable latencies, and occasional crashes. Focus also on infrastructure applications, like cluster file systems, lock managers and communication services.</li>
<li>Sinfonia is a data sharing service, comprising a set of memory nodes which each export a linear address space (no structure imposed). Protocol design becomes the easier problem of shared data structure design. Minitransactions are used to access the memory nodes.</li>
<li>Minitranscations have ACID properties, balance power and efficiency. Reduce network round-trips, but are still flexible and easy to use. Semantics: perform a bunch of compare items (for equality over an address range), if all of these match then retrieve the read items and perform the write items. Execution of the transaction is piggybacked on the two-phase commit process, to minimise round-trips.</li>
<li>Commit coordinator runs at the application, which may crash, so a new two-phase commit protocol was necessary. Based on all memory nodes logging a &#8220;yes&#8221; vote, rather than the coordinator logging a &#8220;commit&#8221; decision.</li>
<li>Applications: sinfoniaFS (a cluster file system) and sinfoniaGCS (a group communication service - i.e. a chatroom with ordered notifications).</li>
<li>sinfoniaFS performs as well as LinuxNFS in the Andrew benchmark. sinfoniaGCS scales better than Spread.</li>
</ul>
<h3>PeerReview: Practical Accountability for Distributed Systems</h3>
<ul>
<li>Fault in a distributed system with distributed state can lead to incomplete information. In the general case, there could be multiple admins with different interests (and possible malicious intent).</li>
<li>Many faults are not fail-stop but instead change behaviour of a node. Dealing with general faults is difficult, because of a lack of control over the system. How can faults be detected or the faulty nodes identified. Then how do you convince others that a node is faulty (or not faulty) and must be fixed.</li>
<li>Real-world systems rely on accountability. e.g. signed receipts, double-entry bookkeeping and auditing. These can be used to detect, identify and convince about faults.</li>
<li>A fault is when a node deviates from expected behaviour. We want a system that generates a proof of misbehaviour against a faulty node. Difficult if we consider a fault that affects only a node&#8217;s internal state (requires an online trusted probe at each node). So we focus on observable faults: those that causally affect a correct node, and don&#8217;t require a trusted component.</li>
<li>Verifiable evidence: a proof of misbehaviour or a challenge that a faulty node cannot answer.</li>
<li>Accountability: whenever a fault is observed by a correct node, the system eventually generates verifiable evidence against a faulty node.</li>
<li>Implementation: as a library, PeerReview. Assumes that nodes modelled as a collection of deterministic state machines, with each node holding a reference implementation of all state machines. Also that correct nodes can eventually communicate and nodes can sign messages.</li>
<li>All nodes log all inputs and outputs. Each log is audited periodically by witnesses (a set of nodes for each node). If misbehaviour is detected, evidence is generated and distributed to other nodes.</li>
<li>Log entries form a hash chain to prevent forgery of a log. Keeping multiple logs or forking logs is prevented by checking the signed hashes to ensure that a single linear log is kept. Faults in a log are recognised by replaying the state machine with the known inputs and checking outputs against the log.</li>
<li>PeerReview guarantees that faults will be detected and good nodes cannot be accused.</li>
<li>Applicability: NFS server in the Linux kernel, overlay multicast, and P2P email system. Deals with small latency-sensitive requests, large transfers, and complex, large and decentralised systems.</li>
<li>Dominant cost depends on the number of witnesses per node.</li>
<li>Normal P2P email scheme scales as O(log n) with n being the number of nodes. PeerReview can scale O((log n)^2) if you accept a 99.999% probability of detecting faults. (100% accuracy is prohibitive (linear, I think)).</li>
</ul>
<h3>Dynamo: Amazon&#8217;s Highly Available Key-Value Store</h3>
<ul>
<li>Amazon architecture: loosely coupled SOA, with stateful services that manage their own state. Requirements on latency, availability and scale.</li>
<li>State management is the primary factory affecting scalability and availability. Data is only ever accessed by primary key, in self-describing blobs, so key-value is better than RDBMS (don&#8217;t need a query optimiser, triggers, etc.). RDBMS scale up, not out, and (strong, transactional) consistency is less important than availability.</li>
<li>Dynamo requirements: always writable (accept writes during failures), e.g. to add an item to a shopping cart must always be possible; user-perceived consistency (so can&#8217;t be too weak on consistency); performance with latency in the 99.9th percentile; low cost; incremental scalability; and tunable knobs for cost, consistency, durability and latency. No production-ready systems met these requirements</li>
<li>Replicated DHT with consistency management, using: consistent hashing, optimistic replication, sloppy quorum; anti-entropy mechanisms; and object versioning.</li>
<li>DHT is similar to Chord, using MD5 as the hash function. Clique for routing between replicas. Each storage node given many locations on the ring for load-balancing. Trade-off between balancing load and keeping membership tables concise.</li>
<li>Configurable for different scenarios e.g. consistent, durable, interactive user state; a high-performance read engine; or a distributed web cache.</li>
<li>Able to meet a 500ms SLA on latency, for the implementation of a shopping cart.</li>
<li>Drawbacks: no indefinite scaling, not appropriate if transactional semantics required, more challenging to program than using ACID properties.</li>
</ul>
<h2>System Maintenance</h2>
<h3>Staged Deployment in Mirage, an Integrated Software Upgrade Testing and Distribution System</h3>
<ul>
<li>Typically test on a couple of vendor machines, then a bunch of beta tests in the outside world, but despite this, the software will still fail on user machines in the outside world (due to dependencies, odd configurations, etc.). Mirage enables vendors to cluster outside world machines in terms of the their configurations/environment, so as to ensure coverage of beta testing before public release (and hence improve reliability). This reduces &#8220;upgrade overhead&#8221;: the number of machines that fail.</li>
<li>Environment must be identified and fingerprinted, before clustering. Then the order and speed of deployment must be chosen by the vendor.</li>
<li>In a cluster, all machines &#8220;behave identically wrt an upgrade&#8221;. Benefit depends on quality of clustering and the quality of testing.</li>
<li>Identification: instrument the application to determine the resources (libraries, config files, environment variables and dependent applications) that it uses during a normal run. Filter out noise (log files, temp files and data) based on heuristics and vendor-provided rules. A library becomes a (name, version, hash of binary) tuple. The hash is used in case there are different builds, for example.</li>
<li>Tradeoff between low upgrade overhead (few failures) and fast deployment. Choice between deploying in parallel to many clusters (fast, but high overhead), or sequentially (slow, but low overhead). Fixing a problem for one environment might also fix it for another environment (benefit of serialisation).</li>
<li>Evaluated the quality of clustering (identification and clustering), and staged deployment (the upgrade overhead and deployment speed).</li>
<li>Accurate identification: between 200 and 850 environment resources (on firefox, apache, php and mysql), between 0 and 7 vendor rules, yielded 0 errors in any case.</li>
<li>Accurate clustering: 21 different environments, with 2 distros of linux, PHP and Apache and various MySQL configurations. Yielded 0 misplaced machines within 15 clusters. Varying the fingerprinting granularity can vary the number of clusters.</li>
<li>Controlling the tradeoff: simulation with 100,000 machines, 3 problems and 2 staging protocols. Upgrade overhead reduced very significantly, with, in the worst case, a 25% increase in deployment time.</li>
<li>Studying &#8220;real machines&#8221; and deployments is the subject of future work.</li>
</ul>
<h3>AutoBash: Improving Configuration Management with Operating System Causality Analysis</h3>
<ul>
<li>Current approach to configuration management is to ask friends, search online, read manual, try potential solutions. Frustrating, time consuming and tedious! Hard to undo a wrong solution or know if the problem was solved. A solution can cause new problems.</li>
<li>AutoBash: tries many solutions at once, provides an undo capability, explains to the user how it solves a problem and automatically runs regression tests.</li>
<li>When a problem is detected, there are two modes: Replay mode which automatically searches for a solution; and Observation mode which helps the user to fix the problem. All the time, there is a Health monitoring mode.</li>
<li>Observation mode: a modified bash where user tries to fix a problem by typing a command then testing if the app works, then possibly undoing the command and rolling back the command, before trying again. Has &#8220;predicates&#8221; which test if an application functions correctly and returns true iff the test passes. Idea is for application to be shipped with a testsuite of predicates. Speculator (SOSP&#8217;05) is used to perform process-level speculative execution, and make predicate testing safe (undoable). Shell provides a rollback command, used in conjunction with the speculative execution, to undo the attempted command.</li>
<li>Regression testing is handled by running all predicates in some database. This can be slow, however, so causality tracking is used to find which predicates might be affected by a change.</li>
<li>Tracking causality: Output set of kernel objects that an action causally affects, and Input set of kernel objects on which a predicate causally depends. If the output set of an action and input set of a predicate intersect, then the predicate must be checked. The output set can be tracked by syscall tracing, and then trimmed by excluding temporary objects (such as processes, or temporary files). Similarly, this can be done to determine the input set of a predicate.</li>
<li>Generating a causal explanation: the important actions are the ones whose output sets intersect with the input sets of the newly-passing predicates.</li>
<li>Replay mode: assume that vendors ship &#8220;common solutions&#8221; with their software, so that these may be automatically tested (safely, using speculative execution) in the background, while the user gets on with work.</li>
<li>Evaluation: overhead of speculative execution and effectiveness of causality analysis? Looked at CVS, GCC cross compiler and webserver. Created 10 bugs and 10 solutions.</li>
<li>Very little overhead (non-significant) of speculative execution. Causality analysis almost halves the total replay time, with a slight increase in the predicate testing time.</li>
</ul>
<h2>Energy</h2>
<h3>Integrating Concurrency Control and Energy Management in Device Drivers</h3>
<ul>
<li>Concentrating on wireless sensor networks. Concurrency of I/O operations are considered (synchronous versus asynchronous). Energy management is considered in terms of the necessary power state of devices to perform I/O.</li>
<li>The more workload information the app can provide the scheduler, the less energy needs to be used.</li>
<li>Hard to tell OS about application workloads: API extensions for hints to energy management. Done for CPU voltage scaling and disk spin-down. Sensor networks need a unique solution, however: harsh requirements, small power source, and need to run unattended for periods ranging from months to years. 1st generation OSs push all decisions to the applications.</li>
<li>ICEM: a driver architecture that automatically manages energy, implemented in TinyOS 2. New concept of Power Locks: split-phase (asynchronous) locks with integrated energy and configuration management. Dedicated, shared and virtualised drivers and a component library for building them.v Energy efficiency is 98.4% of hand-tuned implementation. Also much easier to program.</li>
<li>Case study on the TMote platform: three interfaces to six total devices. Logging application with producer, which writes sensor samples out to flash every 5 minutes, and a consumer that sends all samples from flash over the radio every 12 hours. Code complexity of hand-tuned implementation is very complex: need to order the power-on and -off of each different device. ICEM hides these calls, greatly simplifying the code.</li>
<li>Split-phase I/O operations: single thread of control, where read() call is given a callback, readDone(), that allows the driver to poke the application when it is done. These can then be scheduled efficiently, because the driver has information about what has to be done before it considers switching the device off.</li>
<li>Virtualised driver: only a functional interface. Assume multiple concurrent users, buffering requests for concurrency management, and managing energy by looking at pending requests. Such drivers have longer latencies, because the underlying device has to have synchronous access. Suitable for high-level hardware (like a block device?)</li>
<li>Dedicated driver: functional and power control interfaces. A single user only with no concurrency control and explicit energy management. Suitable for low-level hardware.</li>
<li>Shared driver: functional and lock interfaces. Multiple users, explicit concurrency control (using the lock), and implicit energy management based on pending requests.</li>
<li>Power locks: hardware-specific configuration and power interfaces, and a lock interface. Talk to a dedicated driver. First, request access to the lock, which powers on the underlying device and configures it and calls back the application (split-phase). Other locking requests are queued after the current user. Power down only happens when all pending lock requests have been fulfilled and released.</li>
<li>Arbiter: performs concurrency control and queuing/scheduling (FCFS and round-robin). Configurator: calls h/w-specific configuration interface on the dedicated driver. Power Manager: powers down device when it falls idle, and powers it back up when a new lock request comes in (with immediate and deferred policies).</li>
<li>Evaluation: comparing hand-tuned, ICEM, optimal serial and worst-case serial orderings, in the sensor logging application described above. ICEM performance very close to hand-tuned, and better than both serial versions.</li>
<li>ICEM overhead is 5.60% of total sampling energy. Node lifetime for ICEM is between 98.4% and 100% of the hand-tuned version, as the sampling period is increased.</li>
</ul>
<h3>VirtualPower: Coordinated Power Management in Virtualized Enterprise Systems</h3>
<ul>
<li>Two key benefits of integrated power management: power savings are proportional to cost savings, and cooling capability is proportional to rack utilisation. Information and capabilities exported ACPI have led to application-specific power management policies. How do we do this in the context of virtualisation?</li>
<li>Problems: what should be exposed? Are there issues with isolation? Are there power benefits arising from resource sharing? VPM can give a 34% power improvement.</li>
<li>Data centres have a heterogeneous collection of platforms. Virtualisation should abstract this (e.g. because of migration), but this is tricky with different power characteristics in the underlying hardware. The workload stays the same, so we should virtualise the power management states: therefore we can have a consistent view of manageability across migrations.</li>
<li>What about isolation? Disassociate changes to virtual state from the management of the physical state: front-ends in DomUs, back-end in Dom0, the latter of which manages the physical state. Notion of soft scaling which allows the possibility of &#8220;scaling&#8221; a virtual CPU, which enables consolidation of soft-scaled virtual resources. (Two soft-scaled VCPUs on a single full-power CPU, but you might be able to turn off another physical CPU, for example.)</li>
<li>VPM rules in Dom0 which control the physical PM. Virtualisation layer policies are driven by VPM state requests from the VMs, which makes an implicit feedback loop. Might need some learning algorithms to derive rules.</li>
<li>e.g. Dom1 changes VPx state via a hypercall and creates a VPM event (comprising the VM soft state, and a shadow state (the physical manifestation of the current soft state). This leads to a new set of shadow states being calculated.</li>
<li>Workloads: tiered web service (RUBIS) (Linux on-demand gonvernor); transactional workload (select policy based on the transaction processing rate and the amount of slack); web service with quality of information metric (Travelport (Worldspan)) (policy based on the QoI and processing time of requests across different client classes).</li>
<li>Complex analysis has up to 4% degradation on throughput. Seems to work with multiple VMs too. [Lots of fairly convincing graphs but I think it&#8217;ll be necessary to take a good look at the paper to judge what we&#8217;re actually being presented.]</li>
</ul>
]]></content:encoded>
			<wfw:commentRSS>http://www.mrry.co.uk/blog/2007/10/16/sosp-2007-day-2/feed/</wfw:commentRSS>
		</item>
	</channel>
</rss>
