Archive for the ‘Data Compression’ Category

On Wookiees and providing unreasonable customer service.


It’s been a few weeks since I last blogged and I bet you thought I’d forgotten about you .. or was out frolicking in the snow with the other Storage Wookiees.

No, sorry .. wrong on both counts.  It’s just that Q4 really got me this year, but I’m back now so whether you’re cheering, groaning, or deleting .. I’ll be blogging regularly again.

Now .. there is no getting away from the fact that it is Q4 and, for many of us, there can be quite a bit of stress as we seek to close the year both personally and professionally.  But I have a confession to make .. I really love this time of year.

There are the Chrimbo parties and lunches when you get to meet up and spend time with people that perhaps you haven’t seen all year, the Hanukkah Armadillo, a snowcalypse and Arctic temperatures that bring the UK to a grinding halt .. I mean, hey .. what’s not to love?  London will be back to normal by mid January, and I hear Scotland it due to reopen in June.  And my favourite part of this time of year?  No, not an opportunity for me to use my fireplace app on my iPad. And if you don’t have an iPad, I don’t want you to feel left out so here you go.  Although with the way Adobe Flash lights up a laptop CPU you may not even need a fireplace to keep warm whilst you’re watching that.

But I digress.  My favourite part of this time of year is .. the Starbuck’s Gingerbread Latte.  And when I say favourite, I mean deep down and dirty wookiee love with a capital L .. L-O-V-E the Starbuck’s Gingerbread Latte.  If you don’t like Starbucks and think they are overpriced coloured water merchants, fine .. but I’m here to tell you that I am big time into their Gingerbread Latte.  And as a wise man once said .. ‘Let the wookiee win.’

But since it’s that time of year, here’s a video of Chewbaccah singing ‘Silent Night’ just so there’s no hard feelings.

I went in to Starbucks to buy my first gingerbread latte of the season on the first day they were officially available in the UK .. I probably shouldn’t mention that I also buy the syrup from Starbucks so I can make them at home with my Siemens Nespresso machine .. but, to be fair, they really only taste completely ‘proper’ during the season and from a Starbucks.

And when I went to buy my latte, I got a very pleasant surprise.  The barista, whom I know quite well due to my somewhat severe Starbucks addiction, offered me a Starbucks card.  Why would I want that?, asked I .. It isn’t a credit card so why should I prepay you now for my future lattes?

Because you get free extra shots of coffee, soya milk, flavoured syrup .. and *drum roll* Wifi in all of our stores.  Free.  So that £4.05 grande triple shot sugar free hazelnut latte now costs about £3.50.


What’s this got to do with Data Storage & Protection?

The industry market is quickly consolidating through merger and acquisition .. the most recent being the acquisition of Compellent by Dell, but there have been dozens over the past eighteen months .. indeed, too many to list here.

What has become frightfully obvious through much of this M&A activity is that asking the question ‘is it a product or a feature’ has never been more important as what were once products .. e.g. Data Domain and Diligent with data deduplication, or Storewize and Ocarina for data compression .. were swiftly acquired by much larger storage companies like EMCIBM, and Dell as they surmised that these technologies such as dedupe and compression were actually features and not really products in their own right.

In other words, why not include dedupe, compression, thin provisioning, and automated tiering in the storage array(s) themselves as opposed to individual stand alone products?

Equally, the market seems to be segmenting into three distinct customer behaviours for data storage;

1. Let’s Optimise the Lot – I’m prepared to explore internal change and IT process evolution in the pursuit of lower IT costs and increased business agility, so perhaps a virtual datacentre [VDC] is the right solution as opposed to siloed solutions of storage plus server plus network and so on.  Put more simply .. optimise EVERYTHING, not just storage.

2. Let’s Optimise the Storage – I may want to optimise the lot in the fullness of time, but right now I need to optimise my data storage to reduce my storage costs specifically.

3. I Need a Bucket – I will optimise my storage and possibly everything else when I can, but right now I need an efficient and cost effective bit bucket.

Now, at Computacenter we have solutions which credibly and competitively address each of these areas but this is where the lattes come in in my opinion.

Just like Starbucks have realised that they need to provide unreasonable customer service to continue to get people to buy their coffees .. i.e. give away much of what, perhaps, their competition wouldn’t .. so too must we consider what we could do to give unreasonable customer service to our customers not just in Q4, but throughout the year.

If I’m buying a bucket .. what options and features might be available to me that I haven’t asked you for?  Might I be able to get data dedupe, data compression, or automated tiering to make my purchase even more cost effective?  And how will my purchase enable me to either optimise my storage and/or entire IT infrastructure next year?

If I’m looking to optimise the storage .. what vendor partners include the features as part of the array price versus those, for example, who will make me pay more just to automate the tiering of my data?  How can I get the most storage optimisation per spent pound?  And how can I leverage my optimised storage purchase when I seek to optimise the lot next year?

If I’m looking to optimise the lot .. how will my purchase enable me to connect to external service providers in the future?  How will I be able to retain my structured data internally in a fully optimised state whilst shipping out my unstructured data to an external service provider .. safely, reliably, and securely?

Starbucks with the freebies offered with their Starbucks card is but one example of offering unreasonable customer service .. I think Amazon including free 3G on their Kindle as part of the £149 purchase price is another.

I’m committed to helping Computacenter be another such example, so please feel free to contact me or Bill McGloin if we can help provide you unreasonable customer service either now in the last days of Q4 or in 2011.

For now, here’s a video of Chewbaccah singing the blues .. which is what I hope Australia will be doing when we win the Ashes.

Look out next week for my 2011 predictions and a Happy Christmas/New Years to you and yours.



Come on baby compress my data! [with apologies to The Doors].


UPDATE 29 July 2010 – IBM enter into a definitive agreement to acquire Storwize.

If there is one thing in the world that absolutely makes my teeth itch and I would pay just about anything not to have to do, it is packing and unpacking for extended trips.  It would seem that I am not the only one as, prior to the recession, there were companies popping up which would come to your house, pack for you, and then ship your bags to your holiday destination …where one of their representatives would unpack for you!  Probably not a sustainable business model as they’ve since disappeared, but it was an intriguing idea …and, whilst pricey, still cheaper to the live in butler I’ve always secretly wished for.

Extravagant you say?  Perhaps, but it would could help avoid the inevitable rows in Case PL as, whenever Mrs. PL and I go on hols with PL Junior, we end up having a very full and frank discussion regarding how much we need to take.

I would be more than happy to go on holiday with nothing more than a carry on.  Now that I have my geek lair at home setup such that I can access my personal data from anywhere with a WiFi connection, all I really need for a fortnight’s holiday is my wash bag, MacBook Air, iPod iTouch, Sony eBook Reader …and possibly a couple of pairs of knickers, tshirts, and shorts.  I’d of course wash them well prior to them standing up and walking on their own.  Mrs. PL rolls her eyes, notes my objections to wanting to take anything more than this …and proceeds to tell me not to be ‘ridiculous’ and get on with packing what seems to be every stitch of clothing I’ve ever owned.  And don’t get me started on what Mrs. PL ends up packing for herself and PL Junior.  Do you really need to pack clothing which you ‘might feel like wearing’?  Nor do I think it the remotest possibility that Her Highness will have selected the same resort in Malta and invite us round for high tea, thus necessitating us to pack our finest …’just in case’.  But, as with all disagreements in Casa PL, Mrs. PL humours me just long enough for me to realise that she is right, state ‘yes dear’ …and just get on with it.

In fairness, there has been a bit of a truce called on this front and a reasonable  compromise struck.  We now use vacuum bags to compress our packing and thus fit 25%-40% more than we could have otherwise.  Et voilà, Mrs. PL gets to take virtually our whole wardrobe …just in case …although the toothpaste made rather a mess when it got compressed this year.

What has this got to do with Data Storage and Protection?

Data deduplication has been a very prevalent buzz word in the storage industry for the past few years with the major vendors scrambling to introduce deduplication into their solutions through either invention or acquisition.  The IBM acquisition of Diligent in April 2008 for $200 million and the very public tussle in July 2009 between EMC and NetApp over the acquisition of Data Domain …with EMC eventually winning but at a costly $2.4 billion …are among the more interesting.

Why the rush and what would cause a $2.4 billion struggle?  Well, just as I’m not over the moon about taking everything we own on holiday and would prefer to leave the unneeded bits and bobs at home, our customers have a similar challenge as data storage requirements has continued to grow and, by extension, so to has the need to backup that data.  Problem is, not only are we storing lots of duplicate and dormant data …when we try to back it up we can see both the time to backup and the, perhaps more importantly the cost to backup …rise exponentially.  Data deduplication allows us to quickly investigate the data to be backed up at the block level …the zeroes and ones of data, essentially, as opposed to the file level, i.e. a ‘PPT’ or ‘Word’ document …and when we see a non-unique series of zeroes and ones, we can ‘drop’ them but leave a reference to where a future user can find the series of unique zeroes and ones.  With industry standard deduplication ratios of 40% …with many customers achieve much higher ratios of 60% or even 80% …data deupe can have a hugely positive impact on a customer’s backup infrastructure by significantly reducing the amount of data storage and time required to backup data.  As a technology, data dedupe has one of the quickest ROIs and demonstrable cost benefits …great for us as we use our equation of ROI + CBA + DPB = CSS to show customers how we can save them dosh not just now, but for years to come.

But.  There’s always a but, isn’t there?  Some have openly questioned what the performance impacts would be if we then had to restore the data we have deduped.  Sometimes known as ‘rehydration’, I do think that it is indeed possible …nay, probable …that it will take a bit longer to restore deduped data as opposed to bog standard backups.  To my mind the cost benefits far outweigh any potential performance impact on restoration, so I believe that this risk can be mitigated by ensuring that our customers reset their service level agreements internally such that any added restoration time is expected and catered for.

But.  There’s that word again!  But if data deduplication is so great for backup, why wouldn’t we just go ahead and introduce dedupe into primary storage?  In other words, why stop there …why not have dedupe in our SANs and NAS?

Perhaps, although I’m not convinced this is the most appropriate way forward.  If we anticipate performance degradation when we rehydrate deduped data during data restores from backups, should we not also expect some performance impact if we introduce data dedupe into primary storage?  Yes, I think we should.  Indeed, data dedupe is effectively changing the data in that non-unique zeroes and ones are dropped and replaced by a much smaller ‘reference’ to the unique zeroes and ones so it would stand to reason that there would be some performance impact during future host access to data.

But let’s not throw the baby out with the bathwater …we could still get the ROI and CBA benefits of deduplication without changing the data.  Enter data compression for primary storage.

Just as Mrs. PL gets more packing space when we go on hols by using vacuum bags …and you get more space by using ZIP files and compression on your PC hard drive …so too can we conserve data space in primary data through compression.  Put simply, whilst data deduplication uses an algorithm to ‘drop’ non-unique zeroes and ones data compression also uses an algorithm to compress non-unique data blocks.  I think it less likely for there to be a performance degradation in using compression as we’re not ‘changing’ the data, but merely compressing it.

One of the companies I’m watching in this space is Storwize.  Storwize have data compression products which can compress data with NAS devices, and often see ratios which aren’t dissimilar to data dedupe …40% or more of duplicate data compressed, in other words.  I am expecting them to be bringing out products in the near future which will allow for compression with SAN products …imagine reducing a corporate datacentre by ⅓ or more in a non-disruptive manner and you can see why I’m so excited by the prospect of saving our customers money through data compression within primary storage and data deduplication in backup.

Have a great weekend.


Click here to contact me.