<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sentrana Blog &#187; Tech Trends</title>
	<atom:link href="http://blog.sentrana.com/category/tech-trends/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.sentrana.com</link>
	<description>Turning complexity into competitive advantage</description>
	<lastBuildDate>Thu, 12 Jan 2012 15:38:09 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Before You Build, Ask the Right Questions</title>
		<link>http://blog.sentrana.com/2011/06/30/before-you-build-ask-the-right-questions/</link>
		<comments>http://blog.sentrana.com/2011/06/30/before-you-build-ask-the-right-questions/#comments</comments>
		<pubDate>Thu, 30 Jun 2011 17:42:10 +0000</pubDate>
		<dc:creator>Katrina Lamb</dc:creator>
				<category><![CDATA[Managers View]]></category>
		<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[asking the right questions]]></category>
		<category><![CDATA[data architecture]]></category>
		<category><![CDATA[data granularity]]></category>
		<category><![CDATA[data infrastructure]]></category>
		<category><![CDATA[data integrity]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[organizational capabilities for data management]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/?p=565</guid>
		<description><![CDATA[An Approach for Robust Data Management
Building a robust data management environment is in many ways like building a house. There are three components to building a good house. First of all, there are some fundamental questions you need to ask before doing anything. Why are you building the house in the first place? What are [...]]]></description>
			<content:encoded><![CDATA[<p><span style="color: #333333;"><strong>An Approach for Robust Data Management</strong></span></p>
<p>Building a robust data management environment is in many ways like building a house. There are three components to building a good house. First of all, there are some <span style="text-decoration: underline;">fundamental questions</span> you need to ask before doing anything. Why are you building the house in the first place? What are the important goals and benefits you want to enjoy? What other things are you willing to trade off to realize those benefits? Asking and answering those questions will help with the second component: <span style="text-decoration: underline;">building a model</span>, or architectural blueprint. There are many different ways to build a house (or a data management system). Not all of them will be right for the needs you have in mind. There are efficiencies to designing and building in certain ways – and, as always, there are trade-offs with any given choice. Finally, once you have established a workable model, it’s time to <span style="text-decoration: underline;">build out the infrastructure</span>. That starts with the <em>plumbing</em>. Nothing else in the house is going to work well without good plumbing which, seamlessly and unobserved, harnesses the flow of water (or data, in our analogy) to efficient uses as and when needed. Then comes the <em>foundation</em> – the platform to support the house according to your model. Think of the plumbing and the foundation as the transmission pipes, the controls to regulate the flow of information, the storage repositories and the other critical supports for your data management platform.</p>
<p><em> </em></p>
<p><em> </em></p>
<div class="wp-caption alignleft" style="width: 327px"><em><img src="http://www.westernjournalism.com/wp-content/uploads/2011/04/blueprint.gif" alt="a blueprint is an architectural model" width="317" height="240" /></em><p class="wp-caption-text">robust systems need good blueprints</p></div>
<p><em><br />
</em></p>
<p><em> </em></p>
<p><em>Asking the Questions that Matter for You</em></p>
<p>It’s hard to imagine that someone would build a home without first asking and answering some basic questions about what purpose the home is meant to serve. But all too often enterprise managers think of their data intelligence needs in terms of generic, one-size-fits-all products and solutions. They may be driven by the perceived urgency of getting immediate results, so they do not put the extra time into thinking through all the details that have to be in place in order for a solution to best meet their targeted needs. They build up organizational IT resources but fail to adequately integrate these resources into business decision-making processes so that business goals and technological capabilities are aligned. By not asking the right questions up front, managers increase the likelihood that their IT investment will fail to achieve the specified goals. <span id="more-565"></span></p>
<p>The place to start is by asking what kind of business you are. What are the core elements of your business strategy? What kind of information – about customers, products, channels, territories, campaigns, transaction dates and so on – do you need to pull into your organization every day in order to make decisions that help accomplish that strategy? What performance measures do you seek to optimize – for example gross margin, market share or something else? Is there a single overriding imperative for all territories or are there more nuanced considerations for local markets? Who needs access to the data, both within the organization and among trade partners? These broad-based questions can then lead into more detailed questions around specific tactical opportunities that can help support the overall strategic goals.</p>
<p><em>Modeling Around the Right Questions</em></p>
<p>One homebuilder wants a large kitchen with lots of counter space for preparing gourmet meals. Another wants to optimize for green initiatives –creating spaces for solar panels and the like. Every homebuilder needs an architectural blueprint and model that defines the best tools, materials and processes to accomplish the job at hand. These decisions should flow naturally from the questions asked and answered in the previous stage.</p>
<p>Data models are designed for different purposes. Some place emphasis on providing a continual stream of information for decision-making in real time. Others focus on high quality analytical tools applied to batch data that are aggregated for a particular time period – for example daily, weekly or monthly. A critical driver of data architecture is the volume of information the system is expected to handle and the types of activities users expect to be performing when they access the databases. Information processing has a cost, which includes a time component and a money component. Thinking back to the homebuilder who wants a state-of-the-art kitchen space, the architectural  blueprint for that kitchen has to think through all the activities our homebuilder-chef is going to perform – how to optimize the process of retrieving ingredients, setting up preparation work stations, pulling out pots and pans and chef’s knives, and cooking on the stove or in the oven. The data architecture similarly has to visualize the user retrieving various bits of data from different locations at different times for different purposes, and figure out the configuration that works best.</p>
<p><em>Laying Pipes and Building a Platform</em></p>
<p>The most elegant model in the world will be of little use if the infrastructure is not solid. Living in a house we tend not to think about the plumbing as long as it is working – we wash dishes, water the plants and fill the ice trays without wondering about how the system works. But we will certainly notice when it doesn’t work – when water doesn’t  come out of the faucet or when it floods the basement. Likewise a business user at a computer workstation is probably not going to think about how all that information flowed from somewhere out there to show up at her desk – unless it doesn’t show up or is in an unusable format. Laying the transmission conduits and controls that regulate the flow of data to where, when and how it is needed in the organization is a serious undertaking.</p>
<p>When information comes into the organization from “out there” it is unfortunately a bit more complex in form than the municipal water that enters your home’s plumbing system. It is easily corruptible and subject to misuse if robust protocols and procedures are not in place to keep it clean and usable. In a surprisingly large number of cases data “formats” consist of little more than unprotected Excel spreadsheets with inconsistent definitions and naming conventions for product categories, subsets, SKUs and customer types. That will likely prove to be of little use in facilitating actionable analysis. To be usable data need to be <span style="text-decoration: underline;">clean</span> and <span style="text-decoration: underline;">granular</span> – in other words, accurate, in the right format and at the right level of detail for the analysis to be performed.</p>
<p><em>Organization Matters, Too</em></p>
<p>Business managers do not always take into account the organizational aspect of data management. The solutions architecture will not be effective if the enterprise is unable to successfully weave it into the fabric of the organizational culture and existing decision making processes. Empirical evidence suggests that rigorous analytical methods play a relatively minor role in the decision making practices of many organizations – especially those at higher strategic levels but also those related to daily tactical opportunities. Ask yourself how different levels of decisions are made in your organization. What role, if any, does data analysis play? How persuasive are data-supported arguments – for example in comparison with rigid top-down planning imperatives, an ad hoc “firefighting” culture or political and personality-centered negotiations? What needs to change in terms of both formal processes and the informal cultural context for actionable solutions architecture to make a lasting impact and lead to more informed decisions?</p>
<p>Like a house, a data management system is best if it can last for a long time and have the flexibility to adapt to changing needs and circumstances. Business strategies will change, as will technology. Far better that a system can adapt to this change without requiring an entire rebuilding effort from scratch. Start with the right questions, and you have a much better chance of building a data management capability that can successfully evolve with your organization.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2011/06/30/before-you-build-ask-the-right-questions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Changing Landscape of the Foodservice Industry &#8211; Part 2</title>
		<link>http://blog.sentrana.com/2011/04/28/the-changing-landscape-of-the-foodservice-industry-part-2/</link>
		<comments>http://blog.sentrana.com/2011/04/28/the-changing-landscape-of-the-foodservice-industry-part-2/#comments</comments>
		<pubDate>Thu, 28 Apr 2011 20:35:30 +0000</pubDate>
		<dc:creator>Katrina Lamb</dc:creator>
				<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[changing landscape of foodservice]]></category>
		<category><![CDATA[foodservice industry]]></category>
		<category><![CDATA[predictive technology in foodservice]]></category>
		<category><![CDATA[social networking]]></category>
		<category><![CDATA[using predictive technology to understand customer demand]]></category>
		<category><![CDATA[view into downstream demand]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/?p=560</guid>
		<description><![CDATA[This is the second installment in a two-part series on major changes taking place in the US foodservice industry. In the first installment we looked at some of the key challenges, deriving from traditional industry practices in sales &#38; marketing that impede optimal performance by manufacturers, distributors and operators in the sector. This second installment [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is the second installment in a two-part series on major changes taking place in the US foodservice industry. In the first installment we looked at some of the key challenges, deriving from traditional industry practices in sales &amp; marketing that impede optimal performance by manufacturers, distributors and operators in the sector. This second installment will take a closer look at converging technologies that are poised to shake up the industry, and look at ways for industry players to benefit from these developments with intelligent, coordinated approaches to technology-driven solutions.</em></p>
<p>For manufacturers of foodservice products an important and often elusive goal is to gain visibility into the factors shaping and influencing downstream demand. The view from upstream is obscured by one or more layers of intermediation separating products from their end customers. Manufacturers typically set aside the largest part of their sales and marketing budgets for payments to trade partners, but evidence suggests that these expenditures do little to improve their understanding of actual downstream demand. Whether on their own or in collaboration with trade partners, manufacturers need to make better use of the data that can provide accurate intelligence about what is happening downstream. The good news is that the data are available, and new technologies are converging to enable manufacturers to capture information from which to make better sales &amp; marketing decisions. The challenge is to get around the obstacles that are preventing this from happening. <span id="more-560"></span></p>
<div class="wp-caption alignleft" style="width: 273px"><img src="http://www.businessdealentertainment.com/wp-content/uploads/2011/01/restaurant.jpg" alt="" width="263" height="241" /><p class="wp-caption-text">social networking is changing the way people decide where, when, what and with whom to eat</p></div>
<p>The end point of the foodservice value chain, of course, is the patron. In a perfect world restaurant operators would be able to predict how many patrons are showing up at their doors every night of the week, and what quantities of which menu offerings they would be ordering to eat and drink. They could then place orders upstream in total accordance with this knowledge. The world is not perfect, though, and no restaurant owner has such powers of clairvoyance. What they do have, increasingly, is the benefit of something that barely existed just a few years ago – <strong><em>social networking technology</em></strong>. More and more of the decisions that people make on a daily basis – including where, when and with whom to eat meals – are utilizing the many technological tributaries of the social networking phenomenon. For their part, restaurant operators have also been figuring out how to get into this game. Even the independent local establishments that lack the large corporate infrastructure of national chains are able to use these new digital tools to better understand their own markets and to control their own marketing – to reach out to potential diners with offers and other targeted inducements to patronize their tables. In so doing they are gaining actionable insights with which to better estimate demand. In turn, they can place orders to their suppliers with a higher level of confidence that these orders are a reasonably accurate reflection of what their customers will actually be ordering, when and in what quantity.</p>
<p>The presence of social networking technology is increasing the quantity and quality of digital signals coming from downstream markets. But there is another rapidly evolving technology that has the ability to amplify these signals even more powerfully up the value chain. That is <strong><em>predictive technology for sales and marketing decisions</em></strong>. Predictive technology applies rigorous quantitative methods to arm sales and marketing decision makers with recommendations to target the right customers with the right products, promotions, pricing and timing. This approach feeds on data signals – the more the better across as many customer-product combinations as possible – and helps to untangle the many diverse factors affecting demand at a very granular level. The algorithms that power predictive technology solutions can capture the digital demand signals coming from operators and patrons downstream, analyze them and provide useful intelligence to managers in corporate headquarters as well as sales professionals on the front line – information that can help them make decisions based on more than just guesswork. This can result in a higher likelihood of success for every offer presented to a customer.</p>
<p>This convergence of two powerful technology trends offers potential benefits throughout the foodservice value chain. For manufacturers the big question – going back to the discussion in the opening paragraph of this post – is how to get past the obstacles that impede the view downstream. There are different ways of doing this, and best practices are likely to evolve over time. One path to greater demand clarity can occur through better and more comprehensive collaborative processes with trade partners. Midstream distributors have their own set of challenges in the current environment – for example managing profit margin stability under conditions of unusually high volatility in week-to-week revenue movements alongside steadily increasing COGS trends for key products and categories. More accurate forecasting for price and other demand levers – assortment, promotions and sales force effort – can result in more predictable margins without the wild fluctuations seen today. That in turn can provide an incentive for distributors to be willing to look at alternative models to the stultifying “pay to play” template for trade spend that prevails today – including closer upstream collaboration.</p>
<p>Thanks to the growing capabilities afforded by social networking technology restaurant operators are steadily improving their ability to understand the intricacies of their markets and plan their foodservice purchases accordingly. Their upstream suppliers in turn can apply predictive technology solutions to identify where their promotions, sales efforts and deal bundling will most likely fall on the most receptive ears. Manufacturers who figure out how best to tap into these more accurate and better-amplified data signals will likely be rewarded with a clearer and farther view downstream.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2011/04/28/the-changing-landscape-of-the-foodservice-industry-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Missing the Ocean for the Stream: What We Can and Cannot Learn from IBM’s New Breakthrough</title>
		<link>http://blog.sentrana.com/2009/07/07/missing-the-ocean-for-the-stream-what-we-can-and-cannot-learn-from-ibms-new-breakthrough/</link>
		<comments>http://blog.sentrana.com/2009/07/07/missing-the-ocean-for-the-stream-what-we-can-and-cannot-learn-from-ibms-new-breakthrough/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 15:23:45 +0000</pubDate>
		<dc:creator>Christian Bonilla</dc:creator>
				<category><![CDATA[Managers View]]></category>
		<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[business analytics]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[decision process]]></category>
		<category><![CDATA[decision support]]></category>
		<category><![CDATA[forward looking analysis]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[IBM System S]]></category>
		<category><![CDATA[itemset detection]]></category>
		<category><![CDATA[link detection]]></category>
		<category><![CDATA[making better business decisions]]></category>
		<category><![CDATA[predict the highest price at which a customer would be willing to buy a product]]></category>
		<category><![CDATA[predicting the effect an advertisement will have in a market]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[stream computing]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/?p=314</guid>
		<description><![CDATA[IBM has dominated the airwaves with the roll-out of its sizable new business analytics and optimization division, along with a new stream computing platform called “System S”. But what exactly does this technological advancement do, and what does it mean for your business?]]></description>
			<content:encoded><![CDATA[<p>As part of its perpetual quest to reinvent and perfect its business model, IBM has made an aggressive push into the analytics market in the last half-dozen or so years. The company’s slick, though occasionally confusing ad campaigns (remember those ads with the mysterious red box being unveiled?) often announce its new initiatives, though it is not always clear that a new announcement is indeed a major one. In the analytics space, however, Big Blue does mean business. The announcement of its sizable new business analytics and optimization division is clearly intended to prove as much. Shortly after its announcement, IBM also unveiled a new stream computing platform called “System S” to much fanfare. The breathless enthusiasm of business journalists, technology bloggers and investment analysts has been palpable. But what exactly does this technological advancement do, and what does it mean for your business?</p>
<p>To answer this question, let’s begin briefly by dissecting what IBM has introduced. Imagine that you are receiving a continuous stream of data, such as stock prices on the Nasdaq. These figures must be quickly analyzed so that the proper buy and sell orders can be placed. Suppose that you also need to base your decisions not just on the Nasdaq prices but also the numbers figures coming in from dozens of other exchanges. <span id="more-314"></span></p>
<div id="attachment_318" class="wp-caption alignright" style="width: 390px"><img class="size-full wp-image-318" src="http://blog.sentrana.com/wp-content/uploads/2009/07/many_bloomberg_screens.jpg" alt="No Match for the Information at Hand" width="380" height="307" /><p class="wp-caption-text">No Match for the Information at Hand</p></div>
<p>Just for fun, let’s say that you also want to include up-to-the-second weather information of 20 cities in your decision process. We can safely say that there are not a great deal of human beings in the world that could look at all of this information in the blink of an eye, make the best possible decision, and then repeat the exact same process a split-second later with new information. Well, this example highlights the best application for stream computing. In the stock price analysis outlined above, we needed to evaluate information coming to us on a continuous basis from a number of different sources, evaluate it, and deploy the result in a desired manner. Let’s say that we wanted analyze the correlation between Motorola’s stock price movements and the price movements of 50 other stocks on 50 different exchanges, as well as the outdoor temperatures of every single world capitol. IBM’s breakthrough allows you to do this and also update your analysis in real-time as new information is received. That’s the good news.</p>
<p>The bad news is that there are real limits to the insight that can be obtained this way. There are really two areas in which stream computing can be applied: link detection and itemset detection. These two are related – link detection tries to identify events that are correlated, and itemset detection tries to identify events that are co-incident (i.e. the prices of GE and Sony stock went up, AND the price of Motorola went up).</p>
<div id="attachment_316" class="wp-caption alignleft" style="width: 183px"><img class="size-full wp-image-316" src="http://blog.sentrana.com/wp-content/uploads/2009/07/blindfolded-driver.jpg" alt="Not Quite Forward-Looking" width="173" height="120" /><p class="wp-caption-text">Not Quite Forward-Looking</p></div>
<p>Itemset detection algorithms have progressed to the point where they can be accurate and effective without taxing the computer’s resources too heavily. IBM’s accomplishment lies in deploying this algorithm for real-time itemset detection on unprecedented scale, but ultimately, the business insight that can be extracted from correlation and co-incidence is limited. Frankly, IBM’s press release statement that System S allows users to “…create a forward-looking analysis of data from any source,” is a bit of a fairy tale. Any competent manager will tell you that “forward looking analysis” based only on correlations in historical data is not a recipe for success. With itemset detection (which is closely related to and sometimes confused with association mining), you can now look at more data more quickly than ever before, but you can’t ultimately do much new with it.</p>
<p>Truly forward-looking analysis requires going several steps beyond itemset detection, because making accurate predictions about events that have never happened requires that you do more than analyze past correlations. For example, how could you predict the highest price at which a customer would be willing to buy a product that they have never purchased before using only their previous purchase information? Or how about predicting the effect an advertisement will have in a market that you have never advertised in before? Integrating techniques that are better geared for this analysis such as principal components analysis, collaborative filtering, and others make these types of predictive analysis possible, not to mention significantly more robust than correlative analytics. The downside of integrating of all of these techniques into your analysis is that it becomes extremely expensive computationally. Other related breakthroughs are required to make this feasible in a business setting (see <em><a href="http://blog.sentrana.com/2009/06/12/cheating-your-way-into-business-visibility/">Cheating Your Way Into Business Visibility</a></em>). To make better informed decisions with an understanding of the likelihood of possible outcomes, managers and policymakers alike need a way to reduce the uncertainty inherent in them. The aforementioned “virgin sale” process is a very micro-example that exemplifies this problem. Sadly, being able to comb through data more quickly by itself does not get us to that higher end-state. IBM can claim a notable achievement in computing, but hailing it as a quantum leap for businesses decision support understates the true challenge of making better business decisions.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2009/07/07/missing-the-ocean-for-the-stream-what-we-can-and-cannot-learn-from-ibms-new-breakthrough/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Cheating Your Way into Business Visibility</title>
		<link>http://blog.sentrana.com/2009/06/12/cheating-your-way-into-business-visibility/</link>
		<comments>http://blog.sentrana.com/2009/06/12/cheating-your-way-into-business-visibility/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 15:06:12 +0000</pubDate>
		<dc:creator>Christian Bonilla</dc:creator>
				<category><![CDATA[Managers View]]></category>
		<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[business intelligence and predictive analytics at a person’s fingertips]]></category>
		<category><![CDATA[cheat the laws of physics]]></category>
		<category><![CDATA[computer architecture]]></category>
		<category><![CDATA[data visibility]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[enterprise data management]]></category>
		<category><![CDATA[high-performance computing]]></category>
		<category><![CDATA[I/O]]></category>
		<category><![CDATA[maximizing the computer’s potential throughput]]></category>
		<category><![CDATA[optimal prices]]></category>
		<category><![CDATA[pricing]]></category>
		<category><![CDATA[processing speed]]></category>
		<category><![CDATA[RAID stack]]></category>
		<category><![CDATA[sentrana research]]></category>
		<category><![CDATA[the i/o curse]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/?p=269</guid>
		<description><![CDATA[Better business visibility vis-à-vis cheating A) the laws of physics, B) the limitations of storage, and C) the laws of math. Implementing some clever tricks – ranging from the conventional to the profoundly innovative – can give us a quantum leap in the rapid accessibility of information. ]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal"><strong> <span style="font-weight: normal; ">Several weeks ago, I wrote a post about <a href="http://blog.sentrana.com/2009/03/29/what-happens-when-we-cant-keep-up-with-information/" target="_self">how the pace at which the world is accumulating information exceeds our ability to critically evaluate it</a>. For companies that make thousands or millions of marketing decisions every day in the form of price offerings, advertising placements and so on, this translates into making decisions that perpetually involve a greater amount of uncertainty relative to the amount of information we have. The root cause of this problem is a technological one: we do not have the computing power to slice and dice massive datasets in order to glean insight in time to support decisions. An even deeper explanation is that the gap between the rate of information accumulation in businesses and the pace of information transfer improvements will continue to widen at an increasing rate. This poses serious challenges to the capabilities offered by Business Intelligence, not to mention our ability to determine optimal prices. <span id="more-269"></span><br />
</span></strong></p>
<p class="MsoNormal"><span> An astute reader pointed out that solving this problem depends at least as much on software as hardware. Indeed, it is a deft blend of hardware as well as software optimization that provides the best hope for making vast reams of information imminently accessible. At the core of this information accessibility problem lies the most inescapable of all culprits – the Laws of Physics.<span> </span>Although the laws are immutable, there is hope.</span></p>
<p class="MsoNormal">No matter how fast the chips powering our computers become, there is a bottleneck between hard disk storage and main memory, or RAM. This condition is referred to as being “I/O bound” (I/O stands for input/output – essentially how fast information can be transferred from the disk to the processing units in a computer). Within a computer’s main memory, all activities are performed electronically, which essentially means varying levels of “rather fast.” The major disadvantage of typical disk storage systems is that reading information from them requires mechanical motion (full disclosure: this paradigm of mass storage is currently facing a major disruption from Solid State Disk (SSD) drives, although SSD drives are significantly more expensive today).</p>
<p class="MsoNormal">
<div id="attachment_270" class="wp-caption alignleft" style="width: 359px"><img class="size-full wp-image-270" src="http://blog.sentrana.com/wp-content/uploads/2009/06/hard_disk.jpg" alt="The Culprit - Read Head on a Hard Disk Drive" width="349" height="263" /><p class="wp-caption-text">The Culprit - Read Head on a Hard Disk Drive</p></div>
<p class="MsoNormal"><span>This mechanical motion significantly increases the time it takes to read and write information, slowing system performance. The computer’s throughput is thus bound by not by how fast electrons move, but by how fast the disk can rotate and the disk-head repositioned &#8211; to the tune of roughly 200 MB per second. We are physically bound to this mechanical limit, but through some clever tricks ranging from the conventional to the profoundly innovative can give us a quantum leap in the rapid accessibility of information.<br />
</span></p>
<p class="MsoNormal"><span> </span></p>
<p class="MsoNormal"><strong><span>Step 1: Cheat the Laws of Physics</span></strong></p>
<p class="MsoNormal"><span> </span></p>
<p class="MsoNormal"><span>If we are ever going to use data to make analytically-driven business decisions, we have to get into the technical weeds just a bit. A good first step is for us to put all of the information in our enterprise databases into what is referred to as a “RAID stack.” RAID stands for Redundant Array of Independent Disks. What this allows us to do (specifically, in what is termed a “RAID-0” configuration), is break up blocks of data and spread them across multiple disk drives. Breaking up information in this manner greatly improves I/O performance by distributing the load across many channels and drives. For reasons we won’t go into here, there is a limit to how many disks we can distribute data across and still get the desired results. Ultimately the attainable performance improvement tops out at 1.6 GB/s (8 disks at 200 MB/s). </span></p>
<div id="attachment_271" class="wp-caption alignright" style="width: 591px"><img class="size-full wp-image-271" src="http://blog.sentrana.com/wp-content/uploads/2009/06/raid.jpg" alt="Striping Data in a RAID-0 Array" width="581" height="265" /><p class="wp-caption-text">Striping Data in a RAID-0 Array</p></div>
<p class="MsoNormal">The computer’s RAM has a read/write speed of around 7 GB/s, but if we can only write information into memory at 1.6 GB/s, then we are still under-saturating RAM and the CPU. The CPU can perform calculations at roughly 10 GB/s, which means that a dual quad-core chip architecture (8 cores) allows for 80 GB/s of calculations. We are still not close to maximizing the computer’s potential throughput. Whether or not we can devise a way to maximize the potential of our machines, however, is the difference between only having the bandwidth to perform deep analytics on the top 10% of your product catalog on the one hand, and being able to quickly analyze not only all of your products, but all possible customer-product combinations. Optimizing your business, in fact, depends on the ability to overcome this challenge.</p>
<p class="MsoNormal">A second way to circumvent I/O boundedness is through compression. By compressing the text stored in the database tables, we can easily achieve a 5:1 gain in throughput as well. This means that the 1.6 GB that has to be picked up by the disk head now gets unpacked into 8 GB by the CPU (although the performance cost of decompression can be high in some cases, there are advances that we can use to get us past this hurdle as well). So by combining two well-known tools right off the bat, we have already achieved a performance improvement of several orders of magnitude compared to how much time it used to take to get 8GB of data to the CPU. Think about how much more of your customer data this allows you to analyze in the same amount of time. But we still have a long way to go before we max out how much information the computer is capable of processing. <span> </span></p>
<p class="MsoNormal"><strong><span>Step 2: Cheat the Limitations of Storage</span></strong></p>
<p class="MsoNormal"><span>Let’s say that over a two-year span, a company records 25 million individual transactions in its database, and that we are interested in knowing the total sales it made of a specific item: SKU5893. This section describes how a typical database would go about answering that query.</span></p>
<p class="MsoNormal">The manner in which we typically store data is not always the most conducive for high-performance computing. Most databases store information as collections of rows. Each row denotes a single unit of interest, such as a sales transaction. Each row has a number of columns that describe the attributes of that transaction, such as the data, customer’s name, item purchased, price, and so on. This can present some problems when data needs to be accessed in certain ways. If you wanted to find all the records involving a specific SKU, it might require scanning across 20 or more columns before getting to the column you need. The value in that column then has to be checked against the desired value to see if it matches, and then this search-and-check process is repeated for the next row until we have searched through the entire table. In massive data sets, all of this search time adds up and creates a crippling performance bottleneck. But there is a way out.</p>
<p class="MsoNormal">If it seems to you that there has got to be a way to store sales data more manageably, right? By taking advantage of what is called vertical fragmentation, we can do just that. Imagine that our transaction table only has five fields: Location, Customer Name, Item, Price, and Date, as in the example below.</p>
<p class="MsoNormal"><span> </span></p>
<div id="attachment_275" class="wp-caption alignnone" style="width: 464px"><img class="size-full wp-image-275" src="http://blog.sentrana.com/wp-content/uploads/2009/06/table_1c_blog.jpg" alt="Row-Based Table" width="454" height="151" /><p class="wp-caption-text">Row-Based Table</p></div>
<p>With tens of thousands of transactions each day, this table quickly accumulates a lot of rows. However, if we decide to orient this table by its columns instead of its rows, we would get the following:</p>
<p><img class="alignnone size-full wp-image-276" src="http://blog.sentrana.com/wp-content/uploads/2009/06/table_2b_blog.jpg" alt="table_2b_blog" width="519" height="137" /></p>
<p class="MsoNormal">We now have five two-column tables after adding a unique id field to each column that maps back to the information for each row. Several important results come from this new orientation. First, notice how much repetition of data we have in certain tables such as Date and Location. For instance, the value “1/1/2007” will be repeated thousands of times in this table. The gains we can achieve by compressing vertically fragmented tables far exceed what we can achieve with row-based tables because run-length encoding (and other compression techniques) because the data model better supports it. The second crucial point is that vertical fragmentation enables us to send to the CPU only the information that it needs to see. The disk-head does not need to scan across columns of data that it doesn&#8217;t need – so the 200MB/s that it is able to read is focused only on the column necessary for the query. Tack on another several orders-of-magnitude in I/O improvements.</p>
<p class="MsoNormal"><strong><span> <span>Step 3: Cheat the Laws of…Math</span></span></strong></p>
<p class="MsoNormal"><span>Now we’re getting somewhere. Data striping, compression, and vertical fragmentation provide a huge boost to the volume of information that we can access and process – indeed we are now getting to and probably exceeding the CPU&#8217;s number-crunching ability of 80GB/s.<span> </span>This brings us to our final bottleneck:<span> </span>we can’t speed up how quickly the chip can do the math required for heavy BI analytics. </span></p>
<p class="MsoNormal"><span>The solution lies not in the CPU, but in many CPUs. Distributed computing allows us to bring more CPUs into the mix while also feeding them from their own disks – what is referred to as a “shared nothing” architecture. If you use ten machines, you can distribute a a billion rows of sales data evenly across all ten machines, leaving 100 million rows on each. Now each machine is executing at warp speed on only a portion of the database, completing our search for sales of a specific SKU in a fraction of the normal processing time. </span></p>
<p class="MsoNormal"><span>Integrating all of the techniques that we have covered can be summed in the following steps:</span></p>
<ol type="1">
<li class="MsoNormal"><span>Distribute a database across multiple machines, so      that each gets some fraction of the total number of rows in the original</span></li>
<li class="MsoNormal"><span>On each machine, vertically fragment the database      section stored on it</span></li>
<li class="MsoNormal"><span>Stripe the data across multiple independent disks      on each machine</span></li>
<li class="MsoNormal"><span>Finally compress that data, </span></li>
</ol>
<p class="MsoNormal"><span> We have now achieved massively parallel and high performance processing. Each machine now runs through all of the information it has and sends the relevant information to the CPU, which can finally hit the processing limits of current chip architectures. The I/O curse long since in our rear-view mirror, we can finally begin to unlock the incredible amount of information latent in our very own data.</span></p>
<p class="MsoNormal"><span> </span></p>
<p class="MsoNormal"><strong><span>Sentrana’s Role</span></strong></p>
<p class="MsoNormal">I should note that the above story is an idealization. Real-world implementation of the interconnecting innovations that I have outlined here confront serious challenges of their own: decompression taxes the CPU; RAID is expensive and the energy drain of all those disks can be significant; management of multiple machines that are co-operating in a cluster is itself a sophisticated systems administration task, and the list goes on. Though valid concerns all, the more important point is that these are all solvable problems. These real-world implementation problems are the ones that Sentrana’s research has focused on in order to put business intelligence and predictive analytics at a person’s fingertips. Truly great business insights are like scientific discoveries – they stem from first asking an important question and then breaking it down into manageable pieces so that it can be answered. In order to support those moments of intuition in which the momentous questions are first asked, we have to be as fast as that other great computer at every decision-maker’s disposal.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2009/06/12/cheating-your-way-into-business-visibility/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>What Happens When We Can’t Keep Up with Information</title>
		<link>http://blog.sentrana.com/2009/03/29/what-happens-when-we-cant-keep-up-with-information/</link>
		<comments>http://blog.sentrana.com/2009/03/29/what-happens-when-we-cant-keep-up-with-information/#comments</comments>
		<pubDate>Sun, 29 Mar 2009 21:48:33 +0000</pubDate>
		<dc:creator>Christian Bonilla</dc:creator>
				<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[data storage]]></category>
		<category><![CDATA[demand for data storage]]></category>
		<category><![CDATA[economic downturn]]></category>
		<category><![CDATA[financial services]]></category>
		<category><![CDATA[HDDs]]></category>
		<category><![CDATA[historical data]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[microprocessor]]></category>
		<category><![CDATA[Moore's Law]]></category>
		<category><![CDATA[processing speed]]></category>
		<category><![CDATA[revenue optimization]]></category>
		<category><![CDATA[solid state and flash memory shipments]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/2009/03/29/what-happens-when-we-can%e2%80%99t-keep-up-with-information/</guid>
		<description><![CDATA[I ran into a former colleague the other day who, as it turns out, recently left his job and presently spends his days building options pricing models and trading from home on his own accounts. In turn, I described to him some of the recent work that we have done in revenue optimization and particularly [...]]]></description>
			<content:encoded><![CDATA[<p>I ran into a former colleague the other day who, as it turns out, recently left his job and presently spends his days building options pricing models and trading from home on his own accounts. In turn, I described to him some of the recent work that we have done in revenue optimization and particularly the breakthroughs that we have engineered for processing data. His face scrunched up a bit, and his response was uncharacteristically blunt: “You can always process numbers quickly if you need to,” he smirked.</p>
<p>Not so, in fact. When you start asking extremely detailed questions that require combing through years of detailed historical data and then performing mathematical transformations on each of those figures, you will find out rather quickly the limits of processing speed when your results finish compiling in a week or so. The thing is that most of us never push up against the processing speed frontier. We can see that every year computers get faster, chips get smaller, and Excel seems to have more rows. Moore’s Law prevails. The trouble is that all the while the rate at which the data universe expands is screaming past advances in processing capabilities, and that rate does not fluctuate with the economic downturn. Consider the markets for microprocessors, which allow us to perform those calculations and manipulate data, and hard drives, which allow for storage of data. Microprocessor sales have been dealt a sharp blow by the global downturn as computer sales have slowed, but worldwide shipments of hard disk drives (HDDs) roughly maintained 2007 levels even in the worst quarters of the recession (and the drives themselves contain more memory).  Solid state and flash memory shipments were down, but the evidence suggests that this is due to consumers substituting HDDs for other types of memory, rather than simply not storing more information. The demand for data storage, while not completely recession-proof, is nonetheless of the hardier variety.</p>
<p>Simply put, information of all kinds accumulates faster than we can analyze it. We are losing the race, and the gap is widening, not shrinking. As for what this ultimately means, I will now make a rather dour point. A fashionable explanation for the recession among both politicians and many “Main Street” types is that greed is what did us in. The greed of the bankers, the hedge funds, the fat cats, the small cats, whomever &#8211; greed is the culprit. But that doesn’t explain everything by a long shot. Even the greediest person doesn’t want the party to end and the money to stop coming in. Might it be possible that they weren’t able to ask the questions that might have led to certain debt instruments having never been created? Financial services employees have more information available to them than decision makers any other industry, and still here we find ourselves. Think about how many times each day similarly misinformed decisions are made inside corporations all across the world. The information is there, but we are more often than not letting it rot on the docks.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2009/03/29/what-happens-when-we-cant-keep-up-with-information/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Wanted: Intelligence (Information Need Not Apply)</title>
		<link>http://blog.sentrana.com/2009/03/19/wanted-intelligence-information-need-not-apply/</link>
		<comments>http://blog.sentrana.com/2009/03/19/wanted-intelligence-information-need-not-apply/#comments</comments>
		<pubDate>Thu, 19 Mar 2009 21:52:09 +0000</pubDate>
		<dc:creator>Christian Bonilla</dc:creator>
				<category><![CDATA[Tech Trends]]></category>
		<category><![CDATA[any piece of knowledge is information]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[ERP]]></category>
		<category><![CDATA[foreign intelligence community]]></category>
		<category><![CDATA[marketing and pricing]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[PI]]></category>
		<category><![CDATA[predictive intelligence]]></category>
		<category><![CDATA[pricing to become a fact-driven corporate discipline]]></category>

		<guid isPermaLink="false">http://blog.sentrana.com/2009/03/19/wanted-intelligence-information-need-not-apply/</guid>
		<description><![CDATA[Professionals in the foreign intelligence community take pains to distinguish between information and bona fide intelligence. Any piece of knowledge, no matter how trivial or irrelevant, is information. Intelligence, by contrast, is the subset of information valued for its relevance rather than simply its level of detail. That distinction is often lost in sector of [...]]]></description>
			<content:encoded><![CDATA[<p>Professionals in the foreign intelligence community take pains to distinguish between information and bona fide intelligence. Any piece of knowledge, no matter how trivial or irrelevant, is information. Intelligence, by contrast, is the subset of information valued for its relevance rather than simply its level of detail. That distinction is often lost in sector of the enterprise technology industry that is somewhat loosely referred to as Business Intelligence, or BI. This has become a bit of a catchall term for many different software applications and platforms that have widely different intended uses. I would argue that many BI tools that aggregate and organize a company’s information, such as transaction history or customer lists, more often provide information than intelligence. The lexicon is what it is, but calling something “intelligence” does not give it any more value. In order to sustainably outperform the competition, a company needs more than a meticulously organized and well-structured view of its history. Decision makers at all levels need a boost when making decisions amidst uncertainty and where many variables are exerting influence. They need what I would call predictive intelligence, or PI – the ability to narrow down the relevant variables for analysis and accurately measure their impact on the probability of a range of outcomes.</p>
<p>What makes the distinction between information and intelligence critical is that information is getting more accessible by the day. This democratization of BI is evidenced nowhere more so than at Microsoft. In 2008, Microsoft unveiled several projects in the late stages of development that it claims will put BI capabilities at the fingertips of more users than ever before. “Project Madison” will massively increase Microsoft’s information storage capabilities, while the “Kilimanjaro” and “Gemini” projects together will provide easy-to-use reporting and analysis tools designed to drastically reduce the complexity of using traditional BI tools &#8211; all at very low cost compared to large-scale ERP implementation. The possibilities abound. But I still ask the question: what are all of these newly empowered users going to do with all of this information once they can access it at the push of a button?</p>
<p>I am excited by the idea of so many more information workers being able to ask the questions that end up driving businesses to continuously reinvent and perfect themselves, but I worry about relevance. Will these capabilities actually increase the amount of intelligence available to decision makers? Any business decision can be thought of as a bet that some desired future state will materialize as a result of a present course of action. Business intelligence tools as we know them more often than not do not help us make more intelligent bets when it comes to the future. The problem is that we think they do. More data often makes the task of identifying the true predictors of business success and isolating their effects more difficult. In order for a company to get the most out of its data, it needs PI as well as BI capabilities at the fingertips of decision makers. For marketing and pricing to become a more fact-driven corporate discipline, we must recognize the need not for more data, but ways of evaluating the probability of outcomes based on only the factors that matter. This is not child’s play. Information alone, however well-groomed, is simply not sufficient to meet this need.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.sentrana.com/2009/03/19/wanted-intelligence-information-need-not-apply/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

