Managing big data: the two biggest mistakes companies make
There's probably not a day -- if not an hour -- that goes by when you don't hear about big data: managing big data,analyzing big data. Indeed, the term has been so widely and, at times, so inappropriately applied that it has started to lose its meaning.
And there's something to watch out for in all the talk about big data and its potential benefits. It's the big data lie -- the notion that big data can automatically solve any number of business problems and position businesses to compete more effectively with their rivals.
What I've consistently found is that those who believe the big data lie are making two big mistakes:
- Collecting data indiscriminately. Companies are collecting loads and loads of data. That's great. But they have no idea how to effectively make sense of it. They jump in by collecting big data -- extremely large, comprehensive, and often varied and fast-changing data sets -- to investigate and draw conclusions. But raw data says nothing to most business users, and placing even more data into bigger databases or copying increasing amounts of it into spreadsheets typically provides no value in and of itself.
- Categorizing data within an inch of its life. Conversely, companies may think they're doing well if they "scrub" all the big data they've collected and then categorize it. Sure, there is a benefit to normalizing data by centralizing, cleansing, classifying and structuring it. But it's hard to make a business case for doing that if, when the cleansing and organizing is done, the data doesn't provide answers to real business questions. This is especially true when a big data management effort turns into an IT-focused project that doesn't gain assurance from business users that the data in question has been validated and is actually usable.
The problem is that many companies today are collecting and managing big data with little to no forethought. And just as planning is key to any strategic business project, forethought is of utmost importance when dealing with big data. All the elements of the process must be purposeful and aligned closely with business goals.
So, what's the right way? You have to start with a strategic business question and then acquire the data needed to answer that question. Only then can you quickly start visualizing your business, performing business discovery and delivering actionable intelligence.

