Big Data Can Solve Small Problems
NEWS ANALYSIS: The focus of the Big Data Summit was on how businesses can most effectively use big data today—by applying it to solve small, but nagging issues.I went to the Big Data Summit hosted by the Mass Technology Leadership Council recently in Cambridge, Mass., and had a chance to get an update on big data from an East Coast perspective. This was the fourth year of the summit, and during that time big data has emerged from concept to buzzword applied to nearly every business segment. "The full potential of big data is still unknown," said Oracle's Big Data Strategist Paul Sonderegger in the conference's keynote presentation. And the theme of the potential of big data was consistent throughout the event, as data scientists, entrepreneurs and users, including the city of Boston interim CIO Justin Holmes, described how they see the potential unfolding. In an era when buzzword answers to business problems ("just Hadoop it") have become all too common, the Summit was refreshing in explaining how the nebulous concept of big data is currently most effective in being applied to solve small, but nagging issues, and the future is in companies figuring out how to meld existing internal data structures with data pulled in from nontraditional outside sources. Also, amid all the discussion regarding the potential benefits of big data—including advanced analytics and forecasting—the old standbys of privacy, security and compliance need to be remembered. Those considerations were part of the Cambridge panel discussions and reflected the experience of the panelists who have dealt with data and privacy since before the big data buzz started. "Our challenge is how to leverage big data without being big brother," said Holmes. City governments collect a lot of data, and the current task is figuring out how to make that data accessible without violating privacy rights. Projects range from collecting pothole problems using the sensors contained in smartphones, allowing residents to adopt a fire hydrant to uncover following snowstorms, providing restaurant inspection data for out-of-town visitors to collecting information that carries more gravity, such as 911 calls to highlight potential high crime areas.
In the case of the 911 scenario, the public data has to be both made anonymous and connected to broader areas than specific addresses. "Who owns the data? In what format and context was the data collected? And what is the value versus risk equation?" said Holmes as he outlined the issues surrounding the transition of data from collection to a useful application.