SAN JOSE, Calif. — Data analyzer and visualizer ClearStory Data and Hadoop provider Hortonworks have set up a new partnership for ClearStory’s platform to run natively on Hortonworks’ Hadoop Data Lakes deployment.
The companies made the announcement Feb. 19 at the O’Reilly Strata + Hadoop World 2015 conference here at the San Jose McEnery Convention Center.
The new partnership enables ClearStory’s fast-cycle, scalable analysis package to be deployed on Hadoop Data Lakes using the Hortonworks Data Platform.
Data lakes is a relatively new storage term referring to vast pools of object storage for corporate data, which can include any type of data—structured or unstructured. They are also sometimes called “bit buckets” or “the landing zone.”
Four-year-old ClearStory Data, based in Menlo Park, Calif., makes software for line-of-business users to find, analyze and use data at scale from corporate, Web and premium data sources for combined and up-to-date insights. Data resources include relational databases, Hadoop batches, Web and social application interfaces, and third-party data providers. It features an intuitive, easy-to-use user interface.
Hortonworks, based in Palo Alto, Calif., was among the earliest independent Hadoop batch processing providers to get up and running in 2011.
In technical terms, ClearStory’s integration includes a data inference and data streaming framework that provides faster access to data in HDP via Apache Hive (a data warehouse infrastructure built on top of Hadoop), fast blending of additional data sources with HDP for holistic insights, and a user interface for business users who need to see results directly, collaborate in real-time on analysis and take data-driven actions.
ClearStory Data has banked $30 million in three funding rounds from five investors in four years, including a $21 million Series B round in March 2014, according to Crunchbase. Investors include Andreessen Horowitz, Google Ventures, Khosla Ventures, Kleiner Perkins Caufield & Byers and DAG Ventures.