MarkLogic Server Adds ETL Database Tool
MarkLogic released a new version of its database Oct. 19 with a host of new features aimed at helping organizations deal with unstructured data.
In MarkLogic Server 4.2, the company has added a new ETL (extraction, transformation and loading) tool for managing the data. According to Ken Chestnut, vice president of product marketing at MarkLogic, the tool, Information Studio, is meant to simplify the loading and processing of information by allowing users to "drag and drop files and load them as [is] without normalizing or preprocessing them first."
Chestnut said, "Information Studio allows organizations to build sophisticated loading processes without having to write any code. This means less code to write and manage and fewer opportunities to introduce errors or performance problems. As your application and information processing requirements evolve, Information Studio provides a way extend the built-in functionality without having to reinvent all of the plumbing to do things like progress reporting and error handling."
In addition to Information Studio, the company added a number of other features, including replication, database rollback and failover. The company also added support for XSLT (Extensible Stylesheet Language Transformations) to allow organizations to "more quickly support new devices/channels or update existing ones all while leveraging their investment in existing XSLT style sheets and skills," Chestnut said.
MarkLogic's target market is industries dealing with volumes of unstructured information such as images, tweets, blogs and research data. "The information explosion makes MarkLogic 4.2 increasingly applicable to companies outside of MarkLogic's traditional key verticals of media, government and financial services," Matt Aslett, an analyst with 451 Group, said in a statement.
"As the increased amount of unstructured information makes the need for a specialized database more necessary than ever, we expect increased adoption of MarkLogic," Aslett said.
"Existing relational databases (RDBMS) are suited to manage data that is predefined and in standardized format such as payroll, accounts payable [and] contacts," Chestnut explained. "It is data that is highly structured and fits into rows and columns. With the explosion in ... volume as well as different forms of information such as contracts, Web content, e-mails, documents, e-books, videos, images, etc. (we call this unstructured), organizations are struggling with both managing and exploiting this information using existing technologies."