Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Applications
    • Applications
    • Big Data and Analytics

    Informatica’s Analytics Package Brings Order to Big Data Lakes

    Written by

    David Needle
    Published September 25, 2016
    Share
    Facebook
    Twitter
    Linkedin

      eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

      Computers, mobile devices and smart sensors are generating trillions of pieces of data around the world at an unprecedented rate. Tiny sensors collect soil data for farmers, wearable devices accumulate data about our health, scanners track consumer purchases. The list of data sources goes on and on.

      But how much of all that data is organized and easily accessible? The rise of big data also has seen a huge demand for data scientists to help companies gain insights from all the data they’re collecting. But data scientists are in short supply and custom data analysis applications aren’t always flexible enough to accommodate new data sources.

      The big data dilemma is often compared to bodies of water. Ideally you want a clear data lake that lets you see everything regardless of depth. But with disparate systems and means of collecting and processing data, what you often end up with is a murky data swamp.

      Data integration company Informatica has been in the data management business for decades. Later this year it plans to release a Data Lake Management system described as an out-of-the-box system for managing big data. The system will be previewed at the Strata-Hadoop conference in New York Sept. 27-29, with availability expected by the end of this year.

      “The biggest competition for this is the do-it-yourselfers, the idea that all you need is Hadoop and a few guys in a basement to make it work,” Murthy Mathiprakasam, director of product marketing for data lake management at Informatica, told eWEEK. “Dealing with terabytes of data is hard, especially as you have to scale.”

      Hadoop is an open-source, Java-based programming framework designed to support the processing and storage of extremely large data sets in a distributed computing environment. It’s been gaining popularity as a data analysis tool, but not every organization has the skills and knowledge to make it work effectively.

      Mathiprakasam said there are four problems Informatica was hearing from customers that led to the creation of its Data Lake Management application.

      “The first is that creating solutions on top of Hadoop is extremely difficult because at some point you need reusability, maintainability and some way to automate some of the processes,” said Mathiprakasam.

      Second is finding an easier way to understanding all the data you may be collecting.

      “Companies have enough data, but they don’t know all of what they have and the business analysts that need it aren’t getting the value of big data. We also talk to customers in regulated industries like health care and finance, where you need to know what you have and where it is,” he said.

      The third problem is time. Some companies have developed applications that work, but the preparation and cleansing of data from disparate sources so it can be presented to business analysts can take weeks.

      The fourth issue is what Mathiprakasam calls insufficient trust. There often is overlapping data collection—for example multiple customer accounts for the same person—that can be inaccurate. “A lot of times the data being collected doesn’t meet audit and compliance requirements regulated industries have,” he said.

      The Data Lake Management package is offered as an integrated suite brought together as a single metadata-driven platform.

      Components include the Informatica Enterprise Information Catalog that offers intelligent self-service tools powered by machine learning and artificial intelligence, according to the company.

      Informatica Intelligent Streaming is designed to help organizations more efficiently capture and process big data—such as machine, social media feeds and website click streams—and real-time events to gain timely insight for business initiatives, such as the internet of things (IoT), marketing and fraud detection.

      Another component, Informatica Blaze, is said to dramatically increase Hadoop processing performance with intelligent data pipelining, job partitioning, job recovery and scaling.

      For users of Microsoft’s Azure cloud, Informatica Big Management can be deployed with a click of a button via the Microsoft Azure Marketplace to integrate, govern and secure big data at scale in Hadoop, according to company officials.

      A related component, Informatica Cloud Microsoft Azure Data Lake Store Connector, helps customers achieve faster business insights with self-service connectivity to integrate and synchronize diverse data sets into Microsoft Azure Data Lake Store.

      “There are so many point solutions on the market that solve one issue like delivery, but not trust and then you have the challenge of integrating that solution with another one,” said Mathiprakasam.

      To that end he said Informatica is also working with other vendors to offer connectors to Informatica Data Lake Management. For example, the company has partnered with Tableau to offer a connector from Informatica to Tableau’s business intelligence application making the data and insights accessible to a business or marketing analyst without needing a data scientist to create a solution.

      David Needle
      David Needle
      Based in Silicon Valley, veteran technology reporter David Needle covers mobile, bi g data, and social media among other topics. He was formerly News Editor at Infoworld, Editor of Computer Currents and TabTimes and West Coast Bureau Chief for both InformationWeek and Internet.com.

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      MOST POPULAR ARTICLES

      Artificial Intelligence

      9 Best AI 3D Generators You Need...

      Sam Rinko - June 25, 2024 0
      AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
      Read more
      Cloud

      RingCentral Expands Its Collaboration Platform

      Zeus Kerravala - November 22, 2023 0
      RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
      Read more
      Artificial Intelligence

      8 Best AI Data Analytics Software &...

      Aminu Abdullahi - January 18, 2024 0
      Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
      Read more
      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Video

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2024 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×