Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Applications
    • Applications
    • Big Data and Analytics
    • Innovation
    • IT Management
    • Storage

    IT Science Case Study: How Deutsche Börse Tamed Data Prep Costs

    Written by

    Chris Preimesberger
    Published December 7, 2017
    Share
    Facebook
    Twitter
    Linkedin

      eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

      Here’s the latest edition of a new occasional feature in eWEEK called IT Science, in which we look at what really happens at the intersection of new-gen IT and legacy systems.

      These articles describe industry solutions. The idea is to look at real-world examples of how new-gen IT products and services are making a difference in production each day. Most of them are success stories, but there will also be others about projects that blew up. We’ll have IT integrators, system consultants, analysts and other experts helping us with these as needed.

      We’ve published similar articles to these in the past, but the format has evolved. We’ll keep them short and clean, and we’ll add relevant links to other eWEEK articles, whitepapers, video interviews and occasionally some outside expertise as we need it in order to tell the story.

      An important feature, however, is this: We will report available ROI of some kind in each article, whether it is income on the bottom line, labor hours saved or some other valued business asset.

      Today’s IT Science Feature: Deutsche Börse

      Germany-based Deutsche BörseAG (or Deutsche Börse Group) is a marketplace organizer for the trading of shares and other securities. It is also a transaction services provider. Information for this edition of IT Science was provided by Konrad Sippel, head of Content Lab and Senior Advisor at Deutsche Börse.

      Name the problem being solved: The Content Lab of Deutsche Börse is a data-driven R&D team that collects, analyzes and enriches data from the entire value chain of trading, clearing and settlement. In this function, the team consumes all sorts of structured and unstructured data sets from various areas of the business. Typically, the ingestion, cleansing and normalization of these data sets occupies a large part of the data scientists’ time spent on any given project. We brought in Trifacta to help our data science team with data ingestion, cleansing and preparation to allow the team to more efficiently spend their time on generating insights and analytics rather than formatting tables.

      Describe the strategy that went into finding the solution: As part of our regular activities, we trial new big-data tools and software on a regular basis. As a result, we ran PoCs with various data wrangling tool providers including Trifacta. During our PoC, we duplicated work that had previously been done on a particularly dirty dataset related to historical fixed-income reference data. During the trial we were able to re-do months of work in a matter of a few weeks.

      List the key components in the solution: We run Trifacta’s Data Wrangling software solution within our cloud based research set-up on AWS on top of a Cloudera-based cluster. Trifacta is a platform for exploring and preparing data for analysis. Trifacta works with cloud and on-premises data platforms.

      Trifacta is designed to allow analysts to explore, transform and enrich raw, diverse data into clean and structured formats for analysis through self-service data preparation.

      Describe how the deployment went, perhaps how long it took, and if it came off as planned: For the PoC installation, the software was installed in less than a day in a temporary environment. For the permanent installation, the set-up was done in parallel to the set-up of the R&D cluster with Cloudera, which also ran very smoothly within a few days.

      Describe the result, new efficiencies gained, and what was learned from the project: We have fully integrated Trifacta into our data science process and technology stack. New data that we acquire or access are ingested through Trifacta to ensure data scientists start off working on a clean set of data with minimal effort spent on cleaning and preparing the dataset. Additionally, data scientists may use further functionalities to combine or further modify datasets more efficiently using Trifacta.

      Describe ROI, carbon footprint savings, and staff time savings, if any: We are at an early stage of our usage of the software, so figures are hard to come by. We strongly believe that efficiency gains in data preparation and ingestion will lead to a massive increase in data scientist efficiency and reduce the time required to ingest new datasets from a process point of view.

      If you have an idea for an IT Science case study, email [email protected].

      Chris Preimesberger
      Chris Preimesberger
      https://www.eweek.com/author/cpreimesberger/
      Chris J. Preimesberger is Editor Emeritus of eWEEK. In his 16 years and more than 5,000 articles at eWEEK, he distinguished himself in reporting and analysis of the business use of new-gen IT in a variety of sectors, including cloud computing, data center systems, storage, edge systems, security and others. In February 2017 and September 2018, Chris was named among the 250 most influential business journalists in the world (https://richtopia.com/inspirational-people/top-250-business-journalists/) by Richtopia, a UK research firm that used analytics to compile the ranking. He has won several national and regional awards for his work, including a 2011 Folio Award for a profile (https://www.eweek.com/cloud/marc-benioff-trend-seer-and-business-socialist/) of Salesforce founder/CEO Marc Benioff--the only time he has entered the competition. Previously, Chris was a founding editor of both IT Manager's Journal and DevX.com and was managing editor of Software Development magazine. He has been a stringer for the Associated Press since 1983 and resides in Silicon Valley.
      Linkedin Twitter

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      Get the Free Newsletter!

      Subscribe to Daily Tech Insider for top news, trends & analysis

      MOST POPULAR ARTICLES

      Artificial Intelligence

      9 Best AI 3D Generators You Need...

      Sam Rinko - June 25, 2024 0
      AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
      Read more
      Cloud

      RingCentral Expands Its Collaboration Platform

      Zeus Kerravala - November 22, 2023 0
      RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
      Read more
      Artificial Intelligence

      8 Best AI Data Analytics Software &...

      Aminu Abdullahi - January 18, 2024 0
      Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
      Read more
      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Video

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2024 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×