Cloudera Unveils Open Source Workbench to Accelerate Data Science and Machine Learning

Press Release | Cloudera | May 1, 2017

Sees early customer adoption and deep learning extensibility, including the UK's Office of National Statistics

PALO ALTO, Calif., May 1, 2017 /PRNewswire/ -- Cloudera, Inc. (NYSE: CLDR), the provider of the leading modern platform for machine learning and advanced analytics built on the latest open source technologies, announced the general availability of the Cloudera Data Science Workbench, its self-service tool for data scientists. The workbench, announced in beta at Strata+Hadoop World San Jose 2017, enables fast, easy and secure self-service data science for the enterprise.

"We are entering the golden age of machine learning and it's all about the data. However, data scientists continue to struggle to build and test new analytics projects as fast as they would like, particularly in large scale environments," said Charles Zedlewski, senior vice president, Products at Cloudera. "The Data Science Workbench is a self-service tool that accelerates the ability to build, scale and deploy machine learning solutions using the most powerful technologies. This means that data scientists now have the freedom to share, collaborate and manage their data in a way that best suits them and their enterprise, resulting in an easier and faster path to production."

Charles ZedlewskiWith Python, R, and Scala directly in the web browser, Cloudera Data Science Workbench delivers a self-service data science experience. It gives users the ability to download and experiment with the latest libraries and frameworks in customizable project environments. Cloudera Data Science Workbench is both secure and compliant, with support for Hadoop authentication, authorization, encryption, and governance.

The Office of National Statistics (ONS), the UK's largest independent producer of official statistics, is aiming to use the Cloudera Data Science Workbench to create repeatable, accurate, and transferable statistical research. "We have seen a decreased time in developing models and better visibility in tracking progress and results," says Simon Sandford-Taylor, Chief Technology Officer.  "We think that Cloudera Data Science Workbench has the potential to accelerate our release calendar and better share best practices."

Cloudera's Data Science Workbench easily integrates with many deep learning frameworks including BigDL, a deep learning library for Apache Spark, open sourced by Intel. Built from the ground-up to run on distributed Spark/Hadoop infrastructure and performance-optimized to run on Intel® Xeon® processors (leveraging the Intel® Math Kernel Library), BigDL works directly within Cloudera's Data Science Workbench.

"Enterprise customers require a cohesive platform to scale their analytics solutions and maximize their investments. BigDL's native integration with Apache Spark brings the world of deep learning to the Apache Spark ecosystem and higher value to enterprise customers," said Michael Greene, vice president and general manager of the System Technologies and Optimization in the Software and Services Group, Intel Corporation. "The BigDL framework will help enterprise customers better utilize existing investments to build their analytics capabilities with optimized performance on Intel® architecture."

The benefits of BigDL integration into Data Science Workbench include the ability to leverage deep learning libraries and tactics on CPU architecture without any additional hardware considerations or separate environments. The combination provides a convenient way to create Spark data science pipelines natively and integrate them with deep learning library (BigDL) and other Spark/Hadoop components on the Cloudera Data Science Workbench.

About Cloudera

Cloudera delivers the leading modern platform for machine learning and advanced analytics built on the latest open source technologies. The world's leading organizations trust Cloudera to help solve their most challenging business problems with Cloudera Enterprise, the fastest, easiest and most secure data platform available for the modern world. Our customers efficiently capture, store, process and analyze vast amounts of data, empowering them to use advanced analytics and machine learning to drive business decisions quickly, flexibly and at lower cost than has been possible before. To ensure our customers are successful, we offer comprehensive support, training and professional services. 

Connect with Cloudera

About Cloudera: cloudera.com/content/cloudera/en/about/company-profile.html
Read our blogs: blog.cloudera.com/ and vision.cloudera.com/
Follow us on Twitter: twitter.com/cloudera
Visit us on Facebook: facebook.com/cloudera
Join the Cloudera Community: community.cloudera.com
Read about our customers' successes: cloudera.com/customers.html

Cloudera, , and associated marks are trademarks or registered trademarks of Cloudera Inc. All other company and product names may be trademarks of their respective owners.

SOURCE Cloudera, Inc.