IBM open-sources machine learning SystemML

IBM is aiming to popularise its proprietary machine learning programme SystemML through open-source communities.

Announcing the decision to share the system source code on the company blog, IBM’s Analytics VP Rob Thomas said application developers are in need of a good translator. This was a reference to the huge challenges developers face when combining information from different sources into data-heavy applications on a variety of computers, said Thomas. It is also a reference to the transformation of a little used proprietary IBM system into a popular, widely adopted artificial intelligence tool for the big data market. The vehicle for this transformation, according to Thomas, will be the open-source community.

IBM claims SystemML is now freely available to share and modify through the Apache Software Foundation open-source organisation. Apache, which manages 150 open-source projects, represents the first step to widespread adoption, Thomas said. The new Apache Incubator project will be code named Apache SystemML.

The machine learning platform originally came out of IBM’s Almaden research lab ten years ago when IBM was looking for ways to simplify the creation of customized machine-learning software, Mr. Thomas said. Now that it is in the public domain, it could be used by a developer of cloud based services to create risk-modeling and fraud prevention software for the financial services industry, Thomas said.

The current version of SystemML could work well with Apache project Spark, Thomas said, since this is designed for processing large amounts of data that stream in from continuous sources like monitors and smartphones. SystemML will save companies valuable time by allowing developers to write a single machine learning algorithm and automatically scale it up using open-source data analytics tools Spark and Hadoop.

MLLib, the machine learning library for Spark, provides developers with a rich set of machine learning algorithms, according to Thomas, and SystemML enables developers to translate those algorithms so they can easily digest different kinds of data and to run on different kinds of computers.

“We believe that Apache Spark is the most important new open-source project in a decade. We’re embedding Spark into our Analytics and Commerce platforms, offering Spark as a service on IBM Cloud, and putting more than 3,500 IBM researchers and developers to work on Spark-related projects,” said Thomas.

While other tech companies have open-sourced machine learning technologies they are generally niche specialised tools to train neural networks. IBM aims to popularise machine learning within Spark or Hadoop and its ubiquity will be critical in the long run, said Thomas.

原文发布于微信公众号 - 智能计算时代(intelligentinterconn)

原文发表时间:2015-12-02

本文参与腾讯云自媒体分享计划,欢迎正在阅读的你也加入,一起分享。

发表于

我来说两句

0 条评论
登录 后参与评论

相关文章

来自专栏about云

spark与hadoop相比,存在哪些缺陷(劣势)

一说大数据,人们往往想到Hadoop。这固然不错,但随着大数据技术的深入应用,多种类型的数据应用不断被要求提出,一些Hadoop被关注的范畴开始被人们注意,相关...

3976
来自专栏鸿的学习笔记

Shark,Spark SQL,Spark上的Hive以及Apache Spark上的SQL的未来

随着Spark SQL和Apache Spark effort(HIVE-7292)上新Hive的引入,我们被问到了很多关于我们在这两个项目中的地位以及它们与S...

1112
来自专栏CDA数据分析师

大数据分析师为什么需要学习Spark?

作者 CDA 数据分析师 Spark这套速度极快的内存分析引擎与以往的大数据处理框架相比具有诸多优势,从而能够轻松地为大数据应用企业带来理想的投资回报。Sp...

2595
来自专栏大数据和云计算技术

hadoop发行商介绍:Cloudera

‍‍‍‍在Hadoop生态系统中,规模最大、知名度最高的公司则是Cloudera。现在国内很多公司也都选用他们的发行版本(CDH)。‍‍ ‍‍Cloudera由...

3018
来自专栏人工智能头条

3位Committer,12场国内外技术实践,2016中国Spark技术峰会议题详解

2195
来自专栏陈湘玲的专栏

生儿育女的算法应用

有没有设想过,生活中突然多了个孩子会是什么体验? 如何更好Handle新身份,用科学的理论武装自己? 不妨看一下这篇不像攻略的攻略,探索新领域带来的乐趣。

1.7K6
来自专栏大数据技术学习

大数据学习过程中需要看些什么书?学习路线

很多朋友对大数据行业心向往之,却苦于不知道该如何下手。作为一个零基础大数据入门学习者该看哪些书?今天给大家推荐一位知乎网友挖矿老司机的指导贴,作为参考。

4043
来自专栏PPV课数据科学社区

技术丨从Hadoop到Spark,看大数据框架发展之路

谈到大数据框架,不得不提Hadoop和 Spark,今天我们进行历史溯源,帮助大家了解Hadoop和Spark的过去,感应未来。 在Hadoop出现前人们采用什...

3039
来自专栏钱塘大数据

Spark与Hadoop两大技术趋势解析

导读: 开源数据集如今深受开发者喜爱,比如谷歌的Images dataset数据集,YouTube-8M数据集等。通过对数据集里的数据进行分析,可以发现许多隐...

3754
来自专栏PPV课数据科学社区

【了解】Spark和Hadoop是友,非敌

Spark 在 6 月份取得了激动人心的成绩。在圣何塞举办的 Hadoop 峰会上,Spark 成了人们经常提及的话题和许多演讲的主题。IBM 还在 6 月 1...

33410

扫码关注云+社区

领取腾讯云代金券