IBM's SystemML machine learning system becomes Apache project

There's a race between tech giants to open source machine learning systems and become a dominant platform. Apache SystemML has clear enterprise spin.

IBM on Monday said its machine learning system, dubbed SystemML, has been accepted as an open source project by the Apache Incubator.

The Apache Incubator is an entry to becoming a project of The Apache Software Foundation. The general idea behind the incubator is to ensure code donations adhere to Apache's legal guidelines and communities follow guiding principles.

IBM said it would donate SystemML as an open source project in June.

What's notable about IBM's SystemML milestone is that open sourcing machine learning systems is becoming a trend. To wit:

  1. Google recently open sourced its TensorFlow machine learning tool under an Apache 2.0 license.
  2. Facebook has also contributed its machine learning and artificial intelligence tools to the Torch open source project.

For enterprises, the upshot is that there will be a bevy of open source machine learning code bases to consider. Google TensorFlow and Facebook Torch are tools to train neural networks. SystemML is aimed a broadening the ecosystem to business use.

Why are tech giants going open source with their machine learning tools? The machine learning platform that gets the most data will learn faster and then become more powerful. That cycle will just result in more data to ingest. IBM is looking to work the enterprise angle on machine learning. Microsoft may be another entry on the enterprise side, but may not go the Apache route.

In addition, there are precedents to how open sourcing big analytics ideas can pay off. MapReduce and Hadoop started as open source projects and would be a cousin of whatever Apache machine learning system wins out.

IBM's SystemML, which is now Apache SystemML, is used to create industry specific machine learning algorithms for enterprise data analysis. IBM created SystemML so it could write one codebase that could apply to multiple industries and platforms. If SystemML can scale, IBM's Apache move could provide a gateway to its other analytics wares.

The Apache SystemML project has included more than 320 patches for everything from APIs, data ingestion and documentation, more than 90 contributions to Apache Spark and 15 additional organizations adding to the SystemML engine.

Here's the full definition of the Apache SystemML project:

SystemML provides declarative large-scale machine learning (ML) that aims at flexible specification of ML algorithms and automatic generation of hybrid runtime plans ranging from single node, in-memory computations, to distributed computations on Apache Hadoop and Apache Spark. ML algorithms are expressed in a R or Python syntax, that includes linear algebra primitives, statistical functions, and ML-specific constructs. This high-level language significantly increases the productivity of data scientists as it provides (1) full flexibility in expressing custom analytics, and (2) data independence from the underlying input formats and physical data representations. Automatic optimization according to data characteristics such as distribution on the disk file system, and sparsity as well as processing characteristics in the distributed environment like number of nodes, CPU, memory per node, ensures both efficiency and scalability.

The link to Apache:

http://systemml.incubator.apache.org/

原文发布于微信公众号 - 智能计算时代(intelligentinterconn)

原文发表时间:2015-12-04

本文参与腾讯云自媒体分享计划,欢迎正在阅读的你也加入,一起分享。

发表于

我来说两句

0 条评论
登录 后参与评论

相关文章

来自专栏不想当开发的产品不是好测试

spring boot 登录注册 demo (四) -- 体验小结

之前没有折腾过Spring,直接上来怼Spring Boot异常痛苦,参考着官网的guide(https://spring.io/guides)写了几个demo...

2139
来自专栏CreateAMind

commaai代码!

lateral plan : selfdrive.controls.lib.pathplanner longitudal plan: selfdrive.con...

841
来自专栏企鹅号快讯

成为Java高手的25个学习要点

. 很多人会问学java不知道该如何入手? 不知道学习的方向该怎么办? 有没有什么学习方法可以推荐? 想成为java高手,有没有一些可以衡量的标准呢? 本文就为...

2049
来自专栏dalaoyang

熔断器---Hystrix

Hystrix:熔断器,容错管理工具,旨在通过熔断机制控制服务和第三方库的节点,从而对延迟和故障提供更强大的容错能力。 说到熔断器,先要引入另外一个词,雪崩效应...

2806
来自专栏有刻

Java 小记 — Spring Boot 的实践与思考

3659
来自专栏程序员互动联盟

【编程技巧】成为Java技术大牛之25招必杀技

1. 你需要精通面向对象分析与设计(OOA/OOD)、设计模式(GOF,J2EEDP)以及综合模式。你应该了解UML,尤其是class、object、inter...

3796
来自专栏xingoo, 一个梦想做发明家的程序员

Java程序员的日常——SpringMVC+Mybatis开发流程、推荐系统

今天大部分时间都在写业务代码,然后算是从无到有的配置了下spring与mybatis的集成。 SpringMVC+Mybatis Web开发流程 配置数据...

1899
来自专栏斑斓

大数据流处理平台的技术选型参考

选择太多,是一件好事情,不过也容易乱花渐欲迷人眼。倘若每个平台(技术)都去动手操练一下,似乎又太耗时间。通过阅读一些文档,可以帮我们快速做一次筛选。在将选择范围...

3235
来自专栏纯洁的微笑

Spring Boot 2.0(三):Spring Boot 开源软件都有哪些?

2016年 Spring Boot 还没有被广泛使用,在网上查找相关开源软件的时候没有发现几个,到了现在经过2年的发展,很多互联网公司已经将 Spring Bo...

4457
来自专栏违法举报

有人利用服务器做违法生意

举报域名www.idaikan.com,经过查询,此域名ip119.28.49.236属于腾讯云

2.6K0

扫码关注云+社区

领取腾讯云代金券