【项目】2017年最佳的30个机器学习项目

AiTechYun

发布于 2018-03-06 11:24:18

6370

发布于 2018-03-06 11:24:18

文章被收录于专栏：ATYUN订阅号

本文比较了在过去的一年里8800个开源机器学习项目，并从中选取了前30个制成这份清单。它涵盖了2017年1月和12月之间发布的最佳开源机器学习库，数据集和应用程序。Mybridge AI通过受欢迎程度，参与度和新近度来评估质量（为了让你明白质量好是什么概念，我可以告诉你，它们的Github星的平均数量是3558）。

开源项目对于数据科学家来也很有用。你可以通过阅读源代码进行学习并在现有项目之上构建一些东西。你可以花很多时间来鼓捣你过去一年中可能错过的机器学习项目。在此之前我首先要推荐两个我认为比较好的机器学习课程（英文课程）。

榜单

NO1

FastText：用于快速文本表示和分类的库。[Github上11786颗星]。由Facebook Research提供。

[ 灵感来源于：基于快速文本的多语言无监督或监督的词嵌入。Github上695颗星。https://github.com/facebookresearch/MUSE]

https://goo.gl/VWGfCs

NO2

Deep-photo-styletransfer：论文的“Deep Photo Style Transfer”代码和数据 [Github上9747颗星]。由康奈尔大学Fujun Luan博士提供。

https://github.com/luanfujun/deep-photo-styletransfer?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO3

为Python和命令行提供的最简单的面部识别api。[Github上8672个星]。由Adam Geitgey提供。

https://github.com/luanfujun/deep-photo-styletransfer?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO4

Magenta：一代与机器智能[Github上8113颗星]。

https://github.com/luanfujun/deep-photo-styletransfer?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO5

Sonnet：基于TensorFlow的神经网络库[Github上5731颗星]。由Deepmind公司的Malcolm Reynolds提供。

https://github.com/deepmind/sonnet?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO6

deeplearn.js：为网络硬件加速的机器智能库[Github上5462颗星]。由Google Brain的Nikhil Thorat提供。

https://github.com/PAIR-code/deeplearnjs?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO7

在TensorFlow中快速转变风格[Github上4843颗星]。由MIT的Logan Engstrom提供。

https://github.com/lengstrom/fast-style-transfer?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO8

Pysc2：星际争霸II学习环境[Github上3683颗星]。由DeepMind的Timo Ewalds提供。

https://github.com/deepmind/pysc2?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO9

AirSim：基于Unreal引擎的开源模拟器，用于微软AI＆Research的自驾车[Github上3861颗星]。由微软的Shital Shah提供。

https://github.com/Microsoft/AirSim?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO10

Facets：机器学习数据集的可视化[Github上3371颗星]。由Google Brain提供。

https://github.com/PAIR-code/facets?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO11

Style2Paints:为图片上色的AI[Github上3310颗星]。

https://github.com/lllyasviel/style2paints?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO12

Tensor2Tensor：一个用于泛化seq2seq模型的库[Github上3087颗星]。由Google Brain的Ryan Sepassi提供。

https://github.com/tensorflow/tensor2tensor?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO13

在PyTorch中进行图片对图片的转化（例如horse2zebra，edges2cats等等）[Github上2847颗星]。由伯克利的Jun-Yan Zhu博士提供。

https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO14

Faiss:用于密集向量的高效相似性搜索和聚类的库。[Github上2629颗星]。由Facebook Research提供。

https://github.com/facebookresearch/faiss?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO15

Fashion-mnist：类似MNIST的时尚产品数据库[Github上2780颗星]。由Zalando Tech的研究科学家Han Xiao提供。

https://github.com/zalandoresearch/fashion-mnist?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO16

ParlAI：各种公开可用的对话数据集上训练和评估AI模型的框架[Github上2578颗星]。由Facebook Research的Alexander Miller提供。

https://github.com/facebookresearch/ParlAI?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO17

Fairseq：Facebook AI研究seq2seq工具包[Github上2571颗星]。

https://github.com/facebookresearch/fairseq?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO18

Pyro：Python和PyTorch的深度通用概率编制程序（Deep universal probabilistic programming）[Github上2387颗星]。由Uber的AI Labs提供。

https://github.com/uber/pyro?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO19

iGAN：由GAN支持的交互式图像生成器（Interactive Image Generation）[Github上2369颗星]。

https://github.com/junyanz/iGAN?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO20

深度图像优先：使用无监督学习的神经网络进行图像恢复[Github上2188颗星]。由Skoltech的Dmitry Ulyanov博士提供。

https://github.com/DmitryUlyanov/deep-image-prior?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO21

Face_classification：使用具有keras CNN模型和openCV的fer2013/imdb数据集的实时人脸检测和情感/性别分类。[Github上1967星]。

https://github.com/oarriaga/face_classification?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO22

Speech-to-Text-WaveNet：使用DeepMind的WaveNet和tensorflow进行端对端的句子级英语语音识别[在Github上1961星]。由来自Kakao Brain的Namju Kim提供。

https://github.com/buriburisuri/speech-to-text-wavenet?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO23

StarGAN：用于多领域图像对图像转化的，标准的生成式对抗网络[Github上1954颗星]。由韩国大学的Yunjey Choi提供。

https://github.com/yunjey/StarGAN?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO24

Ml-agents：Unity机器学习智能体[Github上1658颗星]。由Unity Juliani提供。

https://github.com/Unity-Technologies/ml-agents?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO25

DeepVideoAnalytics：分布式可视化搜索和可视化数据分析平台[Github上1494颗星]。由康奈尔大学的Akshay Bhat博士提供。

https://github.com/AKSHAYUBHAT/DeepVideoAnalytics/?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO26

OpenNMT：Torch的开源神经网络机器翻译系统[Github上1490颗星]。

https://github.com/OpenNMT/OpenNMT?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO27

Pix2pixHD：使用条件式GAN合成和处理2048×1024的图像[Github上1283颗星]。由Nvidia的AI研究科学家Ming-Yu Liu提供。

https://github.com/NVIDIA/pix2pixHD?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO28

Horovod：TensorFlow的分布式训练框架。[在Github上1188颗星]。由Uber Engineering提供。

https://github.com/uber/horovod?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO29

AI-Blocks：强大而直观的WYSIWYG（所见即所得）接口，允许任何人创建机器学习模型[Github上899颗星]。

https://github.com/MrNothing/AI-Blocks?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

NO30

Tensorflow中语音转化的深度神经网络（主要为语音风格的变换）[Github上845颗星]。由Kakao Brain AI研究院的Dabi Ahn提供。

https://github.com/andabi/deep-voice-conversion?utm_source=mybridge&utm_medium=blog&utm_campaign=read_more

本文参与腾讯云自媒体同步曝光计划，分享自微信公众号。

原始发表：2018-01-09，如有侵权请联系 cloudcommunity@tencent.com 删除

机器学习

本文分享自 ATYUN订阅号微信公众号，前往查看

如有侵权，请联系 cloudcommunity@tencent.com 删除。

本文参与腾讯云自媒体同步曝光计划，欢迎热爱写作的你一起参与！

机器学习

登录后参与评论

0 条评论

热度