首页
学习
活动
专区
工具
TVP
发布

SAMshare

专栏成员
272
文章
311956
阅读量
39
订阅数
3万字长文,PySpark入门级学习教程,框架思维
关于PySpark,我们知道它是Python调用Spark的接口,我们可以通过调用Python API的方式来编写Spark程序,它支持了大多数的Spark功能,比如SparkDataFrame、Spark SQL、Streaming、MLlib等等。只要我们了解Python的基本语法,那么在Python里调用Spark的力量就显得十分easy了。下面我将会从相对宏观的层面介绍一下PySpark,让我们对于这个神器有一个框架性的认识,知道它能干什么,知道去哪里寻找问题解答,争取看完这篇文章可以让我们更加丝滑地入门PySpark。话不多说,马上开始!
Sam Gor
2021-08-13
9.1K0
用PySpark开发时的调优思路(下)
下面我们就来讲解一些常用的Spark资源配置的参数吧,了解其参数原理便于我们依据实际的数据情况进行配置。
Sam Gor
2021-08-13
2K0
[013] 7种常见数据结构的图画解读
Data structures are fundamental constructs that are used to build programs. Each data structure has its own way of organizing data, which may work efficiently in particular use cases. With their own particular structures, data structures offer alternative solutions to data organization, management, storage, access, and modification tasks.
Sam Gor
2021-02-03
4130
[008] 数据清洗不知如何着手?强力推荐这份清单
It was before the Stack Overflow era, so not much help was available online. Some people would print out cheatsheets of different kinds and hang on the walls around their workstations. Having a couple of pages of frequently used codes in front of the desk was an efficient way of correcting syntax errors.
Sam Gor
2020-12-02
4230
[007] 这份关于Python可视化的秘笈请收好!
“作者总结了用Python进行EDA可视化的常用demo,同时也有一个案例带着我们走了一遍,代码可以复用,涉及了常见的图表,包括折线图、条形图、柱状图、堆积图、饼图等,可以简单阅读,然后收藏起来备用哦!
Sam Gor
2020-11-23
3250
[006] 模型在生产上无法发挥功效,该何去何从?
The reasons for this are often down to three major factors:
Sam Gor
2020-11-19
2810
[005] Python异常处理三板斧——Try, Except, and Assert!
The dream of every software programmer is to write a program that runs smoothly. However, this is not usually the case at first. The execution of a code stops in case of an error.
Sam Gor
2020-11-19
8190
[004] 使用SQL时,千万别犯这5种错误!
SQL and Machine Learning have a few things in common. It’s easy to start with one as it doesn’t require a lot of coding. Also, code rarely crashes.
Sam Gor
2020-11-19
6150
[003] 这5类机器学习算法你需要知道!
One of the most well-known and essential sub-fields of data science is machine learning. The term machine learning was first used in 1959 by IBM researcher Arthur Samuel. From there, the field of machine learning gained much interest from others, especially for its use in classifications.
Sam Gor
2020-11-19
3940
[002] 一文了解Python中的常用字符串操作
Asthe co-founder of Microsoft says, I invite you to continue stretching your mind in an effort to broaden your programming skills with potential applications in many domains. The purpose of the article is to serve as a cheat-sheet for built-in methods of one of the basic Python data types: strings. A string is a data type in Python programming language that's used to represent a piece of text. They are super flexible and necessary to appropriately represent text inputs in code. As a result, learning how to make the most out of them is a must.
Sam Gor
2020-11-19
3940
[001] 7个简单步骤搞定Data Science的“课后作业”!
Congratulations, you passed the initial interview and have moved on to the data science project! The recruiter has given you a set of extremely vague instructions at this point. You might find yourself at 2 am staring at the computer, resembling the man in the photo above.
Sam Gor
2020-11-11
4820
Linux Basic - 不完全摘录1
学习Linux先明确是学习Linux操作系统还是某个Linux发行版(Ubuntu、CentOS、Fedora、OpenSUSE、Debian、Mint等),发行版指的是由个人或者团队组织编写的操作系统,通常会包括一些系统软件和应用的软件,以及一个用来简化系统初始安装的安装工具,一个典型的Linux发行版会包括Linux内核、一些GNU程序库和工具,命令行shell,图形界面的X Window系统和相应的桌面环境。
Sam Gor
2020-07-16
6690
车老哥手把手带你配置AI项目的环境
本系列为小白入门整个AI项目教程,主要涉及双系统的搭建,linux的使用,安装caffe-gpu版本,利用caffe实现目标检测,并移植模型到android移动端,也就是手机端进行目标检测,本篇为安装双系统的教程,本人保证为原创并使目前全网基本上最最详细的手把手教程
Sam Gor
2019-11-04
2.5K0
一篇精辟的Linux必知必会​,推荐收藏
SamGor:叉烧哥最近有篇关于linux的学习笔记,看完了觉得写得十分精辟,对于尚未使用linux的同学来说简直就是福音~推荐给大家学习学习。
Sam Gor
2019-10-15
8480
分享8点超级有用的Python编程建议
我们在用Python进行机器学习建模项目的时候,每个人都会有自己的一套项目文件管理的习惯,我自己也有一套方法,是自己曾经踩过的坑总结出来的,现在在这里分享一下给大家,希望多少有些地方可以给大家借鉴。
Sam Gor
2019-07-08
4570
没有更多了
社区活动
【纪录片】中国数据库前世今生
穿越半个世纪,探寻中国数据库50年的发展历程
Python精品学习库
代码在线跑,知识轻松学
博客搬家 | 分享价值百万资源包
自行/邀约他人一键搬运博客,速成社区影响力并领取好礼
技术创作特训营·精选知识专栏
往期视频·千货材料·成员作品 最新动态
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档