{{sellerTotalView > 1 ? __("sellers", {number: sellerTotalView}) : __("seller", {number: sellerTotalView}) }}, {{numTotalView > 1 ? __("items", {number: numTotalView}) : __("item", {number: numTotalView}) }}
free FREE

Change Your Zip Code

Inventory information and delivery speeds may vary for different locations.

Location History

{{email ? __('Got it!') : __('Restock Alert')}}

We will notify you by email when the item back in stock.

Cancel
Yami

Jingdong book

Spark MLlib机器学习:算法、源码及实战详解

{{buttonTypePin == 3 ? __("Scan to view more PinGo") : __("Scan to start")}}

Spark MLlib机器学习:算法、源码及实战详解

{{__(":people-members", {'people': item.limit_people_count})}} {{ itemCurrency }}{{ item.valid_price }} {{ itemCurrency }}{{ item.invalid_price }} {{ itemDiscount }}
Ends in
{{ itemCurrency }}{{ item.valid_price }}
{{ itemCurrency }}{{ priceFormat(item.valid_price / item.bundle_specification) }}/{{ item.unit }}
{{ itemDiscount }}
{{ itemCurrency }}{{ item.valid_price }} {{ itemCurrency }}{{ priceFormat(item.valid_price / item.bundle_specification) }}/{{ item.unit }} {{ itemCurrency }}{{ item.invalid_price }} {{itemDiscount}}
{{ itemCurrency }}{{ item.valid_price }}
Sale ends in
Sale will starts after Sale ends in
{{ getSeckillDesc(item.seckill_data) }}
{{ __( "Pay with Gift Card to get sale price: :itemCurrency:price", { 'itemCurrency' : itemCurrency, 'price' : (item.giftcard_price ? priceFormat(item.giftcard_price) : '0.00') } ) }} ({{ itemCurrency }}{{ priceFormat(item.giftcard_price / item.bundle_specification) }}/{{ item.unit }}) Details
Best before

Currently unavailable.

We don't know when or if this item will be back in stock.

Unavailable in your area.
Sold Out

Details

Full product details
Editer Recommend

本书系统、全面、深入地解析了SparkMLlib机器学习的相关知识,着力于探索分布式机器学习的底层实现。
以源码为基础,兼顾算法、理论与实战,帮助读者在实际工作中进行MLlib的应用开发和定制开发。
适合大数据、Spark、数据挖掘领域的从业人员阅读。
Content Description

《Spark MLlib机器学习:算法、源码及实战详解》以Spark 1.4.1版本源码为切入点,全面并且深入地解析Spark MLlib模块,着力于探索分布式机器学习的底层实现。
《Spark MLlib机器学习:算法、源码及实战详解》中本着循序渐进的原则,首先解析MLlib的底层实现基础:数据操作及矩阵向量计算操作,该部分是MLlib实现的基础;接着对各个机器学习算法的理论知识进行讲解,并且解析机器学习算法如何在MLlib中实现分布式计算;然后对MLlib源码进行详细的讲解;最后进行MLlib实例的讲解。相信通过《Spark MLlib机器学习:算法、源码及实战详解》的学习,读者可全面掌握Spark MLlib机器学习,能够进行MLlib实战、MLlib定制开发等。
《Spark MLlib机器学习:算法、源码及实战详解》适合大数据、Spark、数据挖掘领域的从业人员阅读,同时也为Spark开发者和大数据爱好者展现了分布式机器学习的原理和实现细节。
Author Description

黄美灵,久邦数码高级数据挖掘工程师,Spark爱好者,致力于分布式机器学习的研究与应用,现从事移动互联网的计算广告和数据变现工作,专注Spark机器学习在计算广告中的研究和实践。
Catalogue

第一部分Spark MLlib基础
第1章Spark机器学习简介 2
1.1机器学习介绍 2
1.2Spark介绍 3
1.3Spark MLlib介绍 4
第2章Spark数据操作 6
2.1Spark RDD操作 6
2.1.1Spark RDD创建操作 6
2.1.2Spark RDD转换操作 7
2.1.3Spark RDD行动操作 14
2.2MLlib Statistics统计操作 15
2.2.1列统计汇总 15
2.2.2相关系数 16
2.2.3假设检验 18
2.3MLlib数据格式 18
2.3.1数据处理 18
2.3.2生成样本 22
第3章Spark MLlib矩阵向量 26
3.1Breeze介绍 26
3.1.1Breeze创建函数 27
3.1.2Breeze元素访问及操作函数 29
3.1.3Breeze数值计算函数 34
3.1.4Breeze求和函数 35
3.1.5Breeze布尔函数 36
3.1.6Breeze线性代数函数 37
3.1.7Breeze取整函数 39
3.1.8Breeze常量函数 40
3.1.9Breeze复数函数 40
3.1.10Breeze三角函数 40
3.1.11Breeze对数和指数函数 40
3.2BLAS介绍 41
3.2.1BLAS向量-向量运算 42
3.2.2BLAS矩阵-向量运算 42
3.2.3BLAS矩阵-矩阵运算 43
3.3MLlib向量 43
3.3.1MLlib向量介绍 43
3.3.2MLlib Vector接口 44
3.3.3MLlib DenseVector类 46
3.3.4MLlib SparseVector类 49
3.3.5MLlib Vectors伴生对象 50
3.4MLlib矩阵 57
3.4.1MLlib矩阵介绍 57
3.4.2MLlib Matrix接口 57
3.4.3MLlib DenseMatrix类 59
3.4.4MLlib SparseMatrix类 64
3.4.5MLlib Matrix伴生对象 71
3.5MLlib BLAS 77
3.6MLlib分布式矩阵 93
3.6.1MLlib分布式矩阵介绍 93
3.6.2行矩阵(RowMatrix) 94
3.6.3行索引矩阵(IndexedRowMatrix) 96
3.6.4坐标矩阵(CoordinateMatrix) 97
3.6.5分块矩阵(BlockMatrix) 98

第二部分Spark MLlib回归算法
第4章Spark MLlib线性回归算法 102
4.1线性回归算法 102
4.1.1数学模型 102
4.1.2最小二乘法 105
4.1.3梯度下降算法 105
4.2源码分析 106
4.2.1建立线性回归 108
4.2.2模型训练run方法 111
4.2.3权重优化计算 114
4.2.4线性回归模型 121
4.3实例 123
4.3.1训练数据 123
4.3.2实例代码 123
第5章Spark MLlib逻辑回归算法 126
5.1逻辑回归算法 126
5.1.1数学模型 126
5.1.2 梯度下降算法 128
5.1.3正则化 129
5.2源码分析 132
5.2.1建立逻辑回归 134
5.2.2模型训练run方法 137
5.2.3权重优化计算 137
5.2.4逻辑回归模型 144
5.3实例 148
5.3.1训练数据 148
5.3.2实例代码 148
第6章Spark MLlib保序回归算法 151
6.1保序回归算法 151
6.1.1数学模型 151
6.1.2L2保序回归算法 153
6.2源码分析 153
6.2.1建立保序回归 154
6.2.2模型训练run方法 156
6.2.3并行PAV计算 156
6.2.4PAV计算 157
6.2.5保序回归模型 159
6.3实例 164
6.3.1训练数据 164
6.3.2实例代码 164

第三部分Spark MLlib分类算法
第7章Spark MLlib贝叶斯分类算法 170
7.1贝叶斯分类算法 170
7.1.1贝叶斯定理 170
7.1.2朴素贝叶斯分类 171
7.2源码分析 173
7.2.1建立贝叶斯分类 173
7.2.2模型训练run方法 176
7.2.3贝叶斯分类模型 179
7.3实例 181
7.3.1训练数据 181
7.3.2实例代码 182
第8章Spark MLlib SVM支持向量机算法 184
8.1SVM支持向量机算法 184
8.1.1数学模型 184
8.1.2拉格朗日 186
8.2源码分析 189
8.2.1建立线性SVM分类 191
8.2.2模型训练run方法 194
8.2.3权重优化计算 194
8.2.4线性SVM分类模型 196
8.3实例 199
8.3.1训练数据 199
8.3.2实例代码 199
第9章Spark MLlib决策树算法 202
9.1决策树算法 202
9.1.1决策树 202
9.1.2特征选择 203
9.1.3决策树生成 205
9.1.4决策树生成实例 206
9.1.5决策树的剪枝 208
9.2源码分析 209
9.2.1建立决策树 211
9.2.2建立随机森林 216
9.2.3建立元数据 220
9.2.4查找特征的分裂及划分 223
9.2.5查找最好的分裂顺序 228
9.2.6决策树模型 231
9.3实例 234
9.3.1训练数据 234
9.3.2实例代码 234

第四部分Spark MLlib聚类算法
第10章Spark MLlib KMeans聚类算法 238
10.1KMeans聚类算法 238
10.1.1KMeans算法 238
10.1.2演示KMeans算法 239
10.1.3初始化聚类中心点 239
10.2源码分析 240
10.2.1建立KMeans聚类 242
10.2.2模型训练run方法 247
10.2.3聚类中心点计算 248
10.2.4中心点初始化 251
10.2.5快速距离计算 254
10.2.6KMeans聚类模型 255
10.3实例 258
10.3.1训练数据 258
10.3.2实例代码 259
第11章Spark MLlib LDA主题模型算法 261
11.1LDA主题模型算法 261
11.1.1LDA概述 261
11.1.2LDA概率统计基础 262
11.1.3LDA数学模型 264
11.2GraphX基础 267
11.3源码分析 270
11.3.1建立LDA主题模型 272
11.3.2优化计算 279
11.3.3LDA模型 283
11.4实例 288
11.4.1训练数据 288
11.4.2实例代码 288

第五部分Spark MLlib关联规则挖掘算法
第12章Spark MLlib FPGrowth关联规则算法 292
12.1FPGrowth关联规则算法 292
12.1.1基本概念 292
12.1.2FPGrowth算法 293
12.1.3演示FP树构建 294
12.1.4演示FP树挖掘 296
12.2源码分析 298
12.2.1FPGrowth类 298
12.2.2关联规则挖掘 300
12.2.3FPTree类 303
12.2.4FPGrowthModel类 306
12.3实例 306
12.3.1训练数据 306
12.3.2实例代码 306

第六部分Spark MLlib推荐算法
第13章Spark MLlib ALS交替最小二乘算法 310
13.1ALS交替最小二乘算法 310
13.2源码分析 312
13.2.1建立ALS 314
13.2.2矩阵分解计算 322
13.2.3ALS模型 329
13.3实例 334
13.3.1训练数据 334
13.3.2实例代码 334
第14章Spark MLlib协同过滤推荐算法 337
14.1协同过滤推荐算法 337
14.1.1协同过滤推荐概述 337
14.1.2用户评分 338
14.1.3相似度计算 338
14.1.4推荐计算 340
14.2协同推荐算法实现 341
14.2.1相似度计算 344
14.2.2协同推荐计算 348
14.3实例 350
14.3.1训练数据 350
14.3.2实例代码 350

第七部分Spark MLlib神经网络算法
第15章Spark MLlib神经网络算法综述 354
15.1人工神经网络算法 354
15.1.1神经元 354
15.1.2神经网络模型 355
15.1.3 信号前向传播 356
15.1.4误差反向传播 357
15.1.5其他参数 360
15.2神经网络算法实现 361
15.2.1神经网络类 363
15.2.2训练准备 370
15.2.3前向传播 375
15.2.4误差反向传播 377
15.2.5权重更新 381
15.2.6ANN模型 382
15.3实例 384
15.3.1测试数据 384
15.3.2测试函数代码 387
15.3.3实例代码 388

Introduction

机器学习是一门多领域交叉学科,涉及概率论、统计学、逼近论、凸分析、计算复杂性理论等多门学科,其中大部分理论来源于18、19世纪,例如贝叶斯定理,是18世纪英国数学家托马斯·贝叶斯(Thomas Bayes)提出的重要概率论理论;而21世纪则侧重于如何将机器学习理论运用在工业化中,帮助改进性能及提升其效率。
机器学习理论主要是设计和分析一些让计算机可以自动“学习”的算法。机器学习算法是一类从数据中自动分析获得规律,并利用规律对未知数据进行预测的算法。在算法设计方面,机器学习理论关注可以实现的、行之有效的学习算法;机器学习研究的不是求解精确的结果,而是研究开发容易处理的近似求解算法。尤其是在21世纪,知识和数据量爆发的时代,机器学习面临大数据的求解难题。
随着数据量的增加,从传统的单机计算发展到大规模的集群计算,以至发展到今天的一种大规模、快速计算的集群平台—Apache Spark。Spark是一个开源集群运算框架,最初由加州大学伯克利分校AMP实验室开发。相对于Hadoop的MapReduce会在执行完工作后将中介资料存放到磁盘中,Spark使用了内存内运算技术,能在资料尚未写入硬盘时即在内存内分析运算。Spark在内存上的运算速度比Hadoop MapReduce的运算速度快100倍,即便是在磁盘上运行也能快10倍。Spark允许将数据加载至集群内存,并多次对其进行查询,非常适合用于机器学习算法。
本书侧重讲解Spark MLlib模块。Spark MLlib是一种高效、快速、可扩展的分布式计算框架,实现了常用的机器学习,如聚类、分类、回归等算法。本文循序渐进,从Spark的基础知识、矩阵向量的基础知识开始,然后再讲解各种算法的理论知识,以及Spark源码实现和实例实战,帮助读者从基础到实践全面掌握Spark MLlib分布式机器学习。
学习本书需要的基础知识包括:Spark基础入门、Scala入门、线性代数基础知识。
本书面向的读者:Spark开发者、大数据工程师、数据挖掘工程师、机器学习工程师、研究生和高年级本科生等。
在本书的编写过程中,何娟、何丹、王蒙、叶月媚参与了全书的编写、整理及校对工作,刘程辉、李俊、廖宏参与了Spark集群运维和第2章数据操作的实例部分工作,刘晓宏、方佳武、于善龙参与了全书的实例部分工作。
本书在写作的过程中,得到了很多朋友及同事的帮助和支持,在此表示衷心感谢!
感谢久邦数码大数据团队的同事们。在两年的工作中,笔者得到了很多同事的指导、支持和帮助,尤其感谢杨树清、周小平、梁宁、刘程辉、刘晓宏、方佳武、于善龙、王蒙、叶月媚、廖宏、谭钊承、吴梦玲、邹桂芳、曹越等。
感谢电子工业出版社的付睿编辑,她不仅积极策划和推动本书的出版,而且在写作过程中还给出了极为详细的改进意见。感谢电子工业出版社的李云静编辑为本书做了非常辛苦和专业的编辑工作。
感谢我的父母和妻子,有了你们的帮助和支持,我才有时间和精力去完成写作。
谨以此书献给热爱大数据技术的朋友们!

Specifications

Brand Jingdong book
Brand Origin China

Disclaimer

Product packaging, specifications and price are subject to change without notice. All information about the products on our website is provided for information purposes only. Please always read labels, warnings and directions provided with the product before use.

View Full Terms of Use
Add to favorites
{{ $isZh ? coupon.coupon_name_sub : coupon.coupon_ename_sub | formatCurrency }}
{{__("Buy Directly")}} {{ itemCurrency }}{{ item.directly_price }}
Quantity
{{ quantity }}
{{ instockMsg }}
{{ limitText }}
{{buttonTypePin == 3 ? __("Scan to view more PinGo") : __("Scan to start")}}
Sold by JD@CHINA
Ship to
{{ __("Ship to United States only") }}
Free shipping over 69
Genuine guarantee

Added to Cart

Keep Shopping

More to Consider

{{ item.brand_name }}

{{ item.item_name }}

{{ item.currency }}{{ item.market_price }}

{{ item.currency }}{{ item.unit_price }}

{{ item.currency }}{{ item.unit_price }}

Coupons

{{ coupon.coupon_name_new | formatCurrency }}
Clip Clipped Over
{{ getCouponDescStr(coupon) }}
{{ coupon.use_time_desc }}
Expires soon {{ formatTime(coupon.use_end_time) }}

Share this item with friends

Cancel

Yami Gift Card

Get this exclusive deal when paying with gift card

Terms and Conditions

Gift card deals are special offers for selected products;

The gift card deals will automatically be activated if a customer uses gift card balance at check out and the balance is sufficient to pay for the total price of the shopping cart products with gift card deals;

You will not be able to activate the gift card deals if you choose other payment methods besides gift card. The products will be purchased at their normal prices;

If your account balance is not enough to pay for the products with gift card deals, you can choose to reload your gift card balance by clicking on the Reload button at either shopping cart page or check out page;

Products that have gift card deals can be recognized by a special symbol showing 'GC Deal';

For any additional questions or concerns, please contact our customer service;

Yamibuy reserves the right of final interpretation.

Sold by Yami

Service Guarantee

Yami Free Shipping over $49
Yami Easy Returns
Yami Ships from United States

Shipping

  • United States

    Standard Shipping is $5.99 (Excluding Alaska & Hawaii). Free on orders of $49 or more.

    Local Express is $5.99 (Available in Parts of CA, NJ, MA & PA). Free on orders of $49 or more.

    2-Day Express (Includes Alaska & Hawaii) starts at $19.99.

Return Policy

Yami is committed to provide our customers with a peace of mind when purchasing from us. Most items shipped from Yamibuy.com can be returned within 30 days of receipt of shipment (For Food, Beverages, Snacks, Dry Goods, Health supplements, Fresh Grocery and Perishables Goods, within 7 days of receipt of shipment due to damages or quality issues; To ensure that every customer receives safe and high-quality products, we do not provide refunds or returns for beauty products once they have been opened or used, except in the case of quality issues; Some products may have different policies or requirements associated with them, please see below for products under special categories, or contact Yami Customer Service for further assistance).
Thank you for your understanding and support.

Learn More

Sold by Yami

Terms and Conditions of Yami E-Gift Card

If you choose “Redeem automatically” as your delivery method, your gift card balance will be reload automatically after your order has been processed successfully;

If you choose “Send to Email”as your delivery method, the card number and CVV will be sent to the email address automatically;

Any user can use the card number and CVV to redeem the gift card, please keep your gift card information safely. If you have any trouble receiving email, please contact Yami customer service;

Yami gift card can be used to purchase both Yami owned or Marketplace products;

Yami gift card will never expire;

Yami gift card balance does not have to be used up at once;

All rights reserved by Yami.

Return Policy

Gift card that has already been consumed is non-refundable.

Sold by JD@CHINA

Service Guarantee

Yami Free Shipping over $49
Yami Easy Returns
Yami Ships from United States

Shipping

  • United States

    Standard Shipping is $5.99 (Excluding Alaska & Hawaii). Free on orders of $49 or more.

    Local Express is $5.99 (Available in Parts of CA, NJ, MA & PA). Free on orders of $49 or more.

    2-Day Express (Includes Alaska & Hawaii) starts at $19.99.

Return Policy

You may return product within 30 days upon receiving the product. Items returned must be new in it's original packing, including the original invoice for the purchase. Customer return product at their own expense.

Sold by JD@CHINA

Service Guarantee

Yami Cross-store Free Shipping over $69
Yami 30-days Return

Yami-China FC

Yami has a consolidation warehouse in China which collects multiple sellers’ packages and combines to one order. Our Yami consolidation warehouse will directly ship the packages to your door. Cross-store free shipping over $69.

Return Policy

You may return products within 30 days upon receiving the products. Sellers take responsibilities for any wrong shipment or missing items. Packing needs to be unopened for any other than quality issues return. We promise to pack carefully, but because goods are taking long journey to destinations, simple damages to packaging may occur. Any damages not causing internal goods quality problems are not allowed to return. If you open the package and any quality problem is found, please contact customer service within three days after receipt of goods.

Shipping Information

Yami Consolidation Service Shipping Fee $9.99(Free shipping over $69)

Sellers in China will ship their orders within 1-2 business days once the order is placed. Packages are sent to our consolidation warehouse in China and combined there. Our Yami consolidation warehouse will directly ship the packages to you via UPS. The average time for UPS to ship from China to the United States is about 10 working days and it can be traced using the tracking number. Due to the pandemic, the delivery time may be delayed by about 5 days. The package needs to be signed by the guest. If the receipt is not signed, the customer shall bear the risk of loss of the package.

Sold by JD@CHINA

Service Guarantee

Free shipping over 69
Genuine guarantee

Shipping

Yami Consolidated Shipping $9.99(Free shipping over $69)


Seller will ship the orders within 1-2 business days. The logistics time limit is expected to be 7-15 working days. In case of customs clearance, the delivery time will be extended by 3-7 days. The final receipt date is subject to the information of the postal company.

Yami Points information

All items are excluding from any promotion or points events on Yamibuy.com

Return Policy

You may return product within 30 days upon receiving the product. Items returned must be new in it's original packing, including the original invoice for the purchase. Customer return product at their own expense.

Yami

Download the Yami App

Back Top

Recommended for You

About the brand

Jingdong book

为您推荐

Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折
Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折
Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折
Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折
Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折
Yami
欣葉
2种选择
欣叶 御大福 芋头麻薯 180g

周销量 600+

$1.66 $1.99 83折

Reviews{{'('+ commentList.posts_count + ')'}}

Have your say. Be the first to help other guests.

Write a review
{{ totalRating }} Write a review
  • {{i}} star

    {{i}} stars

    {{ parseInt(commentRatingList[i]) }}%

Yami Yami
{{ comment.user_name }}

{{ showTranslate(comment) }}Show Less

{{ strLimit(comment,800) }}Show more

Show Original

{{ comment.content }}

Yami
Show All

{{ formatTime(comment.in_dtm) }} VERIFIED PURCHASE {{groupData}}

{{ comment.likes_count }} {{ comment.likes_count }} {{ comment.reply_count }} {{comment.in_user==uid ? __('Delete') : __('Report')}}
Yami Yami
{{ comment.user_name }}

{{ showTranslate(comment) }}Show Less

{{ strLimit(comment,800) }}Show more

Show Original

{{ comment.content }}

Yami
Show All

{{ formatTime(comment.in_dtm) }} VERIFIED PURCHASE {{groupData}}

{{ comment.likes_count }} {{ comment.likes_count }} {{ comment.reply_count }} {{comment.in_user==uid ? __('Delete') : __('Report')}}

No related comment~

Review

Yami Yami

{{ showTranslate(commentDetails) }}Show Less

{{ strLimit(commentDetails,800) }}Show more

Show Original

{{ commentDetails.content }}

Yami
Show All

{{ formatTime(commentDetails.in_dtm) }} VERIFIED PURCHASE {{groupData}}

{{ commentDetails.likes_count }} {{ commentDetails.likes_count }} {{ commentDetails.reply_count }} {{commentDetails.in_user==uid ? __('Delete') : __('Report')}}

Please write at least one word

Comments{{'(' + replyList.length + ')'}}

Yami Yami

{{ showTranslate(reply) }}Show Less

{{ strLimit(reply,800) }}Show more

Show Original

{{ reply.reply_content }}

{{ formatTime(reply.reply_in_dtm) }}

{{ reply.reply_likes_count }} {{ reply.reply_likes_count }} {{ reply.reply_reply_count }} {{reply.reply_in_user==uid ? __('Delete') : __('Report')}}

Please write at least one word

Cancel

That’s all the comments so far!

Write a review
How would you rate this item?

Please add your comment.

  • A nice nickname will make your comments more popular!
  • The nickname in your account will be changed to the same as here.
Thanks for your review
Our community rely on great reviews like yours to find the best of Asia.

Report

If you find this content inappropriate and think it should be removed from the Yami.com site, let us know please.

Cancel

Are you sure to delete your review?

Cancel

You’ve Recently Viewed

About the brand

Jingdong book