Hello, Habr! In anticipation of the start of the course "Ecosystem Hadoop, Spark, Hive" we have prepared for you a translation of a useful article. We also offer to watch a free recording of a demo lesson on the topic: "Spark 3.0: What's New?" ...
(executor) - , (.. ) (node) . , EC2 . r5.4xlarge, , AWS EC2, 16 .
(job), (Cluster Manager). 16 . , Spark , 15 .
![](https://habrastorage.org/getpro/habr/upload_files/870/18a/bed/87018abed61662ea6acacd334faf1bac.png)
, , , , (core) Spark . (X * Y = 15), , , 15 Spark :
![Possible executor configurations Possible executor configurations](https://habrastorage.org/getpro/habr/upload_files/668/345/98e/66834598e0b2285cdb850f1979cf0e34.jpeg)
.
![](https://habrastorage.org/getpro/habr/upload_files/146/253/bd3/146253bd3c015b32e28ece129155ec99.jpeg)
, , - 15 . , , , , , (64 +), . .
![](https://habrastorage.org/getpro/habr/upload_files/492/24d/1ee/49224d1ee077d073be32ea575f845192.jpeg)
, β 15 , . , , , . , . .
![](https://habrastorage.org/getpro/habr/upload_files/0fa/9b6/314/0fa9b6314c7656f43fd582c96147e1b8.png)
10% 384 MB ( , ). big data , Qubole, , . , Β«EnvironmentsΒ» Spark spark.executor.memoryOverhead
.
Spark , . , , , . , , .
![](https://habrastorage.org/getpro/habr/upload_files/f82/fb8/b9c/f82fb8b9c0770c1617284bf2e0563154.jpeg)
, . Spark , 5 β . , , , . , . , .
--executor-cores 5
β , . , , . , β . , EC2 , , , . r5.4xlarge AWS , 128 .
![](https://habrastorage.org/getpro/habr/upload_files/326/6bb/303/3266bb303dabe56ac62fe72232c4db6b.png)
![](https://habrastorage.org/getpro/habr/upload_files/ba4/801/475/ba480147549a29fc4940cada7aa0e20c.gif)
, 112 .
, 112 , . , 3. .
( Qubole), . (112/3) = 37β2,3 = 34,7 = 34.
Spark , . (112/3) = 37 / 1,1 = 33,6 = 33.
Qubole.
--executor-memory 34G
, Spark . . , Spark Spark. , , , . , , 2, . , , , β , .
, , EC2, . , , .
. , , 5.
, , , . , - , 15 , , , . 3, .
. . , 3, 16- , , 14 . !
, , , 3 , .
--num-executors (3x - 1)
4 , .
data- . AWS , . , .
--driver-memory 34G
, , , . 2 , (3x - 2), .
. , , 500 Spark, , . . , , .
--driver-cores 5
?
, , 16 128 , .
--driver-memory 34G --executor-memory 34G --num-executors (3x - 1) --executor-cores 5
:
β , .
, . . , , , .
, , Expedia Group β’, . , , , .
, , . ? ? , , ? - ? 4: Β« Apache Spark Β».