Running more than spark streaming job in Google dataproc

Question

Running more than spark streaming job in Google dataproc

274 views Asked by passionate At 18 September 2017 at 10:58

How do I run more than one spark streaming job in dataproc cluster? I created multiple queues using capacity-scheduler.xml but now I will need 12 queues if I want to run 12 different streaming - aggregate applications. Any idea?

Original Q&A

There are 1 answers

**tix** · Answer 1 · 2017-09-18T16:41:30+00:00

Dataproc 1.2 image enabled fair mode in capacity scheduler which should do what you want without overhead of queues [1] [2].

[1] https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_yarn_resource_mgt/content/flexible_scheduling_policies.html

[2] https://community.hortonworks.com/questions/19342/yarn-fair-sharing-ordering-policy-for-capacity-sch.html

TechQA.

Running more than spark streaming job in Google dataproc

There are 1 answers

Related Questions in APACHE-SPARK

Related Questions in APACHE-SPARK-SQL

Related Questions in GOOGLE-CLOUD-DATAPROC

Related Questions in SPARK-STRUCTURED-STREAMING

Popular Questions

Trending Questions