List Question
20 TechQA 2017-12-06T22:53:48.333000pyspark 1.4 how to get list in aggregated function
160 views
Asked by Helen Z
Python versions in worker node and master node vary
1k views
Asked by Abhi
Inconsistent JSON schema guess with Spark dataframes
713 views
Asked by Victor
Spark: DecoderException: java.lang.OutOfMemoryError
1.9k views
Asked by user3646174
Spark worker node removed but not gone
989 views
Asked by user1342645
Cannot start spark-shell
8.6k views
Asked by worldterminator
Spark + Kafka integration - mapping of Kafka partitions to RDD partitions
984 views
Asked by jithinpt
Select values from a dataframe column
884 views
Asked by the3rdNotch
Custom Transformer in PySpark Pipeline with Cross Validation
2.5k views
Asked by vkoe
Slow or incomplete saveAsParquetFile from EMR Spark to S3
1.1k views
Asked by Kirk Broadhurst
Why can't YARN acquire any executor when dynamic allocation is enabled?
3.7k views
Asked by gunererd
Spark Scala how to execute
161 views
Asked by Ravi Reddy
DataFrame join optimization - Broadcast Hash Join
109.9k views
Asked by NNamed
In Apache Spark SQL, How to close metastore connection from HiveContext
1.8k views
Asked by tribbloid
Find size of data stored in rdd from a text file in apache spark
7.9k views
Asked by Pawan B
Unable to save an RDD[String] as a text file using saveAsTextFile
3.2k views
Asked by Ravitej Somayajula
Spark 1.4 Mllib LDA topicDistributions() returning wrong number of documents
473 views
Asked by smannan
Spark SQL + Streaming issues
539 views
Asked by Subhash Vaddiparty
Databricks - How to create a Library with updated maven artifacts
375 views
Asked by sag
Spark grouping and custom aggregation
4k views
Asked by Akash