List Question
20 TechQA 2020-12-14T05:08:33.303000Find Longest Continuous Streak In Spark
241 views
Asked by sovan
Spark spark.sql.session.timeZone doesn't work with JSON source
3.2k views
Asked by VB_
Cant save table to hive metastore, HDP 3.0
2.2k views
Asked by AudioBubble
Different behaviour of cache method for PySpark dataframes in Spark 2.3
637 views
Asked by max04
pyspark dataframe column value replace with index in another list in pyspark version 2.3
672 views
Asked by kavya
pyspark map each row in dataframe and apply UDF which return dataframe
1k views
Asked by Muhammed Saed
Janusgraph libs cant communicate with hbase in kerberos environment(Failed to specify server's Kerberos principal name)
734 views
Asked by ModdingFox
write pyspark dataframe to csv with out outer quotes
252 views
Asked by kavya
Spark 2.3 Stream-Stream Join lost left table key
169 views
Asked by Xu Yan
Spark - Operation not allowed: alter table replace columns
4k views
Asked by nir
Sharing data across executors in Apache spark
883 views
Asked by A Learner
Read specific file from multiple .gz file in Spark
378 views
Asked by Neeleshkumar S
use corelated subquery in pyspark sql
266 views
Asked by saahil shah
create new column in pyspark dataframe using existing columns
3.1k views
Asked by Shashank BR
Pyspark renaming file in HDFS
1.7k views
Asked by Achyut Vyas
How to build zeppelin 0.8.0 with spark 2.3.2 inbuilt
305 views
Asked by AlphaWolf
Spark(2.3) not able to identify new columns in Parquet table added via Hive Alter Table command
1.8k views
Asked by user2717470
Pyspark self-join with error "Resolved attribute(s) missing"
5k views
Asked by Maviles
Spark shuffle disk spill increase when upgrading versions
82 views
Asked by Barak Freiman
Repartitioning a pyspark dataframe fails and how to avoid the initial partition size
1.8k views
Asked by SarahData