kedro pyspark starter cannot load spark.SparkDataset

85 views Asked by At

The kedro starter project immediately fails to recognize the spark dataset

kedro new --starter=spaceflights-pyspark-viz
cd projectpath
pip install -r requirements.txt
kedro ipython
Class 'spark.SparkDataset' not found, is this a typo?

Tested with kedro 0.19.1 and .2

any idea?

1

There are 1 answers

4
datajoely On

if you explicitly install pip kedro-datasets[spark.SparkDataset] does it work? This should be part of the requirements.txt