PySpark createDataFrame() throws segmentation fault on Mac
I'm trying to learn PySpark. Finally got it installed following the tutorial here: https://sparkbyexamples.com/pyspark/install-pyspark-in-anaconda-jupyter-notebook/
However, even though I am now able to create a spark session (I can access the GUI, create some RDDs, etc.), any time I try to create a DataFrame, no matter how tiny, I get a segmentation fault (when I'm in shell) or my kernel crashes (when I'm in Jupyter notebook).
I'm on MacOS Monterey (12.3.1), spark v3.1.2, Python 3.9.11 (conda distribution installed via homebrew), java version openjdk 11.0.13 (installed through conda).
I understand that segmentation fault is likely a recursion issue (I have the default recursion limit set to 1000) but honestly not sure how to deal with it.
Any help would be greatly appreciated.
Topic pyspark
Category Data Science