Getting Started w/ Python, Spark, and Databricks: Difference between revisions
No edit summary |
No edit summary |
||
Line 19: | Line 19: | ||
== Combining Dataframes == | == Combining Dataframes == | ||
=== Union === | |||
<pre>homicidesBostonDF = homicidesNewYorkDF.union ( homicidesBostonDF )</pre> | <pre>homicidesBostonDF = homicidesNewYorkDF.union ( homicidesBostonDF )</pre> |
Revision as of 19:20, 2 August 2019
Loading Data
Viewing Contents of a Parquet Folder
%fs ls /mnt/training/crime-data-20016
Reading Parquet
df = spark.read.parquet ( "/mnt/training/cimr-data-2016/Crime-Data-Boston-2016.parquet" )
Viewing Results
Text Based (Generic)
show(df)
Databricks Native Viewer
display(df)
Combining Dataframes
Union
homicidesBostonDF = homicidesNewYorkDF.union ( homicidesBostonDF )