Spark 2.0 -Outer Join Java Example

In SPARK 2, datasets do not have api like leftouterjoin() or rightouterjoin() similar to that of RDD.So if we have to join two datasets, then we need write specialized code which would help us in achieving the outer joins. To the join API we need to pass the join type argument which can various values as below
‘inner’, ‘outer’, ‘full’, ‘fullouter’, ‘leftouter’, ‘left’, ‘rightouter’, ‘right’, ‘leftsemi’, ‘leftanti’ .

customer.csv

order.csv

Output

Leave a Reply

Your email address will not be published. Required fields are marked *