Join Spark tbls.
These functions are wrappers around their 'dplyr' equivalents that set Spark SQL-compliant values for the 'suffix' argument by replacing dots ('.') with underscores ('_'). See [join] for a description of the general purpose of the functions.
## S3 method for class 'tbl_spark' inner_join( x, y, by = NULL, copy = FALSE, suffix = c("_x", "_y"), auto_index = FALSE, ..., sql_on = NULL ) ## S3 method for class 'tbl_spark' left_join( x, y, by = NULL, copy = FALSE, suffix = c("_x", "_y"), auto_index = FALSE, ..., sql_on = NULL ) ## S3 method for class 'tbl_spark' right_join( x, y, by = NULL, copy = FALSE, suffix = c("_x", "_y"), auto_index = FALSE, ..., sql_on = NULL ) ## S3 method for class 'tbl_spark' full_join( x, y, by = NULL, copy = FALSE, suffix = c("_x", "_y"), auto_index = FALSE, ..., sql_on = NULL )
x |
A pair of lazy data frames backed by database queries. |
y |
A pair of lazy data frames backed by database queries. |
by |
A character vector of variables to join by. If To join by different variables on To join by multiple variables, use a vector with length > 1.
For example, To perform a cross-join, generating all combinations of |
copy |
If This allows you to join tables across srcs, but it's potentially expensive operation so you must opt into it. |
suffix |
If there are non-joined duplicate variables in |
auto_index |
if |
... |
Other parameters passed onto methods. |
sql_on |
A custom join predicate as an SQL expression.
Usually joins use column equality, but you can perform more complex
queries by supply |
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.