h2o: h2o.parseSetup – R documentation

Pricing

Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!

Get Started for Free

Documentation

h2o

h2o.parseSetup

Get a parse setup back for the staged data.

Description

Get a parse setup back for the staged data.

Usage

h2o.parseSetup(
  data,
  pattern = "",
  destination_frame = "",
  header = NA,
  sep = "",
  col.names = NULL,
  col.types = NULL,
  na.strings = NULL,
  parse_type = NULL,
  chunk_size = NULL,
  decrypt_tool = NULL,
  skipped_columns = NULL,
  custom_non_data_line_markers = NULL,
  partition_by = NULL,
  single_quotes = FALSE,
  escapechar = ""
)

Arguments

`data`	An H2OFrame object to be parsed.
`pattern`	(Optional) Character string containing a regular expression to match file(s) in the folder.
`destination_frame`	(Optional) The hex key assigned to the parsed file.
`header`	(Optional) A logical value indicating whether the first row is the column header. If missing, H2O will automatically try to detect the presence of a header.
`sep`	(Optional) The field separator character. Values on each line of the file are separated by this character. If `sep = ""`, the parser will automatically detect the separator.
`col.names`	(Optional) An H2OFrame object containing a single delimited line with the column names for the file. If skipped_columns are specified, only list column names of columns that are not skipped.
`col.types`	(Optional) A vector specifying the types to attempt to force over columns. If skipped_columns are specified, only list column types of columns that are not skipped.
`na.strings`	(Optional) H2O will interpret these strings as missing.
`parse_type`	(Optional) Specify which parser type H2O will use. Valid types are "ARFF", "XLS", "CSV", "SVMLight"
`chunk_size`	size of chunk of (input) data in bytes
`decrypt_tool`	(Optional) Specify a Decryption Tool (key-reference acquired by calling h2o.decryptionSetup.
`skipped_columns`	a list of column indices to be excluded from parsing
`custom_non_data_line_markers`	(Optional) If a line in imported file starts with any character in given string it will NOT be imported. Empty string means all lines are imported, NULL means that default behaviour for given format will be used
`partition_by`	(Optional) Names of the columns the persisted dataset has been partitioned by.
`single_quotes`	If set to true, the parser expects single quotes. False for double quotes (default).
`escapechar`	(Optional) One ASCII character used to escape other characters.

Erin LeDell [aut, cre], Navdeep Gill [aut], Spencer Aiello [aut], Anqi Fu [aut], Arno Candel [aut], Cliff Click [aut], Tom Kraljevic [aut], Tomas Nykodym [aut], Patrick Aboyoun [aut], Michal Kurka [aut], Michal Malohlava [aut], Ludi Rehak [ctb], Eric Eckstrand [ctb], Brandon Hill [ctb], Sebastian Vidrio [ctb], Surekha Jadhawani [ctb], Amy Wang [ctb], Raymond Peck [ctb], Wendy Wong [ctb], Jan Gorecki [ctb], Matt Dowle [ctb], Yuan Tang [ctb], Lauren DiPerna [ctb], Tomas Fryda [ctb], H2O.ai [cph, fnd]

Initial release

2021-04-29

h2o.parseSetup

Description

Usage

Arguments

See Also

h2o

We don't support your browser anymore