#include <string_ops.h>

Split elements of input based on delimiter into a SparseTensor.

Summary

Let N be the size of source (typically N will be the batch size). Split each element of input based on delimiter and return a SparseTensor containing the splitted tokens. Empty tokens are ignored.

delimiter can be empty, or a string of split characters. If delimiter is an empty string, each element of input is split into individual single-byte character strings, including splitting of UTF-8 multibyte sequences. Otherwise every character of delimiter is a potential split point.

For example: N = 2, input[0] is 'hello world' and input[1] is 'a b c', then the output will be

indices = [0, 0; 0, 1; 1, 0; 1, 1; 1, 2] shape = [2, 3] values = ['hello', 'world', 'a', 'b', 'c']

Args:

  • scope: A Scope object
  • input: 1-D. Strings to split.
  • delimiter: 0-D. Delimiter characters (bytes), or empty string.

Optional attributes (see Attrs):

  • skip_empty: A bool. If True, skip the empty strings from the result.

Returns:

  • Output indices: A dense matrix of int64 representing the indices of the sparse tensor.
  • Output values: A vector of strings corresponding to the splited values.
  • Output shape: a length-2 vector of int64 representing the shape of the sparse tensor, where the first value is N and the second value is the maximum number of tokens in a single input entry.

Constructors and Destructors

StringSplit(const ::tensorflow::Scope & scope, ::tensorflow::Input input, ::tensorflow::Input delimiter)
StringSplit(const ::tensorflow::Scope & scope, ::tensorflow::Input input, ::tensorflow::Input delimiter, const StringSplit::Attrs & attrs)

Public attributes

indices
operation
shape
values

Public static functions

SkipEmpty(bool x)

Structs

tensorflow::ops::StringSplit::Attrs

Optional attribute setters for StringSplit.

Public attributes

Public functions

Public static functions

SkipEmpty

Attrs SkipEmpty(
  bool x
)

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.

Last updated 2021-11-15 UTC.