DocumentWord Class
A word object consisting of a contiguous sequence of characters. For non-space delimited languages, such as Chinese, Japanese, and Korean, each character is represented as its own word.
Constructor
DocumentWord(*args: Any, **kwargs: Any)
Variables
| Name | Description |
|---|---|
|
content
|
Text content of the word. Required. |
|
polygon
|
Bounding polygon of the word, with coordinates specified relative to the top-left of the page. The numbers represent the x, y values of the polygon vertices, clockwise from the left (-180 degrees inclusive) relative to the element orientation. |
|
span
|
Location of the word in the reading order concatenated content. Required. |
|
confidence
|
Confidence of correctly extracting the word. Required. |
Methods
| as_dict |
Return a dict that can be turned into json using json.dump. |
| clear | |
| copy | |
| get | |
| items | |
| keys | |
| pop | |
| popitem | |
| setdefault | |
| update | |
| values |
as_dict
Return a dict that can be turned into json using json.dump.
as_dict(*, exclude_readonly: bool = False) -> Dict[str, Any]
Keyword-Only Parameters
| Name | Description |
|---|---|
|
exclude_readonly
|
Whether to remove the readonly properties. Default value: False
|
Returns
| Type | Description |
|---|---|
|
A dict JSON compatible object |
clear
clear() -> None
copy
copy() -> Model
get
get(key: str, default: Any = None) -> Any
Parameters
| Name | Description |
|---|---|
|
key
Required
|
|
|
default
|
Default value: None
|
items
items() -> ItemsView[str, Any]
keys
keys() -> KeysView[str]
pop
pop(key: str, default: ~typing.Any = <object object>) -> Any
Parameters
| Name | Description |
|---|---|
|
key
Required
|
|
|
default
|
|
popitem
popitem() -> Tuple[str, Any]
setdefault
setdefault(key: str, default: ~typing.Any = <object object>) -> Any
Parameters
| Name | Description |
|---|---|
|
key
Required
|
|
|
default
|
|
update
update(*args: Any, **kwargs: Any) -> None
values
values() -> ValuesView[Any]
Attributes
confidence
Confidence of correctly extracting the word. Required.
confidence: float
content
Text content of the word. Required.
content: str
polygon
Bounding polygon of the word, with coordinates specified relative to the top-left of the page. The numbers represent the x, y values of the polygon vertices, clockwise from the left (-180 degrees inclusive) relative to the element orientation.
polygon: List[float] | None
span
Location of the word in the reading order concatenated content. Required.
span: _models.DocumentSpan