SetaPDF_Extractor_Result_Word This class represents a single word

File: /SetaPDF v2/Extractor/Result/Word.php

Class hierarchy

Implements

Summary

Static Properties

$_ignoreConstructor

Defines if the constructor logic will be ignored or not.


Properties

$_bounds

The bounds of this word.

$_filterId

The filter id of the first text item in.

$_parts

protected self[] SetaPDF_Extractor_Result_Word::$_parts = array()

The initial words that were used to create this word.

$_word

protected array SetaPDF_Extractor_Result_Word::$_word = array()

The resulting word in a specific encoding.


Static Methods

merge()

Merges two SetaPDF_Extractor_Result_Word instances into a new one.

The words text-content will be $resultingString.

Parameters
$a : SetaPDF_Extractor_Result_Word
 
$b : SetaPDF_Extractor_Result_Word
 
$resultingString

In UTF-8 encoding.


Methods

__construct()

public SetaPDF_Extractor_Result_Word::__construct (
array $glyphs
)

The constructor.

Parameters
$glyphs : array
 

cleanUp()

public SetaPDF_Extractor_Result_Word::cleanUp (
void
): void

Release memory and cycled references.

getBounds()

Get the bounds of this word.

getFilterId()

Get the id of the filter instance that accepted this result.

getParts()

Get all parts of this word.

If the word is not created by several words, this method will return itself.

getString()

public SetaPDF_Extractor_Result_Word::getString (
[ $encoding = 'utf-8' ]
):

Get the words string value in a specific encoding.

Parameters
$encoding