SetaPDF_Extractor The main class of the SetaPDF-Extractor Component

File: /SetaPDF v2/Extractor.php

Class hierarchy

Summary

Constants

VERSION

public const string SetaPDF_Extractor::VERSION = '2.46.0.2006'

The version


Properties

$_document

The document instance

$_ignoreFaultyStreams

Defines wether to continue when a stream cannot be decoded or not.

$_strategy


Methods

__construct()

public SetaPDF_Extractor::__construct (
SetaPDF_Core_Document $document [, SetaPDF_Extractor_Strategy_AbstractStrategy|null $strategy = null [, bool $ignoreFaultyStreams = false ]]
)

The constructor.

Parameters
$document : SetaPDF_Core_Document
 
$strategy : SetaPDF_Extractor_Strategy_AbstractStrategy|null
 
$ignoreFaultyStreams : bool
 

cleanUp()

public SetaPDF_Extractor::cleanUp (
void
): void

Release cycled references.

getResultByPageNumber()

Get the result by the default or individual strategy of a specific page.

Parameters
$pageNumber : integer
 
$boundaryBox : string

If set the page boundary is used to limit the result to the rectangle of the given boundary. See SetaPDF_Core_PageBoundaries::XXX_BOX constants for possible values.

Exceptions

Throws SetaPDF_Core_Exception

See

setStrategy()

Set the extraction strategy.

Parameters
$strategy : SetaPDF_Extractor_Strategy_AbstractStrategy