SetaPDF_Extractor_Strategy_Glyph Extraction strategy for single glyphs.

File: /SetaPDF/Extractor/Strategy/Glyph.php

The result of this strategy is not sorted.

Class hierarchy

Summary

Properties

$_graphicState

$_items

$_lastMatrix

$_resources

The stream resources dictionary.

$_sorter

$_textCount

A text item counter.

$spaceWidthFactor

The fonts space character width is devided by this factor to define the minimum space for a character separator.


Methods

_accept()

Proxy method that forwards the call to a filter instance if available.

Parameters
$textItem : SetaPDF_Extractor_TextItem
 
See

_cleanResult()

public string SetaPDF_Extractor_Strategy_Plain::_cleanResult ( $result $result )

Callback to clean up the resulting text.

Parameters
$result : $result
 

_getParser()

Creates the content stream parser.

Parameters
$stream : string
 

_onAfterShowText()

public void SetaPDF_Extractor_Strategy_Plain::_onAfterShowText ( string $rawString )

Callback that is called after a show text operation was invoked.

Parameters
$rawString : string
 

_onBeforeShowText()

Callback that is called before a show text operation is invoked.

_onBeginOrEndText()

public void SetaPDF_Extractor_Strategy_Plain::_onBeginOrEndText ( array $arguments, string $operator )

Callback for begin or end text operators (BT/ET).

Parameters
$arguments : array
 
$operator : string
 

_onCurrentTransformationMatrix()

public void SetaPDF_Extractor_Strategy_Plain::_onCurrentTransformationMatrix ( array $arguments, string $operator )

Callback for ctm changes (cm).

Parameters
$arguments : array
 
$operator : string
 

_onFormXObject()

public void SetaPDF_Extractor_Strategy_Plain::_onFormXObject ( array $arguments, string $operator )

Callback for painting a specified XObject.

Parameters
$arguments : array
 
$operator : string
 
Exceptions

Throws SetaPDF_Exception_NotImplemented

_onGraphicStateChange()

public void SetaPDF_Extractor_Strategy_Plain::_onGraphicStateChange ( array $arguments, string $operator )

Callback for graphic state changes operators (q/Q).

Parameters
$arguments : array
 
$operator : string
 

_onInlineImage()

public void SetaPDF_Extractor_Strategy_Plain::_onInlineImage ( array $arguments, string $operator )

Callback for inline image operator

Parameters
$arguments : array
 
$operator : string
 

_onTextPosition()

public void SetaPDF_Extractor_Strategy_Plain::_onTextPosition ( array $arguments, string $operator )

Callback for text position operators.

Parameters
$arguments : array
 
$operator : string
 

_onTextShow()

public void SetaPDF_Extractor_Strategy_Glyph::_onTextShow ( string $arguments, mixed $operator )

Callback that is called if a text should be shown.

Parameters
$arguments : string
 
$operator : mixed
 

_onTextState()

public void SetaPDF_Extractor_Strategy_Plain::_onTextState ( array $arguments, string $operator )

Callback for text state operators.

All states has to be passed to the current graphic state as defined in PDF 32000-1:2008, Table 52 on page 121.

Parameters
$arguments : array
 
$operator : string
 
Exceptions

Throws SetaPDF_Extractor_Exception

_saveLastMatrix()

protected void SetaPDF_Extractor_Strategy_Plain::_saveLastMatrix ( string $type )

Saves the last matrix by a specific type.

Parameters
$type : string
 

_showText()

protected void SetaPDF_Extractor_Strategy_Glyph::_showText ( $string $string )

Method that shows text.

Parameters
$string : $string
 

_showTextStrings()

public void SetaPDF_Extractor_Strategy_Glyph::_showTextStrings ( array $textStrings )

Callback that is called if text strings should be shown.

Parameters
$textStrings : array
 

getGraphicState()

getResult()

Get all resoved glyphs.

Parameters
$stream : string
 
$resources : SetaPDF_Core_Type_Dictionary
 

getSorter()

Get the sorter instance.

If none was set a base line sorter is created automatically.

process()

Processes a stream through the plain text strategy.

Parameters
$stream : string
 
$resources : SetaPDF_Core_Type_Dictionary
 

setFilter()

setGraphicState()

Set the graphic state.

Parameters
$graphicState : SetaPDF_Core_Canvas_GraphicState
 

setSorter()

Set a sorter instance.

Parameters
$sorter : SetaPDF_Extractor_Sorter