Package com.itextpdf.text.pdf.parser

Interface Summary
ContentOperator Interface implemented by a series of content operators
LocationTextExtractionStrategy.TextChunkFilter Specifies a filter for filtering LocationTextExtractionStrategy.TextChunk objects during text extraction
RenderListener A callback interface that receives notifications from the PdfContentStreamProcessor as various render operations are required.
TextExtractionStrategy Defines an interface for RenderListeners that can return text
XObjectDoHandler  
 

Class Summary
ContentByteUtils  
FilteredRenderListener A text render listener that filters text operations before passing them on to a delegate
FilteredTextRenderListener A text render listener that filters text operations before passing them on to a delegate
GraphicsState Keeps all the parameters of the graphics state.
ImageRenderInfo Represents image data from a PDF
InlineImageInfo Represents an inline image from a PDF
InlineImageUtils Utility methods to help with processing of inline images
LineSegment Represents a line segment in a particular coordinate system.
LocationTextExtractionStrategy Development preview - this class (and all of the parser classes) are still experiencing heavy development, and are subject to change both behavior and interface.
LocationTextExtractionStrategy.TextChunk Represents a chunk of text, it's orientation, and location relative to the orientation vector
MarkedContentInfo Represents a Marked Content block in a PDF
MarkedContentRenderFilter A RenderFilter that only allows text within a specified marked content sequence.
Matrix Keeps all the values of a 3 by 3 matrix and allows you to do some math with matrices.
PdfContentReaderTool Tool that parses the content of a PDF document.
PdfContentStreamProcessor Processor for a PDF content Stream.
PdfImageObject An object that contains an image dictionary and image bytes.
PdfReaderContentParser A utility class that makes it cleaner to process content from pages of a PdfReader through a specified RenderListener.
PdfTextExtractor Extracts text from a PDF file.
RegionTextRenderFilter A RenderFilter that only allows text within a specified rectangular region
RenderFilter Interface for defining filters for use with FilteredRenderListener
SimpleTextExtractionStrategy A simple text extraction renderer.
TaggedPdfReaderTool Converts a tagged PDF document into an XML file.
TextMarginFinder Allows you to find the rectangle that contains all the text in a page.
TextRenderInfo Provides information and calculations needed by render listeners to display/evaluate text render operations.
Vector Represents a vector (i.e.
 

Enum Summary
PdfImageObject.ImageBytesType Different types of data that can be stored in the bytes of a PdfImageObject
 

Exception Summary
InlineImageUtils.InlineImageParseException Simple class in case users need to differentiate an exception from processing inline images vs other exceptions
 



Copyright © 2013. All Rights Reserved.