processors
Interfaces, Classes, Traits and Enums
- BmpProcessor
- Used to create crawl summary information
for BMP and ICO files
- CompressedProcessor
- Used to create crawl summary information
for a gz compressed file whose uncompressed form has
a processor we index.
- DocProcessor
- Used to create crawl summary information
for binary DOC files
- DocxProcessor
- Used to create crawl summary information
for DOCX files
- EpubProcessor
- Used to create crawl summary information
for XML files (those served as application/epub+zip)
- GifProcessor
- Used to create crawl summary information
for GIF files
- GitXmlProcessor
- Parent class common to all processors used to create crawl summary
information that involves basically text data
- GopherProcessor
- Used to create crawl summary information
for gopher protocol pages
- HtmlProcessor
- Used to create crawl summary information
for HTML files
- IconProcessor
- Used to create crawl summary information
for BMP and ICO files
- ImageProcessor
- Base abstract class common to all processors used to create crawl summary
information from images
- JavaProcessor
- Parent class common to all processors used to create crawl summary
information that involves basically text data
- JpgProcessor
- Used to create crawl summary information
for JPEG files
- PageProcessor
- Base class common to all processors of web page data
- PdfProcessor
- Used to create crawl summary information
for PDF files
- PngProcessor
- Used to create crawl summary information
for PNG files
- PptProcessor
- Used to create crawl summary information
for PPT files
- PptxProcessor
- Used to create crawl summary information
for PPTX files
- PythonProcessor
- Parent class common to all processors used to create crawl summary
information that involves basically text data
- RobotProcessor
- Processor class used to extract information from robots.txt files
- RssProcessor
- Used to create crawl summary information
for RSS or Atom files
- RtfProcessor
- Used to create crawl summary information
for RTF files
- SitemapProcessor
- Used to create crawl summary information
for sitemap files
- SvgProcessor
- Used to create crawl summary information
for SVG files. This class is a little bit
weird in that it generates thumbs like the
image processor classes, but when it gives
up on the data it falls back to text
processor handling.
- TextProcessor
- Parent class common to all processors used to create crawl summary
information that involves basically text data
- VideoProcessor
- Base abstract class common to all processors used to create crawl summary
information from videos
- XlsxProcessor
- Used to create crawl summary information
for xlsx files
- XmlProcessor
- Used to create crawl summary information
for XML files (those served as text/xml)