<< OpenSSL 的 Heartbleed 漏洞的影响到底有多少? - 知乎 | 首页 | JVM系列四:生产环境参数实例及分析【生产环境实例增加中】 - redcreen - 博客园 >>

Apache Tika - Apache Tika

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. You can find the latest release on the download page. See the Getting Started guide for instructions on how to start using Tika.

支持的文件格式有:

Supported Document Formats

This page lists all the document formats supported by Apache Tika 0.6. Follow the links to the various parser class javadocs for more detailed information about each document format and how it is parsed by Tika.

阅读全文……

标签 :



发表评论 发送引用通报