h2. Summary
| Purpose | {excerpt}Detects and extracts metadata and text content from documents.{excerpt} |
| Homepage \\ | [http://tika.apache.org/] |
| Source Code Repository \\ | [https://github.com/apache/tika] |
| License \\ | Apache License, Version 2.0 \\ |
| Debian Package | |
h2. Description
Java based tool for detecting and extracting metadata and text content from documents.
h2. User Experiences
e.g. links to AQuA/SCAPE/Hackathon issues that use the tool
* [SP:IS25 Web Content Characterisation]
* [SP:SO11 The Tika characterisation Tool]
* [SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika|SP:SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika]
h2. News Feeds
h3. Release Feed
Link to any RSS feed that is updated when new releases occur, if any, e.g:
{rss:max=7|url=http://projects.apache.org/feeds/rss/tika.xml}
h3. Activity Feed
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:
{rss:max=7|url=https://issues.apache.org/jira/activity?maxResults=10&streams=key+IS+TIKA}
h2. Searching for Tika on OPF Labs
{search:query=Tika\|type=page}
| Purpose | {excerpt}Detects and extracts metadata and text content from documents.{excerpt} |
| Homepage \\ | [http://tika.apache.org/] |
| Source Code Repository \\ | [https://github.com/apache/tika] |
| License \\ | Apache License, Version 2.0 \\ |
| Debian Package | |
h2. Description
Java based tool for detecting and extracting metadata and text content from documents.
h2. User Experiences
e.g. links to AQuA/SCAPE/Hackathon issues that use the tool
* [SP:IS25 Web Content Characterisation]
* [SP:SO11 The Tika characterisation Tool]
* [SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika|SP:SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika]
h2. News Feeds
h3. Release Feed
Link to any RSS feed that is updated when new releases occur, if any, e.g:
{rss:max=7|url=http://projects.apache.org/feeds/rss/tika.xml}
h3. Activity Feed
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:
{rss:max=7|url=https://issues.apache.org/jira/activity?maxResults=10&streams=key+IS+TIKA}
h2. Searching for Tika on OPF Labs
{search:query=Tika\|type=page}